2025-12-06T00:02:45,925 Created temporary directory: /tmp/pip-ephem-wheel-cache-__04737_ 2025-12-06T00:02:45,926 Created temporary directory: /tmp/pip-build-tracker-o94lek3y 2025-12-06T00:02:45,927 Initialized build tracking at /tmp/pip-build-tracker-o94lek3y 2025-12-06T00:02:45,927 Created build tracker: /tmp/pip-build-tracker-o94lek3y 2025-12-06T00:02:45,928 Entered build tracker: /tmp/pip-build-tracker-o94lek3y 2025-12-06T00:02:45,929 Created temporary directory: /tmp/pip-wheel-xxnppyvq 2025-12-06T00:02:45,932 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-06T00:02:45,934 Created temporary directory: /tmp/pip-ephem-wheel-cache-s12modma 2025-12-06T00:02:45,956 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-06T00:02:45,960 2 location(s) to search for versions of llm-benchmark-toolkit: 2025-12-06T00:02:45,960 * https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-06T00:02:45,960 * https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-06T00:02:45,961 Fetching project page and analyzing links: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-06T00:02:45,962 Getting page https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-06T00:02:45,964 Found index url https://pypi.org/simple 2025-12-06T00:02:46,187 Fetched page https://pypi.org/simple/llm-benchmark-toolkit/ as application/vnd.pypi.simple.v1+json 2025-12-06T00:02:46,193 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/95/b5/c528dad5028e188283baec8a39b34f2f5258c6f93a5984b194354ab3cb67/llm_benchmark_toolkit-0.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,194 Found link https://files.pythonhosted.org/packages/a7/41/73de3b2dd4005a66ae4a2860ae322e7de9cc75bd8b7ef7e8dc2f710f7a29/llm_benchmark_toolkit-0.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.0 2025-12-06T00:02:46,195 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/96/49/345527ecab82ff6ccaf832a6eb1fe2d6bccb98971861482a4c96c173ec68/llm_benchmark_toolkit-0.3.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,196 Found link https://files.pythonhosted.org/packages/d6/7c/960c3132ff0d4fab4b3b0f5dc2642fe4119a5ea7063d60ab2a8f5b8541de/llm_benchmark_toolkit-0.3.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.1 2025-12-06T00:02:46,197 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/86/df/8cf567d200bf116978d565bfa82022212049a4e077e2f4057020ba7b3ec1/llm_benchmark_toolkit-0.3.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,198 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/87/7d/18901d63a9d945df5a49aae05eac642327e2432951a027b39ac4e6e9d303/llm_benchmark_toolkit-0.4.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,198 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/4d/09/e0d3f5c7e8e533df93d488a0411eb15557ee120b1aa045a7f1fd2bfe8047/llm_benchmark_toolkit-0.4.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,199 Found link https://files.pythonhosted.org/packages/94/98/e41f2f6abc60c05c01e401fa39fed8d398305200e1ef23cef596be8809c5/llm_benchmark_toolkit-0.4.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.4.1 2025-12-06T00:02:46,200 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/19/3d/8ca9dc0f08263596dea0de71a92635fe8e23faa668182a4349d90ccb0d82/llm_benchmark_toolkit-2.0.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,201 Found link https://files.pythonhosted.org/packages/36/45/264df4505ca7ef5433049cc402373960ab82e5885cc789e8b85fc73f76bb/llm_benchmark_toolkit-2.0.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.0.0 2025-12-06T00:02:46,202 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/5b/43/32c2b3793610b688ec24cc490ad4c50bcdff3e33f1ccb172cc464972534c/llm_benchmark_toolkit-2.1.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,203 Found link https://files.pythonhosted.org/packages/be/d3/7152b5619a14d4c8a043cc4ee01e4c6b746c910b6df8b91ecb696da1ef4c/llm_benchmark_toolkit-2.1.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.1.0 2025-12-06T00:02:46,203 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/41/79/ca2033ee9ccfebcf708ae68a2007a6072c87c0e9bb24c5354d274bd57247/llm_benchmark_toolkit-2.2.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,204 Found link https://files.pythonhosted.org/packages/b9/ba/602dbec3514ff6b5a611d98e44e81a298dd4a7904c8b378a8e5a7fdc3c87/llm_benchmark_toolkit-2.2.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.0 2025-12-06T00:02:46,205 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/ee/0b/7e7e6b6575046e5bf1d8cb1eceb88aba723ebde5bb0d761f50d971181902/llm_benchmark_toolkit-2.2.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,206 Found link https://files.pythonhosted.org/packages/f8/1d/ee3da6b517008793f0444ec55d964cb2afee0a21c475f89e9be9e90479cf/llm_benchmark_toolkit-2.2.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.1 2025-12-06T00:02:46,207 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/90/d6/ee6f413ced99a3ee80d162870f1b759ab790bc4dfb45352a454c3ef9f663/llm_benchmark_toolkit-2.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,208 Found link https://files.pythonhosted.org/packages/26/eb/11a05fa90f5bac82d1d4f4bec7b6e13e3a9cb11502db8e53a3dc74270f34/llm_benchmark_toolkit-2.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.3.0 2025-12-06T00:02:46,208 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/ca/84/53c0b5d7d479671dcb589ea79d730f626768c29dc4ecd5963929ec624ea9/llm_benchmark_toolkit-2.3.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,209 Found link https://files.pythonhosted.org/packages/75/6b/c3d742f7e9cf668df0c1dab60629839b213df2946819f34ee03218c7b434/llm_benchmark_toolkit-2.3.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.3.1 2025-12-06T00:02:46,210 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/48/2a/98f908c8ee338c89c42bdd846e3a8996eb909c778b38d4dd8068eecc7bd6/llm_benchmark_toolkit-2.3.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,211 Found link https://files.pythonhosted.org/packages/bb/ce/cd4b4fcd1be52032f03fb742184d3bf854e8c849593757753c6c0565f05a/llm_benchmark_toolkit-2.3.2.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.3.2 2025-12-06T00:02:46,212 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/39/b5/bccd38a047ddbde176f5576374dd174bdb427c3eda7c92d86b5ea3a5094c/llm_benchmark_toolkit-2.4.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,213 Found link https://files.pythonhosted.org/packages/86/65/75715f0021de2a69cbf293176b6eed5a7b2bc76d3f832f29367f5ed92fbc/llm_benchmark_toolkit-2.4.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.4.0 2025-12-06T00:02:46,213 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/fb/9a/23ffeb1e40ccac4d1b7fe4cdd3daaf777604970465b21a48114e4842a804/llm_benchmark_toolkit-2.4.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,214 Found link https://files.pythonhosted.org/packages/03/8c/c8af4e1c13946d342d9f72b9f499de990db9f6c5b7bdf126f4179f74bd0d/llm_benchmark_toolkit-2.4.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.4.1 2025-12-06T00:02:46,215 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/55/78/b1ef2ae73612640393d1fc73c52da4f80c9aa492bc19058ddd234437ebd8/llm_benchmark_toolkit-2.4.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,215 Found link https://files.pythonhosted.org/packages/e6/9f/94e53c1accb629b7506238a1a2dcddf0c2b66105d7dc27dce04b4ce30a11/llm_benchmark_toolkit-2.4.2.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.4.2 2025-12-06T00:02:46,216 Fetching project page and analyzing links: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-06T00:02:46,217 Getting page https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-06T00:02:46,218 Found index url https://www.piwheels.org/simple 2025-12-06T00:02:46,611 Fetched page https://www.piwheels.org/simple/llm-benchmark-toolkit/ as text/html 2025-12-06T00:02:46,616 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.4.1-py3-none-any.whl#sha256=5cc26dd85863c76a8253af274964ecf8a7bc9ae95d48de2ea1db3c21d00dc05c (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,617 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.4.0-py3-none-any.whl#sha256=b930ef8333ec881679b72a5ade68b3358903063b29a55d67ff13b3050c8874de (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,617 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.3.2-py3-none-any.whl#sha256=e33d43d5efc28788bbac71ec74fddd0e2253b5a2940038ec16551484b46806fc (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,618 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.3.1-py3-none-any.whl#sha256=e821a36a5ab2587c33a4f9aeb452387305b7fbfc61e38ac109eaac44afcfcf11 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,619 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.3.0-py3-none-any.whl#sha256=de403f7d45ce04034909c8e7022101a4bcd685517024ebe115314d8862ed7895 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,619 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.2.1-py3-none-any.whl#sha256=a910972ff3802aeeec4c93220e0a5373097fb87c071c065ce81cef88e41c0d7f (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,620 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.2.0-py3-none-any.whl#sha256=2fbbb501c0dd81fa057db285c3202e45adfe99ceba2398974482329d2b805a41 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,621 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.1.0-py3-none-any.whl#sha256=c85ebb550837b0faf5cb28798b4ec2f84c9d20796e69f43ce7933f3145f903ce (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,621 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.0.0-py3-none-any.whl#sha256=9956b86c6b62f17a14b176a906481dbb5053a5f4c7f8dc479a1afb0668f10336 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,622 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.4.1-py3-none-any.whl#sha256=c237cd524f90f123d8ecce03f9b033a198b328d9d44ca52c0df5aa132b3e60be (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,622 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.1-py3-none-any.whl#sha256=44752309369b0c2b9e5bb1a1093611cc8a51e84014c2d50b4087e7c9bf1e8a49 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,623 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.0-py3-none-any.whl#sha256=8abac7bd509fef8b4c7c3a124e9afb765a0340e7d6c4c2b800dc7117fd8f99b4 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-06T00:02:46,623 Skipping link: not a file: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-06T00:02:46,624 Skipping link: not a file: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-06T00:02:46,646 Given no hashes to check 1 links for project 'llm-benchmark-toolkit': discarding no candidates 2025-12-06T00:02:46,665 Collecting llm-benchmark-toolkit==2.4.2 2025-12-06T00:02:46,668 Created temporary directory: /tmp/pip-unpack-gftgt8lf 2025-12-06T00:02:46,882 Downloading llm_benchmark_toolkit-2.4.2.tar.gz (398 kB) 2025-12-06T00:02:47,272 Added llm-benchmark-toolkit==2.4.2 from https://files.pythonhosted.org/packages/e6/9f/94e53c1accb629b7506238a1a2dcddf0c2b66105d7dc27dce04b4ce30a11/llm_benchmark_toolkit-2.4.2.tar.gz to build tracker '/tmp/pip-build-tracker-o94lek3y' 2025-12-06T00:02:47,285 Created temporary directory: /tmp/pip-build-env-pmsnsh_b 2025-12-06T00:02:47,290 Installing build dependencies: started 2025-12-06T00:02:47,292 Running command pip subprocess to install build dependencies 2025-12-06T00:02:48,481 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2025-12-06T00:02:49,078 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-06T00:02:49,101 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-06T00:02:50,847 Collecting setuptools>=61.0 2025-12-06T00:02:50,958 Using cached https://www.piwheels.org/simple/setuptools/setuptools-80.9.0-py3-none-any.whl (1.2 MB) 2025-12-06T00:02:51,245 Collecting wheel 2025-12-06T00:02:51,262 Using cached https://www.piwheels.org/simple/wheel/wheel-0.45.1-py3-none-any.whl (72 kB) 2025-12-06T00:02:54,161 Installing collected packages: wheel, setuptools 2025-12-06T00:02:54,417 Creating /tmp/pip-build-env-pmsnsh_b/overlay/local/bin 2025-12-06T00:02:54,419 changing mode of /tmp/pip-build-env-pmsnsh_b/overlay/local/bin/wheel to 755 2025-12-06T00:02:58,098 Successfully installed setuptools-80.9.0 wheel-0.45.1 2025-12-06T00:02:58,375 Installing build dependencies: finished with status 'done' 2025-12-06T00:02:58,382 Getting requirements to build wheel: started 2025-12-06T00:02:58,383 Running command Getting requirements to build wheel 2025-12-06T00:02:59,016 /tmp/pip-build-env-pmsnsh_b/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-06T00:02:59,016 corresp(dist, value, root_dir) 2025-12-06T00:02:59,017 /tmp/pip-build-env-pmsnsh_b/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-06T00:02:59,017 corresp(dist, value, root_dir) 2025-12-06T00:02:59,114 running egg_info 2025-12-06T00:02:59,122 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-06T00:02:59,142 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-06T00:02:59,144 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-06T00:02:59,157 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-06T00:02:59,159 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-06T00:02:59,194 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-06T00:02:59,207 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-06T00:02:59,307 Getting requirements to build wheel: finished with status 'done' 2025-12-06T00:02:59,311 Created temporary directory: /tmp/pip-modern-metadata-8vmyyjln 2025-12-06T00:02:59,313 Preparing metadata (pyproject.toml): started 2025-12-06T00:02:59,315 Running command Preparing metadata (pyproject.toml) 2025-12-06T00:02:59,927 /tmp/pip-build-env-pmsnsh_b/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-06T00:02:59,927 corresp(dist, value, root_dir) 2025-12-06T00:02:59,928 /tmp/pip-build-env-pmsnsh_b/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-06T00:02:59,929 corresp(dist, value, root_dir) 2025-12-06T00:03:00,025 running dist_info 2025-12-06T00:03:00,038 creating /tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit.egg-info 2025-12-06T00:03:00,039 writing /tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-06T00:03:00,058 writing dependency_links to /tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-06T00:03:00,060 writing entry points to /tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-06T00:03:00,072 writing requirements to /tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-06T00:03:00,073 writing top-level names to /tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-06T00:03:00,075 writing manifest file '/tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-06T00:03:00,104 reading manifest file '/tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-06T00:03:00,111 writing manifest file '/tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-06T00:03:00,112 creating '/tmp/pip-modern-metadata-8vmyyjln/llm_benchmark_toolkit-2.4.2.dist-info' 2025-12-06T00:03:00,237 Preparing metadata (pyproject.toml): finished with status 'done' 2025-12-06T00:03:00,244 Source in /tmp/pip-wheel-xxnppyvq/llm-benchmark-toolkit_9df19cb8d5c04129b0723749add5189c has version 2.4.2, which satisfies requirement llm-benchmark-toolkit==2.4.2 from https://files.pythonhosted.org/packages/e6/9f/94e53c1accb629b7506238a1a2dcddf0c2b66105d7dc27dce04b4ce30a11/llm_benchmark_toolkit-2.4.2.tar.gz 2025-12-06T00:03:00,245 Removed llm-benchmark-toolkit==2.4.2 from https://files.pythonhosted.org/packages/e6/9f/94e53c1accb629b7506238a1a2dcddf0c2b66105d7dc27dce04b4ce30a11/llm_benchmark_toolkit-2.4.2.tar.gz from build tracker '/tmp/pip-build-tracker-o94lek3y' 2025-12-06T00:03:00,254 Created temporary directory: /tmp/pip-unpack-er5fqpa8 2025-12-06T00:03:00,254 Building wheels for collected packages: llm-benchmark-toolkit 2025-12-06T00:03:00,259 Created temporary directory: /tmp/pip-wheel-mm7d012t 2025-12-06T00:03:00,259 Destination directory: /tmp/pip-wheel-mm7d012t 2025-12-06T00:03:00,262 Building wheel for llm-benchmark-toolkit (pyproject.toml): started 2025-12-06T00:03:00,263 Running command Building wheel for llm-benchmark-toolkit (pyproject.toml) 2025-12-06T00:03:00,834 /tmp/pip-build-env-pmsnsh_b/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-06T00:03:00,835 corresp(dist, value, root_dir) 2025-12-06T00:03:00,835 /tmp/pip-build-env-pmsnsh_b/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-06T00:03:00,836 corresp(dist, value, root_dir) 2025-12-06T00:03:00,919 running bdist_wheel 2025-12-06T00:03:00,940 running build 2025-12-06T00:03:00,941 running build_py 2025-12-06T00:03:00,948 creating build/lib/llm_evaluator 2025-12-06T00:03:00,950 copying src/llm_evaluator/statistical_metrics.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,953 copying src/llm_evaluator/evaluator.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,956 copying src/llm_evaluator/error_analysis.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,959 copying src/llm_evaluator/__init__.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,961 copying src/llm_evaluator/export.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,963 copying src/llm_evaluator/visualizations.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,966 copying src/llm_evaluator/dataset_loaders.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,968 copying src/llm_evaluator/config.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,970 copying src/llm_evaluator/metrics.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,972 copying src/llm_evaluator/academic_baselines.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,975 copying src/llm_evaluator/cli.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,979 copying src/llm_evaluator/system_info.py -> build/lib/llm_evaluator 2025-12-06T00:03:00,982 creating build/lib/llm_evaluator/providers 2025-12-06T00:03:00,983 copying src/llm_evaluator/providers/huggingface_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:00,985 copying src/llm_evaluator/providers/gemini_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:00,988 copying src/llm_evaluator/providers/deepseek_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:00,991 copying src/llm_evaluator/providers/anthropic_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:00,993 copying src/llm_evaluator/providers/groq_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:00,996 copying src/llm_evaluator/providers/__init__.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:00,998 copying src/llm_evaluator/providers/ollama_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:01,001 copying src/llm_evaluator/providers/base.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:01,003 copying src/llm_evaluator/providers/fireworks_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:01,005 copying src/llm_evaluator/providers/together_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:01,007 copying src/llm_evaluator/providers/openai_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:01,010 copying src/llm_evaluator/providers/cached_provider.py -> build/lib/llm_evaluator/providers 2025-12-06T00:03:01,013 creating build/lib/llm_evaluator/security 2025-12-06T00:03:01,014 copying src/llm_evaluator/security/toxicity.py -> build/lib/llm_evaluator/security 2025-12-06T00:03:01,016 copying src/llm_evaluator/security/pii_detector.py -> build/lib/llm_evaluator/security 2025-12-06T00:03:01,018 copying src/llm_evaluator/security/__init__.py -> build/lib/llm_evaluator/security 2025-12-06T00:03:01,020 copying src/llm_evaluator/security/red_team.py -> build/lib/llm_evaluator/security 2025-12-06T00:03:01,023 copying src/llm_evaluator/security/prompt_injection.py -> build/lib/llm_evaluator/security 2025-12-06T00:03:01,026 creating build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,027 copying src/llm_evaluator/benchmarks/hellaswag.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,029 copying src/llm_evaluator/benchmarks/truthfulqa.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,032 copying src/llm_evaluator/benchmarks/donotanswer.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,034 copying src/llm_evaluator/benchmarks/mmlu.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,036 copying src/llm_evaluator/benchmarks/boolq.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,038 copying src/llm_evaluator/benchmarks/__init__.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,040 copying src/llm_evaluator/benchmarks/winogrande.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,042 copying src/llm_evaluator/benchmarks/gsm8k.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,044 copying src/llm_evaluator/benchmarks/runner.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,046 copying src/llm_evaluator/benchmarks/base.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,049 copying src/llm_evaluator/benchmarks/commonsenseqa.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,051 copying src/llm_evaluator/benchmarks/arc.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,053 copying src/llm_evaluator/benchmarks/safetybench.py -> build/lib/llm_evaluator/benchmarks 2025-12-06T00:03:01,056 creating build/lib/llm_evaluator/dashboard 2025-12-06T00:03:01,057 copying src/llm_evaluator/dashboard/model_discovery.py -> build/lib/llm_evaluator/dashboard 2025-12-06T00:03:01,059 copying src/llm_evaluator/dashboard/__init__.py -> build/lib/llm_evaluator/dashboard 2025-12-06T00:03:01,061 copying src/llm_evaluator/dashboard/runner.py -> build/lib/llm_evaluator/dashboard 2025-12-06T00:03:01,063 copying src/llm_evaluator/dashboard/app.py -> build/lib/llm_evaluator/dashboard 2025-12-06T00:03:01,066 copying src/llm_evaluator/dashboard/__main__.py -> build/lib/llm_evaluator/dashboard 2025-12-06T00:03:01,068 copying src/llm_evaluator/dashboard/models.py -> build/lib/llm_evaluator/dashboard 2025-12-06T00:03:01,071 running egg_info 2025-12-06T00:03:01,083 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-06T00:03:01,101 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-06T00:03:01,103 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-06T00:03:01,115 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-06T00:03:01,116 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-06T00:03:01,135 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-06T00:03:01,148 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-06T00:03:01,155 creating build/lib/llm_evaluator/dashboard/static 2025-12-06T00:03:01,156 copying src/llm_evaluator/dashboard/static/favicon.svg -> build/lib/llm_evaluator/dashboard/static 2025-12-06T00:03:01,158 copying src/llm_evaluator/dashboard/static/index.html -> build/lib/llm_evaluator/dashboard/static 2025-12-06T00:03:01,160 creating build/lib/llm_evaluator/dashboard/static/assets 2025-12-06T00:03:01,161 copying src/llm_evaluator/dashboard/static/assets/index-DE0Rg7o1.css -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-06T00:03:01,165 copying src/llm_evaluator/dashboard/static/assets/index-CcvlFRmM.js -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-06T00:03:01,199 installing to build/bdist.linux-armv7l/wheel 2025-12-06T00:03:01,200 running install 2025-12-06T00:03:01,224 running install_lib 2025-12-06T00:03:01,230 creating build/bdist.linux-armv7l/wheel 2025-12-06T00:03:01,232 creating build/bdist.linux-armv7l/wheel/llm_evaluator 2025-12-06T00:03:01,233 copying build/lib/llm_evaluator/statistical_metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,236 copying build/lib/llm_evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,238 copying build/lib/llm_evaluator/error_analysis.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,241 copying build/lib/llm_evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,243 copying build/lib/llm_evaluator/export.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,245 copying build/lib/llm_evaluator/visualizations.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,248 copying build/lib/llm_evaluator/dataset_loaders.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,250 creating build/bdist.linux-armv7l/wheel/llm_evaluator/providers 2025-12-06T00:03:01,251 copying build/lib/llm_evaluator/providers/huggingface_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,254 copying build/lib/llm_evaluator/providers/gemini_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,256 copying build/lib/llm_evaluator/providers/deepseek_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,258 copying build/lib/llm_evaluator/providers/anthropic_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,261 copying build/lib/llm_evaluator/providers/groq_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,263 copying build/lib/llm_evaluator/providers/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,265 copying build/lib/llm_evaluator/providers/ollama_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,267 copying build/lib/llm_evaluator/providers/base.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,269 copying build/lib/llm_evaluator/providers/fireworks_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,271 copying build/lib/llm_evaluator/providers/together_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,273 copying build/lib/llm_evaluator/providers/openai_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,276 copying build/lib/llm_evaluator/providers/cached_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-06T00:03:01,278 creating build/bdist.linux-armv7l/wheel/llm_evaluator/security 2025-12-06T00:03:01,279 copying build/lib/llm_evaluator/security/toxicity.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-06T00:03:01,282 copying build/lib/llm_evaluator/security/pii_detector.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-06T00:03:01,284 copying build/lib/llm_evaluator/security/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-06T00:03:01,286 copying build/lib/llm_evaluator/security/red_team.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-06T00:03:01,288 copying build/lib/llm_evaluator/security/prompt_injection.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-06T00:03:01,290 copying build/lib/llm_evaluator/config.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,292 copying build/lib/llm_evaluator/metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,294 copying build/lib/llm_evaluator/academic_baselines.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,297 creating build/bdist.linux-armv7l/wheel/llm_evaluator/benchmarks 2025-12-06T00:03:01,298 copying build/lib/llm_evaluator/benchmarks/hellaswag.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,301 copying build/lib/llm_evaluator/benchmarks/truthfulqa.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,303 copying build/lib/llm_evaluator/benchmarks/donotanswer.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,305 copying build/lib/llm_evaluator/benchmarks/mmlu.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,307 copying build/lib/llm_evaluator/benchmarks/boolq.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,309 copying build/lib/llm_evaluator/benchmarks/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,310 copying build/lib/llm_evaluator/benchmarks/winogrande.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,312 copying build/lib/llm_evaluator/benchmarks/gsm8k.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,314 copying build/lib/llm_evaluator/benchmarks/runner.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,316 copying build/lib/llm_evaluator/benchmarks/base.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,319 copying build/lib/llm_evaluator/benchmarks/commonsenseqa.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,321 copying build/lib/llm_evaluator/benchmarks/arc.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,323 copying build/lib/llm_evaluator/benchmarks/safetybench.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-06T00:03:01,325 copying build/lib/llm_evaluator/cli.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,329 copying build/lib/llm_evaluator/system_info.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-06T00:03:01,331 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard 2025-12-06T00:03:01,333 copying build/lib/llm_evaluator/dashboard/model_discovery.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-06T00:03:01,335 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static 2025-12-06T00:03:01,337 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static/assets 2025-12-06T00:03:01,338 copying build/lib/llm_evaluator/dashboard/static/assets/index-DE0Rg7o1.css -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-06T00:03:01,341 copying build/lib/llm_evaluator/dashboard/static/assets/index-CcvlFRmM.js -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-06T00:03:01,374 copying build/lib/llm_evaluator/dashboard/static/favicon.svg -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-06T00:03:01,376 copying build/lib/llm_evaluator/dashboard/static/index.html -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-06T00:03:01,378 copying build/lib/llm_evaluator/dashboard/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-06T00:03:01,379 copying build/lib/llm_evaluator/dashboard/runner.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-06T00:03:01,383 copying build/lib/llm_evaluator/dashboard/app.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-06T00:03:01,386 copying build/lib/llm_evaluator/dashboard/__main__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-06T00:03:01,388 copying build/lib/llm_evaluator/dashboard/models.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-06T00:03:01,390 running install_egg_info 2025-12-06T00:03:01,395 Copying src/llm_benchmark_toolkit.egg-info to build/bdist.linux-armv7l/wheel/./llm_benchmark_toolkit-2.4.2-py3.11.egg-info 2025-12-06T00:03:01,407 running install_scripts 2025-12-06T00:03:01,416 creating build/bdist.linux-armv7l/wheel/llm_benchmark_toolkit-2.4.2.dist-info/WHEEL 2025-12-06T00:03:01,418 creating '/tmp/pip-wheel-mm7d012t/.tmp-3frdh49_/llm_benchmark_toolkit-2.4.2-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-12-06T00:03:01,422 adding 'llm_evaluator/__init__.py' 2025-12-06T00:03:01,424 adding 'llm_evaluator/academic_baselines.py' 2025-12-06T00:03:01,433 adding 'llm_evaluator/cli.py' 2025-12-06T00:03:01,436 adding 'llm_evaluator/config.py' 2025-12-06T00:03:01,437 adding 'llm_evaluator/dataset_loaders.py' 2025-12-06T00:03:01,439 adding 'llm_evaluator/error_analysis.py' 2025-12-06T00:03:01,443 adding 'llm_evaluator/evaluator.py' 2025-12-06T00:03:01,445 adding 'llm_evaluator/export.py' 2025-12-06T00:03:01,447 adding 'llm_evaluator/metrics.py' 2025-12-06T00:03:01,450 adding 'llm_evaluator/statistical_metrics.py' 2025-12-06T00:03:01,451 adding 'llm_evaluator/system_info.py' 2025-12-06T00:03:01,454 adding 'llm_evaluator/visualizations.py' 2025-12-06T00:03:01,456 adding 'llm_evaluator/benchmarks/__init__.py' 2025-12-06T00:03:01,457 adding 'llm_evaluator/benchmarks/arc.py' 2025-12-06T00:03:01,459 adding 'llm_evaluator/benchmarks/base.py' 2025-12-06T00:03:01,461 adding 'llm_evaluator/benchmarks/boolq.py' 2025-12-06T00:03:01,462 adding 'llm_evaluator/benchmarks/commonsenseqa.py' 2025-12-06T00:03:01,464 adding 'llm_evaluator/benchmarks/donotanswer.py' 2025-12-06T00:03:01,465 adding 'llm_evaluator/benchmarks/gsm8k.py' 2025-12-06T00:03:01,466 adding 'llm_evaluator/benchmarks/hellaswag.py' 2025-12-06T00:03:01,468 adding 'llm_evaluator/benchmarks/mmlu.py' 2025-12-06T00:03:01,470 adding 'llm_evaluator/benchmarks/runner.py' 2025-12-06T00:03:01,472 adding 'llm_evaluator/benchmarks/safetybench.py' 2025-12-06T00:03:01,473 adding 'llm_evaluator/benchmarks/truthfulqa.py' 2025-12-06T00:03:01,475 adding 'llm_evaluator/benchmarks/winogrande.py' 2025-12-06T00:03:01,476 adding 'llm_evaluator/dashboard/__init__.py' 2025-12-06T00:03:01,478 adding 'llm_evaluator/dashboard/__main__.py' 2025-12-06T00:03:01,482 adding 'llm_evaluator/dashboard/app.py' 2025-12-06T00:03:01,483 adding 'llm_evaluator/dashboard/model_discovery.py' 2025-12-06T00:03:01,485 adding 'llm_evaluator/dashboard/models.py' 2025-12-06T00:03:01,489 adding 'llm_evaluator/dashboard/runner.py' 2025-12-06T00:03:01,491 adding 'llm_evaluator/dashboard/static/favicon.svg' 2025-12-06T00:03:01,492 adding 'llm_evaluator/dashboard/static/index.html' 2025-12-06T00:03:01,573 adding 'llm_evaluator/dashboard/static/assets/index-CcvlFRmM.js' 2025-12-06T00:03:01,581 adding 'llm_evaluator/dashboard/static/assets/index-DE0Rg7o1.css' 2025-12-06T00:03:01,583 adding 'llm_evaluator/providers/__init__.py' 2025-12-06T00:03:01,585 adding 'llm_evaluator/providers/anthropic_provider.py' 2025-12-06T00:03:01,587 adding 'llm_evaluator/providers/base.py' 2025-12-06T00:03:01,589 adding 'llm_evaluator/providers/cached_provider.py' 2025-12-06T00:03:01,591 adding 'llm_evaluator/providers/deepseek_provider.py' 2025-12-06T00:03:01,593 adding 'llm_evaluator/providers/fireworks_provider.py' 2025-12-06T00:03:01,595 adding 'llm_evaluator/providers/gemini_provider.py' 2025-12-06T00:03:01,596 adding 'llm_evaluator/providers/groq_provider.py' 2025-12-06T00:03:01,598 adding 'llm_evaluator/providers/huggingface_provider.py' 2025-12-06T00:03:01,600 adding 'llm_evaluator/providers/ollama_provider.py' 2025-12-06T00:03:01,602 adding 'llm_evaluator/providers/openai_provider.py' 2025-12-06T00:03:01,604 adding 'llm_evaluator/providers/together_provider.py' 2025-12-06T00:03:01,606 adding 'llm_evaluator/security/__init__.py' 2025-12-06T00:03:01,607 adding 'llm_evaluator/security/pii_detector.py' 2025-12-06T00:03:01,609 adding 'llm_evaluator/security/prompt_injection.py' 2025-12-06T00:03:01,611 adding 'llm_evaluator/security/red_team.py' 2025-12-06T00:03:01,613 adding 'llm_evaluator/security/toxicity.py' 2025-12-06T00:03:01,615 adding 'llm_benchmark_toolkit-2.4.2.dist-info/METADATA' 2025-12-06T00:03:01,617 adding 'llm_benchmark_toolkit-2.4.2.dist-info/WHEEL' 2025-12-06T00:03:01,617 adding 'llm_benchmark_toolkit-2.4.2.dist-info/entry_points.txt' 2025-12-06T00:03:01,618 adding 'llm_benchmark_toolkit-2.4.2.dist-info/top_level.txt' 2025-12-06T00:03:01,620 adding 'llm_benchmark_toolkit-2.4.2.dist-info/RECORD' 2025-12-06T00:03:01,627 removing build/bdist.linux-armv7l/wheel 2025-12-06T00:03:01,747 Building wheel for llm-benchmark-toolkit (pyproject.toml): finished with status 'done' 2025-12-06T00:03:01,756 Created wheel for llm-benchmark-toolkit: filename=llm_benchmark_toolkit-2.4.2-py3-none-any.whl size=365229 sha256=0987f1d21b4f16511fe7d9e42438b993e8110423f0e42ad6abdd71409cd34726 2025-12-06T00:03:01,757 Stored in directory: /tmp/pip-ephem-wheel-cache-s12modma/wheels/de/44/07/fb93bdf6632080881738569bc828dcf52d1d9ab2dd185fb4da 2025-12-06T00:03:01,775 Successfully built llm-benchmark-toolkit 2025-12-06T00:03:01,786 Removed build tracker: '/tmp/pip-build-tracker-o94lek3y'