2025-12-05T23:47:50,233 Created temporary directory: /tmp/pip-ephem-wheel-cache-iuzk59ti 2025-12-05T23:47:50,234 Created temporary directory: /tmp/pip-build-tracker-6gpfuu1s 2025-12-05T23:47:50,235 Initialized build tracking at /tmp/pip-build-tracker-6gpfuu1s 2025-12-05T23:47:50,235 Created build tracker: /tmp/pip-build-tracker-6gpfuu1s 2025-12-05T23:47:50,236 Entered build tracker: /tmp/pip-build-tracker-6gpfuu1s 2025-12-05T23:47:50,237 Created temporary directory: /tmp/pip-wheel-mnrxagf_ 2025-12-05T23:47:50,240 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-05T23:47:50,242 Created temporary directory: /tmp/pip-ephem-wheel-cache-quq0xme_ 2025-12-05T23:47:50,266 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-05T23:47:50,270 2 location(s) to search for versions of llm-benchmark-toolkit: 2025-12-05T23:47:50,270 * https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-05T23:47:50,270 * https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-05T23:47:50,271 Fetching project page and analyzing links: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-05T23:47:50,272 Getting page https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-05T23:47:50,273 Found index url https://pypi.org/simple 2025-12-05T23:47:50,492 Fetched page https://pypi.org/simple/llm-benchmark-toolkit/ as application/vnd.pypi.simple.v1+json 2025-12-05T23:47:50,498 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/95/b5/c528dad5028e188283baec8a39b34f2f5258c6f93a5984b194354ab3cb67/llm_benchmark_toolkit-0.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,499 Found link https://files.pythonhosted.org/packages/a7/41/73de3b2dd4005a66ae4a2860ae322e7de9cc75bd8b7ef7e8dc2f710f7a29/llm_benchmark_toolkit-0.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.0 2025-12-05T23:47:50,500 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/96/49/345527ecab82ff6ccaf832a6eb1fe2d6bccb98971861482a4c96c173ec68/llm_benchmark_toolkit-0.3.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,501 Found link https://files.pythonhosted.org/packages/d6/7c/960c3132ff0d4fab4b3b0f5dc2642fe4119a5ea7063d60ab2a8f5b8541de/llm_benchmark_toolkit-0.3.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.1 2025-12-05T23:47:50,502 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/86/df/8cf567d200bf116978d565bfa82022212049a4e077e2f4057020ba7b3ec1/llm_benchmark_toolkit-0.3.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,502 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/87/7d/18901d63a9d945df5a49aae05eac642327e2432951a027b39ac4e6e9d303/llm_benchmark_toolkit-0.4.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,503 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/4d/09/e0d3f5c7e8e533df93d488a0411eb15557ee120b1aa045a7f1fd2bfe8047/llm_benchmark_toolkit-0.4.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,504 Found link https://files.pythonhosted.org/packages/94/98/e41f2f6abc60c05c01e401fa39fed8d398305200e1ef23cef596be8809c5/llm_benchmark_toolkit-0.4.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.4.1 2025-12-05T23:47:50,505 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/19/3d/8ca9dc0f08263596dea0de71a92635fe8e23faa668182a4349d90ccb0d82/llm_benchmark_toolkit-2.0.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,506 Found link https://files.pythonhosted.org/packages/36/45/264df4505ca7ef5433049cc402373960ab82e5885cc789e8b85fc73f76bb/llm_benchmark_toolkit-2.0.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.0.0 2025-12-05T23:47:50,507 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/5b/43/32c2b3793610b688ec24cc490ad4c50bcdff3e33f1ccb172cc464972534c/llm_benchmark_toolkit-2.1.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,507 Found link https://files.pythonhosted.org/packages/be/d3/7152b5619a14d4c8a043cc4ee01e4c6b746c910b6df8b91ecb696da1ef4c/llm_benchmark_toolkit-2.1.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.1.0 2025-12-05T23:47:50,508 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/41/79/ca2033ee9ccfebcf708ae68a2007a6072c87c0e9bb24c5354d274bd57247/llm_benchmark_toolkit-2.2.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,509 Found link https://files.pythonhosted.org/packages/b9/ba/602dbec3514ff6b5a611d98e44e81a298dd4a7904c8b378a8e5a7fdc3c87/llm_benchmark_toolkit-2.2.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.0 2025-12-05T23:47:50,509 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/ee/0b/7e7e6b6575046e5bf1d8cb1eceb88aba723ebde5bb0d761f50d971181902/llm_benchmark_toolkit-2.2.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,510 Found link https://files.pythonhosted.org/packages/f8/1d/ee3da6b517008793f0444ec55d964cb2afee0a21c475f89e9be9e90479cf/llm_benchmark_toolkit-2.2.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.1 2025-12-05T23:47:50,511 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/90/d6/ee6f413ced99a3ee80d162870f1b759ab790bc4dfb45352a454c3ef9f663/llm_benchmark_toolkit-2.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,512 Found link https://files.pythonhosted.org/packages/26/eb/11a05fa90f5bac82d1d4f4bec7b6e13e3a9cb11502db8e53a3dc74270f34/llm_benchmark_toolkit-2.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.3.0 2025-12-05T23:47:50,513 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/ca/84/53c0b5d7d479671dcb589ea79d730f626768c29dc4ecd5963929ec624ea9/llm_benchmark_toolkit-2.3.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,514 Found link https://files.pythonhosted.org/packages/75/6b/c3d742f7e9cf668df0c1dab60629839b213df2946819f34ee03218c7b434/llm_benchmark_toolkit-2.3.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.3.1 2025-12-05T23:47:50,515 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/48/2a/98f908c8ee338c89c42bdd846e3a8996eb909c778b38d4dd8068eecc7bd6/llm_benchmark_toolkit-2.3.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,516 Found link https://files.pythonhosted.org/packages/bb/ce/cd4b4fcd1be52032f03fb742184d3bf854e8c849593757753c6c0565f05a/llm_benchmark_toolkit-2.3.2.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.3.2 2025-12-05T23:47:50,516 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/39/b5/bccd38a047ddbde176f5576374dd174bdb427c3eda7c92d86b5ea3a5094c/llm_benchmark_toolkit-2.4.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,518 Found link https://files.pythonhosted.org/packages/86/65/75715f0021de2a69cbf293176b6eed5a7b2bc76d3f832f29367f5ed92fbc/llm_benchmark_toolkit-2.4.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.4.0 2025-12-05T23:47:50,518 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/fb/9a/23ffeb1e40ccac4d1b7fe4cdd3daaf777604970465b21a48114e4842a804/llm_benchmark_toolkit-2.4.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,519 Found link https://files.pythonhosted.org/packages/03/8c/c8af4e1c13946d342d9f72b9f499de990db9f6c5b7bdf126f4179f74bd0d/llm_benchmark_toolkit-2.4.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.4.1 2025-12-05T23:47:50,520 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/55/78/b1ef2ae73612640393d1fc73c52da4f80c9aa492bc19058ddd234437ebd8/llm_benchmark_toolkit-2.4.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,520 Found link https://files.pythonhosted.org/packages/e6/9f/94e53c1accb629b7506238a1a2dcddf0c2b66105d7dc27dce04b4ce30a11/llm_benchmark_toolkit-2.4.2.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.4.2 2025-12-05T23:47:50,521 Fetching project page and analyzing links: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-05T23:47:50,522 Getting page https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-05T23:47:50,523 Found index url https://www.piwheels.org/simple 2025-12-05T23:47:50,886 Fetched page https://www.piwheels.org/simple/llm-benchmark-toolkit/ as text/html 2025-12-05T23:47:50,892 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.4.0-py3-none-any.whl#sha256=b930ef8333ec881679b72a5ade68b3358903063b29a55d67ff13b3050c8874de (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,893 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.3.2-py3-none-any.whl#sha256=e33d43d5efc28788bbac71ec74fddd0e2253b5a2940038ec16551484b46806fc (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,894 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.3.1-py3-none-any.whl#sha256=e821a36a5ab2587c33a4f9aeb452387305b7fbfc61e38ac109eaac44afcfcf11 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,895 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.3.0-py3-none-any.whl#sha256=de403f7d45ce04034909c8e7022101a4bcd685517024ebe115314d8862ed7895 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,895 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.2.1-py3-none-any.whl#sha256=a910972ff3802aeeec4c93220e0a5373097fb87c071c065ce81cef88e41c0d7f (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,896 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.2.0-py3-none-any.whl#sha256=2fbbb501c0dd81fa057db285c3202e45adfe99ceba2398974482329d2b805a41 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,897 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.1.0-py3-none-any.whl#sha256=c85ebb550837b0faf5cb28798b4ec2f84c9d20796e69f43ce7933f3145f903ce (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,898 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.0.0-py3-none-any.whl#sha256=9956b86c6b62f17a14b176a906481dbb5053a5f4c7f8dc479a1afb0668f10336 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,898 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.4.1-py3-none-any.whl#sha256=c237cd524f90f123d8ecce03f9b033a198b328d9d44ca52c0df5aa132b3e60be (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,899 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.1-py3-none-any.whl#sha256=44752309369b0c2b9e5bb1a1093611cc8a51e84014c2d50b4087e7c9bf1e8a49 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,899 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.0-py3-none-any.whl#sha256=8abac7bd509fef8b4c7c3a124e9afb765a0340e7d6c4c2b800dc7117fd8f99b4 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-05T23:47:50,900 Skipping link: not a file: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-05T23:47:50,901 Skipping link: not a file: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-05T23:47:50,933 Given no hashes to check 1 links for project 'llm-benchmark-toolkit': discarding no candidates 2025-12-05T23:47:50,961 Collecting llm-benchmark-toolkit==2.4.1 2025-12-05T23:47:50,963 Created temporary directory: /tmp/pip-unpack-fbtl0xud 2025-12-05T23:47:51,179 Downloading llm_benchmark_toolkit-2.4.1.tar.gz (398 kB) 2025-12-05T23:47:51,526 Added llm-benchmark-toolkit==2.4.1 from https://files.pythonhosted.org/packages/03/8c/c8af4e1c13946d342d9f72b9f499de990db9f6c5b7bdf126f4179f74bd0d/llm_benchmark_toolkit-2.4.1.tar.gz to build tracker '/tmp/pip-build-tracker-6gpfuu1s' 2025-12-05T23:47:51,535 Created temporary directory: /tmp/pip-build-env-hg82w3jk 2025-12-05T23:47:51,540 Installing build dependencies: started 2025-12-05T23:47:51,541 Running command pip subprocess to install build dependencies 2025-12-05T23:47:52,717 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2025-12-05T23:47:53,320 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-05T23:47:53,343 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-05T23:47:55,073 Collecting setuptools>=61.0 2025-12-05T23:47:55,224 Using cached https://www.piwheels.org/simple/setuptools/setuptools-80.9.0-py3-none-any.whl (1.2 MB) 2025-12-05T23:47:55,522 Collecting wheel 2025-12-05T23:47:55,540 Using cached https://www.piwheels.org/simple/wheel/wheel-0.45.1-py3-none-any.whl (72 kB) 2025-12-05T23:47:58,479 Installing collected packages: wheel, setuptools 2025-12-05T23:47:58,725 Creating /tmp/pip-build-env-hg82w3jk/overlay/local/bin 2025-12-05T23:47:58,727 changing mode of /tmp/pip-build-env-hg82w3jk/overlay/local/bin/wheel to 755 2025-12-05T23:48:02,386 Successfully installed setuptools-80.9.0 wheel-0.45.1 2025-12-05T23:48:02,663 Installing build dependencies: finished with status 'done' 2025-12-05T23:48:02,669 Getting requirements to build wheel: started 2025-12-05T23:48:02,671 Running command Getting requirements to build wheel 2025-12-05T23:48:03,319 /tmp/pip-build-env-hg82w3jk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-05T23:48:03,319 corresp(dist, value, root_dir) 2025-12-05T23:48:03,320 /tmp/pip-build-env-hg82w3jk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-05T23:48:03,320 corresp(dist, value, root_dir) 2025-12-05T23:48:03,416 running egg_info 2025-12-05T23:48:03,423 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-05T23:48:03,443 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-05T23:48:03,445 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-05T23:48:03,457 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-05T23:48:03,459 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-05T23:48:03,493 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-05T23:48:03,506 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-05T23:48:03,605 Getting requirements to build wheel: finished with status 'done' 2025-12-05T23:48:03,609 Created temporary directory: /tmp/pip-modern-metadata-3os4xq3x 2025-12-05T23:48:03,611 Preparing metadata (pyproject.toml): started 2025-12-05T23:48:03,612 Running command Preparing metadata (pyproject.toml) 2025-12-05T23:48:04,221 /tmp/pip-build-env-hg82w3jk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-05T23:48:04,221 corresp(dist, value, root_dir) 2025-12-05T23:48:04,222 /tmp/pip-build-env-hg82w3jk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-05T23:48:04,222 corresp(dist, value, root_dir) 2025-12-05T23:48:04,314 running dist_info 2025-12-05T23:48:04,326 creating /tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit.egg-info 2025-12-05T23:48:04,327 writing /tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-05T23:48:04,346 writing dependency_links to /tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-05T23:48:04,347 writing entry points to /tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-05T23:48:04,359 writing requirements to /tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-05T23:48:04,360 writing top-level names to /tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-05T23:48:04,362 writing manifest file '/tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-05T23:48:04,390 reading manifest file '/tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-05T23:48:04,398 writing manifest file '/tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-05T23:48:04,399 creating '/tmp/pip-modern-metadata-3os4xq3x/llm_benchmark_toolkit-2.4.1.dist-info' 2025-12-05T23:48:04,526 Preparing metadata (pyproject.toml): finished with status 'done' 2025-12-05T23:48:04,533 Source in /tmp/pip-wheel-mnrxagf_/llm-benchmark-toolkit_a283f1bcd18c456aa96720919d8a23f0 has version 2.4.1, which satisfies requirement llm-benchmark-toolkit==2.4.1 from https://files.pythonhosted.org/packages/03/8c/c8af4e1c13946d342d9f72b9f499de990db9f6c5b7bdf126f4179f74bd0d/llm_benchmark_toolkit-2.4.1.tar.gz 2025-12-05T23:48:04,534 Removed llm-benchmark-toolkit==2.4.1 from https://files.pythonhosted.org/packages/03/8c/c8af4e1c13946d342d9f72b9f499de990db9f6c5b7bdf126f4179f74bd0d/llm_benchmark_toolkit-2.4.1.tar.gz from build tracker '/tmp/pip-build-tracker-6gpfuu1s' 2025-12-05T23:48:04,543 Created temporary directory: /tmp/pip-unpack-eyveqzdc 2025-12-05T23:48:04,544 Building wheels for collected packages: llm-benchmark-toolkit 2025-12-05T23:48:04,549 Created temporary directory: /tmp/pip-wheel-kao28s3w 2025-12-05T23:48:04,549 Destination directory: /tmp/pip-wheel-kao28s3w 2025-12-05T23:48:04,551 Building wheel for llm-benchmark-toolkit (pyproject.toml): started 2025-12-05T23:48:04,552 Running command Building wheel for llm-benchmark-toolkit (pyproject.toml) 2025-12-05T23:48:05,123 /tmp/pip-build-env-hg82w3jk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-05T23:48:05,123 corresp(dist, value, root_dir) 2025-12-05T23:48:05,124 /tmp/pip-build-env-hg82w3jk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-05T23:48:05,124 corresp(dist, value, root_dir) 2025-12-05T23:48:05,213 running bdist_wheel 2025-12-05T23:48:05,234 running build 2025-12-05T23:48:05,235 running build_py 2025-12-05T23:48:05,242 creating build/lib/llm_evaluator 2025-12-05T23:48:05,245 copying src/llm_evaluator/statistical_metrics.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,248 copying src/llm_evaluator/evaluator.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,252 copying src/llm_evaluator/error_analysis.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,255 copying src/llm_evaluator/__init__.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,257 copying src/llm_evaluator/export.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,260 copying src/llm_evaluator/visualizations.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,262 copying src/llm_evaluator/dataset_loaders.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,265 copying src/llm_evaluator/config.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,267 copying src/llm_evaluator/metrics.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,269 copying src/llm_evaluator/academic_baselines.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,272 copying src/llm_evaluator/cli.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,275 copying src/llm_evaluator/system_info.py -> build/lib/llm_evaluator 2025-12-05T23:48:05,278 creating build/lib/llm_evaluator/providers 2025-12-05T23:48:05,280 copying src/llm_evaluator/providers/huggingface_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,282 copying src/llm_evaluator/providers/gemini_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,285 copying src/llm_evaluator/providers/deepseek_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,287 copying src/llm_evaluator/providers/anthropic_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,290 copying src/llm_evaluator/providers/groq_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,292 copying src/llm_evaluator/providers/__init__.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,295 copying src/llm_evaluator/providers/ollama_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,297 copying src/llm_evaluator/providers/base.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,299 copying src/llm_evaluator/providers/fireworks_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,302 copying src/llm_evaluator/providers/together_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,305 copying src/llm_evaluator/providers/openai_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,307 copying src/llm_evaluator/providers/cached_provider.py -> build/lib/llm_evaluator/providers 2025-12-05T23:48:05,310 creating build/lib/llm_evaluator/security 2025-12-05T23:48:05,311 copying src/llm_evaluator/security/toxicity.py -> build/lib/llm_evaluator/security 2025-12-05T23:48:05,314 copying src/llm_evaluator/security/pii_detector.py -> build/lib/llm_evaluator/security 2025-12-05T23:48:05,316 copying src/llm_evaluator/security/__init__.py -> build/lib/llm_evaluator/security 2025-12-05T23:48:05,318 copying src/llm_evaluator/security/red_team.py -> build/lib/llm_evaluator/security 2025-12-05T23:48:05,321 copying src/llm_evaluator/security/prompt_injection.py -> build/lib/llm_evaluator/security 2025-12-05T23:48:05,324 creating build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,326 copying src/llm_evaluator/benchmarks/hellaswag.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,328 copying src/llm_evaluator/benchmarks/truthfulqa.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,331 copying src/llm_evaluator/benchmarks/donotanswer.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,333 copying src/llm_evaluator/benchmarks/mmlu.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,335 copying src/llm_evaluator/benchmarks/boolq.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,337 copying src/llm_evaluator/benchmarks/__init__.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,339 copying src/llm_evaluator/benchmarks/winogrande.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,341 copying src/llm_evaluator/benchmarks/gsm8k.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,343 copying src/llm_evaluator/benchmarks/runner.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,346 copying src/llm_evaluator/benchmarks/base.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,348 copying src/llm_evaluator/benchmarks/commonsenseqa.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,351 copying src/llm_evaluator/benchmarks/arc.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,353 copying src/llm_evaluator/benchmarks/safetybench.py -> build/lib/llm_evaluator/benchmarks 2025-12-05T23:48:05,356 creating build/lib/llm_evaluator/dashboard 2025-12-05T23:48:05,357 copying src/llm_evaluator/dashboard/model_discovery.py -> build/lib/llm_evaluator/dashboard 2025-12-05T23:48:05,360 copying src/llm_evaluator/dashboard/__init__.py -> build/lib/llm_evaluator/dashboard 2025-12-05T23:48:05,362 copying src/llm_evaluator/dashboard/runner.py -> build/lib/llm_evaluator/dashboard 2025-12-05T23:48:05,365 copying src/llm_evaluator/dashboard/app.py -> build/lib/llm_evaluator/dashboard 2025-12-05T23:48:05,368 copying src/llm_evaluator/dashboard/__main__.py -> build/lib/llm_evaluator/dashboard 2025-12-05T23:48:05,370 copying src/llm_evaluator/dashboard/models.py -> build/lib/llm_evaluator/dashboard 2025-12-05T23:48:05,374 running egg_info 2025-12-05T23:48:05,386 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-05T23:48:05,405 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-05T23:48:05,406 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-05T23:48:05,419 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-05T23:48:05,420 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-05T23:48:05,438 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-05T23:48:05,452 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-05T23:48:05,459 creating build/lib/llm_evaluator/dashboard/static 2025-12-05T23:48:05,461 copying src/llm_evaluator/dashboard/static/favicon.svg -> build/lib/llm_evaluator/dashboard/static 2025-12-05T23:48:05,463 copying src/llm_evaluator/dashboard/static/index.html -> build/lib/llm_evaluator/dashboard/static 2025-12-05T23:48:05,465 creating build/lib/llm_evaluator/dashboard/static/assets 2025-12-05T23:48:05,466 copying src/llm_evaluator/dashboard/static/assets/index-DE0Rg7o1.css -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-05T23:48:05,470 copying src/llm_evaluator/dashboard/static/assets/index-CcvlFRmM.js -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-05T23:48:05,500 installing to build/bdist.linux-armv7l/wheel 2025-12-05T23:48:05,501 running install 2025-12-05T23:48:05,524 running install_lib 2025-12-05T23:48:05,530 creating build/bdist.linux-armv7l/wheel 2025-12-05T23:48:05,533 creating build/bdist.linux-armv7l/wheel/llm_evaluator 2025-12-05T23:48:05,534 copying build/lib/llm_evaluator/statistical_metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,537 copying build/lib/llm_evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,540 copying build/lib/llm_evaluator/error_analysis.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,542 copying build/lib/llm_evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,544 copying build/lib/llm_evaluator/export.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,547 copying build/lib/llm_evaluator/visualizations.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,550 copying build/lib/llm_evaluator/dataset_loaders.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,553 creating build/bdist.linux-armv7l/wheel/llm_evaluator/providers 2025-12-05T23:48:05,554 copying build/lib/llm_evaluator/providers/huggingface_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,556 copying build/lib/llm_evaluator/providers/gemini_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,559 copying build/lib/llm_evaluator/providers/deepseek_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,561 copying build/lib/llm_evaluator/providers/anthropic_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,564 copying build/lib/llm_evaluator/providers/groq_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,566 copying build/lib/llm_evaluator/providers/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,568 copying build/lib/llm_evaluator/providers/ollama_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,571 copying build/lib/llm_evaluator/providers/base.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,573 copying build/lib/llm_evaluator/providers/fireworks_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,575 copying build/lib/llm_evaluator/providers/together_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,578 copying build/lib/llm_evaluator/providers/openai_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,580 copying build/lib/llm_evaluator/providers/cached_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-05T23:48:05,583 creating build/bdist.linux-armv7l/wheel/llm_evaluator/security 2025-12-05T23:48:05,584 copying build/lib/llm_evaluator/security/toxicity.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-05T23:48:05,587 copying build/lib/llm_evaluator/security/pii_detector.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-05T23:48:05,589 copying build/lib/llm_evaluator/security/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-05T23:48:05,591 copying build/lib/llm_evaluator/security/red_team.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-05T23:48:05,593 copying build/lib/llm_evaluator/security/prompt_injection.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/security 2025-12-05T23:48:05,596 copying build/lib/llm_evaluator/config.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,598 copying build/lib/llm_evaluator/metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,600 copying build/lib/llm_evaluator/academic_baselines.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,602 creating build/bdist.linux-armv7l/wheel/llm_evaluator/benchmarks 2025-12-05T23:48:05,603 copying build/lib/llm_evaluator/benchmarks/hellaswag.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,605 copying build/lib/llm_evaluator/benchmarks/truthfulqa.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,608 copying build/lib/llm_evaluator/benchmarks/donotanswer.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,610 copying build/lib/llm_evaluator/benchmarks/mmlu.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,612 copying build/lib/llm_evaluator/benchmarks/boolq.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,613 copying build/lib/llm_evaluator/benchmarks/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,615 copying build/lib/llm_evaluator/benchmarks/winogrande.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,617 copying build/lib/llm_evaluator/benchmarks/gsm8k.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,619 copying build/lib/llm_evaluator/benchmarks/runner.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,621 copying build/lib/llm_evaluator/benchmarks/base.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,623 copying build/lib/llm_evaluator/benchmarks/commonsenseqa.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,626 copying build/lib/llm_evaluator/benchmarks/arc.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,628 copying build/lib/llm_evaluator/benchmarks/safetybench.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/benchmarks 2025-12-05T23:48:05,630 copying build/lib/llm_evaluator/cli.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,634 copying build/lib/llm_evaluator/system_info.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-05T23:48:05,637 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard 2025-12-05T23:48:05,638 copying build/lib/llm_evaluator/dashboard/model_discovery.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-05T23:48:05,640 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static 2025-12-05T23:48:05,642 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static/assets 2025-12-05T23:48:05,643 copying build/lib/llm_evaluator/dashboard/static/assets/index-DE0Rg7o1.css -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-05T23:48:05,646 copying build/lib/llm_evaluator/dashboard/static/assets/index-CcvlFRmM.js -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-05T23:48:05,660 copying build/lib/llm_evaluator/dashboard/static/favicon.svg -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-05T23:48:05,662 copying build/lib/llm_evaluator/dashboard/static/index.html -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-05T23:48:05,664 copying build/lib/llm_evaluator/dashboard/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-05T23:48:05,665 copying build/lib/llm_evaluator/dashboard/runner.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-05T23:48:05,668 copying build/lib/llm_evaluator/dashboard/app.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-05T23:48:05,671 copying build/lib/llm_evaluator/dashboard/__main__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-05T23:48:05,673 copying build/lib/llm_evaluator/dashboard/models.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-05T23:48:05,674 running install_egg_info 2025-12-05T23:48:05,680 Copying src/llm_benchmark_toolkit.egg-info to build/bdist.linux-armv7l/wheel/./llm_benchmark_toolkit-2.4.1-py3.11.egg-info 2025-12-05T23:48:05,691 running install_scripts 2025-12-05T23:48:05,700 creating build/bdist.linux-armv7l/wheel/llm_benchmark_toolkit-2.4.1.dist-info/WHEEL 2025-12-05T23:48:05,702 creating '/tmp/pip-wheel-kao28s3w/.tmp-y0o_65nd/llm_benchmark_toolkit-2.4.1-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-12-05T23:48:05,706 adding 'llm_evaluator/__init__.py' 2025-12-05T23:48:05,708 adding 'llm_evaluator/academic_baselines.py' 2025-12-05T23:48:05,717 adding 'llm_evaluator/cli.py' 2025-12-05T23:48:05,720 adding 'llm_evaluator/config.py' 2025-12-05T23:48:05,721 adding 'llm_evaluator/dataset_loaders.py' 2025-12-05T23:48:05,723 adding 'llm_evaluator/error_analysis.py' 2025-12-05T23:48:05,726 adding 'llm_evaluator/evaluator.py' 2025-12-05T23:48:05,729 adding 'llm_evaluator/export.py' 2025-12-05T23:48:05,730 adding 'llm_evaluator/metrics.py' 2025-12-05T23:48:05,733 adding 'llm_evaluator/statistical_metrics.py' 2025-12-05T23:48:05,735 adding 'llm_evaluator/system_info.py' 2025-12-05T23:48:05,737 adding 'llm_evaluator/visualizations.py' 2025-12-05T23:48:05,739 adding 'llm_evaluator/benchmarks/__init__.py' 2025-12-05T23:48:05,741 adding 'llm_evaluator/benchmarks/arc.py' 2025-12-05T23:48:05,743 adding 'llm_evaluator/benchmarks/base.py' 2025-12-05T23:48:05,744 adding 'llm_evaluator/benchmarks/boolq.py' 2025-12-05T23:48:05,746 adding 'llm_evaluator/benchmarks/commonsenseqa.py' 2025-12-05T23:48:05,747 adding 'llm_evaluator/benchmarks/donotanswer.py' 2025-12-05T23:48:05,748 adding 'llm_evaluator/benchmarks/gsm8k.py' 2025-12-05T23:48:05,750 adding 'llm_evaluator/benchmarks/hellaswag.py' 2025-12-05T23:48:05,751 adding 'llm_evaluator/benchmarks/mmlu.py' 2025-12-05T23:48:05,753 adding 'llm_evaluator/benchmarks/runner.py' 2025-12-05T23:48:05,755 adding 'llm_evaluator/benchmarks/safetybench.py' 2025-12-05T23:48:05,756 adding 'llm_evaluator/benchmarks/truthfulqa.py' 2025-12-05T23:48:05,758 adding 'llm_evaluator/benchmarks/winogrande.py' 2025-12-05T23:48:05,760 adding 'llm_evaluator/dashboard/__init__.py' 2025-12-05T23:48:05,761 adding 'llm_evaluator/dashboard/__main__.py' 2025-12-05T23:48:05,765 adding 'llm_evaluator/dashboard/app.py' 2025-12-05T23:48:05,767 adding 'llm_evaluator/dashboard/model_discovery.py' 2025-12-05T23:48:05,769 adding 'llm_evaluator/dashboard/models.py' 2025-12-05T23:48:05,772 adding 'llm_evaluator/dashboard/runner.py' 2025-12-05T23:48:05,775 adding 'llm_evaluator/dashboard/static/favicon.svg' 2025-12-05T23:48:05,776 adding 'llm_evaluator/dashboard/static/index.html' 2025-12-05T23:48:05,857 adding 'llm_evaluator/dashboard/static/assets/index-CcvlFRmM.js' 2025-12-05T23:48:05,865 adding 'llm_evaluator/dashboard/static/assets/index-DE0Rg7o1.css' 2025-12-05T23:48:05,868 adding 'llm_evaluator/providers/__init__.py' 2025-12-05T23:48:05,870 adding 'llm_evaluator/providers/anthropic_provider.py' 2025-12-05T23:48:05,871 adding 'llm_evaluator/providers/base.py' 2025-12-05T23:48:05,873 adding 'llm_evaluator/providers/cached_provider.py' 2025-12-05T23:48:05,875 adding 'llm_evaluator/providers/deepseek_provider.py' 2025-12-05T23:48:05,877 adding 'llm_evaluator/providers/fireworks_provider.py' 2025-12-05T23:48:05,879 adding 'llm_evaluator/providers/gemini_provider.py' 2025-12-05T23:48:05,881 adding 'llm_evaluator/providers/groq_provider.py' 2025-12-05T23:48:05,883 adding 'llm_evaluator/providers/huggingface_provider.py' 2025-12-05T23:48:05,885 adding 'llm_evaluator/providers/ollama_provider.py' 2025-12-05T23:48:05,887 adding 'llm_evaluator/providers/openai_provider.py' 2025-12-05T23:48:05,889 adding 'llm_evaluator/providers/together_provider.py' 2025-12-05T23:48:05,891 adding 'llm_evaluator/security/__init__.py' 2025-12-05T23:48:05,892 adding 'llm_evaluator/security/pii_detector.py' 2025-12-05T23:48:05,894 adding 'llm_evaluator/security/prompt_injection.py' 2025-12-05T23:48:05,896 adding 'llm_evaluator/security/red_team.py' 2025-12-05T23:48:05,898 adding 'llm_evaluator/security/toxicity.py' 2025-12-05T23:48:05,900 adding 'llm_benchmark_toolkit-2.4.1.dist-info/METADATA' 2025-12-05T23:48:05,901 adding 'llm_benchmark_toolkit-2.4.1.dist-info/WHEEL' 2025-12-05T23:48:05,902 adding 'llm_benchmark_toolkit-2.4.1.dist-info/entry_points.txt' 2025-12-05T23:48:05,903 adding 'llm_benchmark_toolkit-2.4.1.dist-info/top_level.txt' 2025-12-05T23:48:05,905 adding 'llm_benchmark_toolkit-2.4.1.dist-info/RECORD' 2025-12-05T23:48:05,911 removing build/bdist.linux-armv7l/wheel 2025-12-05T23:48:06,025 Building wheel for llm-benchmark-toolkit (pyproject.toml): finished with status 'done' 2025-12-05T23:48:06,038 Created wheel for llm-benchmark-toolkit: filename=llm_benchmark_toolkit-2.4.1-py3-none-any.whl size=365231 sha256=5cc26dd85863c76a8253af274964ecf8a7bc9ae95d48de2ea1db3c21d00dc05c 2025-12-05T23:48:06,039 Stored in directory: /tmp/pip-ephem-wheel-cache-quq0xme_/wheels/a8/c8/13/dd0290c5925dc0ccb7d9e105d92f140b9ab4b5bd69540222de 2025-12-05T23:48:06,057 Successfully built llm-benchmark-toolkit 2025-12-05T23:48:06,073 Removed build tracker: '/tmp/pip-build-tracker-6gpfuu1s'