2025-12-02T23:55:34,859 Created temporary directory: /tmp/pip-ephem-wheel-cache-5vqpb5ch 2025-12-02T23:55:34,861 Created temporary directory: /tmp/pip-build-tracker-l3o80d1u 2025-12-02T23:55:34,861 Initialized build tracking at /tmp/pip-build-tracker-l3o80d1u 2025-12-02T23:55:34,862 Created build tracker: /tmp/pip-build-tracker-l3o80d1u 2025-12-02T23:55:34,862 Entered build tracker: /tmp/pip-build-tracker-l3o80d1u 2025-12-02T23:55:34,863 Created temporary directory: /tmp/pip-wheel-9liwvfrl 2025-12-02T23:55:34,866 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-02T23:55:34,868 Created temporary directory: /tmp/pip-ephem-wheel-cache-hwvdgm05 2025-12-02T23:55:34,892 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-02T23:55:34,896 2 location(s) to search for versions of llm-benchmark-toolkit: 2025-12-02T23:55:34,896 * https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:55:34,896 * https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:55:34,897 Fetching project page and analyzing links: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:55:34,897 Getting page https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:55:34,899 Found index url https://pypi.org/simple 2025-12-02T23:55:35,112 Fetched page https://pypi.org/simple/llm-benchmark-toolkit/ as application/vnd.pypi.simple.v1+json 2025-12-02T23:55:35,116 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/95/b5/c528dad5028e188283baec8a39b34f2f5258c6f93a5984b194354ab3cb67/llm_benchmark_toolkit-0.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,117 Found link https://files.pythonhosted.org/packages/a7/41/73de3b2dd4005a66ae4a2860ae322e7de9cc75bd8b7ef7e8dc2f710f7a29/llm_benchmark_toolkit-0.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.0 2025-12-02T23:55:35,118 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/96/49/345527ecab82ff6ccaf832a6eb1fe2d6bccb98971861482a4c96c173ec68/llm_benchmark_toolkit-0.3.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,119 Found link https://files.pythonhosted.org/packages/d6/7c/960c3132ff0d4fab4b3b0f5dc2642fe4119a5ea7063d60ab2a8f5b8541de/llm_benchmark_toolkit-0.3.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.1 2025-12-02T23:55:35,120 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/86/df/8cf567d200bf116978d565bfa82022212049a4e077e2f4057020ba7b3ec1/llm_benchmark_toolkit-0.3.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,121 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/87/7d/18901d63a9d945df5a49aae05eac642327e2432951a027b39ac4e6e9d303/llm_benchmark_toolkit-0.4.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,121 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/4d/09/e0d3f5c7e8e533df93d488a0411eb15557ee120b1aa045a7f1fd2bfe8047/llm_benchmark_toolkit-0.4.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,122 Found link https://files.pythonhosted.org/packages/94/98/e41f2f6abc60c05c01e401fa39fed8d398305200e1ef23cef596be8809c5/llm_benchmark_toolkit-0.4.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.4.1 2025-12-02T23:55:35,123 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/19/3d/8ca9dc0f08263596dea0de71a92635fe8e23faa668182a4349d90ccb0d82/llm_benchmark_toolkit-2.0.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,124 Found link https://files.pythonhosted.org/packages/36/45/264df4505ca7ef5433049cc402373960ab82e5885cc789e8b85fc73f76bb/llm_benchmark_toolkit-2.0.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.0.0 2025-12-02T23:55:35,125 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/5b/43/32c2b3793610b688ec24cc490ad4c50bcdff3e33f1ccb172cc464972534c/llm_benchmark_toolkit-2.1.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,125 Found link https://files.pythonhosted.org/packages/be/d3/7152b5619a14d4c8a043cc4ee01e4c6b746c910b6df8b91ecb696da1ef4c/llm_benchmark_toolkit-2.1.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.1.0 2025-12-02T23:55:35,126 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/41/79/ca2033ee9ccfebcf708ae68a2007a6072c87c0e9bb24c5354d274bd57247/llm_benchmark_toolkit-2.2.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,127 Found link https://files.pythonhosted.org/packages/b9/ba/602dbec3514ff6b5a611d98e44e81a298dd4a7904c8b378a8e5a7fdc3c87/llm_benchmark_toolkit-2.2.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.0 2025-12-02T23:55:35,127 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/ee/0b/7e7e6b6575046e5bf1d8cb1eceb88aba723ebde5bb0d761f50d971181902/llm_benchmark_toolkit-2.2.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,128 Found link https://files.pythonhosted.org/packages/f8/1d/ee3da6b517008793f0444ec55d964cb2afee0a21c475f89e9be9e90479cf/llm_benchmark_toolkit-2.2.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.1 2025-12-02T23:55:35,129 Fetching project page and analyzing links: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:55:35,130 Getting page https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:55:35,131 Found index url https://www.piwheels.org/simple 2025-12-02T23:55:35,313 Fetched page https://www.piwheels.org/simple/llm-benchmark-toolkit/ as text/html 2025-12-02T23:55:35,315 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.1.0-py3-none-any.whl#sha256=c85ebb550837b0faf5cb28798b4ec2f84c9d20796e69f43ce7933f3145f903ce (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,316 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.0.0-py3-none-any.whl#sha256=9956b86c6b62f17a14b176a906481dbb5053a5f4c7f8dc479a1afb0668f10336 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,317 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.4.1-py3-none-any.whl#sha256=c237cd524f90f123d8ecce03f9b033a198b328d9d44ca52c0df5aa132b3e60be (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,317 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.1-py3-none-any.whl#sha256=44752309369b0c2b9e5bb1a1093611cc8a51e84014c2d50b4087e7c9bf1e8a49 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,318 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.0-py3-none-any.whl#sha256=8abac7bd509fef8b4c7c3a124e9afb765a0340e7d6c4c2b800dc7117fd8f99b4 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:55:35,319 Skipping link: not a file: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:55:35,319 Skipping link: not a file: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:55:35,339 Given no hashes to check 1 links for project 'llm-benchmark-toolkit': discarding no candidates 2025-12-02T23:55:35,356 Collecting llm-benchmark-toolkit==2.2.0 2025-12-02T23:55:35,358 Created temporary directory: /tmp/pip-unpack-oasnel8y 2025-12-02T23:55:35,568 Downloading llm_benchmark_toolkit-2.2.0.tar.gz (333 kB) 2025-12-02T23:55:35,881 Added llm-benchmark-toolkit==2.2.0 from https://files.pythonhosted.org/packages/b9/ba/602dbec3514ff6b5a611d98e44e81a298dd4a7904c8b378a8e5a7fdc3c87/llm_benchmark_toolkit-2.2.0.tar.gz to build tracker '/tmp/pip-build-tracker-l3o80d1u' 2025-12-02T23:55:35,889 Created temporary directory: /tmp/pip-build-env-xkt_0o76 2025-12-02T23:55:35,894 Installing build dependencies: started 2025-12-02T23:55:35,895 Running command pip subprocess to install build dependencies 2025-12-02T23:55:37,056 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2025-12-02T23:55:37,680 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-02T23:55:37,704 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-02T23:55:39,421 Collecting setuptools>=61.0 2025-12-02T23:55:39,510 Using cached https://www.piwheels.org/simple/setuptools/setuptools-80.9.0-py3-none-any.whl (1.2 MB) 2025-12-02T23:55:39,780 Collecting wheel 2025-12-02T23:55:39,795 Using cached https://www.piwheels.org/simple/wheel/wheel-0.45.1-py3-none-any.whl (72 kB) 2025-12-02T23:55:42,700 Installing collected packages: wheel, setuptools 2025-12-02T23:55:42,942 Creating /tmp/pip-build-env-xkt_0o76/overlay/local/bin 2025-12-02T23:55:42,944 changing mode of /tmp/pip-build-env-xkt_0o76/overlay/local/bin/wheel to 755 2025-12-02T23:55:46,539 Successfully installed setuptools-80.9.0 wheel-0.45.1 2025-12-02T23:55:46,813 Installing build dependencies: finished with status 'done' 2025-12-02T23:55:46,820 Getting requirements to build wheel: started 2025-12-02T23:55:46,821 Running command Getting requirements to build wheel 2025-12-02T23:55:47,445 /tmp/pip-build-env-xkt_0o76/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-02T23:55:47,445 corresp(dist, value, root_dir) 2025-12-02T23:55:47,446 /tmp/pip-build-env-xkt_0o76/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-02T23:55:47,447 corresp(dist, value, root_dir) 2025-12-02T23:55:47,544 running egg_info 2025-12-02T23:55:47,550 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-02T23:55:47,567 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-02T23:55:47,569 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-02T23:55:47,579 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-02T23:55:47,581 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-02T23:55:47,626 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:55:47,650 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:55:47,749 Getting requirements to build wheel: finished with status 'done' 2025-12-02T23:55:47,752 Created temporary directory: /tmp/pip-modern-metadata-isigsrdu 2025-12-02T23:55:47,754 Preparing metadata (pyproject.toml): started 2025-12-02T23:55:47,755 Running command Preparing metadata (pyproject.toml) 2025-12-02T23:55:48,329 /tmp/pip-build-env-xkt_0o76/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-02T23:55:48,329 corresp(dist, value, root_dir) 2025-12-02T23:55:48,330 /tmp/pip-build-env-xkt_0o76/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-02T23:55:48,330 corresp(dist, value, root_dir) 2025-12-02T23:55:48,424 running dist_info 2025-12-02T23:55:48,436 creating /tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit.egg-info 2025-12-02T23:55:48,437 writing /tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-02T23:55:48,454 writing dependency_links to /tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-02T23:55:48,456 writing entry points to /tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-02T23:55:48,465 writing requirements to /tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-02T23:55:48,467 writing top-level names to /tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-02T23:55:48,468 writing manifest file '/tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:55:48,498 reading manifest file '/tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:55:48,504 writing manifest file '/tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:55:48,505 creating '/tmp/pip-modern-metadata-isigsrdu/llm_benchmark_toolkit-2.2.0.dist-info' 2025-12-02T23:55:48,631 Preparing metadata (pyproject.toml): finished with status 'done' 2025-12-02T23:55:48,637 Source in /tmp/pip-wheel-9liwvfrl/llm-benchmark-toolkit_2024c5b0e8594228ab1e6927f36edb25 has version 2.2.0, which satisfies requirement llm-benchmark-toolkit==2.2.0 from https://files.pythonhosted.org/packages/b9/ba/602dbec3514ff6b5a611d98e44e81a298dd4a7904c8b378a8e5a7fdc3c87/llm_benchmark_toolkit-2.2.0.tar.gz 2025-12-02T23:55:48,638 Removed llm-benchmark-toolkit==2.2.0 from https://files.pythonhosted.org/packages/b9/ba/602dbec3514ff6b5a611d98e44e81a298dd4a7904c8b378a8e5a7fdc3c87/llm_benchmark_toolkit-2.2.0.tar.gz from build tracker '/tmp/pip-build-tracker-l3o80d1u' 2025-12-02T23:55:48,646 Created temporary directory: /tmp/pip-unpack-7ivyagfe 2025-12-02T23:55:48,647 Building wheels for collected packages: llm-benchmark-toolkit 2025-12-02T23:55:48,651 Created temporary directory: /tmp/pip-wheel-5noyj2u1 2025-12-02T23:55:48,652 Destination directory: /tmp/pip-wheel-5noyj2u1 2025-12-02T23:55:48,654 Building wheel for llm-benchmark-toolkit (pyproject.toml): started 2025-12-02T23:55:48,655 Running command Building wheel for llm-benchmark-toolkit (pyproject.toml) 2025-12-02T23:55:49,229 /tmp/pip-build-env-xkt_0o76/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-02T23:55:49,229 corresp(dist, value, root_dir) 2025-12-02T23:55:49,230 /tmp/pip-build-env-xkt_0o76/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-02T23:55:49,231 corresp(dist, value, root_dir) 2025-12-02T23:55:49,314 running bdist_wheel 2025-12-02T23:55:49,335 running build 2025-12-02T23:55:49,336 running build_py 2025-12-02T23:55:49,343 creating build/lib/llm_evaluator 2025-12-02T23:55:49,345 copying src/llm_evaluator/benchmarks.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,349 copying src/llm_evaluator/cli.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,353 copying src/llm_evaluator/config.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,355 copying src/llm_evaluator/error_analysis.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,358 copying src/llm_evaluator/visualizations.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,361 copying src/llm_evaluator/system_info.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,363 copying src/llm_evaluator/__init__.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,365 copying src/llm_evaluator/export.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,368 copying src/llm_evaluator/evaluator.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,370 copying src/llm_evaluator/statistical_metrics.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,373 copying src/llm_evaluator/metrics.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,375 copying src/llm_evaluator/academic_baselines.py -> build/lib/llm_evaluator 2025-12-02T23:55:49,379 creating build/lib/llm_evaluator/dashboard 2025-12-02T23:55:49,380 copying src/llm_evaluator/dashboard/runner.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:55:49,383 copying src/llm_evaluator/dashboard/models.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:55:49,385 copying src/llm_evaluator/dashboard/app.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:55:49,388 copying src/llm_evaluator/dashboard/__init__.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:55:49,390 copying src/llm_evaluator/dashboard/model_discovery.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:55:49,392 copying src/llm_evaluator/dashboard/__main__.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:55:49,395 creating build/lib/llm_evaluator/providers 2025-12-02T23:55:49,395 copying src/llm_evaluator/providers/openai_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:55:49,398 copying src/llm_evaluator/providers/huggingface_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:55:49,400 copying src/llm_evaluator/providers/anthropic_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:55:49,403 copying src/llm_evaluator/providers/cached_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:55:49,405 copying src/llm_evaluator/providers/base.py -> build/lib/llm_evaluator/providers 2025-12-02T23:55:49,408 copying src/llm_evaluator/providers/__init__.py -> build/lib/llm_evaluator/providers 2025-12-02T23:55:49,409 copying src/llm_evaluator/providers/deepseek_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:55:49,412 copying src/llm_evaluator/providers/ollama_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:55:49,415 running egg_info 2025-12-02T23:55:49,426 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-02T23:55:49,442 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-02T23:55:49,444 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-02T23:55:49,453 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-02T23:55:49,454 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-02T23:55:49,470 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:55:49,481 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:55:49,487 creating build/lib/llm_evaluator/dashboard/static 2025-12-02T23:55:49,488 copying src/llm_evaluator/dashboard/static/favicon.svg -> build/lib/llm_evaluator/dashboard/static 2025-12-02T23:55:49,491 copying src/llm_evaluator/dashboard/static/index.html -> build/lib/llm_evaluator/dashboard/static 2025-12-02T23:55:49,493 creating build/lib/llm_evaluator/dashboard/static/assets 2025-12-02T23:55:49,494 copying src/llm_evaluator/dashboard/static/assets/index-B1T9cNmb.js -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-02T23:55:49,507 copying src/llm_evaluator/dashboard/static/assets/index-BGRAUHhM.css -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-02T23:55:49,511 copying src/llm_evaluator/dashboard/static/assets/index-B1T9cNmb.js -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-02T23:55:49,537 installing to build/bdist.linux-armv7l/wheel 2025-12-02T23:55:49,537 running install 2025-12-02T23:55:49,561 running install_lib 2025-12-02T23:55:49,567 creating build/bdist.linux-armv7l/wheel 2025-12-02T23:55:49,569 creating build/bdist.linux-armv7l/wheel/llm_evaluator 2025-12-02T23:55:49,570 copying build/lib/llm_evaluator/benchmarks.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,574 copying build/lib/llm_evaluator/cli.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,577 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard 2025-12-02T23:55:49,578 copying build/lib/llm_evaluator/dashboard/runner.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:55:49,581 copying build/lib/llm_evaluator/dashboard/models.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:55:49,583 copying build/lib/llm_evaluator/dashboard/app.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:55:49,586 copying build/lib/llm_evaluator/dashboard/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:55:49,588 copying build/lib/llm_evaluator/dashboard/model_discovery.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:55:49,590 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static 2025-12-02T23:55:49,591 copying build/lib/llm_evaluator/dashboard/static/favicon.svg -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-02T23:55:49,594 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static/assets 2025-12-02T23:55:49,595 copying build/lib/llm_evaluator/dashboard/static/assets/index-B1T9cNmb.js -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-02T23:55:49,607 copying build/lib/llm_evaluator/dashboard/static/assets/index-BGRAUHhM.css -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-02T23:55:49,610 copying build/lib/llm_evaluator/dashboard/static/index.html -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-02T23:55:49,612 copying build/lib/llm_evaluator/dashboard/__main__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:55:49,614 copying build/lib/llm_evaluator/config.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,617 copying build/lib/llm_evaluator/error_analysis.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,619 copying build/lib/llm_evaluator/visualizations.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,623 creating build/bdist.linux-armv7l/wheel/llm_evaluator/providers 2025-12-02T23:55:49,624 copying build/lib/llm_evaluator/providers/openai_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:55:49,627 copying build/lib/llm_evaluator/providers/huggingface_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:55:49,630 copying build/lib/llm_evaluator/providers/anthropic_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:55:49,632 copying build/lib/llm_evaluator/providers/cached_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:55:49,635 copying build/lib/llm_evaluator/providers/base.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:55:49,638 copying build/lib/llm_evaluator/providers/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:55:49,640 copying build/lib/llm_evaluator/providers/deepseek_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:55:49,642 copying build/lib/llm_evaluator/providers/ollama_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:55:49,645 copying build/lib/llm_evaluator/system_info.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,647 copying build/lib/llm_evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,649 copying build/lib/llm_evaluator/export.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,652 copying build/lib/llm_evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,655 copying build/lib/llm_evaluator/statistical_metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,658 copying build/lib/llm_evaluator/metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,660 copying build/lib/llm_evaluator/academic_baselines.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:55:49,663 running install_egg_info 2025-12-02T23:55:49,669 Copying src/llm_benchmark_toolkit.egg-info to build/bdist.linux-armv7l/wheel/./llm_benchmark_toolkit-2.2.0-py3.11.egg-info 2025-12-02T23:55:49,681 running install_scripts 2025-12-02T23:55:49,691 creating build/bdist.linux-armv7l/wheel/llm_benchmark_toolkit-2.2.0.dist-info/WHEEL 2025-12-02T23:55:49,694 creating '/tmp/pip-wheel-5noyj2u1/.tmp-g2ij8_jt/llm_benchmark_toolkit-2.2.0-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-12-02T23:55:49,698 adding 'llm_evaluator/__init__.py' 2025-12-02T23:55:49,700 adding 'llm_evaluator/academic_baselines.py' 2025-12-02T23:55:49,706 adding 'llm_evaluator/benchmarks.py' 2025-12-02T23:55:49,713 adding 'llm_evaluator/cli.py' 2025-12-02T23:55:49,715 adding 'llm_evaluator/config.py' 2025-12-02T23:55:49,718 adding 'llm_evaluator/error_analysis.py' 2025-12-02T23:55:49,721 adding 'llm_evaluator/evaluator.py' 2025-12-02T23:55:49,724 adding 'llm_evaluator/export.py' 2025-12-02T23:55:49,726 adding 'llm_evaluator/metrics.py' 2025-12-02T23:55:49,729 adding 'llm_evaluator/statistical_metrics.py' 2025-12-02T23:55:49,731 adding 'llm_evaluator/system_info.py' 2025-12-02T23:55:49,734 adding 'llm_evaluator/visualizations.py' 2025-12-02T23:55:49,736 adding 'llm_evaluator/dashboard/__init__.py' 2025-12-02T23:55:49,738 adding 'llm_evaluator/dashboard/__main__.py' 2025-12-02T23:55:49,742 adding 'llm_evaluator/dashboard/app.py' 2025-12-02T23:55:49,744 adding 'llm_evaluator/dashboard/model_discovery.py' 2025-12-02T23:55:49,745 adding 'llm_evaluator/dashboard/models.py' 2025-12-02T23:55:49,749 adding 'llm_evaluator/dashboard/runner.py' 2025-12-02T23:55:49,751 adding 'llm_evaluator/dashboard/static/favicon.svg' 2025-12-02T23:55:49,753 adding 'llm_evaluator/dashboard/static/index.html' 2025-12-02T23:55:49,829 adding 'llm_evaluator/dashboard/static/assets/index-B1T9cNmb.js' 2025-12-02T23:55:49,836 adding 'llm_evaluator/dashboard/static/assets/index-BGRAUHhM.css' 2025-12-02T23:55:49,839 adding 'llm_evaluator/providers/__init__.py' 2025-12-02T23:55:49,841 adding 'llm_evaluator/providers/anthropic_provider.py' 2025-12-02T23:55:49,843 adding 'llm_evaluator/providers/base.py' 2025-12-02T23:55:49,845 adding 'llm_evaluator/providers/cached_provider.py' 2025-12-02T23:55:49,848 adding 'llm_evaluator/providers/deepseek_provider.py' 2025-12-02T23:55:49,850 adding 'llm_evaluator/providers/huggingface_provider.py' 2025-12-02T23:55:49,852 adding 'llm_evaluator/providers/ollama_provider.py' 2025-12-02T23:55:49,855 adding 'llm_evaluator/providers/openai_provider.py' 2025-12-02T23:55:49,857 adding 'llm_benchmark_toolkit-2.2.0.dist-info/METADATA' 2025-12-02T23:55:49,858 adding 'llm_benchmark_toolkit-2.2.0.dist-info/WHEEL' 2025-12-02T23:55:49,860 adding 'llm_benchmark_toolkit-2.2.0.dist-info/entry_points.txt' 2025-12-02T23:55:49,861 adding 'llm_benchmark_toolkit-2.2.0.dist-info/top_level.txt' 2025-12-02T23:55:49,862 adding 'llm_benchmark_toolkit-2.2.0.dist-info/RECORD' 2025-12-02T23:55:49,867 removing build/bdist.linux-armv7l/wheel 2025-12-02T23:55:49,977 Building wheel for llm-benchmark-toolkit (pyproject.toml): finished with status 'done' 2025-12-02T23:55:49,988 Created wheel for llm-benchmark-toolkit: filename=llm_benchmark_toolkit-2.2.0-py3-none-any.whl size=294130 sha256=2fbbb501c0dd81fa057db285c3202e45adfe99ceba2398974482329d2b805a41 2025-12-02T23:55:49,989 Stored in directory: /tmp/pip-ephem-wheel-cache-hwvdgm05/wheels/c9/01/c0/b0d77eca7a1e2eff53cdf202f85346061be4fb49befe3248ff 2025-12-02T23:55:50,004 Successfully built llm-benchmark-toolkit 2025-12-02T23:55:50,017 Removed build tracker: '/tmp/pip-build-tracker-l3o80d1u'