2025-12-03T23:32:35,060 Created temporary directory: /tmp/pip-ephem-wheel-cache-zib6md57 2025-12-03T23:32:35,062 Created temporary directory: /tmp/pip-build-tracker-4yrgh521 2025-12-03T23:32:35,063 Initialized build tracking at /tmp/pip-build-tracker-4yrgh521 2025-12-03T23:32:35,063 Created build tracker: /tmp/pip-build-tracker-4yrgh521 2025-12-03T23:32:35,064 Entered build tracker: /tmp/pip-build-tracker-4yrgh521 2025-12-03T23:32:35,065 Created temporary directory: /tmp/pip-wheel-fzdq6e6n 2025-12-03T23:32:35,068 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-03T23:32:35,070 Created temporary directory: /tmp/pip-ephem-wheel-cache-zi_owu2y 2025-12-03T23:32:35,096 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-03T23:32:35,100 2 location(s) to search for versions of llm-benchmark-toolkit: 2025-12-03T23:32:35,100 * https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-03T23:32:35,100 * https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-03T23:32:35,101 Fetching project page and analyzing links: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-03T23:32:35,101 Getting page https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-03T23:32:35,103 Found index url https://pypi.org/simple 2025-12-03T23:32:35,322 Fetched page https://pypi.org/simple/llm-benchmark-toolkit/ as application/vnd.pypi.simple.v1+json 2025-12-03T23:32:35,326 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/95/b5/c528dad5028e188283baec8a39b34f2f5258c6f93a5984b194354ab3cb67/llm_benchmark_toolkit-0.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,327 Found link https://files.pythonhosted.org/packages/a7/41/73de3b2dd4005a66ae4a2860ae322e7de9cc75bd8b7ef7e8dc2f710f7a29/llm_benchmark_toolkit-0.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.0 2025-12-03T23:32:35,328 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/96/49/345527ecab82ff6ccaf832a6eb1fe2d6bccb98971861482a4c96c173ec68/llm_benchmark_toolkit-0.3.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,329 Found link https://files.pythonhosted.org/packages/d6/7c/960c3132ff0d4fab4b3b0f5dc2642fe4119a5ea7063d60ab2a8f5b8541de/llm_benchmark_toolkit-0.3.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.1 2025-12-03T23:32:35,330 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/86/df/8cf567d200bf116978d565bfa82022212049a4e077e2f4057020ba7b3ec1/llm_benchmark_toolkit-0.3.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,331 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/87/7d/18901d63a9d945df5a49aae05eac642327e2432951a027b39ac4e6e9d303/llm_benchmark_toolkit-0.4.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,331 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/4d/09/e0d3f5c7e8e533df93d488a0411eb15557ee120b1aa045a7f1fd2bfe8047/llm_benchmark_toolkit-0.4.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,332 Found link https://files.pythonhosted.org/packages/94/98/e41f2f6abc60c05c01e401fa39fed8d398305200e1ef23cef596be8809c5/llm_benchmark_toolkit-0.4.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.4.1 2025-12-03T23:32:35,333 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/19/3d/8ca9dc0f08263596dea0de71a92635fe8e23faa668182a4349d90ccb0d82/llm_benchmark_toolkit-2.0.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,335 Found link https://files.pythonhosted.org/packages/36/45/264df4505ca7ef5433049cc402373960ab82e5885cc789e8b85fc73f76bb/llm_benchmark_toolkit-2.0.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.0.0 2025-12-03T23:32:35,336 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/5b/43/32c2b3793610b688ec24cc490ad4c50bcdff3e33f1ccb172cc464972534c/llm_benchmark_toolkit-2.1.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,337 Found link https://files.pythonhosted.org/packages/be/d3/7152b5619a14d4c8a043cc4ee01e4c6b746c910b6df8b91ecb696da1ef4c/llm_benchmark_toolkit-2.1.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.1.0 2025-12-03T23:32:35,338 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/41/79/ca2033ee9ccfebcf708ae68a2007a6072c87c0e9bb24c5354d274bd57247/llm_benchmark_toolkit-2.2.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,339 Found link https://files.pythonhosted.org/packages/b9/ba/602dbec3514ff6b5a611d98e44e81a298dd4a7904c8b378a8e5a7fdc3c87/llm_benchmark_toolkit-2.2.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.0 2025-12-03T23:32:35,340 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/ee/0b/7e7e6b6575046e5bf1d8cb1eceb88aba723ebde5bb0d761f50d971181902/llm_benchmark_toolkit-2.2.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,341 Found link https://files.pythonhosted.org/packages/f8/1d/ee3da6b517008793f0444ec55d964cb2afee0a21c475f89e9be9e90479cf/llm_benchmark_toolkit-2.2.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.1 2025-12-03T23:32:35,342 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/90/d6/ee6f413ced99a3ee80d162870f1b759ab790bc4dfb45352a454c3ef9f663/llm_benchmark_toolkit-2.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,343 Found link https://files.pythonhosted.org/packages/26/eb/11a05fa90f5bac82d1d4f4bec7b6e13e3a9cb11502db8e53a3dc74270f34/llm_benchmark_toolkit-2.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.3.0 2025-12-03T23:32:35,344 Fetching project page and analyzing links: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-03T23:32:35,345 Getting page https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-03T23:32:35,347 Found index url https://www.piwheels.org/simple 2025-12-03T23:32:35,504 Fetched page https://www.piwheels.org/simple/llm-benchmark-toolkit/ as text/html 2025-12-03T23:32:35,507 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.2.1-py3-none-any.whl#sha256=a910972ff3802aeeec4c93220e0a5373097fb87c071c065ce81cef88e41c0d7f (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,508 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.2.0-py3-none-any.whl#sha256=2fbbb501c0dd81fa057db285c3202e45adfe99ceba2398974482329d2b805a41 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,508 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.1.0-py3-none-any.whl#sha256=c85ebb550837b0faf5cb28798b4ec2f84c9d20796e69f43ce7933f3145f903ce (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,509 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.0.0-py3-none-any.whl#sha256=9956b86c6b62f17a14b176a906481dbb5053a5f4c7f8dc479a1afb0668f10336 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,510 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.4.1-py3-none-any.whl#sha256=c237cd524f90f123d8ecce03f9b033a198b328d9d44ca52c0df5aa132b3e60be (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,510 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.1-py3-none-any.whl#sha256=44752309369b0c2b9e5bb1a1093611cc8a51e84014c2d50b4087e7c9bf1e8a49 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,511 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.0-py3-none-any.whl#sha256=8abac7bd509fef8b4c7c3a124e9afb765a0340e7d6c4c2b800dc7117fd8f99b4 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-03T23:32:35,511 Skipping link: not a file: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-03T23:32:35,512 Skipping link: not a file: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-03T23:32:35,531 Given no hashes to check 1 links for project 'llm-benchmark-toolkit': discarding no candidates 2025-12-03T23:32:35,549 Collecting llm-benchmark-toolkit==2.3.0 2025-12-03T23:32:35,551 Created temporary directory: /tmp/pip-unpack-2bphlg18 2025-12-03T23:32:35,764 Downloading llm_benchmark_toolkit-2.3.0.tar.gz (369 kB) 2025-12-03T23:32:36,105 Added llm-benchmark-toolkit==2.3.0 from https://files.pythonhosted.org/packages/26/eb/11a05fa90f5bac82d1d4f4bec7b6e13e3a9cb11502db8e53a3dc74270f34/llm_benchmark_toolkit-2.3.0.tar.gz to build tracker '/tmp/pip-build-tracker-4yrgh521' 2025-12-03T23:32:36,114 Created temporary directory: /tmp/pip-build-env-gm3jbbcm 2025-12-03T23:32:36,119 Installing build dependencies: started 2025-12-03T23:32:36,120 Running command pip subprocess to install build dependencies 2025-12-03T23:32:37,284 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2025-12-03T23:32:37,917 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-03T23:32:37,941 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-03T23:32:39,656 Collecting setuptools>=61.0 2025-12-03T23:32:39,741 Using cached https://www.piwheels.org/simple/setuptools/setuptools-80.9.0-py3-none-any.whl (1.2 MB) 2025-12-03T23:32:40,013 Collecting wheel 2025-12-03T23:32:40,030 Using cached https://www.piwheels.org/simple/wheel/wheel-0.45.1-py3-none-any.whl (72 kB) 2025-12-03T23:32:42,918 Installing collected packages: wheel, setuptools 2025-12-03T23:32:43,173 Creating /tmp/pip-build-env-gm3jbbcm/overlay/local/bin 2025-12-03T23:32:43,176 changing mode of /tmp/pip-build-env-gm3jbbcm/overlay/local/bin/wheel to 755 2025-12-03T23:32:47,418 Successfully installed setuptools-80.9.0 wheel-0.45.1 2025-12-03T23:32:47,705 Installing build dependencies: finished with status 'done' 2025-12-03T23:32:47,714 Getting requirements to build wheel: started 2025-12-03T23:32:47,717 Running command Getting requirements to build wheel 2025-12-03T23:32:48,437 /tmp/pip-build-env-gm3jbbcm/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-03T23:32:48,437 corresp(dist, value, root_dir) 2025-12-03T23:32:48,438 /tmp/pip-build-env-gm3jbbcm/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-03T23:32:48,438 corresp(dist, value, root_dir) 2025-12-03T23:32:48,537 running egg_info 2025-12-03T23:32:48,545 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-03T23:32:48,565 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-03T23:32:48,567 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-03T23:32:48,579 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-03T23:32:48,581 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-03T23:32:48,619 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-03T23:32:48,631 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-03T23:32:48,729 Getting requirements to build wheel: finished with status 'done' 2025-12-03T23:32:48,733 Created temporary directory: /tmp/pip-modern-metadata-sa360yj4 2025-12-03T23:32:48,735 Preparing metadata (pyproject.toml): started 2025-12-03T23:32:48,736 Running command Preparing metadata (pyproject.toml) 2025-12-03T23:32:49,321 /tmp/pip-build-env-gm3jbbcm/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-03T23:32:49,321 corresp(dist, value, root_dir) 2025-12-03T23:32:49,322 /tmp/pip-build-env-gm3jbbcm/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-03T23:32:49,322 corresp(dist, value, root_dir) 2025-12-03T23:32:49,423 running dist_info 2025-12-03T23:32:49,435 creating /tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit.egg-info 2025-12-03T23:32:49,436 writing /tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-03T23:32:49,456 writing dependency_links to /tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-03T23:32:49,458 writing entry points to /tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-03T23:32:49,470 writing requirements to /tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-03T23:32:49,472 writing top-level names to /tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-03T23:32:49,473 writing manifest file '/tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-03T23:32:49,500 reading manifest file '/tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-03T23:32:49,507 writing manifest file '/tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-03T23:32:49,508 creating '/tmp/pip-modern-metadata-sa360yj4/llm_benchmark_toolkit-2.3.0.dist-info' 2025-12-03T23:32:49,636 Preparing metadata (pyproject.toml): finished with status 'done' 2025-12-03T23:32:49,643 Source in /tmp/pip-wheel-fzdq6e6n/llm-benchmark-toolkit_f0ca2f1bbc2b4c48b10bb9c0d6aa13e5 has version 2.3.0, which satisfies requirement llm-benchmark-toolkit==2.3.0 from https://files.pythonhosted.org/packages/26/eb/11a05fa90f5bac82d1d4f4bec7b6e13e3a9cb11502db8e53a3dc74270f34/llm_benchmark_toolkit-2.3.0.tar.gz 2025-12-03T23:32:49,644 Removed llm-benchmark-toolkit==2.3.0 from https://files.pythonhosted.org/packages/26/eb/11a05fa90f5bac82d1d4f4bec7b6e13e3a9cb11502db8e53a3dc74270f34/llm_benchmark_toolkit-2.3.0.tar.gz from build tracker '/tmp/pip-build-tracker-4yrgh521' 2025-12-03T23:32:49,653 Created temporary directory: /tmp/pip-unpack-ncabtn7o 2025-12-03T23:32:49,654 Building wheels for collected packages: llm-benchmark-toolkit 2025-12-03T23:32:49,659 Created temporary directory: /tmp/pip-wheel-sd1vt_ny 2025-12-03T23:32:49,659 Destination directory: /tmp/pip-wheel-sd1vt_ny 2025-12-03T23:32:49,662 Building wheel for llm-benchmark-toolkit (pyproject.toml): started 2025-12-03T23:32:49,663 Running command Building wheel for llm-benchmark-toolkit (pyproject.toml) 2025-12-03T23:32:50,237 /tmp/pip-build-env-gm3jbbcm/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-03T23:32:50,237 corresp(dist, value, root_dir) 2025-12-03T23:32:50,238 /tmp/pip-build-env-gm3jbbcm/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-03T23:32:50,238 corresp(dist, value, root_dir) 2025-12-03T23:32:50,322 running bdist_wheel 2025-12-03T23:32:50,343 running build 2025-12-03T23:32:50,343 running build_py 2025-12-03T23:32:50,351 creating build/lib/llm_evaluator 2025-12-03T23:32:50,353 copying src/llm_evaluator/benchmarks.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,357 copying src/llm_evaluator/cli.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,361 copying src/llm_evaluator/config.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,363 copying src/llm_evaluator/error_analysis.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,366 copying src/llm_evaluator/visualizations.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,369 copying src/llm_evaluator/system_info.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,371 copying src/llm_evaluator/__init__.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,373 copying src/llm_evaluator/export.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,376 copying src/llm_evaluator/evaluator.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,378 copying src/llm_evaluator/statistical_metrics.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,381 copying src/llm_evaluator/metrics.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,383 copying src/llm_evaluator/academic_baselines.py -> build/lib/llm_evaluator 2025-12-03T23:32:50,386 creating build/lib/llm_evaluator/dashboard 2025-12-03T23:32:50,387 copying src/llm_evaluator/dashboard/runner.py -> build/lib/llm_evaluator/dashboard 2025-12-03T23:32:50,390 copying src/llm_evaluator/dashboard/models.py -> build/lib/llm_evaluator/dashboard 2025-12-03T23:32:50,392 copying src/llm_evaluator/dashboard/app.py -> build/lib/llm_evaluator/dashboard 2025-12-03T23:32:50,395 copying src/llm_evaluator/dashboard/__init__.py -> build/lib/llm_evaluator/dashboard 2025-12-03T23:32:50,397 copying src/llm_evaluator/dashboard/model_discovery.py -> build/lib/llm_evaluator/dashboard 2025-12-03T23:32:50,399 copying src/llm_evaluator/dashboard/__main__.py -> build/lib/llm_evaluator/dashboard 2025-12-03T23:32:50,402 creating build/lib/llm_evaluator/providers 2025-12-03T23:32:50,403 copying src/llm_evaluator/providers/groq_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,406 copying src/llm_evaluator/providers/gemini_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,408 copying src/llm_evaluator/providers/openai_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,411 copying src/llm_evaluator/providers/huggingface_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,413 copying src/llm_evaluator/providers/together_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,416 copying src/llm_evaluator/providers/anthropic_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,418 copying src/llm_evaluator/providers/cached_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,420 copying src/llm_evaluator/providers/base.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,423 copying src/llm_evaluator/providers/__init__.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,424 copying src/llm_evaluator/providers/deepseek_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,427 copying src/llm_evaluator/providers/fireworks_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,429 copying src/llm_evaluator/providers/ollama_provider.py -> build/lib/llm_evaluator/providers 2025-12-03T23:32:50,432 running egg_info 2025-12-03T23:32:50,443 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-03T23:32:50,462 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-03T23:32:50,463 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-03T23:32:50,475 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-03T23:32:50,476 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-03T23:32:50,493 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-03T23:32:50,504 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-03T23:32:50,510 creating build/lib/llm_evaluator/dashboard/static 2025-12-03T23:32:50,511 copying src/llm_evaluator/dashboard/static/favicon.svg -> build/lib/llm_evaluator/dashboard/static 2025-12-03T23:32:50,514 copying src/llm_evaluator/dashboard/static/index.html -> build/lib/llm_evaluator/dashboard/static 2025-12-03T23:32:50,516 creating build/lib/llm_evaluator/dashboard/static/assets 2025-12-03T23:32:50,517 copying src/llm_evaluator/dashboard/static/assets/index-6rBrVz97.js -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-03T23:32:50,532 copying src/llm_evaluator/dashboard/static/assets/index-BjvrLrQa.css -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-03T23:32:50,537 copying src/llm_evaluator/dashboard/static/assets/index-6rBrVz97.js -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-03T23:32:50,563 installing to build/bdist.linux-armv7l/wheel 2025-12-03T23:32:50,564 running install 2025-12-03T23:32:50,587 running install_lib 2025-12-03T23:32:50,594 creating build/bdist.linux-armv7l/wheel 2025-12-03T23:32:50,596 creating build/bdist.linux-armv7l/wheel/llm_evaluator 2025-12-03T23:32:50,597 copying build/lib/llm_evaluator/benchmarks.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,601 copying build/lib/llm_evaluator/cli.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,605 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard 2025-12-03T23:32:50,606 copying build/lib/llm_evaluator/dashboard/runner.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-03T23:32:50,609 copying build/lib/llm_evaluator/dashboard/models.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-03T23:32:50,611 copying build/lib/llm_evaluator/dashboard/app.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-03T23:32:50,614 copying build/lib/llm_evaluator/dashboard/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-03T23:32:50,616 copying build/lib/llm_evaluator/dashboard/model_discovery.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-03T23:32:50,618 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static 2025-12-03T23:32:50,619 copying build/lib/llm_evaluator/dashboard/static/favicon.svg -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-03T23:32:50,622 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static/assets 2025-12-03T23:32:50,623 copying build/lib/llm_evaluator/dashboard/static/assets/index-6rBrVz97.js -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-03T23:32:50,635 copying build/lib/llm_evaluator/dashboard/static/assets/index-BjvrLrQa.css -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-03T23:32:50,638 copying build/lib/llm_evaluator/dashboard/static/index.html -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-03T23:32:50,640 copying build/lib/llm_evaluator/dashboard/__main__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-03T23:32:50,642 copying build/lib/llm_evaluator/config.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,645 copying build/lib/llm_evaluator/error_analysis.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,647 copying build/lib/llm_evaluator/visualizations.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,650 creating build/bdist.linux-armv7l/wheel/llm_evaluator/providers 2025-12-03T23:32:50,651 copying build/lib/llm_evaluator/providers/groq_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,654 copying build/lib/llm_evaluator/providers/gemini_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,656 copying build/lib/llm_evaluator/providers/openai_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,659 copying build/lib/llm_evaluator/providers/huggingface_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,661 copying build/lib/llm_evaluator/providers/together_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,663 copying build/lib/llm_evaluator/providers/anthropic_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,666 copying build/lib/llm_evaluator/providers/cached_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,668 copying build/lib/llm_evaluator/providers/base.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,671 copying build/lib/llm_evaluator/providers/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,673 copying build/lib/llm_evaluator/providers/deepseek_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,675 copying build/lib/llm_evaluator/providers/fireworks_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,678 copying build/lib/llm_evaluator/providers/ollama_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-03T23:32:50,680 copying build/lib/llm_evaluator/system_info.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,682 copying build/lib/llm_evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,684 copying build/lib/llm_evaluator/export.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,687 copying build/lib/llm_evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,690 copying build/lib/llm_evaluator/statistical_metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,693 copying build/lib/llm_evaluator/metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,695 copying build/lib/llm_evaluator/academic_baselines.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-03T23:32:50,697 running install_egg_info 2025-12-03T23:32:50,703 Copying src/llm_benchmark_toolkit.egg-info to build/bdist.linux-armv7l/wheel/./llm_benchmark_toolkit-2.3.0-py3.11.egg-info 2025-12-03T23:32:50,716 running install_scripts 2025-12-03T23:32:50,726 creating build/bdist.linux-armv7l/wheel/llm_benchmark_toolkit-2.3.0.dist-info/WHEEL 2025-12-03T23:32:50,729 creating '/tmp/pip-wheel-sd1vt_ny/.tmp-xcom7844/llm_benchmark_toolkit-2.3.0-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-12-03T23:32:50,733 adding 'llm_evaluator/__init__.py' 2025-12-03T23:32:50,735 adding 'llm_evaluator/academic_baselines.py' 2025-12-03T23:32:50,742 adding 'llm_evaluator/benchmarks.py' 2025-12-03T23:32:50,751 adding 'llm_evaluator/cli.py' 2025-12-03T23:32:50,753 adding 'llm_evaluator/config.py' 2025-12-03T23:32:50,756 adding 'llm_evaluator/error_analysis.py' 2025-12-03T23:32:50,759 adding 'llm_evaluator/evaluator.py' 2025-12-03T23:32:50,761 adding 'llm_evaluator/export.py' 2025-12-03T23:32:50,763 adding 'llm_evaluator/metrics.py' 2025-12-03T23:32:50,766 adding 'llm_evaluator/statistical_metrics.py' 2025-12-03T23:32:50,767 adding 'llm_evaluator/system_info.py' 2025-12-03T23:32:50,770 adding 'llm_evaluator/visualizations.py' 2025-12-03T23:32:50,772 adding 'llm_evaluator/dashboard/__init__.py' 2025-12-03T23:32:50,773 adding 'llm_evaluator/dashboard/__main__.py' 2025-12-03T23:32:50,777 adding 'llm_evaluator/dashboard/app.py' 2025-12-03T23:32:50,778 adding 'llm_evaluator/dashboard/model_discovery.py' 2025-12-03T23:32:50,780 adding 'llm_evaluator/dashboard/models.py' 2025-12-03T23:32:50,784 adding 'llm_evaluator/dashboard/runner.py' 2025-12-03T23:32:50,786 adding 'llm_evaluator/dashboard/static/favicon.svg' 2025-12-03T23:32:50,787 adding 'llm_evaluator/dashboard/static/index.html' 2025-12-03T23:32:50,865 adding 'llm_evaluator/dashboard/static/assets/index-6rBrVz97.js' 2025-12-03T23:32:50,874 adding 'llm_evaluator/dashboard/static/assets/index-BjvrLrQa.css' 2025-12-03T23:32:50,876 adding 'llm_evaluator/providers/__init__.py' 2025-12-03T23:32:50,878 adding 'llm_evaluator/providers/anthropic_provider.py' 2025-12-03T23:32:50,879 adding 'llm_evaluator/providers/base.py' 2025-12-03T23:32:50,881 adding 'llm_evaluator/providers/cached_provider.py' 2025-12-03T23:32:50,883 adding 'llm_evaluator/providers/deepseek_provider.py' 2025-12-03T23:32:50,885 adding 'llm_evaluator/providers/fireworks_provider.py' 2025-12-03T23:32:50,887 adding 'llm_evaluator/providers/gemini_provider.py' 2025-12-03T23:32:50,889 adding 'llm_evaluator/providers/groq_provider.py' 2025-12-03T23:32:50,891 adding 'llm_evaluator/providers/huggingface_provider.py' 2025-12-03T23:32:50,892 adding 'llm_evaluator/providers/ollama_provider.py' 2025-12-03T23:32:50,894 adding 'llm_evaluator/providers/openai_provider.py' 2025-12-03T23:32:50,896 adding 'llm_evaluator/providers/together_provider.py' 2025-12-03T23:32:50,899 adding 'llm_benchmark_toolkit-2.3.0.dist-info/METADATA' 2025-12-03T23:32:50,900 adding 'llm_benchmark_toolkit-2.3.0.dist-info/WHEEL' 2025-12-03T23:32:50,901 adding 'llm_benchmark_toolkit-2.3.0.dist-info/entry_points.txt' 2025-12-03T23:32:50,902 adding 'llm_benchmark_toolkit-2.3.0.dist-info/top_level.txt' 2025-12-03T23:32:50,903 adding 'llm_benchmark_toolkit-2.3.0.dist-info/RECORD' 2025-12-03T23:32:50,908 removing build/bdist.linux-armv7l/wheel 2025-12-03T23:32:51,021 Building wheel for llm-benchmark-toolkit (pyproject.toml): finished with status 'done' 2025-12-03T23:32:51,035 Created wheel for llm-benchmark-toolkit: filename=llm_benchmark_toolkit-2.3.0-py3-none-any.whl size=326824 sha256=de403f7d45ce04034909c8e7022101a4bcd685517024ebe115314d8862ed7895 2025-12-03T23:32:51,037 Stored in directory: /tmp/pip-ephem-wheel-cache-zi_owu2y/wheels/7c/9c/1d/eb8c79d5bb9635dc34d64c26f2e58c9ee807eee99f2eb1bd68 2025-12-03T23:32:51,059 Successfully built llm-benchmark-toolkit 2025-12-03T23:32:51,074 Removed build tracker: '/tmp/pip-build-tracker-4yrgh521'