2025-12-04T01:55:56,720 Created temporary directory: /tmp/pip-ephem-wheel-cache-slpo8er1 2025-12-04T01:55:56,722 Created temporary directory: /tmp/pip-build-tracker-jve71343 2025-12-04T01:55:56,723 Initialized build tracking at /tmp/pip-build-tracker-jve71343 2025-12-04T01:55:56,723 Created build tracker: /tmp/pip-build-tracker-jve71343 2025-12-04T01:55:56,724 Entered build tracker: /tmp/pip-build-tracker-jve71343 2025-12-04T01:55:56,725 Created temporary directory: /tmp/pip-wheel-7yf4juq7 2025-12-04T01:55:56,728 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-04T01:55:56,730 Created temporary directory: /tmp/pip-ephem-wheel-cache-0xvo9_9x 2025-12-04T01:55:56,753 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-04T01:55:56,756 2 location(s) to search for versions of llm-benchmark-toolkit: 2025-12-04T01:55:56,756 * https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-04T01:55:56,756 * https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-04T01:55:56,757 Fetching project page and analyzing links: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-04T01:55:56,758 Getting page https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-04T01:55:56,759 Found index url https://pypi.org/simple 2025-12-04T01:55:57,011 Fetched page https://pypi.org/simple/llm-benchmark-toolkit/ as application/vnd.pypi.simple.v1+json 2025-12-04T01:55:57,018 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/95/b5/c528dad5028e188283baec8a39b34f2f5258c6f93a5984b194354ab3cb67/llm_benchmark_toolkit-0.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,020 Found link https://files.pythonhosted.org/packages/a7/41/73de3b2dd4005a66ae4a2860ae322e7de9cc75bd8b7ef7e8dc2f710f7a29/llm_benchmark_toolkit-0.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.0 2025-12-04T01:55:57,023 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/96/49/345527ecab82ff6ccaf832a6eb1fe2d6bccb98971861482a4c96c173ec68/llm_benchmark_toolkit-0.3.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,026 Found link https://files.pythonhosted.org/packages/d6/7c/960c3132ff0d4fab4b3b0f5dc2642fe4119a5ea7063d60ab2a8f5b8541de/llm_benchmark_toolkit-0.3.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.1 2025-12-04T01:55:57,028 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/86/df/8cf567d200bf116978d565bfa82022212049a4e077e2f4057020ba7b3ec1/llm_benchmark_toolkit-0.3.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,031 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/87/7d/18901d63a9d945df5a49aae05eac642327e2432951a027b39ac4e6e9d303/llm_benchmark_toolkit-0.4.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,033 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/4d/09/e0d3f5c7e8e533df93d488a0411eb15557ee120b1aa045a7f1fd2bfe8047/llm_benchmark_toolkit-0.4.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,036 Found link https://files.pythonhosted.org/packages/94/98/e41f2f6abc60c05c01e401fa39fed8d398305200e1ef23cef596be8809c5/llm_benchmark_toolkit-0.4.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.4.1 2025-12-04T01:55:57,039 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/19/3d/8ca9dc0f08263596dea0de71a92635fe8e23faa668182a4349d90ccb0d82/llm_benchmark_toolkit-2.0.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,044 Found link https://files.pythonhosted.org/packages/36/45/264df4505ca7ef5433049cc402373960ab82e5885cc789e8b85fc73f76bb/llm_benchmark_toolkit-2.0.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.0.0 2025-12-04T01:55:57,047 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/5b/43/32c2b3793610b688ec24cc490ad4c50bcdff3e33f1ccb172cc464972534c/llm_benchmark_toolkit-2.1.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,049 Found link https://files.pythonhosted.org/packages/be/d3/7152b5619a14d4c8a043cc4ee01e4c6b746c910b6df8b91ecb696da1ef4c/llm_benchmark_toolkit-2.1.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.1.0 2025-12-04T01:55:57,052 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/41/79/ca2033ee9ccfebcf708ae68a2007a6072c87c0e9bb24c5354d274bd57247/llm_benchmark_toolkit-2.2.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,055 Found link https://files.pythonhosted.org/packages/b9/ba/602dbec3514ff6b5a611d98e44e81a298dd4a7904c8b378a8e5a7fdc3c87/llm_benchmark_toolkit-2.2.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.0 2025-12-04T01:55:57,057 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/ee/0b/7e7e6b6575046e5bf1d8cb1eceb88aba723ebde5bb0d761f50d971181902/llm_benchmark_toolkit-2.2.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,060 Found link https://files.pythonhosted.org/packages/f8/1d/ee3da6b517008793f0444ec55d964cb2afee0a21c475f89e9be9e90479cf/llm_benchmark_toolkit-2.2.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.1 2025-12-04T01:55:57,063 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/90/d6/ee6f413ced99a3ee80d162870f1b759ab790bc4dfb45352a454c3ef9f663/llm_benchmark_toolkit-2.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,066 Found link https://files.pythonhosted.org/packages/26/eb/11a05fa90f5bac82d1d4f4bec7b6e13e3a9cb11502db8e53a3dc74270f34/llm_benchmark_toolkit-2.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.3.0 2025-12-04T01:55:57,069 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/ca/84/53c0b5d7d479671dcb589ea79d730f626768c29dc4ecd5963929ec624ea9/llm_benchmark_toolkit-2.3.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,071 Found link https://files.pythonhosted.org/packages/75/6b/c3d742f7e9cf668df0c1dab60629839b213df2946819f34ee03218c7b434/llm_benchmark_toolkit-2.3.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.3.1 2025-12-04T01:55:57,073 Fetching project page and analyzing links: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-04T01:55:57,075 Getting page https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-04T01:55:57,078 Found index url https://www.piwheels.org/simple 2025-12-04T01:55:57,467 Fetched page https://www.piwheels.org/simple/llm-benchmark-toolkit/ as text/html 2025-12-04T01:55:57,471 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.3.0-py3-none-any.whl#sha256=de403f7d45ce04034909c8e7022101a4bcd685517024ebe115314d8862ed7895 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,472 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.2.1-py3-none-any.whl#sha256=a910972ff3802aeeec4c93220e0a5373097fb87c071c065ce81cef88e41c0d7f (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,473 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.2.0-py3-none-any.whl#sha256=2fbbb501c0dd81fa057db285c3202e45adfe99ceba2398974482329d2b805a41 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,473 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.1.0-py3-none-any.whl#sha256=c85ebb550837b0faf5cb28798b4ec2f84c9d20796e69f43ce7933f3145f903ce (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,474 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.0.0-py3-none-any.whl#sha256=9956b86c6b62f17a14b176a906481dbb5053a5f4c7f8dc479a1afb0668f10336 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,474 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.4.1-py3-none-any.whl#sha256=c237cd524f90f123d8ecce03f9b033a198b328d9d44ca52c0df5aa132b3e60be (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,475 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.1-py3-none-any.whl#sha256=44752309369b0c2b9e5bb1a1093611cc8a51e84014c2d50b4087e7c9bf1e8a49 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,476 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.0-py3-none-any.whl#sha256=8abac7bd509fef8b4c7c3a124e9afb765a0340e7d6c4c2b800dc7117fd8f99b4 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-04T01:55:57,476 Skipping link: not a file: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-04T01:55:57,477 Skipping link: not a file: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-04T01:55:57,497 Given no hashes to check 1 links for project 'llm-benchmark-toolkit': discarding no candidates 2025-12-04T01:55:57,515 Collecting llm-benchmark-toolkit==2.3.1 2025-12-04T01:55:57,517 Created temporary directory: /tmp/pip-unpack-1dm1akjb 2025-12-04T01:55:57,762 Downloading llm_benchmark_toolkit-2.3.1.tar.gz (369 kB) 2025-12-04T01:55:58,094 Added llm-benchmark-toolkit==2.3.1 from https://files.pythonhosted.org/packages/75/6b/c3d742f7e9cf668df0c1dab60629839b213df2946819f34ee03218c7b434/llm_benchmark_toolkit-2.3.1.tar.gz to build tracker '/tmp/pip-build-tracker-jve71343' 2025-12-04T01:55:58,102 Created temporary directory: /tmp/pip-build-env-47xv7of6 2025-12-04T01:55:58,107 Installing build dependencies: started 2025-12-04T01:55:58,108 Running command pip subprocess to install build dependencies 2025-12-04T01:55:59,267 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2025-12-04T01:55:59,885 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-04T01:55:59,908 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-04T01:56:01,633 Collecting setuptools>=61.0 2025-12-04T01:56:01,786 Using cached https://www.piwheels.org/simple/setuptools/setuptools-80.9.0-py3-none-any.whl (1.2 MB) 2025-12-04T01:56:02,054 Collecting wheel 2025-12-04T01:56:02,076 Using cached https://www.piwheels.org/simple/wheel/wheel-0.45.1-py3-none-any.whl (72 kB) 2025-12-04T01:56:05,015 Installing collected packages: wheel, setuptools 2025-12-04T01:56:05,248 Creating /tmp/pip-build-env-47xv7of6/overlay/local/bin 2025-12-04T01:56:05,250 changing mode of /tmp/pip-build-env-47xv7of6/overlay/local/bin/wheel to 755 2025-12-04T01:56:09,167 Successfully installed setuptools-80.9.0 wheel-0.45.1 2025-12-04T01:56:09,444 Installing build dependencies: finished with status 'done' 2025-12-04T01:56:09,452 Getting requirements to build wheel: started 2025-12-04T01:56:09,454 Running command Getting requirements to build wheel 2025-12-04T01:56:10,078 /tmp/pip-build-env-47xv7of6/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-04T01:56:10,078 corresp(dist, value, root_dir) 2025-12-04T01:56:10,079 /tmp/pip-build-env-47xv7of6/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-04T01:56:10,080 corresp(dist, value, root_dir) 2025-12-04T01:56:10,177 running egg_info 2025-12-04T01:56:10,184 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-04T01:56:10,203 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-04T01:56:10,205 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-04T01:56:10,217 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-04T01:56:10,218 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-04T01:56:10,248 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-04T01:56:10,260 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-04T01:56:10,359 Getting requirements to build wheel: finished with status 'done' 2025-12-04T01:56:10,362 Created temporary directory: /tmp/pip-modern-metadata-j5_k03wr 2025-12-04T01:56:10,365 Preparing metadata (pyproject.toml): started 2025-12-04T01:56:10,366 Running command Preparing metadata (pyproject.toml) 2025-12-04T01:56:10,975 /tmp/pip-build-env-47xv7of6/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-04T01:56:10,975 corresp(dist, value, root_dir) 2025-12-04T01:56:10,976 /tmp/pip-build-env-47xv7of6/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-04T01:56:10,976 corresp(dist, value, root_dir) 2025-12-04T01:56:11,070 running dist_info 2025-12-04T01:56:11,082 creating /tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit.egg-info 2025-12-04T01:56:11,083 writing /tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-04T01:56:11,103 writing dependency_links to /tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-04T01:56:11,104 writing entry points to /tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-04T01:56:11,117 writing requirements to /tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-04T01:56:11,118 writing top-level names to /tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-04T01:56:11,119 writing manifest file '/tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-04T01:56:11,146 reading manifest file '/tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-04T01:56:11,153 writing manifest file '/tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-04T01:56:11,154 creating '/tmp/pip-modern-metadata-j5_k03wr/llm_benchmark_toolkit-2.3.1.dist-info' 2025-12-04T01:56:11,280 Preparing metadata (pyproject.toml): finished with status 'done' 2025-12-04T01:56:11,287 Source in /tmp/pip-wheel-7yf4juq7/llm-benchmark-toolkit_ae510749a9db4367a2e6f40063e8eec7 has version 2.3.1, which satisfies requirement llm-benchmark-toolkit==2.3.1 from https://files.pythonhosted.org/packages/75/6b/c3d742f7e9cf668df0c1dab60629839b213df2946819f34ee03218c7b434/llm_benchmark_toolkit-2.3.1.tar.gz 2025-12-04T01:56:11,288 Removed llm-benchmark-toolkit==2.3.1 from https://files.pythonhosted.org/packages/75/6b/c3d742f7e9cf668df0c1dab60629839b213df2946819f34ee03218c7b434/llm_benchmark_toolkit-2.3.1.tar.gz from build tracker '/tmp/pip-build-tracker-jve71343' 2025-12-04T01:56:11,297 Created temporary directory: /tmp/pip-unpack-wn7k0cq4 2025-12-04T01:56:11,298 Building wheels for collected packages: llm-benchmark-toolkit 2025-12-04T01:56:11,302 Created temporary directory: /tmp/pip-wheel-nlmo0v9s 2025-12-04T01:56:11,303 Destination directory: /tmp/pip-wheel-nlmo0v9s 2025-12-04T01:56:11,305 Building wheel for llm-benchmark-toolkit (pyproject.toml): started 2025-12-04T01:56:11,306 Running command Building wheel for llm-benchmark-toolkit (pyproject.toml) 2025-12-04T01:56:11,874 /tmp/pip-build-env-47xv7of6/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-04T01:56:11,875 corresp(dist, value, root_dir) 2025-12-04T01:56:11,875 /tmp/pip-build-env-47xv7of6/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-04T01:56:11,876 corresp(dist, value, root_dir) 2025-12-04T01:56:11,960 running bdist_wheel 2025-12-04T01:56:11,981 running build 2025-12-04T01:56:11,981 running build_py 2025-12-04T01:56:11,988 creating build/lib/llm_evaluator 2025-12-04T01:56:11,990 copying src/llm_evaluator/metrics.py -> build/lib/llm_evaluator 2025-12-04T01:56:11,992 copying src/llm_evaluator/export.py -> build/lib/llm_evaluator 2025-12-04T01:56:11,996 copying src/llm_evaluator/statistical_metrics.py -> build/lib/llm_evaluator 2025-12-04T01:56:11,999 copying src/llm_evaluator/visualizations.py -> build/lib/llm_evaluator 2025-12-04T01:56:12,001 copying src/llm_evaluator/academic_baselines.py -> build/lib/llm_evaluator 2025-12-04T01:56:12,004 copying src/llm_evaluator/config.py -> build/lib/llm_evaluator 2025-12-04T01:56:12,006 copying src/llm_evaluator/__init__.py -> build/lib/llm_evaluator 2025-12-04T01:56:12,008 copying src/llm_evaluator/evaluator.py -> build/lib/llm_evaluator 2025-12-04T01:56:12,011 copying src/llm_evaluator/cli.py -> build/lib/llm_evaluator 2025-12-04T01:56:12,014 copying src/llm_evaluator/benchmarks.py -> build/lib/llm_evaluator 2025-12-04T01:56:12,018 copying src/llm_evaluator/error_analysis.py -> build/lib/llm_evaluator 2025-12-04T01:56:12,021 copying src/llm_evaluator/system_info.py -> build/lib/llm_evaluator 2025-12-04T01:56:12,024 creating build/lib/llm_evaluator/dashboard 2025-12-04T01:56:12,025 copying src/llm_evaluator/dashboard/runner.py -> build/lib/llm_evaluator/dashboard 2025-12-04T01:56:12,028 copying src/llm_evaluator/dashboard/model_discovery.py -> build/lib/llm_evaluator/dashboard 2025-12-04T01:56:12,030 copying src/llm_evaluator/dashboard/__init__.py -> build/lib/llm_evaluator/dashboard 2025-12-04T01:56:12,032 copying src/llm_evaluator/dashboard/models.py -> build/lib/llm_evaluator/dashboard 2025-12-04T01:56:12,034 copying src/llm_evaluator/dashboard/app.py -> build/lib/llm_evaluator/dashboard 2025-12-04T01:56:12,037 copying src/llm_evaluator/dashboard/__main__.py -> build/lib/llm_evaluator/dashboard 2025-12-04T01:56:12,039 creating build/lib/llm_evaluator/providers 2025-12-04T01:56:12,040 copying src/llm_evaluator/providers/huggingface_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,043 copying src/llm_evaluator/providers/ollama_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,045 copying src/llm_evaluator/providers/base.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,047 copying src/llm_evaluator/providers/groq_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,049 copying src/llm_evaluator/providers/fireworks_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,051 copying src/llm_evaluator/providers/__init__.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,053 copying src/llm_evaluator/providers/gemini_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,056 copying src/llm_evaluator/providers/openai_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,058 copying src/llm_evaluator/providers/anthropic_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,060 copying src/llm_evaluator/providers/deepseek_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,063 copying src/llm_evaluator/providers/together_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,065 copying src/llm_evaluator/providers/cached_provider.py -> build/lib/llm_evaluator/providers 2025-12-04T01:56:12,068 running egg_info 2025-12-04T01:56:12,080 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-04T01:56:12,099 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-04T01:56:12,101 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-04T01:56:12,113 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-04T01:56:12,114 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-04T01:56:12,130 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-04T01:56:12,141 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-04T01:56:12,148 creating build/lib/llm_evaluator/dashboard/static 2025-12-04T01:56:12,149 copying src/llm_evaluator/dashboard/static/favicon.svg -> build/lib/llm_evaluator/dashboard/static 2025-12-04T01:56:12,152 copying src/llm_evaluator/dashboard/static/index.html -> build/lib/llm_evaluator/dashboard/static 2025-12-04T01:56:12,154 creating build/lib/llm_evaluator/dashboard/static/assets 2025-12-04T01:56:12,155 copying src/llm_evaluator/dashboard/static/assets/index-CbmYu4aA.css -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-04T01:56:12,159 copying src/llm_evaluator/dashboard/static/assets/index-BFAiCS6Z.js -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-04T01:56:12,190 installing to build/bdist.linux-armv7l/wheel 2025-12-04T01:56:12,191 running install 2025-12-04T01:56:12,214 running install_lib 2025-12-04T01:56:12,221 creating build/bdist.linux-armv7l/wheel 2025-12-04T01:56:12,223 creating build/bdist.linux-armv7l/wheel/llm_evaluator 2025-12-04T01:56:12,224 copying build/lib/llm_evaluator/metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,226 copying build/lib/llm_evaluator/export.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,229 copying build/lib/llm_evaluator/statistical_metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,231 copying build/lib/llm_evaluator/visualizations.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,234 copying build/lib/llm_evaluator/academic_baselines.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,236 copying build/lib/llm_evaluator/config.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,239 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard 2025-12-04T01:56:12,240 copying build/lib/llm_evaluator/dashboard/runner.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-04T01:56:12,242 copying build/lib/llm_evaluator/dashboard/model_discovery.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-04T01:56:12,244 copying build/lib/llm_evaluator/dashboard/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-04T01:56:12,246 copying build/lib/llm_evaluator/dashboard/models.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-04T01:56:12,248 copying build/lib/llm_evaluator/dashboard/app.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-04T01:56:12,251 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static 2025-12-04T01:56:12,252 copying build/lib/llm_evaluator/dashboard/static/favicon.svg -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-04T01:56:12,254 copying build/lib/llm_evaluator/dashboard/static/index.html -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-04T01:56:12,256 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static/assets 2025-12-04T01:56:12,257 copying build/lib/llm_evaluator/dashboard/static/assets/index-CbmYu4aA.css -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-04T01:56:12,260 copying build/lib/llm_evaluator/dashboard/static/assets/index-BFAiCS6Z.js -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-04T01:56:12,275 copying build/lib/llm_evaluator/dashboard/__main__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-04T01:56:12,277 creating build/bdist.linux-armv7l/wheel/llm_evaluator/providers 2025-12-04T01:56:12,278 copying build/lib/llm_evaluator/providers/huggingface_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,281 copying build/lib/llm_evaluator/providers/ollama_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,283 copying build/lib/llm_evaluator/providers/base.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,285 copying build/lib/llm_evaluator/providers/groq_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,287 copying build/lib/llm_evaluator/providers/fireworks_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,289 copying build/lib/llm_evaluator/providers/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,291 copying build/lib/llm_evaluator/providers/gemini_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,294 copying build/lib/llm_evaluator/providers/openai_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,296 copying build/lib/llm_evaluator/providers/anthropic_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,298 copying build/lib/llm_evaluator/providers/deepseek_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,301 copying build/lib/llm_evaluator/providers/together_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,303 copying build/lib/llm_evaluator/providers/cached_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-04T01:56:12,306 copying build/lib/llm_evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,308 copying build/lib/llm_evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,310 copying build/lib/llm_evaluator/cli.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,314 copying build/lib/llm_evaluator/benchmarks.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,317 copying build/lib/llm_evaluator/error_analysis.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,320 copying build/lib/llm_evaluator/system_info.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-04T01:56:12,322 running install_egg_info 2025-12-04T01:56:12,327 Copying src/llm_benchmark_toolkit.egg-info to build/bdist.linux-armv7l/wheel/./llm_benchmark_toolkit-2.3.1-py3.11.egg-info 2025-12-04T01:56:12,339 running install_scripts 2025-12-04T01:56:12,348 creating build/bdist.linux-armv7l/wheel/llm_benchmark_toolkit-2.3.1.dist-info/WHEEL 2025-12-04T01:56:12,351 creating '/tmp/pip-wheel-nlmo0v9s/.tmp-jr4hoptw/llm_benchmark_toolkit-2.3.1-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-12-04T01:56:12,354 adding 'llm_evaluator/__init__.py' 2025-12-04T01:56:12,357 adding 'llm_evaluator/academic_baselines.py' 2025-12-04T01:56:12,364 adding 'llm_evaluator/benchmarks.py' 2025-12-04T01:56:12,373 adding 'llm_evaluator/cli.py' 2025-12-04T01:56:12,375 adding 'llm_evaluator/config.py' 2025-12-04T01:56:12,377 adding 'llm_evaluator/error_analysis.py' 2025-12-04T01:56:12,380 adding 'llm_evaluator/evaluator.py' 2025-12-04T01:56:12,383 adding 'llm_evaluator/export.py' 2025-12-04T01:56:12,384 adding 'llm_evaluator/metrics.py' 2025-12-04T01:56:12,387 adding 'llm_evaluator/statistical_metrics.py' 2025-12-04T01:56:12,389 adding 'llm_evaluator/system_info.py' 2025-12-04T01:56:12,392 adding 'llm_evaluator/visualizations.py' 2025-12-04T01:56:12,394 adding 'llm_evaluator/dashboard/__init__.py' 2025-12-04T01:56:12,395 adding 'llm_evaluator/dashboard/__main__.py' 2025-12-04T01:56:12,398 adding 'llm_evaluator/dashboard/app.py' 2025-12-04T01:56:12,400 adding 'llm_evaluator/dashboard/model_discovery.py' 2025-12-04T01:56:12,402 adding 'llm_evaluator/dashboard/models.py' 2025-12-04T01:56:12,405 adding 'llm_evaluator/dashboard/runner.py' 2025-12-04T01:56:12,407 adding 'llm_evaluator/dashboard/static/favicon.svg' 2025-12-04T01:56:12,408 adding 'llm_evaluator/dashboard/static/index.html' 2025-12-04T01:56:12,486 adding 'llm_evaluator/dashboard/static/assets/index-BFAiCS6Z.js' 2025-12-04T01:56:12,494 adding 'llm_evaluator/dashboard/static/assets/index-CbmYu4aA.css' 2025-12-04T01:56:12,496 adding 'llm_evaluator/providers/__init__.py' 2025-12-04T01:56:12,498 adding 'llm_evaluator/providers/anthropic_provider.py' 2025-12-04T01:56:12,500 adding 'llm_evaluator/providers/base.py' 2025-12-04T01:56:12,501 adding 'llm_evaluator/providers/cached_provider.py' 2025-12-04T01:56:12,503 adding 'llm_evaluator/providers/deepseek_provider.py' 2025-12-04T01:56:12,505 adding 'llm_evaluator/providers/fireworks_provider.py' 2025-12-04T01:56:12,507 adding 'llm_evaluator/providers/gemini_provider.py' 2025-12-04T01:56:12,508 adding 'llm_evaluator/providers/groq_provider.py' 2025-12-04T01:56:12,510 adding 'llm_evaluator/providers/huggingface_provider.py' 2025-12-04T01:56:12,512 adding 'llm_evaluator/providers/ollama_provider.py' 2025-12-04T01:56:12,514 adding 'llm_evaluator/providers/openai_provider.py' 2025-12-04T01:56:12,516 adding 'llm_evaluator/providers/together_provider.py' 2025-12-04T01:56:12,518 adding 'llm_benchmark_toolkit-2.3.1.dist-info/METADATA' 2025-12-04T01:56:12,519 adding 'llm_benchmark_toolkit-2.3.1.dist-info/WHEEL' 2025-12-04T01:56:12,520 adding 'llm_benchmark_toolkit-2.3.1.dist-info/entry_points.txt' 2025-12-04T01:56:12,521 adding 'llm_benchmark_toolkit-2.3.1.dist-info/top_level.txt' 2025-12-04T01:56:12,522 adding 'llm_benchmark_toolkit-2.3.1.dist-info/RECORD' 2025-12-04T01:56:12,527 removing build/bdist.linux-armv7l/wheel 2025-12-04T01:56:12,638 Building wheel for llm-benchmark-toolkit (pyproject.toml): finished with status 'done' 2025-12-04T01:56:12,649 Created wheel for llm-benchmark-toolkit: filename=llm_benchmark_toolkit-2.3.1-py3-none-any.whl size=327109 sha256=e821a36a5ab2587c33a4f9aeb452387305b7fbfc61e38ac109eaac44afcfcf11 2025-12-04T01:56:12,650 Stored in directory: /tmp/pip-ephem-wheel-cache-0xvo9_9x/wheels/1d/1e/aa/0eb565fa19d7abfaa414f9a8b6dfa23e9a9a0a59601ce4006a 2025-12-04T01:56:12,667 Successfully built llm-benchmark-toolkit 2025-12-04T01:56:12,678 Removed build tracker: '/tmp/pip-build-tracker-jve71343'