2025-12-02T23:58:28,507 Created temporary directory: /tmp/pip-ephem-wheel-cache-aqjqha1b 2025-12-02T23:58:28,508 Created temporary directory: /tmp/pip-build-tracker-70grt20_ 2025-12-02T23:58:28,509 Initialized build tracking at /tmp/pip-build-tracker-70grt20_ 2025-12-02T23:58:28,510 Created build tracker: /tmp/pip-build-tracker-70grt20_ 2025-12-02T23:58:28,510 Entered build tracker: /tmp/pip-build-tracker-70grt20_ 2025-12-02T23:58:28,511 Created temporary directory: /tmp/pip-wheel-jak1iak9 2025-12-02T23:58:28,514 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-02T23:58:28,517 Created temporary directory: /tmp/pip-ephem-wheel-cache-dkxntvl7 2025-12-02T23:58:28,539 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-02T23:58:28,542 2 location(s) to search for versions of llm-benchmark-toolkit: 2025-12-02T23:58:28,542 * https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:58:28,542 * https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:58:28,543 Fetching project page and analyzing links: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:58:28,544 Getting page https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:58:28,546 Found index url https://pypi.org/simple 2025-12-02T23:58:28,760 Fetched page https://pypi.org/simple/llm-benchmark-toolkit/ as application/vnd.pypi.simple.v1+json 2025-12-02T23:58:28,764 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/95/b5/c528dad5028e188283baec8a39b34f2f5258c6f93a5984b194354ab3cb67/llm_benchmark_toolkit-0.3.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,765 Found link https://files.pythonhosted.org/packages/a7/41/73de3b2dd4005a66ae4a2860ae322e7de9cc75bd8b7ef7e8dc2f710f7a29/llm_benchmark_toolkit-0.3.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.0 2025-12-02T23:58:28,766 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/96/49/345527ecab82ff6ccaf832a6eb1fe2d6bccb98971861482a4c96c173ec68/llm_benchmark_toolkit-0.3.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,767 Found link https://files.pythonhosted.org/packages/d6/7c/960c3132ff0d4fab4b3b0f5dc2642fe4119a5ea7063d60ab2a8f5b8541de/llm_benchmark_toolkit-0.3.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.3.1 2025-12-02T23:58:28,768 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/86/df/8cf567d200bf116978d565bfa82022212049a4e077e2f4057020ba7b3ec1/llm_benchmark_toolkit-0.3.2-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,769 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/87/7d/18901d63a9d945df5a49aae05eac642327e2432951a027b39ac4e6e9d303/llm_benchmark_toolkit-0.4.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,770 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/4d/09/e0d3f5c7e8e533df93d488a0411eb15557ee120b1aa045a7f1fd2bfe8047/llm_benchmark_toolkit-0.4.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,770 Found link https://files.pythonhosted.org/packages/94/98/e41f2f6abc60c05c01e401fa39fed8d398305200e1ef23cef596be8809c5/llm_benchmark_toolkit-0.4.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 0.4.1 2025-12-02T23:58:28,771 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/19/3d/8ca9dc0f08263596dea0de71a92635fe8e23faa668182a4349d90ccb0d82/llm_benchmark_toolkit-2.0.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,773 Found link https://files.pythonhosted.org/packages/36/45/264df4505ca7ef5433049cc402373960ab82e5885cc789e8b85fc73f76bb/llm_benchmark_toolkit-2.0.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.0.0 2025-12-02T23:58:28,773 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/5b/43/32c2b3793610b688ec24cc490ad4c50bcdff3e33f1ccb172cc464972534c/llm_benchmark_toolkit-2.1.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,774 Found link https://files.pythonhosted.org/packages/be/d3/7152b5619a14d4c8a043cc4ee01e4c6b746c910b6df8b91ecb696da1ef4c/llm_benchmark_toolkit-2.1.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.1.0 2025-12-02T23:58:28,775 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/41/79/ca2033ee9ccfebcf708ae68a2007a6072c87c0e9bb24c5354d274bd57247/llm_benchmark_toolkit-2.2.0-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,775 Found link https://files.pythonhosted.org/packages/b9/ba/602dbec3514ff6b5a611d98e44e81a298dd4a7904c8b378a8e5a7fdc3c87/llm_benchmark_toolkit-2.2.0.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.0 2025-12-02T23:58:28,776 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://files.pythonhosted.org/packages/ee/0b/7e7e6b6575046e5bf1d8cb1eceb88aba723ebde5bb0d761f50d971181902/llm_benchmark_toolkit-2.2.1-py3-none-any.whl (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,777 Found link https://files.pythonhosted.org/packages/f8/1d/ee3da6b517008793f0444ec55d964cb2afee0a21c475f89e9be9e90479cf/llm_benchmark_toolkit-2.2.1.tar.gz (from https://pypi.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11), version: 2.2.1 2025-12-02T23:58:28,778 Fetching project page and analyzing links: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:58:28,779 Getting page https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:58:28,780 Found index url https://www.piwheels.org/simple 2025-12-02T23:58:28,938 Fetched page https://www.piwheels.org/simple/llm-benchmark-toolkit/ as text/html 2025-12-02T23:58:28,941 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.2.0-py3-none-any.whl#sha256=2fbbb501c0dd81fa057db285c3202e45adfe99ceba2398974482329d2b805a41 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,942 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.1.0-py3-none-any.whl#sha256=c85ebb550837b0faf5cb28798b4ec2f84c9d20796e69f43ce7933f3145f903ce (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,943 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-2.0.0-py3-none-any.whl#sha256=9956b86c6b62f17a14b176a906481dbb5053a5f4c7f8dc479a1afb0668f10336 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,943 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.4.1-py3-none-any.whl#sha256=c237cd524f90f123d8ecce03f9b033a198b328d9d44ca52c0df5aa132b3e60be (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,944 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.1-py3-none-any.whl#sha256=44752309369b0c2b9e5bb1a1093611cc8a51e84014c2d50b4087e7c9bf1e8a49 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,945 Skipping link: No binaries permitted for llm-benchmark-toolkit: https://www.piwheels.org/simple/llm-benchmark-toolkit/llm_benchmark_toolkit-0.3.0-py3-none-any.whl#sha256=8abac7bd509fef8b4c7c3a124e9afb765a0340e7d6c4c2b800dc7117fd8f99b4 (from https://www.piwheels.org/simple/llm-benchmark-toolkit/) (requires-python:>=3.11) 2025-12-02T23:58:28,945 Skipping link: not a file: https://www.piwheels.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:58:28,946 Skipping link: not a file: https://pypi.org/simple/llm-benchmark-toolkit/ 2025-12-02T23:58:28,965 Given no hashes to check 1 links for project 'llm-benchmark-toolkit': discarding no candidates 2025-12-02T23:58:28,983 Collecting llm-benchmark-toolkit==2.2.1 2025-12-02T23:58:28,986 Created temporary directory: /tmp/pip-unpack-o0wrs15e 2025-12-02T23:58:29,239 Downloading llm_benchmark_toolkit-2.2.1.tar.gz (333 kB) 2025-12-02T23:58:29,535 Added llm-benchmark-toolkit==2.2.1 from https://files.pythonhosted.org/packages/f8/1d/ee3da6b517008793f0444ec55d964cb2afee0a21c475f89e9be9e90479cf/llm_benchmark_toolkit-2.2.1.tar.gz to build tracker '/tmp/pip-build-tracker-70grt20_' 2025-12-02T23:58:29,543 Created temporary directory: /tmp/pip-build-env-yyaw1l3g 2025-12-02T23:58:29,547 Installing build dependencies: started 2025-12-02T23:58:29,549 Running command pip subprocess to install build dependencies 2025-12-02T23:58:30,756 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2025-12-02T23:58:31,409 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-02T23:58:31,435 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-02T23:58:33,169 Collecting setuptools>=61.0 2025-12-02T23:58:33,255 Using cached https://www.piwheels.org/simple/setuptools/setuptools-80.9.0-py3-none-any.whl (1.2 MB) 2025-12-02T23:58:33,524 Collecting wheel 2025-12-02T23:58:33,542 Using cached https://www.piwheels.org/simple/wheel/wheel-0.45.1-py3-none-any.whl (72 kB) 2025-12-02T23:58:36,409 Installing collected packages: wheel, setuptools 2025-12-02T23:58:36,659 Creating /tmp/pip-build-env-yyaw1l3g/overlay/local/bin 2025-12-02T23:58:36,661 changing mode of /tmp/pip-build-env-yyaw1l3g/overlay/local/bin/wheel to 755 2025-12-02T23:58:40,246 Successfully installed setuptools-80.9.0 wheel-0.45.1 2025-12-02T23:58:40,519 Installing build dependencies: finished with status 'done' 2025-12-02T23:58:40,526 Getting requirements to build wheel: started 2025-12-02T23:58:40,527 Running command Getting requirements to build wheel 2025-12-02T23:58:41,180 /tmp/pip-build-env-yyaw1l3g/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-02T23:58:41,180 corresp(dist, value, root_dir) 2025-12-02T23:58:41,181 /tmp/pip-build-env-yyaw1l3g/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-02T23:58:41,182 corresp(dist, value, root_dir) 2025-12-02T23:58:41,279 running egg_info 2025-12-02T23:58:41,286 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-02T23:58:41,302 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-02T23:58:41,305 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-02T23:58:41,314 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-02T23:58:41,316 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-02T23:58:41,346 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:58:41,357 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:58:41,455 Getting requirements to build wheel: finished with status 'done' 2025-12-02T23:58:41,458 Created temporary directory: /tmp/pip-modern-metadata-cv_mt5j1 2025-12-02T23:58:41,461 Preparing metadata (pyproject.toml): started 2025-12-02T23:58:41,462 Running command Preparing metadata (pyproject.toml) 2025-12-02T23:58:42,039 /tmp/pip-build-env-yyaw1l3g/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-02T23:58:42,039 corresp(dist, value, root_dir) 2025-12-02T23:58:42,039 /tmp/pip-build-env-yyaw1l3g/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-02T23:58:42,040 corresp(dist, value, root_dir) 2025-12-02T23:58:42,134 running dist_info 2025-12-02T23:58:42,146 creating /tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit.egg-info 2025-12-02T23:58:42,147 writing /tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-02T23:58:42,162 writing dependency_links to /tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-02T23:58:42,164 writing entry points to /tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-02T23:58:42,174 writing requirements to /tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-02T23:58:42,175 writing top-level names to /tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-02T23:58:42,176 writing manifest file '/tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:58:42,202 reading manifest file '/tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:58:42,209 writing manifest file '/tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:58:42,210 creating '/tmp/pip-modern-metadata-cv_mt5j1/llm_benchmark_toolkit-2.2.1.dist-info' 2025-12-02T23:58:42,334 Preparing metadata (pyproject.toml): finished with status 'done' 2025-12-02T23:58:42,339 Source in /tmp/pip-wheel-jak1iak9/llm-benchmark-toolkit_19820534eb944a10aa7017c74d43b152 has version 2.2.1, which satisfies requirement llm-benchmark-toolkit==2.2.1 from https://files.pythonhosted.org/packages/f8/1d/ee3da6b517008793f0444ec55d964cb2afee0a21c475f89e9be9e90479cf/llm_benchmark_toolkit-2.2.1.tar.gz 2025-12-02T23:58:42,340 Removed llm-benchmark-toolkit==2.2.1 from https://files.pythonhosted.org/packages/f8/1d/ee3da6b517008793f0444ec55d964cb2afee0a21c475f89e9be9e90479cf/llm_benchmark_toolkit-2.2.1.tar.gz from build tracker '/tmp/pip-build-tracker-70grt20_' 2025-12-02T23:58:42,348 Created temporary directory: /tmp/pip-unpack-g6wjzswl 2025-12-02T23:58:42,349 Building wheels for collected packages: llm-benchmark-toolkit 2025-12-02T23:58:42,353 Created temporary directory: /tmp/pip-wheel-1cu6uoj1 2025-12-02T23:58:42,354 Destination directory: /tmp/pip-wheel-1cu6uoj1 2025-12-02T23:58:42,356 Building wheel for llm-benchmark-toolkit (pyproject.toml): started 2025-12-02T23:58:42,357 Running command Building wheel for llm-benchmark-toolkit (pyproject.toml) 2025-12-02T23:58:42,919 /tmp/pip-build-env-yyaw1l3g/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `install_requires` overwritten in `pyproject.toml` (dependencies) 2025-12-02T23:58:42,920 corresp(dist, value, root_dir) 2025-12-02T23:58:42,920 /tmp/pip-build-env-yyaw1l3g/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsWarning: `extras_require` overwritten in `pyproject.toml` (optional-dependencies) 2025-12-02T23:58:42,921 corresp(dist, value, root_dir) 2025-12-02T23:58:43,003 running bdist_wheel 2025-12-02T23:58:43,025 running build 2025-12-02T23:58:43,025 running build_py 2025-12-02T23:58:43,034 creating build/lib/llm_evaluator 2025-12-02T23:58:43,036 copying src/llm_evaluator/metrics.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,038 copying src/llm_evaluator/export.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,040 copying src/llm_evaluator/statistical_metrics.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,043 copying src/llm_evaluator/visualizations.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,046 copying src/llm_evaluator/academic_baselines.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,048 copying src/llm_evaluator/config.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,050 copying src/llm_evaluator/__init__.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,053 copying src/llm_evaluator/evaluator.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,055 copying src/llm_evaluator/cli.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,058 copying src/llm_evaluator/benchmarks.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,062 copying src/llm_evaluator/error_analysis.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,065 copying src/llm_evaluator/system_info.py -> build/lib/llm_evaluator 2025-12-02T23:58:43,068 creating build/lib/llm_evaluator/dashboard 2025-12-02T23:58:43,069 copying src/llm_evaluator/dashboard/runner.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:58:43,072 copying src/llm_evaluator/dashboard/model_discovery.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:58:43,074 copying src/llm_evaluator/dashboard/__init__.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:58:43,075 copying src/llm_evaluator/dashboard/models.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:58:43,078 copying src/llm_evaluator/dashboard/app.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:58:43,081 copying src/llm_evaluator/dashboard/__main__.py -> build/lib/llm_evaluator/dashboard 2025-12-02T23:58:43,083 creating build/lib/llm_evaluator/providers 2025-12-02T23:58:43,084 copying src/llm_evaluator/providers/huggingface_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:58:43,087 copying src/llm_evaluator/providers/ollama_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:58:43,089 copying src/llm_evaluator/providers/base.py -> build/lib/llm_evaluator/providers 2025-12-02T23:58:43,091 copying src/llm_evaluator/providers/__init__.py -> build/lib/llm_evaluator/providers 2025-12-02T23:58:43,093 copying src/llm_evaluator/providers/openai_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:58:43,096 copying src/llm_evaluator/providers/anthropic_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:58:43,099 copying src/llm_evaluator/providers/deepseek_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:58:43,101 copying src/llm_evaluator/providers/cached_provider.py -> build/lib/llm_evaluator/providers 2025-12-02T23:58:43,104 running egg_info 2025-12-02T23:58:43,116 writing src/llm_benchmark_toolkit.egg-info/PKG-INFO 2025-12-02T23:58:43,131 writing dependency_links to src/llm_benchmark_toolkit.egg-info/dependency_links.txt 2025-12-02T23:58:43,133 writing entry points to src/llm_benchmark_toolkit.egg-info/entry_points.txt 2025-12-02T23:58:43,142 writing requirements to src/llm_benchmark_toolkit.egg-info/requires.txt 2025-12-02T23:58:43,144 writing top-level names to src/llm_benchmark_toolkit.egg-info/top_level.txt 2025-12-02T23:58:43,160 reading manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:58:43,171 writing manifest file 'src/llm_benchmark_toolkit.egg-info/SOURCES.txt' 2025-12-02T23:58:43,177 creating build/lib/llm_evaluator/dashboard/static 2025-12-02T23:58:43,178 copying src/llm_evaluator/dashboard/static/favicon.svg -> build/lib/llm_evaluator/dashboard/static 2025-12-02T23:58:43,180 copying src/llm_evaluator/dashboard/static/index.html -> build/lib/llm_evaluator/dashboard/static 2025-12-02T23:58:43,182 creating build/lib/llm_evaluator/dashboard/static/assets 2025-12-02T23:58:43,183 copying src/llm_evaluator/dashboard/static/assets/index-B1T9cNmb.js -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-02T23:58:43,212 copying src/llm_evaluator/dashboard/static/assets/index-BGRAUHhM.css -> build/lib/llm_evaluator/dashboard/static/assets 2025-12-02T23:58:43,229 installing to build/bdist.linux-armv7l/wheel 2025-12-02T23:58:43,230 running install 2025-12-02T23:58:43,254 running install_lib 2025-12-02T23:58:43,261 creating build/bdist.linux-armv7l/wheel 2025-12-02T23:58:43,263 creating build/bdist.linux-armv7l/wheel/llm_evaluator 2025-12-02T23:58:43,264 copying build/lib/llm_evaluator/metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,267 copying build/lib/llm_evaluator/export.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,269 copying build/lib/llm_evaluator/statistical_metrics.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,273 copying build/lib/llm_evaluator/visualizations.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,276 copying build/lib/llm_evaluator/academic_baselines.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,279 copying build/lib/llm_evaluator/config.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,282 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard 2025-12-02T23:58:43,283 copying build/lib/llm_evaluator/dashboard/runner.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:58:43,287 copying build/lib/llm_evaluator/dashboard/model_discovery.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:58:43,289 copying build/lib/llm_evaluator/dashboard/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:58:43,291 copying build/lib/llm_evaluator/dashboard/models.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:58:43,293 copying build/lib/llm_evaluator/dashboard/app.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:58:43,297 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static 2025-12-02T23:58:43,299 copying build/lib/llm_evaluator/dashboard/static/favicon.svg -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-02T23:58:43,301 copying build/lib/llm_evaluator/dashboard/static/index.html -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static 2025-12-02T23:58:43,304 creating build/bdist.linux-armv7l/wheel/llm_evaluator/dashboard/static/assets 2025-12-02T23:58:43,305 copying build/lib/llm_evaluator/dashboard/static/assets/index-B1T9cNmb.js -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-02T23:58:43,338 copying build/lib/llm_evaluator/dashboard/static/assets/index-BGRAUHhM.css -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard/static/assets 2025-12-02T23:58:43,341 copying build/lib/llm_evaluator/dashboard/__main__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/dashboard 2025-12-02T23:58:43,344 creating build/bdist.linux-armv7l/wheel/llm_evaluator/providers 2025-12-02T23:58:43,345 copying build/lib/llm_evaluator/providers/huggingface_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:58:43,348 copying build/lib/llm_evaluator/providers/ollama_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:58:43,351 copying build/lib/llm_evaluator/providers/base.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:58:43,353 copying build/lib/llm_evaluator/providers/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:58:43,356 copying build/lib/llm_evaluator/providers/openai_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:58:43,358 copying build/lib/llm_evaluator/providers/anthropic_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:58:43,361 copying build/lib/llm_evaluator/providers/deepseek_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:58:43,364 copying build/lib/llm_evaluator/providers/cached_provider.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator/providers 2025-12-02T23:58:43,366 copying build/lib/llm_evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,368 copying build/lib/llm_evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,372 copying build/lib/llm_evaluator/cli.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,376 copying build/lib/llm_evaluator/benchmarks.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,381 copying build/lib/llm_evaluator/error_analysis.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,386 copying build/lib/llm_evaluator/system_info.py -> build/bdist.linux-armv7l/wheel/./llm_evaluator 2025-12-02T23:58:43,388 running install_egg_info 2025-12-02T23:58:43,394 Copying src/llm_benchmark_toolkit.egg-info to build/bdist.linux-armv7l/wheel/./llm_benchmark_toolkit-2.2.1-py3.11.egg-info 2025-12-02T23:58:43,407 running install_scripts 2025-12-02T23:58:43,416 creating build/bdist.linux-armv7l/wheel/llm_benchmark_toolkit-2.2.1.dist-info/WHEEL 2025-12-02T23:58:43,419 creating '/tmp/pip-wheel-1cu6uoj1/.tmp-w_92juku/llm_benchmark_toolkit-2.2.1-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-12-02T23:58:43,423 adding 'llm_evaluator/__init__.py' 2025-12-02T23:58:43,426 adding 'llm_evaluator/academic_baselines.py' 2025-12-02T23:58:43,432 adding 'llm_evaluator/benchmarks.py' 2025-12-02T23:58:43,439 adding 'llm_evaluator/cli.py' 2025-12-02T23:58:43,441 adding 'llm_evaluator/config.py' 2025-12-02T23:58:43,444 adding 'llm_evaluator/error_analysis.py' 2025-12-02T23:58:43,448 adding 'llm_evaluator/evaluator.py' 2025-12-02T23:58:43,450 adding 'llm_evaluator/export.py' 2025-12-02T23:58:43,452 adding 'llm_evaluator/metrics.py' 2025-12-02T23:58:43,456 adding 'llm_evaluator/statistical_metrics.py' 2025-12-02T23:58:43,458 adding 'llm_evaluator/system_info.py' 2025-12-02T23:58:43,461 adding 'llm_evaluator/visualizations.py' 2025-12-02T23:58:43,463 adding 'llm_evaluator/dashboard/__init__.py' 2025-12-02T23:58:43,464 adding 'llm_evaluator/dashboard/__main__.py' 2025-12-02T23:58:43,468 adding 'llm_evaluator/dashboard/app.py' 2025-12-02T23:58:43,470 adding 'llm_evaluator/dashboard/model_discovery.py' 2025-12-02T23:58:43,471 adding 'llm_evaluator/dashboard/models.py' 2025-12-02T23:58:43,474 adding 'llm_evaluator/dashboard/runner.py' 2025-12-02T23:58:43,477 adding 'llm_evaluator/dashboard/static/favicon.svg' 2025-12-02T23:58:43,478 adding 'llm_evaluator/dashboard/static/index.html' 2025-12-02T23:58:43,558 adding 'llm_evaluator/dashboard/static/assets/index-B1T9cNmb.js' 2025-12-02T23:58:43,565 adding 'llm_evaluator/dashboard/static/assets/index-BGRAUHhM.css' 2025-12-02T23:58:43,567 adding 'llm_evaluator/providers/__init__.py' 2025-12-02T23:58:43,569 adding 'llm_evaluator/providers/anthropic_provider.py' 2025-12-02T23:58:43,571 adding 'llm_evaluator/providers/base.py' 2025-12-02T23:58:43,573 adding 'llm_evaluator/providers/cached_provider.py' 2025-12-02T23:58:43,575 adding 'llm_evaluator/providers/deepseek_provider.py' 2025-12-02T23:58:43,577 adding 'llm_evaluator/providers/huggingface_provider.py' 2025-12-02T23:58:43,578 adding 'llm_evaluator/providers/ollama_provider.py' 2025-12-02T23:58:43,580 adding 'llm_evaluator/providers/openai_provider.py' 2025-12-02T23:58:43,582 adding 'llm_benchmark_toolkit-2.2.1.dist-info/METADATA' 2025-12-02T23:58:43,583 adding 'llm_benchmark_toolkit-2.2.1.dist-info/WHEEL' 2025-12-02T23:58:43,584 adding 'llm_benchmark_toolkit-2.2.1.dist-info/entry_points.txt' 2025-12-02T23:58:43,585 adding 'llm_benchmark_toolkit-2.2.1.dist-info/top_level.txt' 2025-12-02T23:58:43,586 adding 'llm_benchmark_toolkit-2.2.1.dist-info/RECORD' 2025-12-02T23:58:43,590 removing build/bdist.linux-armv7l/wheel 2025-12-02T23:58:43,699 Building wheel for llm-benchmark-toolkit (pyproject.toml): finished with status 'done' 2025-12-02T23:58:43,711 Created wheel for llm-benchmark-toolkit: filename=llm_benchmark_toolkit-2.2.1-py3-none-any.whl size=294290 sha256=a910972ff3802aeeec4c93220e0a5373097fb87c071c065ce81cef88e41c0d7f 2025-12-02T23:58:43,712 Stored in directory: /tmp/pip-ephem-wheel-cache-dkxntvl7/wheels/87/b5/a0/a541bb12f21ffc73a9cbde9afdf59cf12509d3052d3f1cd65d 2025-12-02T23:58:43,727 Successfully built llm-benchmark-toolkit 2025-12-02T23:58:43,741 Removed build tracker: '/tmp/pip-build-tracker-70grt20_'