2025-12-04T11:53:51,171 Created temporary directory: /tmp/pip-ephem-wheel-cache-vnhlphxj 2025-12-04T11:53:51,173 Created temporary directory: /tmp/pip-build-tracker-yzea_7jl 2025-12-04T11:53:51,173 Initialized build tracking at /tmp/pip-build-tracker-yzea_7jl 2025-12-04T11:53:51,174 Created build tracker: /tmp/pip-build-tracker-yzea_7jl 2025-12-04T11:53:51,174 Entered build tracker: /tmp/pip-build-tracker-yzea_7jl 2025-12-04T11:53:51,175 Created temporary directory: /tmp/pip-wheel-i1wy0112 2025-12-04T11:53:51,178 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-04T11:53:51,181 Created temporary directory: /tmp/pip-ephem-wheel-cache-2yqqqic9 2025-12-04T11:53:51,202 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-04T11:53:51,206 2 location(s) to search for versions of flow-benchmark-tools: 2025-12-04T11:53:51,206 * https://pypi.org/simple/flow-benchmark-tools/ 2025-12-04T11:53:51,206 * https://www.piwheels.org/simple/flow-benchmark-tools/ 2025-12-04T11:53:51,207 Fetching project page and analyzing links: https://pypi.org/simple/flow-benchmark-tools/ 2025-12-04T11:53:51,207 Getting page https://pypi.org/simple/flow-benchmark-tools/ 2025-12-04T11:53:51,209 Found index url https://pypi.org/simple 2025-12-04T11:53:51,421 Fetched page https://pypi.org/simple/flow-benchmark-tools/ as application/vnd.pypi.simple.v1+json 2025-12-04T11:53:51,425 Skipping link: No binaries permitted for flow-benchmark-tools: https://files.pythonhosted.org/packages/1e/18/482a4688a7627145ad3c1be08ad30e4950ba2f90749c350a00e3e0148098/flow_benchmark_tools-1.0.0-py3-none-any.whl (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,426 Found link https://files.pythonhosted.org/packages/f2/66/cbae0de4838f58e2a0e21a241fd288548c0f6be90827800036af77d4178a/flow_benchmark_tools-1.0.0.tar.gz (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8), version: 1.0.0 2025-12-04T11:53:51,427 Skipping link: No binaries permitted for flow-benchmark-tools: https://files.pythonhosted.org/packages/b9/79/1583303d9bbfde0f67e07ab62035eb48c5e27b7c6c8a7c13512ed11f97ab/flow_benchmark_tools-1.1.0-py3-none-any.whl (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,428 Found link https://files.pythonhosted.org/packages/42/67/7a21ee4a4a46373ba1973dfd7ae22cf52370327ac4ccff7a3c576239710c/flow_benchmark_tools-1.1.0.tar.gz (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8), version: 1.1.0 2025-12-04T11:53:51,429 Skipping link: No binaries permitted for flow-benchmark-tools: https://files.pythonhosted.org/packages/b4/b8/2a95f759d5f4bfe49d1908009483b7a102fd68c70d6b645e7aaa5016a54e/flow_benchmark_tools-1.2.0-py3-none-any.whl (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,430 Found link https://files.pythonhosted.org/packages/f2/46/94b9a1c492c6c012b50828c84dc3ea36295dc9c2dae128791bbcb15752ad/flow_benchmark_tools-1.2.0.tar.gz (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8), version: 1.2.0 2025-12-04T11:53:51,431 Skipping link: No binaries permitted for flow-benchmark-tools: https://files.pythonhosted.org/packages/2b/36/82c9f77073cdfb0f8a0bd4097e0911620f20e9cb2dfceb8a69844ab47ea2/flow_benchmark_tools-1.3.0-py3-none-any.whl (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,432 Found link https://files.pythonhosted.org/packages/9e/59/63bc76dbad7b4aa28f4e64afc26f8e56a393905b1a3ae37d74ccb0b8cb35/flow_benchmark_tools-1.3.0.tar.gz (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8), version: 1.3.0 2025-12-04T11:53:51,433 Skipping link: No binaries permitted for flow-benchmark-tools: https://files.pythonhosted.org/packages/12/f8/cf5d492119eebc8d63375529b99150e5a31640273fc8b119663904c3b939/flow_benchmark_tools-1.4.0-py3-none-any.whl (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,434 Found link https://files.pythonhosted.org/packages/87/66/22706b1934d66105d4321c3e4a55225be145e1ed545f704f1e2b69a7abf8/flow_benchmark_tools-1.4.0.tar.gz (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8), version: 1.4.0 2025-12-04T11:53:51,434 Skipping link: No binaries permitted for flow-benchmark-tools: https://files.pythonhosted.org/packages/bc/bb/09ae6ddf9abdacf7325f9d80b625d0f7493d27de2dad458c0608647399be/flow_benchmark_tools-1.5.0-py3-none-any.whl (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,435 Found link https://files.pythonhosted.org/packages/3b/6a/b73d1267120af320b8b10552fef68bc6e92bcb2a5c98d66c1cf9d525e799/flow_benchmark_tools-1.5.0.tar.gz (from https://pypi.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8), version: 1.5.0 2025-12-04T11:53:51,436 Fetching project page and analyzing links: https://www.piwheels.org/simple/flow-benchmark-tools/ 2025-12-04T11:53:51,437 Getting page https://www.piwheels.org/simple/flow-benchmark-tools/ 2025-12-04T11:53:51,438 Found index url https://www.piwheels.org/simple 2025-12-04T11:53:51,595 Fetched page https://www.piwheels.org/simple/flow-benchmark-tools/ as text/html 2025-12-04T11:53:51,598 Skipping link: No binaries permitted for flow-benchmark-tools: https://archive1.piwheels.org/simple/flow-benchmark-tools/flow_benchmark_tools-1.4.0-py3-none-any.whl#sha256=d7a5b4b4474efd65e3048861633fb541e8699a7830f3918c1e761067b42205a8 (from https://www.piwheels.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,599 Skipping link: No binaries permitted for flow-benchmark-tools: https://archive1.piwheels.org/simple/flow-benchmark-tools/flow_benchmark_tools-1.3.0-py3-none-any.whl#sha256=21439ee7a4a5d67aebe0b5c81f1d1ad312eb4ba52a18958cfdc9d58279d974d0 (from https://www.piwheels.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,599 Skipping link: No binaries permitted for flow-benchmark-tools: https://archive1.piwheels.org/simple/flow-benchmark-tools/flow_benchmark_tools-1.2.0-py3-none-any.whl#sha256=c820853859283a7c44ad9c75ffc3d6b4f4eeb96a520e8e218d453050f3c15813 (from https://www.piwheels.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,600 Skipping link: No binaries permitted for flow-benchmark-tools: https://archive1.piwheels.org/simple/flow-benchmark-tools/flow_benchmark_tools-1.1.0-py3-none-any.whl#sha256=6b5009d3c71377c4f61ee047d05573338fa81047f54962d14e309d9944f6f08f (from https://www.piwheels.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,600 Skipping link: No binaries permitted for flow-benchmark-tools: https://archive1.piwheels.org/simple/flow-benchmark-tools/flow_benchmark_tools-1.0.0-py3-none-any.whl#sha256=b1e4b8637dbf6adff5001456fc99e5dfdf8ead0e419cc481ab95de5e394b12d7 (from https://www.piwheels.org/simple/flow-benchmark-tools/) (requires-python:<3.12,>=3.8) 2025-12-04T11:53:51,601 Skipping link: not a file: https://www.piwheels.org/simple/flow-benchmark-tools/ 2025-12-04T11:53:51,602 Skipping link: not a file: https://pypi.org/simple/flow-benchmark-tools/ 2025-12-04T11:53:51,623 Given no hashes to check 1 links for project 'flow-benchmark-tools': discarding no candidates 2025-12-04T11:53:51,642 Collecting flow-benchmark-tools==1.5.0 2025-12-04T11:53:51,644 Created temporary directory: /tmp/pip-unpack-po6c1jzu 2025-12-04T11:53:51,878 Downloading flow_benchmark_tools-1.5.0.tar.gz (855 kB) 2025-12-04T11:53:52,180 Added flow-benchmark-tools==1.5.0 from https://files.pythonhosted.org/packages/3b/6a/b73d1267120af320b8b10552fef68bc6e92bcb2a5c98d66c1cf9d525e799/flow_benchmark_tools-1.5.0.tar.gz to build tracker '/tmp/pip-build-tracker-yzea_7jl' 2025-12-04T11:53:52,187 Created temporary directory: /tmp/pip-build-env-uycxmn1a 2025-12-04T11:53:52,192 Installing build dependencies: started 2025-12-04T11:53:52,193 Running command pip subprocess to install build dependencies 2025-12-04T11:53:53,325 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2025-12-04T11:53:53,976 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-12-04T11:53:53,998 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-12-04T11:53:55,824 Collecting setuptools<77,>=70.3.0 2025-12-04T11:53:55,936 Using cached https://www.piwheels.org/simple/setuptools/setuptools-76.1.0-py3-none-any.whl (1.2 MB) 2025-12-04T11:53:56,312 Collecting setuptools-scm>=8 2025-12-04T11:53:56,342 Using cached https://www.piwheels.org/simple/setuptools-scm/setuptools_scm-9.2.2-py3-none-any.whl (62 kB) 2025-12-04T11:53:56,552 Collecting packaging>=20 2025-12-04T11:53:56,568 Using cached https://www.piwheels.org/simple/packaging/packaging-25.0-py3-none-any.whl (66 kB) 2025-12-04T11:53:59,481 Installing collected packages: setuptools, packaging, setuptools-scm 2025-12-04T11:54:03,693 Creating /tmp/pip-build-env-uycxmn1a/overlay/local/bin 2025-12-04T11:54:03,696 changing mode of /tmp/pip-build-env-uycxmn1a/overlay/local/bin/setuptools-scm to 755 2025-12-04T11:54:03,719 Successfully installed packaging-25.0 setuptools-76.1.0 setuptools-scm-9.2.2 2025-12-04T11:54:04,002 Installing build dependencies: finished with status 'done' 2025-12-04T11:54:04,009 Getting requirements to build wheel: started 2025-12-04T11:54:04,010 Running command Getting requirements to build wheel 2025-12-04T11:54:04,691 toml section missing PosixPath('pyproject.toml') does not contain a tool.setuptools_scm section 2025-12-04T11:54:04,926 running egg_info 2025-12-04T11:54:04,937 writing src/flow_benchmark_tools.egg-info/PKG-INFO 2025-12-04T11:54:04,950 writing dependency_links to src/flow_benchmark_tools.egg-info/dependency_links.txt 2025-12-04T11:54:04,958 writing requirements to src/flow_benchmark_tools.egg-info/requires.txt 2025-12-04T11:54:04,960 writing top-level names to src/flow_benchmark_tools.egg-info/top_level.txt 2025-12-04T11:54:05,021 listing git files failed - pretending there aren't any 2025-12-04T11:54:05,029 reading manifest file 'src/flow_benchmark_tools.egg-info/SOURCES.txt' 2025-12-04T11:54:05,039 writing manifest file 'src/flow_benchmark_tools.egg-info/SOURCES.txt' 2025-12-04T11:54:05,151 Getting requirements to build wheel: finished with status 'done' 2025-12-04T11:54:05,155 Created temporary directory: /tmp/pip-modern-metadata-ge8d72ko 2025-12-04T11:54:05,157 Preparing metadata (pyproject.toml): started 2025-12-04T11:54:05,158 Running command Preparing metadata (pyproject.toml) 2025-12-04T11:54:05,749 toml section missing PosixPath('pyproject.toml') does not contain a tool.setuptools_scm section 2025-12-04T11:54:05,957 running dist_info 2025-12-04T11:54:05,975 creating /tmp/pip-modern-metadata-ge8d72ko/flow_benchmark_tools.egg-info 2025-12-04T11:54:05,976 writing /tmp/pip-modern-metadata-ge8d72ko/flow_benchmark_tools.egg-info/PKG-INFO 2025-12-04T11:54:05,989 writing dependency_links to /tmp/pip-modern-metadata-ge8d72ko/flow_benchmark_tools.egg-info/dependency_links.txt 2025-12-04T11:54:05,997 writing requirements to /tmp/pip-modern-metadata-ge8d72ko/flow_benchmark_tools.egg-info/requires.txt 2025-12-04T11:54:05,998 writing top-level names to /tmp/pip-modern-metadata-ge8d72ko/flow_benchmark_tools.egg-info/top_level.txt 2025-12-04T11:54:06,000 writing manifest file '/tmp/pip-modern-metadata-ge8d72ko/flow_benchmark_tools.egg-info/SOURCES.txt' 2025-12-04T11:54:06,055 listing git files failed - pretending there aren't any 2025-12-04T11:54:06,062 reading manifest file '/tmp/pip-modern-metadata-ge8d72ko/flow_benchmark_tools.egg-info/SOURCES.txt' 2025-12-04T11:54:06,067 writing manifest file '/tmp/pip-modern-metadata-ge8d72ko/flow_benchmark_tools.egg-info/SOURCES.txt' 2025-12-04T11:54:06,068 creating '/tmp/pip-modern-metadata-ge8d72ko/flow_benchmark_tools-1.5.0.dist-info' 2025-12-04T11:54:06,206 Preparing metadata (pyproject.toml): finished with status 'done' 2025-12-04T11:54:06,211 Source in /tmp/pip-wheel-i1wy0112/flow-benchmark-tools_28e3d288cd084776b681e3ee07de0113 has version 1.5.0, which satisfies requirement flow-benchmark-tools==1.5.0 from https://files.pythonhosted.org/packages/3b/6a/b73d1267120af320b8b10552fef68bc6e92bcb2a5c98d66c1cf9d525e799/flow_benchmark_tools-1.5.0.tar.gz 2025-12-04T11:54:06,212 Removed flow-benchmark-tools==1.5.0 from https://files.pythonhosted.org/packages/3b/6a/b73d1267120af320b8b10552fef68bc6e92bcb2a5c98d66c1cf9d525e799/flow_benchmark_tools-1.5.0.tar.gz from build tracker '/tmp/pip-build-tracker-yzea_7jl' 2025-12-04T11:54:06,218 Created temporary directory: /tmp/pip-unpack-kh9oqqvt 2025-12-04T11:54:06,218 Building wheels for collected packages: flow-benchmark-tools 2025-12-04T11:54:06,223 Created temporary directory: /tmp/pip-wheel-ktjta0ut 2025-12-04T11:54:06,224 Destination directory: /tmp/pip-wheel-ktjta0ut 2025-12-04T11:54:06,226 Building wheel for flow-benchmark-tools (pyproject.toml): started 2025-12-04T11:54:06,227 Running command Building wheel for flow-benchmark-tools (pyproject.toml) 2025-12-04T11:54:06,839 toml section missing PosixPath('pyproject.toml') does not contain a tool.setuptools_scm section 2025-12-04T11:54:07,034 running bdist_wheel 2025-12-04T11:54:07,062 running build 2025-12-04T11:54:07,063 running build_py 2025-12-04T11:54:07,072 creating build/lib/recursiveai/benchmark 2025-12-04T11:54:07,074 copying src/recursiveai/benchmark/__init__.py -> build/lib/recursiveai/benchmark 2025-12-04T11:54:07,077 creating build/lib/recursiveai/benchmark/_internal 2025-12-04T11:54:07,078 copying src/recursiveai/benchmark/_internal/_run_output.py -> build/lib/recursiveai/benchmark/_internal 2025-12-04T11:54:07,080 copying src/recursiveai/benchmark/_internal/_evaluation.py -> build/lib/recursiveai/benchmark/_internal 2025-12-04T11:54:07,082 copying src/recursiveai/benchmark/_internal/_benchmark_evaluator.py -> build/lib/recursiveai/benchmark/_internal 2025-12-04T11:54:07,084 copying src/recursiveai/benchmark/_internal/_criteria_evaluator.py -> build/lib/recursiveai/benchmark/_internal 2025-12-04T11:54:07,086 copying src/recursiveai/benchmark/_internal/__init__.py -> build/lib/recursiveai/benchmark/_internal 2025-12-04T11:54:07,088 copying src/recursiveai/benchmark/_internal/_benchmark_output.py -> build/lib/recursiveai/benchmark/_internal 2025-12-04T11:54:07,091 creating build/lib/recursiveai/benchmark/api 2025-12-04T11:54:07,092 copying src/recursiveai/benchmark/api/benchmark.py -> build/lib/recursiveai/benchmark/api 2025-12-04T11:54:07,094 copying src/recursiveai/benchmark/api/benchmark_case.py -> build/lib/recursiveai/benchmark/api 2025-12-04T11:54:07,096 copying src/recursiveai/benchmark/api/exit_code.py -> build/lib/recursiveai/benchmark/api 2025-12-04T11:54:07,097 copying src/recursiveai/benchmark/api/benchmark_agent.py -> build/lib/recursiveai/benchmark/api 2025-12-04T11:54:07,099 copying src/recursiveai/benchmark/api/benchmark_runner.py -> build/lib/recursiveai/benchmark/api 2025-12-04T11:54:07,102 copying src/recursiveai/benchmark/api/__init__.py -> build/lib/recursiveai/benchmark/api 2025-12-04T11:54:07,103 copying src/recursiveai/benchmark/api/benchmark_run.py -> build/lib/recursiveai/benchmark/api 2025-12-04T11:54:07,105 copying src/recursiveai/benchmark/api/benchmark_evaluator.py -> build/lib/recursiveai/benchmark/api 2025-12-04T11:54:07,108 creating build/lib/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,109 copying src/recursiveai/benchmark/_internal/_evaluators/_llm_criteria_jury.py -> build/lib/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,111 copying src/recursiveai/benchmark/_internal/_evaluators/_llm_jury.py -> build/lib/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,113 copying src/recursiveai/benchmark/_internal/_evaluators/_strict_match.py -> build/lib/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,115 copying src/recursiveai/benchmark/_internal/_evaluators/_llm_judge.py -> build/lib/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,117 copying src/recursiveai/benchmark/_internal/_evaluators/_happy.py -> build/lib/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,118 copying src/recursiveai/benchmark/_internal/_evaluators/_regex_match.py -> build/lib/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,120 copying src/recursiveai/benchmark/_internal/_evaluators/__init__.py -> build/lib/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,122 copying src/recursiveai/benchmark/_internal/_evaluators/_llm_criteria_judge.py -> build/lib/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,125 creating build/lib/recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,126 copying src/recursiveai/benchmark/_internal/_llm/_google_gemini_model.py -> build/lib/recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,128 copying src/recursiveai/benchmark/_internal/_llm/_llm_model.py -> build/lib/recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,130 copying src/recursiveai/benchmark/_internal/_llm/_openai_gpt_model.py -> build/lib/recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,132 copying src/recursiveai/benchmark/_internal/_llm/_azure_openai_gpt_model.py -> build/lib/recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,134 copying src/recursiveai/benchmark/_internal/_llm/_anthropic_claude_model.py -> build/lib/recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,137 creating build/lib/recursiveai/benchmark/_internal/_metrics 2025-12-04T11:54:07,138 copying src/recursiveai/benchmark/_internal/_metrics/_run_metrics.py -> build/lib/recursiveai/benchmark/_internal/_metrics 2025-12-04T11:54:07,140 copying src/recursiveai/benchmark/_internal/_metrics/_benchmark_metrics.py -> build/lib/recursiveai/benchmark/_internal/_metrics 2025-12-04T11:54:07,142 creating build/lib/recursiveai/benchmark/_internal/_util 2025-12-04T11:54:07,143 copying src/recursiveai/benchmark/_internal/_util/__init__.py -> build/lib/recursiveai/benchmark/_internal/_util 2025-12-04T11:54:07,146 creating build/lib/recursiveai/benchmark/api/util 2025-12-04T11:54:07,147 copying src/recursiveai/benchmark/api/util/__init__.py -> build/lib/recursiveai/benchmark/api/util 2025-12-04T11:54:07,149 creating build/lib/recursiveai/benchmark/api/agents 2025-12-04T11:54:07,150 copying src/recursiveai/benchmark/api/agents/async_callback_agent.py -> build/lib/recursiveai/benchmark/api/agents 2025-12-04T11:54:07,152 copying src/recursiveai/benchmark/api/agents/callback_agent.py -> build/lib/recursiveai/benchmark/api/agents 2025-12-04T11:54:07,154 copying src/recursiveai/benchmark/api/agents/__init__.py -> build/lib/recursiveai/benchmark/api/agents 2025-12-04T11:54:07,156 running egg_info 2025-12-04T11:54:07,170 writing src/flow_benchmark_tools.egg-info/PKG-INFO 2025-12-04T11:54:07,183 writing dependency_links to src/flow_benchmark_tools.egg-info/dependency_links.txt 2025-12-04T11:54:07,190 writing requirements to src/flow_benchmark_tools.egg-info/requires.txt 2025-12-04T11:54:07,191 writing top-level names to src/flow_benchmark_tools.egg-info/top_level.txt 2025-12-04T11:54:07,233 listing git files failed - pretending there aren't any 2025-12-04T11:54:07,239 reading manifest file 'src/flow_benchmark_tools.egg-info/SOURCES.txt' 2025-12-04T11:54:07,249 writing manifest file 'src/flow_benchmark_tools.egg-info/SOURCES.txt' 2025-12-04T11:54:07,274 installing to build/bdist.linux-armv7l/wheel 2025-12-04T11:54:07,275 running install 2025-12-04T11:54:07,299 running install_lib 2025-12-04T11:54:07,306 creating build/bdist.linux-armv7l/wheel 2025-12-04T11:54:07,308 creating build/bdist.linux-armv7l/wheel/recursiveai 2025-12-04T11:54:07,310 creating build/bdist.linux-armv7l/wheel/recursiveai/benchmark 2025-12-04T11:54:07,312 creating build/bdist.linux-armv7l/wheel/recursiveai/benchmark/_internal 2025-12-04T11:54:07,313 copying build/lib/recursiveai/benchmark/_internal/_run_output.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal 2025-12-04T11:54:07,315 creating build/bdist.linux-armv7l/wheel/recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,316 copying build/lib/recursiveai/benchmark/_internal/_evaluators/_llm_criteria_jury.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,318 copying build/lib/recursiveai/benchmark/_internal/_evaluators/_llm_jury.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,320 copying build/lib/recursiveai/benchmark/_internal/_evaluators/_strict_match.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,322 copying build/lib/recursiveai/benchmark/_internal/_evaluators/_llm_judge.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,324 copying build/lib/recursiveai/benchmark/_internal/_evaluators/_happy.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,326 copying build/lib/recursiveai/benchmark/_internal/_evaluators/_regex_match.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,327 copying build/lib/recursiveai/benchmark/_internal/_evaluators/__init__.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,330 copying build/lib/recursiveai/benchmark/_internal/_evaluators/_llm_criteria_judge.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_evaluators 2025-12-04T11:54:07,332 creating build/bdist.linux-armv7l/wheel/recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,333 copying build/lib/recursiveai/benchmark/_internal/_llm/_google_gemini_model.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,336 copying build/lib/recursiveai/benchmark/_internal/_llm/_llm_model.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,338 copying build/lib/recursiveai/benchmark/_internal/_llm/_openai_gpt_model.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,340 copying build/lib/recursiveai/benchmark/_internal/_llm/_azure_openai_gpt_model.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,342 copying build/lib/recursiveai/benchmark/_internal/_llm/_anthropic_claude_model.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_llm 2025-12-04T11:54:07,344 copying build/lib/recursiveai/benchmark/_internal/_evaluation.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal 2025-12-04T11:54:07,346 copying build/lib/recursiveai/benchmark/_internal/_benchmark_evaluator.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal 2025-12-04T11:54:07,348 creating build/bdist.linux-armv7l/wheel/recursiveai/benchmark/_internal/_metrics 2025-12-04T11:54:07,349 copying build/lib/recursiveai/benchmark/_internal/_metrics/_run_metrics.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_metrics 2025-12-04T11:54:07,351 copying build/lib/recursiveai/benchmark/_internal/_metrics/_benchmark_metrics.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_metrics 2025-12-04T11:54:07,353 copying build/lib/recursiveai/benchmark/_internal/_criteria_evaluator.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal 2025-12-04T11:54:07,356 creating build/bdist.linux-armv7l/wheel/recursiveai/benchmark/_internal/_util 2025-12-04T11:54:07,357 copying build/lib/recursiveai/benchmark/_internal/_util/__init__.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal/_util 2025-12-04T11:54:07,359 copying build/lib/recursiveai/benchmark/_internal/__init__.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal 2025-12-04T11:54:07,360 copying build/lib/recursiveai/benchmark/_internal/_benchmark_output.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/_internal 2025-12-04T11:54:07,363 creating build/bdist.linux-armv7l/wheel/recursiveai/benchmark/api 2025-12-04T11:54:07,364 creating build/bdist.linux-armv7l/wheel/recursiveai/benchmark/api/util 2025-12-04T11:54:07,365 copying build/lib/recursiveai/benchmark/api/util/__init__.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api/util 2025-12-04T11:54:07,367 copying build/lib/recursiveai/benchmark/api/benchmark.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api 2025-12-04T11:54:07,369 copying build/lib/recursiveai/benchmark/api/benchmark_case.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api 2025-12-04T11:54:07,371 copying build/lib/recursiveai/benchmark/api/exit_code.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api 2025-12-04T11:54:07,372 copying build/lib/recursiveai/benchmark/api/benchmark_agent.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api 2025-12-04T11:54:07,374 copying build/lib/recursiveai/benchmark/api/benchmark_runner.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api 2025-12-04T11:54:07,376 copying build/lib/recursiveai/benchmark/api/__init__.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api 2025-12-04T11:54:07,378 copying build/lib/recursiveai/benchmark/api/benchmark_run.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api 2025-12-04T11:54:07,380 copying build/lib/recursiveai/benchmark/api/benchmark_evaluator.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api 2025-12-04T11:54:07,383 creating build/bdist.linux-armv7l/wheel/recursiveai/benchmark/api/agents 2025-12-04T11:54:07,384 copying build/lib/recursiveai/benchmark/api/agents/async_callback_agent.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api/agents 2025-12-04T11:54:07,386 copying build/lib/recursiveai/benchmark/api/agents/callback_agent.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api/agents 2025-12-04T11:54:07,387 copying build/lib/recursiveai/benchmark/api/agents/__init__.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark/api/agents 2025-12-04T11:54:07,389 copying build/lib/recursiveai/benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./recursiveai/benchmark 2025-12-04T11:54:07,391 running install_egg_info 2025-12-04T11:54:07,398 Copying src/flow_benchmark_tools.egg-info to build/bdist.linux-armv7l/wheel/./flow_benchmark_tools-1.5.0-py3.11.egg-info 2025-12-04T11:54:07,408 running install_scripts 2025-12-04T11:54:07,416 creating build/bdist.linux-armv7l/wheel/flow_benchmark_tools-1.5.0.dist-info/WHEEL 2025-12-04T11:54:07,419 creating '/tmp/pip-wheel-ktjta0ut/.tmp-2k0iuc6r/flow_benchmark_tools-1.5.0-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-12-04T11:54:07,422 adding 'recursiveai/benchmark/__init__.py' 2025-12-04T11:54:07,424 adding 'recursiveai/benchmark/_internal/__init__.py' 2025-12-04T11:54:07,425 adding 'recursiveai/benchmark/_internal/_benchmark_evaluator.py' 2025-12-04T11:54:07,426 adding 'recursiveai/benchmark/_internal/_benchmark_output.py' 2025-12-04T11:54:07,427 adding 'recursiveai/benchmark/_internal/_criteria_evaluator.py' 2025-12-04T11:54:07,429 adding 'recursiveai/benchmark/_internal/_evaluation.py' 2025-12-04T11:54:07,430 adding 'recursiveai/benchmark/_internal/_run_output.py' 2025-12-04T11:54:07,432 adding 'recursiveai/benchmark/_internal/_evaluators/__init__.py' 2025-12-04T11:54:07,433 adding 'recursiveai/benchmark/_internal/_evaluators/_happy.py' 2025-12-04T11:54:07,435 adding 'recursiveai/benchmark/_internal/_evaluators/_llm_criteria_judge.py' 2025-12-04T11:54:07,436 adding 'recursiveai/benchmark/_internal/_evaluators/_llm_criteria_jury.py' 2025-12-04T11:54:07,437 adding 'recursiveai/benchmark/_internal/_evaluators/_llm_judge.py' 2025-12-04T11:54:07,438 adding 'recursiveai/benchmark/_internal/_evaluators/_llm_jury.py' 2025-12-04T11:54:07,440 adding 'recursiveai/benchmark/_internal/_evaluators/_regex_match.py' 2025-12-04T11:54:07,441 adding 'recursiveai/benchmark/_internal/_evaluators/_strict_match.py' 2025-12-04T11:54:07,443 adding 'recursiveai/benchmark/_internal/_llm/_anthropic_claude_model.py' 2025-12-04T11:54:07,444 adding 'recursiveai/benchmark/_internal/_llm/_azure_openai_gpt_model.py' 2025-12-04T11:54:07,445 adding 'recursiveai/benchmark/_internal/_llm/_google_gemini_model.py' 2025-12-04T11:54:07,446 adding 'recursiveai/benchmark/_internal/_llm/_llm_model.py' 2025-12-04T11:54:07,448 adding 'recursiveai/benchmark/_internal/_llm/_openai_gpt_model.py' 2025-12-04T11:54:07,449 adding 'recursiveai/benchmark/_internal/_metrics/_benchmark_metrics.py' 2025-12-04T11:54:07,451 adding 'recursiveai/benchmark/_internal/_metrics/_run_metrics.py' 2025-12-04T11:54:07,452 adding 'recursiveai/benchmark/_internal/_util/__init__.py' 2025-12-04T11:54:07,454 adding 'recursiveai/benchmark/api/__init__.py' 2025-12-04T11:54:07,455 adding 'recursiveai/benchmark/api/benchmark.py' 2025-12-04T11:54:07,457 adding 'recursiveai/benchmark/api/benchmark_agent.py' 2025-12-04T11:54:07,458 adding 'recursiveai/benchmark/api/benchmark_case.py' 2025-12-04T11:54:07,459 adding 'recursiveai/benchmark/api/benchmark_evaluator.py' 2025-12-04T11:54:07,460 adding 'recursiveai/benchmark/api/benchmark_run.py' 2025-12-04T11:54:07,462 adding 'recursiveai/benchmark/api/benchmark_runner.py' 2025-12-04T11:54:07,463 adding 'recursiveai/benchmark/api/exit_code.py' 2025-12-04T11:54:07,464 adding 'recursiveai/benchmark/api/agents/__init__.py' 2025-12-04T11:54:07,466 adding 'recursiveai/benchmark/api/agents/async_callback_agent.py' 2025-12-04T11:54:07,467 adding 'recursiveai/benchmark/api/agents/callback_agent.py' 2025-12-04T11:54:07,469 adding 'recursiveai/benchmark/api/util/__init__.py' 2025-12-04T11:54:07,470 adding 'flow_benchmark_tools-1.5.0.dist-info/METADATA' 2025-12-04T11:54:07,471 adding 'flow_benchmark_tools-1.5.0.dist-info/WHEEL' 2025-12-04T11:54:07,472 adding 'flow_benchmark_tools-1.5.0.dist-info/top_level.txt' 2025-12-04T11:54:07,473 adding 'flow_benchmark_tools-1.5.0.dist-info/RECORD' 2025-12-04T11:54:07,475 removing build/bdist.linux-armv7l/wheel 2025-12-04T11:54:07,600 Building wheel for flow-benchmark-tools (pyproject.toml): finished with status 'done' 2025-12-04T11:54:07,605 Created wheel for flow-benchmark-tools: filename=flow_benchmark_tools-1.5.0-py3-none-any.whl size=26895 sha256=05fd4c4d46253637d8986947f6fec0b003f139d8c6f1f937003a5e2c8208aa0c 2025-12-04T11:54:07,606 Stored in directory: /tmp/pip-ephem-wheel-cache-2yqqqic9/wheels/b1/52/82/c3df22f6509cc5e4c2d8a8355f8c8a9a212fcf34e432b89138 2025-12-04T11:54:07,617 Successfully built flow-benchmark-tools 2025-12-04T11:54:07,621 Removed build tracker: '/tmp/pip-build-tracker-yzea_7jl'