2026-05-28T10:08:08,062 Created temporary directory: /tmp/pip-ephem-wheel-cache-h38if3eq 2026-05-28T10:08:08,064 Created temporary directory: /tmp/pip-build-tracker-f2nuu9mq 2026-05-28T10:08:08,065 Initialized build tracking at /tmp/pip-build-tracker-f2nuu9mq 2026-05-28T10:08:08,065 Created build tracker: /tmp/pip-build-tracker-f2nuu9mq 2026-05-28T10:08:08,066 Entered build tracker: /tmp/pip-build-tracker-f2nuu9mq 2026-05-28T10:08:08,067 Created temporary directory: /tmp/pip-wheel-u7bz1_vl 2026-05-28T10:08:08,070 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-05-28T10:08:08,072 Created temporary directory: /tmp/pip-ephem-wheel-cache-8lzx138m 2026-05-28T10:08:08,094 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-05-28T10:08:08,097 2 location(s) to search for versions of evalscope: 2026-05-28T10:08:08,097 * https://pypi.org/simple/evalscope/ 2026-05-28T10:08:08,097 * https://www.piwheels.org/simple/evalscope/ 2026-05-28T10:08:08,098 Fetching project page and analyzing links: https://pypi.org/simple/evalscope/ 2026-05-28T10:08:08,099 Getting page https://pypi.org/simple/evalscope/ 2026-05-28T10:08:08,100 Found index url https://pypi.org/simple 2026-05-28T10:08:08,329 Fetched page https://pypi.org/simple/evalscope/ as application/vnd.pypi.simple.v1+json 2026-05-28T10:08:08,348 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/94/d2/dc5e929802776bf4e662a46d794b765876bb93e2300189cafd113cac74d6/evalscope-0.5.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,349 Found link https://files.pythonhosted.org/packages/e5/70/45a5dad24b1fa535bff194b99a4668e7f5f328be972b51b3b91eafb4cdbb/evalscope-0.5.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0rc0 2026-05-28T10:08:08,350 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ac/eb/341fe367df2bc9a0ae7ef5eb2037a5d549d9bb8c0d7ad84844c9926e0947/evalscope-0.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,351 Found link https://files.pythonhosted.org/packages/3b/3f/585f7f1cf2ce90b234c1cfd654bb26977be4d889c0e5eed0122cb3024c45/evalscope-0.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0 2026-05-28T10:08:08,352 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ad/58/c0ce004159cfac6df9b5736c011576d52bdceb778943de8a022a419d86eb/evalscope-0.5.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,353 Found link https://files.pythonhosted.org/packages/94/33/7ad2f2285f5b68953ad4466d23dc5de1a2e57e7cc63d5924ab0e84d156ba/evalscope-0.5.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.2 2026-05-28T10:08:08,354 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/59/a9e1c4cf88018ece1fdd8d8b7fa976e28f9b4b181ef7ceb74a5e2db533ab/evalscope-0.5.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,354 Found link https://files.pythonhosted.org/packages/04/57/9ca7b1fd68f2acc32802b22236c83b597a5690a483d5938d38183b549d22/evalscope-0.5.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.3 2026-05-28T10:08:08,355 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/fa/c8/fcbaf01b7486c3b29b7790167c2cda560f00a04d100cec808ee9a3349ca0/evalscope-0.5.4-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,356 Found link https://files.pythonhosted.org/packages/8c/cc/abd412bad714c0266be1f0159b49a817d45db099c3bd031134d223589e93/evalscope-0.5.4.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.4 2026-05-28T10:08:08,357 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4d/da/d705d457683223f289e8c5d6cadbaad15d15e098692c53ea7e6196a94373/evalscope-0.5.5rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,358 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7e/4c/414dd545a1833245a53797d70a35a78ae1b9cfbcc81b35a4e1763e678437/evalscope-0.5.5rc1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,358 Found link https://files.pythonhosted.org/packages/56/5f/aa7fcf62102694dd66b69e88cb2523094bf04b53f785854e17ee6b7234a7/evalscope-0.5.5rc1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5rc1 2026-05-28T10:08:08,359 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/3e/d0/91a7a1f95f3fa19dd8d4e434dc711768abe5c006f32a514a8602b429e049/evalscope-0.5.5-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,360 Found link https://files.pythonhosted.org/packages/04/7e/a7f065d6ebac15fe172d3b0906ff5b26a71df5a9975c0f14978044211cf1/evalscope-0.5.5.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5 2026-05-28T10:08:08,361 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/89/b812b01a5ed91fc079dea052e1341860cd65d25d463c75c90e5a30ab6ae8/evalscope-0.6.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,361 Found link https://files.pythonhosted.org/packages/d5/ad/57a2d5f33c5b7d5066f8a5dcb1d34f14bf246112f7900228b9f2fb41b21b/evalscope-0.6.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0rc0 2026-05-28T10:08:08,362 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/25/3f03d9d924f1b65610724c9f10727b48ef952afdfee8687a461949c88c78/evalscope-0.6.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,363 Found link https://files.pythonhosted.org/packages/9d/ac/1f432bcc46ccb8348b869b80d2aaabde5e583b370418ba48714083e31068/evalscope-0.6.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0 2026-05-28T10:08:08,364 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7b/e1/42c9e58b4690f23ef48bce841fc95cbf0744c1579cdc80fa6f33b0453344/evalscope-0.6.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,365 Found link https://files.pythonhosted.org/packages/cd/d6/1d9d2db9acda6e61d4210074f51e7e3dee4d0212fabdd94999105db23eed/evalscope-0.6.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.1 2026-05-28T10:08:08,366 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/20/92/6fff1dfd12a4f73489c451dab56351b6a3c1095a92bb55025c2934fc625d/evalscope-0.7.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,367 Found link https://files.pythonhosted.org/packages/fe/3c/500d655a27ca80e1aba3fb2b1e8886951942732b869ad1516422d9e6ac97/evalscope-0.7.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.0 2026-05-28T10:08:08,368 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a5/3a/ae22d4d9a44ad37ac887da21b948bf6e784307001a09802d36c5bf04018d/evalscope-0.7.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,368 Found link https://files.pythonhosted.org/packages/88/47/69d067f0d3d784a7975cc4ea067fdc55c8f785ece6fbe86e5e21edc8b36f/evalscope-0.7.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.1 2026-05-28T10:08:08,369 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/23/acbcbc2ed6f00d3bb81220054651e6a2b1b02714d42b1aeb018a2f5574c4/evalscope-0.7.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,370 Found link https://files.pythonhosted.org/packages/a4/38/9126b9329cd2ad6ccfd4a73f04402bf71b65921564c3c12cb0d62b3b421d/evalscope-0.7.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.2 2026-05-28T10:08:08,371 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/1f/9598183da3026696adc19a6727a19d57f9abbf4fd7aeb20cdf12faee7693/evalscope-0.8.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,372 Found link https://files.pythonhosted.org/packages/63/7e/fc44d30e3a83dbc3070396d78279c9e3e8716cbc4cc05811a70f1b463bfb/evalscope-0.8.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.0 2026-05-28T10:08:08,373 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/37/a9/a9c6cda95a6b837c9303e9a1598999f2c4e605abb507365c6ff70b372a5a/evalscope-0.8.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,373 Found link https://files.pythonhosted.org/packages/63/5a/bbc230bb06a7bc40dd3985dde4615a8a71f111bf95761e40f6f0f8a7e1a6/evalscope-0.8.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.1 2026-05-28T10:08:08,374 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/07/39/a1eb2efed77d21e8253daa37f082669a004c4d288813a2ee9e15398f2e80/evalscope-0.8.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,375 Found link https://files.pythonhosted.org/packages/57/bc/5ff6d538e459d8b3c567577c991eec72ab6adbc19497dc164d96cd634d2f/evalscope-0.8.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.2 2026-05-28T10:08:08,376 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f6/29/81a188c03b272bf7def0c9bb556b9e9465adbc68ecb18907b636f1e8cbd7/evalscope-0.9.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,377 Found link https://files.pythonhosted.org/packages/c0/c2/19da3be1fbd6b548ecdc877d47269e92503518de53acbbfe96120c5c9753/evalscope-0.9.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.9.0 2026-05-28T10:08:08,377 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/6d/45d7407f31d6878494c3b493d7e49a8318b1839161839293c1a2e66aadcf/evalscope-0.10.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,378 Found link https://files.pythonhosted.org/packages/16/b1/b6cef37a0dd0acfa5873ca4763ac6b4ac4b19a0b15ca6bdc8f30d4443682/evalscope-0.10.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.0 2026-05-28T10:08:08,379 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a7/8f/907045290d359b4e07e7acc96ec60173380748b49e9f3c91b7ddd8e8342d/evalscope-0.10.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,380 Found link https://files.pythonhosted.org/packages/e3/d3/dda2ac0513904bff8aa0c2efe77bc851d3acc6d514707db36648e4a903d2/evalscope-0.10.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.1 2026-05-28T10:08:08,380 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d6/e5/852326943d86c85b5ca6b548a5f3753c217b771d93968c61ec2ca46ee0b1/evalscope-0.11.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,381 Found link https://files.pythonhosted.org/packages/5f/3f/e2816b99487b4ead257453a242b5282a85881f9d26fbb5efb21cc5cf88fa/evalscope-0.11.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.11.0 2026-05-28T10:08:08,382 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7d/8e/f9eceecf8bc7d740603f915eef7fab3e9d657a01f5de2c523e531445299c/evalscope-0.12.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,383 Found link https://files.pythonhosted.org/packages/33/82/7765517ff80a73eac7465369767aa45a5be3d5e0fb7c4f4a3ff743811f8c/evalscope-0.12.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.0 2026-05-28T10:08:08,384 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ee/4c/84a5e18985e149eb4283fef2b58a81ff2cec2d099c017684938ec3a3935f/evalscope-0.12.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,384 Found link https://files.pythonhosted.org/packages/b2/24/83b5530319bdb02142289e04640c6008dda1b988043c42feb7f0a5eab3b7/evalscope-0.12.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.1 2026-05-28T10:08:08,385 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/a5/65faf0660cd8ae2660354b002b2b3a586b9419bc894120fea97efd506cb6/evalscope-0.13.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,386 Found link https://files.pythonhosted.org/packages/d0/39/d5eb469a94191760c61d1bfcb235e28be1d2a080d88b44792f53d76c45d1/evalscope-0.13.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.0 2026-05-28T10:08:08,387 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f7/fc/91b7b4379131d2e15ca1f575f533daea589357293e492ef2c93e0aac6b55/evalscope-0.13.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,387 Found link https://files.pythonhosted.org/packages/9e/a3/33b4ce270d5500fe7c8f32fa2160749b607f248141328d4785b6032c8f2a/evalscope-0.13.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.1 2026-05-28T10:08:08,388 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e6/66/83752c305879cf3dea54170398839ab08a046485bc18c41a34f41aca11ab/evalscope-0.13.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,389 Found link https://files.pythonhosted.org/packages/b7/8d/711ae30b80329e2dd7da760c001d9a5b45e4d8e5292f317f1ea10c744c29/evalscope-0.13.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.2 2026-05-28T10:08:08,390 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a8/b4/b22c7e52e6a7381333bdfa0bf92fae0258e7812064b1e208cbab56a62d08/evalscope-0.14.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,391 Found link https://files.pythonhosted.org/packages/40/44/7db2cb90e6ca0c9db92124f10c0273d7c6ef4b81523e1c98c34a88e67faa/evalscope-0.14.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.14.0 2026-05-28T10:08:08,392 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b2/47/976633e0f29b58b8c9f3faacf7373a2da734771a7915ac45d721d96e0ad7/evalscope-0.15.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,392 Found link https://files.pythonhosted.org/packages/b8/e3/bd534d69328afa98bdd497b5eaaf4b7416da9e8f56109d045c332b17d016/evalscope-0.15.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.0 2026-05-28T10:08:08,393 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5a/54/237aee5f0317fa04450c9ad67c3bf28b730460b2e6dc1e65b74b4bf2cd67/evalscope-0.15.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,394 Found link https://files.pythonhosted.org/packages/7a/d4/8b87e83a3a08f87ce5b4325f0cd5ab9bc54d296dc3f3492a1d3216a97a6d/evalscope-0.15.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.1 2026-05-28T10:08:08,395 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/1e/b5/6fec1cbb02a41ab79430eb3fb51eea7709525df0f9753ae2c54fbc4633f1/evalscope-0.16.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,396 Found link https://files.pythonhosted.org/packages/99/69/63997bedfa6fd33af671b539f77c375b111017eff23313f76693e24b8872/evalscope-0.16.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.0 2026-05-28T10:08:08,396 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ea/76/639641578bdc92d25211eb8dae24d1fae19e40cd1649e17946f6ad8a5dc3/evalscope-0.16.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,397 Found link https://files.pythonhosted.org/packages/94/13/616bf9c33b0769db44a2bae32b54d33cf7129874392682aa76326a51e085/evalscope-0.16.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.1 2026-05-28T10:08:08,398 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f1/d3/9e83cc1b5a132342a05ef6ee79018bd7561f90a6406dc9db5c85fb0a281f/evalscope-0.16.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,399 Found link https://files.pythonhosted.org/packages/94/47/edd3faaddd321e464ae72db6e7bf82246dd4ca2f0f67127ca8c427cac664/evalscope-0.16.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.2 2026-05-28T10:08:08,399 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5d/4e/34c56086cdfe7d1ad912241af819b3e67f20373f382016e33ac89dd43dde/evalscope-0.16.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,401 Found link https://files.pythonhosted.org/packages/22/07/14603c038a8019472881f574f1c47bd4193481f256b6dc702c65d8b8f984/evalscope-0.16.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.3 2026-05-28T10:08:08,401 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f3/e2/104156f74779cf2849f53566bba585015492d7320c36a7cb76c7196b0ef5/evalscope-0.17.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,402 Found link https://files.pythonhosted.org/packages/c5/d0/b66d1b97ec67d65b6df54f18682e30cfeb6401604b93c9b1bdd1e97b8d79/evalscope-0.17.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.17.0 2026-05-28T10:08:08,403 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a9/b5/630d2c5dc5c32e9fbad5034e04d8aba6f6461dc08f255df77dd8d463857f/evalscope-0.17.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,404 Found link https://files.pythonhosted.org/packages/38/88/326e48929bb9577a6a36e07afa65bbf6bd870c1b644f82e5713874ae3238/evalscope-0.17.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 0.17.1 2026-05-28T10:08:08,405 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/21/18/9a6c208d2bc119ac67b5537b60851c6bccc99f25229eaa96cbe6e38721bd/evalscope-1.0.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,406 Found link https://files.pythonhosted.org/packages/d8/44/3d727dd28fcc50317c95d12e5bb850ed2863105f812373ac120877875434/evalscope-1.0.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.0 2026-05-28T10:08:08,407 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e1/8a/50456fa7dd77be4c3a0ea0b3d96cb7ae5b2557454fdd35cbf0009a9d792f/evalscope-1.0.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,408 Found link https://files.pythonhosted.org/packages/8d/52/93569134b3d8dea2a0d0bc2134c03056f0ffee1840f7299eb83d475457df/evalscope-1.0.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.1 2026-05-28T10:08:08,408 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/22/f6/32fb0fef08a6c881ac840117455a5697a0d63226db8a24cce5208b720829/evalscope-1.0.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,409 Found link https://files.pythonhosted.org/packages/3d/f5/025baefe432d9af1ed845ab5738b638b7b97f2dd3767e9478b8eee10966f/evalscope-1.0.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.2 2026-05-28T10:08:08,410 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/6e/12/f31dbb18daa7e3c6cfecf856ddc323a303e115271166c100e06af58ea6b6/evalscope-1.1.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,410 Found link https://files.pythonhosted.org/packages/b4/98/7449040e89beaa4556bf35ba1e171b2d4955ff15b2b4c43f2ff55b048aeb/evalscope-1.1.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.1.0 2026-05-28T10:08:08,411 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/79/d1/afc8b23345ad8f11a5e1f8c6c3112a8679604d833bdbc02aa06787952fd1/evalscope-1.1.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,412 Found link https://files.pythonhosted.org/packages/e8/3f/d67b73ce19789e914d6a78740fa7bfd0c07f161bc239b92cd3c26541f2fb/evalscope-1.1.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.1.1 2026-05-28T10:08:08,413 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/82/0a/d984751e4b5751f064209da3745b6aff6cc0f1d9d93f13cab0a1017a8639/evalscope-1.2.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,414 Found link https://files.pythonhosted.org/packages/7f/f9/0a2a069ee4500666ec5c3d10b302fc71d176c17bbe70447336e610953e1a/evalscope-1.2.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.2.0 2026-05-28T10:08:08,415 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/58/823d009dfa49cfdc750ac257e744eb97456d69528a09ac108ee8cab15316/evalscope-1.3.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,416 Found link https://files.pythonhosted.org/packages/42/5a/a309f7ce1fbe2b39e4b0a1f26cfcd7864eaa90e4792a5290e8cdd2ce3b4f/evalscope-1.3.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.3.0 2026-05-28T10:08:08,417 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c6/57/005d3ef07ecdd5163e1bbed3413537b653b637d9c8b62a2bcdd97546607b/evalscope-1.4.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,417 Found link https://files.pythonhosted.org/packages/11/d5/268f610ac7db9c5c2109936f65ac4df8b4bc52106ed4369509d3b3c4f127/evalscope-1.4.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.0 2026-05-28T10:08:08,418 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b8/96/a2b4fabf6fa6cf09ac6669aa01dc483ac53576a8fd7c2c4be21ea281840f/evalscope-1.4.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,419 Found link https://files.pythonhosted.org/packages/1c/8d/4fa3d9a2a5443ba2df06147ebc057961b0550bef806c716b2552fe7c7f66/evalscope-1.4.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.1 2026-05-28T10:08:08,420 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/bf/0f/97e68e89f7925160df49ea1dbbcef7f3f8e808a51756c199aaaadc75f5a5/evalscope-1.4.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,421 Found link https://files.pythonhosted.org/packages/97/dd/6e1cda8f161363ef806929b4989a0e9a7df733b7b82361ef1897315c12d0/evalscope-1.4.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.2 2026-05-28T10:08:08,421 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c0/fb/e6b1a396bad204e38591a6d6de1172dac2ce3e0d15b87e812d57e22d0e4f/evalscope-1.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,422 Found link https://files.pythonhosted.org/packages/a7/32/518a920ac8a73c4c6e39f7e443df6da6ea9a3be6567c4a425def866b8f5e/evalscope-1.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.0 2026-05-28T10:08:08,423 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/eb/68/0c870a84e38d5a8d3e7c9df918739a4ba6a45c3ddb624d2792a41a8d3293/evalscope-1.5.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,424 Found link https://files.pythonhosted.org/packages/2f/74/c4275a5a1746667352246ce10b9076137ab661d0f132796d2db32eca97fe/evalscope-1.5.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.1 2026-05-28T10:08:08,424 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/55/1f/b1b087b1d646f635e9e225c8b610f80b1e6e2590802228c15d1d58ae026e/evalscope-1.5.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,425 Found link https://files.pythonhosted.org/packages/ff/13/56b351e22964e93e6d74dbfdb71a4d5e2f96b4ae716f76d5b5ed4d88bae7/evalscope-1.5.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.2 2026-05-28T10:08:08,426 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/53/5b/3d5e1067f98e08cc2aac5c45f4fa67e6a183d62471439e61528469bc5e61/evalscope-1.5.2.post1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,427 Found link https://files.pythonhosted.org/packages/64/d7/ab3fad322268613661de3c4451df7f236475ef4aef7645619e6998b3199d/evalscope-1.5.2.post1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.2.post1 2026-05-28T10:08:08,428 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/02/f3/d38eb7c4488f85d92caf2a6c4b9af852e72ac4d23f5132715d6d5062a82a/evalscope-1.6.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,429 Found link https://files.pythonhosted.org/packages/ba/13/b96ac9df484c1ff1f3e202bd103bc715c87c338c465ac9dff585109bee98/evalscope-1.6.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.6.0 2026-05-28T10:08:08,429 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/20/57/898c44800b1a77a2c4e31afad151f237dad2004666db5ce69a1fec2f654e/evalscope-1.6.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,430 Found link https://files.pythonhosted.org/packages/e6/6e/9ebaec0d2d4b09ca3fa6aed3d9ed4b61e52824c8cc8f5b9ff2dfa58d6ea4/evalscope-1.6.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.6.1 2026-05-28T10:08:08,431 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/63/c3/26996e51ee2a45159c7463e17957bc4adee1f258f7d728926e6dd5702d7d/evalscope-1.7.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,432 Found link https://files.pythonhosted.org/packages/b4/44/e0c2403f07c2588591d5210cdca6e42543e7c11ea1b603871a09805acc11/evalscope-1.7.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.7.0 2026-05-28T10:08:08,433 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/2c/be/052195ab65785e780453caad1d0414d2eb5d075c49f97f1eeb6812166bb9/evalscope-1.7.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,433 Found link https://files.pythonhosted.org/packages/9d/5f/ca9819998d01e2f5141ac53196476f36cef306f4a5c7a9294d04bef69287/evalscope-1.7.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.7.1 2026-05-28T10:08:08,434 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/3d/be/918691d67eed3d52426464439e70a6e744e1d79b52b9406927458171f7ba/evalscope-1.8.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,435 Found link https://files.pythonhosted.org/packages/c5/7d/497553f73dc4cacfbf736c3f39bf003286a871dd761aaf9d695cd237a89b/evalscope-1.8.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.8.0 2026-05-28T10:08:08,435 Fetching project page and analyzing links: https://www.piwheels.org/simple/evalscope/ 2026-05-28T10:08:08,436 Getting page https://www.piwheels.org/simple/evalscope/ 2026-05-28T10:08:08,437 Found index url https://www.piwheels.org/simple 2026-05-28T10:08:08,637 Fetched page https://www.piwheels.org/simple/evalscope/ as text/html 2026-05-28T10:08:08,650 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.7.1-py3-none-any.whl#sha256=4fa39f59314c73264ed6a3d32e9c20df0ed42275b496e3389eb52cb93dde24ee (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,650 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.7.0-py3-none-any.whl#sha256=cef2a54ebfb484365f3b7040f9c432f1116728a79320384b4096a7d031b26e8c (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,651 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.6.1-py3-none-any.whl#sha256=603f6982401ac313daa073da9590d3168948b1638a8a4d45faaa22d41ae6b92f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,652 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.6.0-py3-none-any.whl#sha256=5f6a31b463e6844867fdddb6488a17d304a770d4a30ad0a3615f853d4650f220 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,652 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.2.post1-py3-none-any.whl#sha256=76e17bb53fe492f1648148d0cebe8d500169cef934d52fee6e0e3edbf5351b90 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,653 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.2-py3-none-any.whl#sha256=db9427c4cca0bcaa951a6faac7db6c94854d1384ab0914a0ba0f9d377c947f66 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,653 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.1-py3-none-any.whl#sha256=a95eabf175191595bfeebe4e6face613a6c137a65067e8fd7dca613567bba440 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,654 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.0-py3-none-any.whl#sha256=933f1aa9915ed658bc3ae6901e0b96efbdbf80db96eca40c2d42453b26530d9b (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,655 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.2-py3-none-any.whl#sha256=222e938fe502394b9935f3c00677cf1372892caa530b6fb48476ae909a91399a (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,655 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.1-py3-none-any.whl#sha256=d74ddb7150b19de1eb995026c697b598cdfc0b75fee3a2d110219256c4241688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,656 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.0-py3-none-any.whl#sha256=307c9ed70f562ba776fdf9b0136b35fe4e361b1b974bb0f8ca39e425f4738e6e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,656 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.3.0-py3-none-any.whl#sha256=d9949bacf6c08b5ab341f872c6ee4b31995d724fe963f64ddf3c129e0e39145e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,656 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.2.0-py3-none-any.whl#sha256=5969cf4a3132f6a29f9ef39aa35a8a3be24f114bbcd7a77a19c145bbec432be9 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,657 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.1-py3-none-any.whl#sha256=5bfb8c55f45e1bcd5df5cb0cd4ecceae2ccb93cd09c9477b0b0e6c097ecb1d1f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-05-28T10:08:08,658 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.0-py3-none-any.whl#sha256=ca7c951fa316bb7ec6fb0e38ad6503632067f07f469e37828fe9e39e51591994 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,658 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.2-py3-none-any.whl#sha256=e14016040022bcd666c05ffa806f3713775e9de19a98290bd2a0a36e5c435409 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,659 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.1-py3-none-any.whl#sha256=61b9c14f3409804d84ddf31b749b587c7c7441f9d1aa0d453991b1bd0bbda74c (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,660 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.0-py3-none-any.whl#sha256=ea1044755177db8d9e94cc4ccafbd56e35b173a8425db7d1653dd9f66e1463ad (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,660 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.1-py3-none-any.whl#sha256=aa0054b8aed77684e56d0836d4568080fa4827799a16d62fef6ec13802cd4050 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-05-28T10:08:08,661 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.0-py3-none-any.whl#sha256=e7a2549f9f5ac5b0061d01f8ca99900e06f6340b91d3e546163423b896287862 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,661 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.3-py3-none-any.whl#sha256=75dcbd7cc0a0336f68d407a3925fe065ecaaea0fa9030ceebfaa3f22f0f3b417 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,662 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.2-py3-none-any.whl#sha256=14e00f16b506b723a359799e3cb271370a3da768da3667563612e458982d6847 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,662 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.1-py3-none-any.whl#sha256=296b90c06a33f69c9e7049a768f16929acd1279af0830f47f18fc598560d0e13 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,663 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.0-py3-none-any.whl#sha256=0a6f11a7a4d564d4a1dca5fe424cda19608cf947b9ab739079f5b60842651c7e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,663 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.1-py3-none-any.whl#sha256=504717cfc96a8fdbb0d4bc080d0292e80423891b658d61283385737794c4e95f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,664 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.0-py3-none-any.whl#sha256=263020a7b7f7788e17515c738604ea64904ca0da34d09664d2b7ee16d3522a00 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,664 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.14.0-py3-none-any.whl#sha256=cdf26beb4b188e1dd5e09feeb1832ccf909c9d2a535eb76e702fb3c66fc65688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,665 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.2-py3-none-any.whl#sha256=7d360083cf9dd960996847cde2140085d0830ddc8b12aa8007b4f72d395c5211 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,665 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.1-py3-none-any.whl#sha256=6b26f11daca05d6b56da3cb1b78ea0a8f28de94901b7520171c66e2a16b1c638 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,666 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.0-py3-none-any.whl#sha256=8ca93c4011f04e35df239b5486248e03100a9c9e265664e01330ef4b1cc691f7 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,667 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.1-py3-none-any.whl#sha256=72e5815923789c6cab6c32425477393bf645f670edcc493f8e4dec6e93f3da23 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,667 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.0-py3-none-any.whl#sha256=f26c7317a5bacac8527806723d45bac74563a2608ed761d822ace03be3a5e45f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,668 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.11.0-py3-none-any.whl#sha256=6e7d3242c5bf97b54d24644a1300575fdf41c1d90eef7c1344c20bf0d1518671 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,668 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.1-py3-none-any.whl#sha256=a7e645410333c1aaec5a024be4c02166dfb0d6b4635b020c181ce672d31102d2 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,669 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.0-py3-none-any.whl#sha256=494f1742178def5f86e552c004562c877cd8b8b6f5d4267d29f961f20e8bf569 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,670 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.9.0-py3-none-any.whl#sha256=26fd59fd387850f3e84f995fd38357994460e71d0124bc09954ca8837027ec52 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,670 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.2-py3-none-any.whl#sha256=3c4ed25b5c39d7a706927607e552e6ced6d50532bc442bba74979f70720f4894 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,670 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.1-py3-none-any.whl#sha256=64e9306453082a95b0c0507d6fd1dbf50ed0ab210ebcfebdcb52c534769d1856 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,671 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.0-py3-none-any.whl#sha256=7758550ef3406d9c6096de05909cc1c97ed0c2f7be6edaf3fe5344244e34f233 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,671 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.2-py3-none-any.whl#sha256=21a1f4448ffca926853b7516b6375f801ef6a8067501dde546e7d794fb759f20 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,672 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.1-py3-none-any.whl#sha256=b959f2d9850544f2d96ac765b06ed112cf579a11899618fcc5245407f4e33843 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,673 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.0-py3-none-any.whl#sha256=2deddeb89bfb6fa02844b72a2be24627526136cbd45315e8e4dadb672d53c9fc (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-05-28T10:08:08,673 Skipping link: not a file: https://www.piwheels.org/simple/evalscope/ 2026-05-28T10:08:08,674 Skipping link: not a file: https://pypi.org/simple/evalscope/ 2026-05-28T10:08:08,699 Given no hashes to check 1 links for project 'evalscope': discarding no candidates 2026-05-28T10:08:08,717 Collecting evalscope==1.8.0 2026-05-28T10:08:08,720 Created temporary directory: /tmp/pip-unpack-j92e9w71 2026-05-28T10:08:08,949 Downloading evalscope-1.8.0.tar.gz (3.2 MB) 2026-05-28T10:08:12,021 Added evalscope==1.8.0 from https://files.pythonhosted.org/packages/c5/7d/497553f73dc4cacfbf736c3f39bf003286a871dd761aaf9d695cd237a89b/evalscope-1.8.0.tar.gz to build tracker '/tmp/pip-build-tracker-f2nuu9mq' 2026-05-28T10:08:12,029 Created temporary directory: /tmp/pip-build-env-et21ad1y 2026-05-28T10:08:12,033 Installing build dependencies: started 2026-05-28T10:08:12,035 Running command pip subprocess to install build dependencies 2026-05-28T10:08:13,169 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-05-28T10:08:13,625 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-05-28T10:08:13,649 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-05-28T10:08:15,406 Collecting setuptools>=69 2026-05-28T10:08:15,407 Obtaining dependency information for setuptools>=69 from https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl.metadata 2026-05-28T10:08:15,422 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl.metadata (6.5 kB) 2026-05-28T10:08:15,661 Collecting wheel 2026-05-28T10:08:15,662 Obtaining dependency information for wheel from https://www.piwheels.org/simple/wheel/wheel-0.47.0-py3-none-any.whl.metadata 2026-05-28T10:08:15,685 Using cached https://www.piwheels.org/simple/wheel/wheel-0.47.0-py3-none-any.whl.metadata (2.3 kB) 2026-05-28T10:08:15,865 Collecting packaging>=24.0 2026-05-28T10:08:15,866 Obtaining dependency information for packaging>=24.0 from https://www.piwheels.org/simple/packaging/packaging-26.2-py3-none-any.whl.metadata 2026-05-28T10:08:15,879 Using cached https://www.piwheels.org/simple/packaging/packaging-26.2-py3-none-any.whl.metadata (3.5 kB) 2026-05-28T10:08:16,067 Using cached https://www.piwheels.org/simple/wheel/wheel-0.47.0-py3-none-any.whl (32 kB) 2026-05-28T10:08:16,092 Using cached https://www.piwheels.org/simple/packaging/packaging-26.2-py3-none-any.whl (100 kB) 2026-05-28T10:08:16,204 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-05-28T10:08:16,242 Using cached https://www.piwheels.org/simple/wheel/wheel-0.47.0-py3-none-any.whl (32 kB) 2026-05-28T10:08:16,270 Using cached https://www.piwheels.org/simple/packaging/packaging-26.2-py3-none-any.whl (100 kB) 2026-05-28T10:08:16,365 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-05-28T10:08:19,109 Installing collected packages: setuptools, packaging, wheel 2026-05-28T10:08:22,497 Creating /tmp/pip-build-env-et21ad1y/overlay/local/bin 2026-05-28T10:08:22,500 changing mode of /tmp/pip-build-env-et21ad1y/overlay/local/bin/wheel to 755 2026-05-28T10:08:22,521 Successfully installed packaging-26.2 setuptools-82.0.1 wheel-0.47.0 2026-05-28T10:08:22,807 Installing build dependencies: finished with status 'done' 2026-05-28T10:08:22,813 Getting requirements to build wheel: started 2026-05-28T10:08:22,815 Running command Getting requirements to build wheel 2026-05-28T10:08:23,701 running egg_info 2026-05-28T10:08:23,707 writing evalscope.egg-info/PKG-INFO 2026-05-28T10:08:23,735 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-05-28T10:08:23,737 writing entry points to evalscope.egg-info/entry_points.txt 2026-05-28T10:08:23,752 writing requirements to evalscope.egg-info/requires.txt 2026-05-28T10:08:23,754 writing top-level names to evalscope.egg-info/top_level.txt 2026-05-28T10:08:24,055 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-05-28T10:08:24,119 reading manifest template 'MANIFEST.in' 2026-05-28T10:08:24,678 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-05-28T10:08:24,684 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-05-28T10:08:24,691 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-05-28T10:08:24,699 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-05-28T10:08:24,722 warning: no previously-included files matching '*.h5' found anywhere in distribution 2026-05-28T10:08:24,722 warning: no previously-included files matching '*.hdf5' found anywhere in distribution 2026-05-28T10:08:24,723 warning: no previously-included files matching '*.parquet' found anywhere in distribution 2026-05-28T10:08:24,728 warning: no previously-included files matching '*.bin' found anywhere in distribution 2026-05-28T10:08:24,736 warning: no previously-included files matching '*.safetensors' found anywhere in distribution 2026-05-28T10:08:24,743 warning: no previously-included files matching '*.gguf' found anywhere in distribution 2026-05-28T10:08:24,750 warning: no previously-included files matching '*.pth' found anywhere in distribution 2026-05-28T10:08:24,758 warning: no previously-included files matching '*.pt' found anywhere in distribution 2026-05-28T10:08:24,761 no previously-included directories found matching 'evalscope/web/node_modules' 2026-05-28T10:08:24,764 no previously-included directories found matching 'evalscope/web/src' 2026-05-28T10:08:24,767 warning: no previously-included files found matching 'evalscope/web/package.json' 2026-05-28T10:08:24,771 warning: no previously-included files found matching 'evalscope/web/package-lock.json' 2026-05-28T10:08:24,774 warning: no previously-included files found matching 'evalscope/web/tsconfig*.json' 2026-05-28T10:08:24,777 warning: no previously-included files found matching 'evalscope/web/vite.config.ts' 2026-05-28T10:08:24,780 warning: no previously-included files found matching 'evalscope/web/eslint.config.js' 2026-05-28T10:08:24,780 adding license file 'LICENSE' 2026-05-28T10:08:24,852 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-05-28T10:08:24,954 Getting requirements to build wheel: finished with status 'done' 2026-05-28T10:08:24,957 Created temporary directory: /tmp/pip-modern-metadata-k6mxwtnm 2026-05-28T10:08:24,960 Preparing metadata (pyproject.toml): started 2026-05-28T10:08:24,961 Running command Preparing metadata (pyproject.toml) 2026-05-28T10:08:25,944 running dist_info 2026-05-28T10:08:25,954 creating /tmp/pip-modern-metadata-k6mxwtnm/evalscope.egg-info 2026-05-28T10:08:25,955 writing /tmp/pip-modern-metadata-k6mxwtnm/evalscope.egg-info/PKG-INFO 2026-05-28T10:08:25,983 writing dependency_links to /tmp/pip-modern-metadata-k6mxwtnm/evalscope.egg-info/dependency_links.txt 2026-05-28T10:08:25,985 writing entry points to /tmp/pip-modern-metadata-k6mxwtnm/evalscope.egg-info/entry_points.txt 2026-05-28T10:08:25,999 writing requirements to /tmp/pip-modern-metadata-k6mxwtnm/evalscope.egg-info/requires.txt 2026-05-28T10:08:26,000 writing top-level names to /tmp/pip-modern-metadata-k6mxwtnm/evalscope.egg-info/top_level.txt 2026-05-28T10:08:26,002 writing manifest file '/tmp/pip-modern-metadata-k6mxwtnm/evalscope.egg-info/SOURCES.txt' 2026-05-28T10:08:26,248 reading manifest file '/tmp/pip-modern-metadata-k6mxwtnm/evalscope.egg-info/SOURCES.txt' 2026-05-28T10:08:26,250 reading manifest template 'MANIFEST.in' 2026-05-28T10:08:26,756 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-05-28T10:08:26,760 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-05-28T10:08:26,765 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-05-28T10:08:26,770 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-05-28T10:08:26,775 warning: no previously-included files matching '*.h5' found anywhere in distribution 2026-05-28T10:08:26,780 warning: no previously-included files matching '*.hdf5' found anywhere in distribution 2026-05-28T10:08:26,785 warning: no previously-included files matching '*.parquet' found anywhere in distribution 2026-05-28T10:08:26,790 warning: no previously-included files matching '*.bin' found anywhere in distribution 2026-05-28T10:08:26,794 warning: no previously-included files matching '*.safetensors' found anywhere in distribution 2026-05-28T10:08:26,799 warning: no previously-included files matching '*.gguf' found anywhere in distribution 2026-05-28T10:08:26,804 warning: no previously-included files matching '*.pth' found anywhere in distribution 2026-05-28T10:08:26,809 warning: no previously-included files matching '*.pt' found anywhere in distribution 2026-05-28T10:08:26,811 no previously-included directories found matching 'evalscope/web/node_modules' 2026-05-28T10:08:26,813 no previously-included directories found matching 'evalscope/web/src' 2026-05-28T10:08:26,815 warning: no previously-included files found matching 'evalscope/web/package.json' 2026-05-28T10:08:26,818 warning: no previously-included files found matching 'evalscope/web/package-lock.json' 2026-05-28T10:08:26,820 warning: no previously-included files found matching 'evalscope/web/tsconfig*.json' 2026-05-28T10:08:26,822 warning: no previously-included files found matching 'evalscope/web/vite.config.ts' 2026-05-28T10:08:26,824 warning: no previously-included files found matching 'evalscope/web/eslint.config.js' 2026-05-28T10:08:26,825 adding license file 'LICENSE' 2026-05-28T10:08:26,876 writing manifest file '/tmp/pip-modern-metadata-k6mxwtnm/evalscope.egg-info/SOURCES.txt' 2026-05-28T10:08:26,879 creating '/tmp/pip-modern-metadata-k6mxwtnm/evalscope-1.8.0.dist-info' 2026-05-28T10:08:27,007 Preparing metadata (pyproject.toml): finished with status 'done' 2026-05-28T10:08:27,015 Source in /tmp/pip-wheel-u7bz1_vl/evalscope_93fee4571f184a9ab51ad490564d3d04 has version 1.8.0, which satisfies requirement evalscope==1.8.0 from https://files.pythonhosted.org/packages/c5/7d/497553f73dc4cacfbf736c3f39bf003286a871dd761aaf9d695cd237a89b/evalscope-1.8.0.tar.gz 2026-05-28T10:08:27,016 Removed evalscope==1.8.0 from https://files.pythonhosted.org/packages/c5/7d/497553f73dc4cacfbf736c3f39bf003286a871dd761aaf9d695cd237a89b/evalscope-1.8.0.tar.gz from build tracker '/tmp/pip-build-tracker-f2nuu9mq' 2026-05-28T10:08:27,028 Created temporary directory: /tmp/pip-unpack-ux50vvx2 2026-05-28T10:08:27,029 Building wheels for collected packages: evalscope 2026-05-28T10:08:27,033 Created temporary directory: /tmp/pip-wheel-p75lnt3d 2026-05-28T10:08:27,033 Destination directory: /tmp/pip-wheel-p75lnt3d 2026-05-28T10:08:27,037 Building wheel for evalscope (pyproject.toml): started 2026-05-28T10:08:27,038 Running command Building wheel for evalscope (pyproject.toml) 2026-05-28T10:08:27,791 running bdist_wheel 2026-05-28T10:08:27,808 running build 2026-05-28T10:08:27,809 running build_py 2026-05-28T10:08:27,816 creating build/lib/evalscope 2026-05-28T10:08:27,818 copying evalscope/run.py -> build/lib/evalscope 2026-05-28T10:08:27,821 copying evalscope/arguments.py -> build/lib/evalscope 2026-05-28T10:08:27,824 copying evalscope/constants.py -> build/lib/evalscope 2026-05-28T10:08:27,826 copying evalscope/config.py -> build/lib/evalscope 2026-05-28T10:08:27,829 copying evalscope/version.py -> build/lib/evalscope 2026-05-28T10:08:27,831 copying evalscope/__init__.py -> build/lib/evalscope 2026-05-28T10:08:27,835 creating build/lib/evalscope/benchmarks 2026-05-28T10:08:27,836 copying evalscope/benchmarks/__init__.py -> build/lib/evalscope/benchmarks 2026-05-28T10:08:27,839 creating build/lib/evalscope/filters 2026-05-28T10:08:27,840 copying evalscope/filters/selection.py -> build/lib/evalscope/filters 2026-05-28T10:08:27,843 copying evalscope/filters/extraction.py -> build/lib/evalscope/filters 2026-05-28T10:08:27,845 copying evalscope/filters/__init__.py -> build/lib/evalscope/filters 2026-05-28T10:08:27,848 creating build/lib/evalscope/utils 2026-05-28T10:08:27,850 copying evalscope/utils/data_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,853 copying evalscope/utils/model_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,855 copying evalscope/utils/ner.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,858 copying evalscope/utils/url_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,860 copying evalscope/utils/chat_service.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,863 copying evalscope/utils/deprecation_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,865 copying evalscope/utils/multi_choices.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,867 copying evalscope/utils/json_schema.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,870 copying evalscope/utils/argument_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,872 copying evalscope/utils/resource_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,874 copying evalscope/utils/code_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,877 copying evalscope/utils/function_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,880 copying evalscope/utils/import_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,882 copying evalscope/utils/__init__.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,884 copying evalscope/utils/logger.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,886 copying evalscope/utils/io_utils.py -> build/lib/evalscope/utils 2026-05-28T10:08:27,889 creating build/lib/evalscope/summarizer 2026-05-28T10:08:27,891 copying evalscope/summarizer/summarizer.py -> build/lib/evalscope/summarizer 2026-05-28T10:08:27,893 copying evalscope/summarizer/__init__.py -> build/lib/evalscope/summarizer 2026-05-28T10:08:27,895 creating build/lib/evalscope/evaluator 2026-05-28T10:08:27,896 copying evalscope/evaluator/batch_reviewer.py -> build/lib/evalscope/evaluator 2026-05-28T10:08:27,898 copying evalscope/evaluator/perf_collector.py -> build/lib/evalscope/evaluator 2026-05-28T10:08:27,901 copying evalscope/evaluator/evaluator.py -> build/lib/evalscope/evaluator 2026-05-28T10:08:27,903 copying evalscope/evaluator/__init__.py -> build/lib/evalscope/evaluator 2026-05-28T10:08:27,906 creating build/lib/evalscope/report 2026-05-28T10:08:27,907 copying evalscope/report/report.py -> build/lib/evalscope/report 2026-05-28T10:08:27,909 copying evalscope/report/combinator.py -> build/lib/evalscope/report 2026-05-28T10:08:27,912 copying evalscope/report/visualization.py -> build/lib/evalscope/report 2026-05-28T10:08:27,914 copying evalscope/report/renderer.py -> build/lib/evalscope/report 2026-05-28T10:08:27,917 copying evalscope/report/__init__.py -> build/lib/evalscope/report 2026-05-28T10:08:27,919 copying evalscope/report/generator.py -> build/lib/evalscope/report 2026-05-28T10:08:27,921 creating build/lib/evalscope/collections 2026-05-28T10:08:27,922 copying evalscope/collections/sampler.py -> build/lib/evalscope/collections 2026-05-28T10:08:27,925 copying evalscope/collections/schema.py -> build/lib/evalscope/collections 2026-05-28T10:08:27,927 copying evalscope/collections/__init__.py -> build/lib/evalscope/collections 2026-05-28T10:08:27,930 creating build/lib/evalscope/service 2026-05-28T10:08:27,931 copying evalscope/service/app.py -> build/lib/evalscope/service 2026-05-28T10:08:27,933 copying evalscope/service/__init__.py -> build/lib/evalscope/service 2026-05-28T10:08:27,935 creating build/lib/evalscope/web 2026-05-28T10:08:27,936 copying evalscope/web/__init__.py -> build/lib/evalscope/web 2026-05-28T10:08:27,939 creating build/lib/evalscope/models 2026-05-28T10:08:27,940 copying evalscope/models/openai_compatible.py -> build/lib/evalscope/models 2026-05-28T10:08:27,943 copying evalscope/models/model_apis.py -> build/lib/evalscope/models 2026-05-28T10:08:27,945 copying evalscope/models/image_edit_model.py -> build/lib/evalscope/models 2026-05-28T10:08:27,947 copying evalscope/models/mockllm.py -> build/lib/evalscope/models 2026-05-28T10:08:27,949 copying evalscope/models/litellm_compatible.py -> build/lib/evalscope/models 2026-05-28T10:08:27,951 copying evalscope/models/openai_responses.py -> build/lib/evalscope/models 2026-05-28T10:08:27,954 copying evalscope/models/text2image_model.py -> build/lib/evalscope/models 2026-05-28T10:08:27,956 copying evalscope/models/modelscope.py -> build/lib/evalscope/models 2026-05-28T10:08:27,959 copying evalscope/models/__init__.py -> build/lib/evalscope/models 2026-05-28T10:08:27,961 copying evalscope/models/anthropic_compatible.py -> build/lib/evalscope/models 2026-05-28T10:08:27,963 creating build/lib/evalscope/backend 2026-05-28T10:08:27,965 copying evalscope/backend/base.py -> build/lib/evalscope/backend 2026-05-28T10:08:27,967 copying evalscope/backend/__init__.py -> build/lib/evalscope/backend 2026-05-28T10:08:27,969 creating build/lib/evalscope/third_party 2026-05-28T10:08:27,970 copying evalscope/third_party/__init__.py -> build/lib/evalscope/third_party 2026-05-28T10:08:27,973 creating build/lib/evalscope/cli 2026-05-28T10:08:27,974 copying evalscope/cli/start_perf.py -> build/lib/evalscope/cli 2026-05-28T10:08:27,976 copying evalscope/cli/start_eval.py -> build/lib/evalscope/cli 2026-05-28T10:08:27,978 copying evalscope/cli/benchmark_info.py -> build/lib/evalscope/cli 2026-05-28T10:08:27,980 copying evalscope/cli/cli.py -> build/lib/evalscope/cli 2026-05-28T10:08:27,982 copying evalscope/cli/start_app.py -> build/lib/evalscope/cli 2026-05-28T10:08:27,984 copying evalscope/cli/base.py -> build/lib/evalscope/cli 2026-05-28T10:08:27,985 copying evalscope/cli/__init__.py -> build/lib/evalscope/cli 2026-05-28T10:08:27,987 copying evalscope/cli/start_service.py -> build/lib/evalscope/cli 2026-05-28T10:08:27,990 creating build/lib/evalscope/api 2026-05-28T10:08:27,990 copying evalscope/api/registry.py -> build/lib/evalscope/api 2026-05-28T10:08:27,993 copying evalscope/api/__init__.py -> build/lib/evalscope/api 2026-05-28T10:08:27,995 creating build/lib/evalscope/agent 2026-05-28T10:08:27,996 copying evalscope/agent/runner.py -> build/lib/evalscope/agent 2026-05-28T10:08:27,998 copying evalscope/agent/__init__.py -> build/lib/evalscope/agent 2026-05-28T10:08:28,001 creating build/lib/evalscope/perf 2026-05-28T10:08:28,001 copying evalscope/perf/benchmark.py -> build/lib/evalscope/perf 2026-05-28T10:08:28,004 copying evalscope/perf/arguments.py -> build/lib/evalscope/perf 2026-05-28T10:08:28,007 copying evalscope/perf/multi_turn_benchmark.py -> build/lib/evalscope/perf 2026-05-28T10:08:28,009 copying evalscope/perf/__init__.py -> build/lib/evalscope/perf 2026-05-28T10:08:28,011 copying evalscope/perf/main.py -> build/lib/evalscope/perf 2026-05-28T10:08:28,013 copying evalscope/perf/multi_turn_args.py -> build/lib/evalscope/perf 2026-05-28T10:08:28,016 creating build/lib/evalscope/metrics 2026-05-28T10:08:28,017 copying evalscope/metrics/math_parser.py -> build/lib/evalscope/metrics 2026-05-28T10:08:28,020 copying evalscope/metrics/rouge_metric.py -> build/lib/evalscope/metrics 2026-05-28T10:08:28,022 copying evalscope/metrics/llm_judge.py -> build/lib/evalscope/metrics 2026-05-28T10:08:28,024 copying evalscope/metrics/__init__.py -> build/lib/evalscope/metrics 2026-05-28T10:08:28,026 copying evalscope/metrics/metric.py -> build/lib/evalscope/metrics 2026-05-28T10:08:28,029 copying evalscope/metrics/metrics.py -> build/lib/evalscope/metrics 2026-05-28T10:08:28,032 creating build/lib/evalscope/benchmarks/process_bench 2026-05-28T10:08:28,033 copying evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/lib/evalscope/benchmarks/process_bench 2026-05-28T10:08:28,035 copying evalscope/benchmarks/process_bench/__init__.py -> build/lib/evalscope/benchmarks/process_bench 2026-05-28T10:08:28,037 creating build/lib/evalscope/benchmarks/math_qa 2026-05-28T10:08:28,038 copying evalscope/benchmarks/math_qa/__init__.py -> build/lib/evalscope/benchmarks/math_qa 2026-05-28T10:08:28,040 copying evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/lib/evalscope/benchmarks/math_qa 2026-05-28T10:08:28,042 creating build/lib/evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:28,043 copying evalscope/benchmarks/olympiad_bench/utils.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:28,046 copying evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:28,048 copying evalscope/benchmarks/olympiad_bench/__init__.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:28,050 creating build/lib/evalscope/benchmarks/commonsense_qa 2026-05-28T10:08:28,051 copying evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-05-28T10:08:28,053 copying evalscope/benchmarks/commonsense_qa/__init__.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-05-28T10:08:28,055 creating build/lib/evalscope/benchmarks/mmlu_pro 2026-05-28T10:08:28,056 copying evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-05-28T10:08:28,059 copying evalscope/benchmarks/mmlu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-05-28T10:08:28,061 creating build/lib/evalscope/benchmarks/hellaswag 2026-05-28T10:08:28,062 copying evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/lib/evalscope/benchmarks/hellaswag 2026-05-28T10:08:28,064 copying evalscope/benchmarks/hellaswag/__init__.py -> build/lib/evalscope/benchmarks/hellaswag 2026-05-28T10:08:28,066 creating build/lib/evalscope/benchmarks/frames 2026-05-28T10:08:28,067 copying evalscope/benchmarks/frames/frames_adapter.py -> build/lib/evalscope/benchmarks/frames 2026-05-28T10:08:28,069 copying evalscope/benchmarks/frames/utils.py -> build/lib/evalscope/benchmarks/frames 2026-05-28T10:08:28,071 copying evalscope/benchmarks/frames/__init__.py -> build/lib/evalscope/benchmarks/frames 2026-05-28T10:08:28,073 creating build/lib/evalscope/benchmarks/general_arena 2026-05-28T10:08:28,074 copying evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/lib/evalscope/benchmarks/general_arena 2026-05-28T10:08:28,077 copying evalscope/benchmarks/general_arena/utils.py -> build/lib/evalscope/benchmarks/general_arena 2026-05-28T10:08:28,079 copying evalscope/benchmarks/general_arena/__init__.py -> build/lib/evalscope/benchmarks/general_arena 2026-05-28T10:08:28,081 creating build/lib/evalscope/benchmarks/micro_vqa 2026-05-28T10:08:28,082 copying evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-05-28T10:08:28,084 copying evalscope/benchmarks/micro_vqa/__init__.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-05-28T10:08:28,087 creating build/lib/evalscope/benchmarks/math_vista 2026-05-28T10:08:28,087 copying evalscope/benchmarks/math_vista/__init__.py -> build/lib/evalscope/benchmarks/math_vista 2026-05-28T10:08:28,089 copying evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/lib/evalscope/benchmarks/math_vista 2026-05-28T10:08:28,092 creating build/lib/evalscope/benchmarks/competition_math 2026-05-28T10:08:28,093 copying evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/lib/evalscope/benchmarks/competition_math 2026-05-28T10:08:28,095 copying evalscope/benchmarks/competition_math/__init__.py -> build/lib/evalscope/benchmarks/competition_math 2026-05-28T10:08:28,097 creating build/lib/evalscope/benchmarks/ai2d 2026-05-28T10:08:28,098 copying evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/lib/evalscope/benchmarks/ai2d 2026-05-28T10:08:28,101 copying evalscope/benchmarks/ai2d/__init__.py -> build/lib/evalscope/benchmarks/ai2d 2026-05-28T10:08:28,102 creating build/lib/evalscope/benchmarks/cmmlu 2026-05-28T10:08:28,103 copying evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/lib/evalscope/benchmarks/cmmlu 2026-05-28T10:08:28,106 copying evalscope/benchmarks/cmmlu/__init__.py -> build/lib/evalscope/benchmarks/cmmlu 2026-05-28T10:08:28,108 creating build/lib/evalscope/benchmarks/mm_star 2026-05-28T10:08:28,109 copying evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/lib/evalscope/benchmarks/mm_star 2026-05-28T10:08:28,111 copying evalscope/benchmarks/mm_star/__init__.py -> build/lib/evalscope/benchmarks/mm_star 2026-05-28T10:08:28,113 creating build/lib/evalscope/benchmarks/swe_bench_pro 2026-05-28T10:08:28,114 copying evalscope/benchmarks/swe_bench_pro/utils.py -> build/lib/evalscope/benchmarks/swe_bench_pro 2026-05-28T10:08:28,117 copying evalscope/benchmarks/swe_bench_pro/swe_bench_pro_agentic_adapter.py -> build/lib/evalscope/benchmarks/swe_bench_pro 2026-05-28T10:08:28,119 copying evalscope/benchmarks/swe_bench_pro/__init__.py -> build/lib/evalscope/benchmarks/swe_bench_pro 2026-05-28T10:08:28,122 creating build/lib/evalscope/benchmarks/mia_bench 2026-05-28T10:08:28,123 copying evalscope/benchmarks/mia_bench/utils.py -> build/lib/evalscope/benchmarks/mia_bench 2026-05-28T10:08:28,125 copying evalscope/benchmarks/mia_bench/__init__.py -> build/lib/evalscope/benchmarks/mia_bench 2026-05-28T10:08:28,127 copying evalscope/benchmarks/mia_bench/mia_bench_adapter.py -> build/lib/evalscope/benchmarks/mia_bench 2026-05-28T10:08:28,130 creating build/lib/evalscope/benchmarks/siqa 2026-05-28T10:08:28,131 copying evalscope/benchmarks/siqa/siqa_adapter.py -> build/lib/evalscope/benchmarks/siqa 2026-05-28T10:08:28,133 copying evalscope/benchmarks/siqa/__init__.py -> build/lib/evalscope/benchmarks/siqa 2026-05-28T10:08:28,135 creating build/lib/evalscope/benchmarks/pope 2026-05-28T10:08:28,135 copying evalscope/benchmarks/pope/pope_adapter.py -> build/lib/evalscope/benchmarks/pope 2026-05-28T10:08:28,138 copying evalscope/benchmarks/pope/__init__.py -> build/lib/evalscope/benchmarks/pope 2026-05-28T10:08:28,140 creating build/lib/evalscope/benchmarks/general_mcq 2026-05-28T10:08:28,141 copying evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/lib/evalscope/benchmarks/general_mcq 2026-05-28T10:08:28,143 copying evalscope/benchmarks/general_mcq/__init__.py -> build/lib/evalscope/benchmarks/general_mcq 2026-05-28T10:08:28,145 creating build/lib/evalscope/benchmarks/tir_bench 2026-05-28T10:08:28,146 copying evalscope/benchmarks/tir_bench/utils.py -> build/lib/evalscope/benchmarks/tir_bench 2026-05-28T10:08:28,148 copying evalscope/benchmarks/tir_bench/__init__.py -> build/lib/evalscope/benchmarks/tir_bench 2026-05-28T10:08:28,150 copying evalscope/benchmarks/tir_bench/tir_bench_adapter.py -> build/lib/evalscope/benchmarks/tir_bench 2026-05-28T10:08:28,153 creating build/lib/evalscope/benchmarks/image_edit 2026-05-28T10:08:28,154 copying evalscope/benchmarks/image_edit/__init__.py -> build/lib/evalscope/benchmarks/image_edit 2026-05-28T10:08:28,156 creating build/lib/evalscope/benchmarks/multi_if 2026-05-28T10:08:28,157 copying evalscope/benchmarks/multi_if/ifeval.py -> build/lib/evalscope/benchmarks/multi_if 2026-05-28T10:08:28,160 copying evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/lib/evalscope/benchmarks/multi_if 2026-05-28T10:08:28,163 copying evalscope/benchmarks/multi_if/__init__.py -> build/lib/evalscope/benchmarks/multi_if 2026-05-28T10:08:28,164 copying evalscope/benchmarks/multi_if/metrics.py -> build/lib/evalscope/benchmarks/multi_if 2026-05-28T10:08:28,167 creating build/lib/evalscope/benchmarks/mmmu 2026-05-28T10:08:28,168 copying evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/lib/evalscope/benchmarks/mmmu 2026-05-28T10:08:28,171 copying evalscope/benchmarks/mmmu/__init__.py -> build/lib/evalscope/benchmarks/mmmu 2026-05-28T10:08:28,173 creating build/lib/evalscope/benchmarks/arxivrollbench 2026-05-28T10:08:28,174 copying evalscope/benchmarks/arxivrollbench/arxivrollbench_adapter.py -> build/lib/evalscope/benchmarks/arxivrollbench 2026-05-28T10:08:28,176 copying evalscope/benchmarks/arxivrollbench/__init__.py -> build/lib/evalscope/benchmarks/arxivrollbench 2026-05-28T10:08:28,178 creating build/lib/evalscope/benchmarks/winogrande 2026-05-28T10:08:28,179 copying evalscope/benchmarks/winogrande/__init__.py -> build/lib/evalscope/benchmarks/winogrande 2026-05-28T10:08:28,181 copying evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/lib/evalscope/benchmarks/winogrande 2026-05-28T10:08:28,183 creating build/lib/evalscope/benchmarks/zebralogicbench 2026-05-28T10:08:28,184 copying evalscope/benchmarks/zebralogicbench/utils.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-05-28T10:08:28,187 copying evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-05-28T10:08:28,189 copying evalscope/benchmarks/zebralogicbench/__init__.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-05-28T10:08:28,191 creating build/lib/evalscope/benchmarks/wmt 2026-05-28T10:08:28,192 copying evalscope/benchmarks/wmt/wmt24_adapter.py -> build/lib/evalscope/benchmarks/wmt 2026-05-28T10:08:28,195 copying evalscope/benchmarks/wmt/__init__.py -> build/lib/evalscope/benchmarks/wmt 2026-05-28T10:08:28,197 creating build/lib/evalscope/benchmarks/general_vqa 2026-05-28T10:08:28,198 copying evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/lib/evalscope/benchmarks/general_vqa 2026-05-28T10:08:28,200 copying evalscope/benchmarks/general_vqa/__init__.py -> build/lib/evalscope/benchmarks/general_vqa 2026-05-28T10:08:28,202 creating build/lib/evalscope/benchmarks/refcoco 2026-05-28T10:08:28,203 copying evalscope/benchmarks/refcoco/utils.py -> build/lib/evalscope/benchmarks/refcoco 2026-05-28T10:08:28,205 copying evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/lib/evalscope/benchmarks/refcoco 2026-05-28T10:08:28,207 copying evalscope/benchmarks/refcoco/__init__.py -> build/lib/evalscope/benchmarks/refcoco 2026-05-28T10:08:28,209 copying evalscope/benchmarks/refcoco/evaluation_lib.py -> build/lib/evalscope/benchmarks/refcoco 2026-05-28T10:08:28,211 creating build/lib/evalscope/benchmarks/text2image 2026-05-28T10:08:28,212 copying evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-05-28T10:08:28,214 copying evalscope/benchmarks/text2image/tifa_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-05-28T10:08:28,216 copying evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-05-28T10:08:28,218 copying evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-05-28T10:08:28,220 copying evalscope/benchmarks/text2image/__init__.py -> build/lib/evalscope/benchmarks/text2image 2026-05-28T10:08:28,221 copying evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-05-28T10:08:28,224 creating build/lib/evalscope/benchmarks/maritime_bench 2026-05-28T10:08:28,225 copying evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-05-28T10:08:28,227 copying evalscope/benchmarks/maritime_bench/__init__.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-05-28T10:08:28,229 creating build/lib/evalscope/benchmarks/mmmlu 2026-05-28T10:08:28,230 copying evalscope/benchmarks/mmmlu/__init__.py -> build/lib/evalscope/benchmarks/mmmlu 2026-05-28T10:08:28,232 copying evalscope/benchmarks/mmmlu/prompt.py -> build/lib/evalscope/benchmarks/mmmlu 2026-05-28T10:08:28,234 copying evalscope/benchmarks/mmmlu/mmmlu_adapter.py -> build/lib/evalscope/benchmarks/mmmlu 2026-05-28T10:08:28,237 creating build/lib/evalscope/benchmarks/science_qa 2026-05-28T10:08:28,238 copying evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/lib/evalscope/benchmarks/science_qa 2026-05-28T10:08:28,240 copying evalscope/benchmarks/science_qa/__init__.py -> build/lib/evalscope/benchmarks/science_qa 2026-05-28T10:08:28,242 creating build/lib/evalscope/benchmarks/kimi_verifier 2026-05-28T10:08:28,243 copying evalscope/benchmarks/kimi_verifier/kimi_verifier_adapter.py -> build/lib/evalscope/benchmarks/kimi_verifier 2026-05-28T10:08:28,246 copying evalscope/benchmarks/kimi_verifier/param_spec.py -> build/lib/evalscope/benchmarks/kimi_verifier 2026-05-28T10:08:28,247 copying evalscope/benchmarks/kimi_verifier/__init__.py -> build/lib/evalscope/benchmarks/kimi_verifier 2026-05-28T10:08:28,249 creating build/lib/evalscope/benchmarks/math_verse 2026-05-28T10:08:28,250 copying evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/lib/evalscope/benchmarks/math_verse 2026-05-28T10:08:28,253 copying evalscope/benchmarks/math_verse/__init__.py -> build/lib/evalscope/benchmarks/math_verse 2026-05-28T10:08:28,255 creating build/lib/evalscope/benchmarks/gaia 2026-05-28T10:08:28,256 copying evalscope/benchmarks/gaia/scorer.py -> build/lib/evalscope/benchmarks/gaia 2026-05-28T10:08:28,258 copying evalscope/benchmarks/gaia/gaia_adapter.py -> build/lib/evalscope/benchmarks/gaia 2026-05-28T10:08:28,260 copying evalscope/benchmarks/gaia/__init__.py -> build/lib/evalscope/benchmarks/gaia 2026-05-28T10:08:28,262 creating build/lib/evalscope/benchmarks/cmmu 2026-05-28T10:08:28,263 copying evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmu 2026-05-28T10:08:28,266 copying evalscope/benchmarks/cmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmu 2026-05-28T10:08:28,267 copying evalscope/benchmarks/cmmu/prompt.py -> build/lib/evalscope/benchmarks/cmmu 2026-05-28T10:08:28,269 creating build/lib/evalscope/benchmarks/hle 2026-05-28T10:08:28,270 copying evalscope/benchmarks/hle/hle_adapter.py -> build/lib/evalscope/benchmarks/hle 2026-05-28T10:08:28,272 copying evalscope/benchmarks/hle/__init__.py -> build/lib/evalscope/benchmarks/hle 2026-05-28T10:08:28,274 creating build/lib/evalscope/benchmarks/trivia_qa 2026-05-28T10:08:28,275 copying evalscope/benchmarks/trivia_qa/__init__.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-05-28T10:08:28,277 copying evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-05-28T10:08:28,279 creating build/lib/evalscope/benchmarks/longbench_v2 2026-05-28T10:08:28,280 copying evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py -> build/lib/evalscope/benchmarks/longbench_v2 2026-05-28T10:08:28,282 copying evalscope/benchmarks/longbench_v2/__init__.py -> build/lib/evalscope/benchmarks/longbench_v2 2026-05-28T10:08:28,284 creating build/lib/evalscope/benchmarks/mri_mcqa 2026-05-28T10:08:28,285 copying evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-05-28T10:08:28,287 copying evalscope/benchmarks/mri_mcqa/__init__.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-05-28T10:08:28,289 creating build/lib/evalscope/benchmarks/qasc 2026-05-28T10:08:28,290 copying evalscope/benchmarks/qasc/qasc_adapter.py -> build/lib/evalscope/benchmarks/qasc 2026-05-28T10:08:28,292 copying evalscope/benchmarks/qasc/__init__.py -> build/lib/evalscope/benchmarks/qasc 2026-05-28T10:08:28,294 creating build/lib/evalscope/benchmarks/mbpp 2026-05-28T10:08:28,295 copying evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/lib/evalscope/benchmarks/mbpp 2026-05-28T10:08:28,298 copying evalscope/benchmarks/mbpp/__init__.py -> build/lib/evalscope/benchmarks/mbpp 2026-05-28T10:08:28,300 creating build/lib/evalscope/benchmarks/infovqa 2026-05-28T10:08:28,301 copying evalscope/benchmarks/infovqa/__init__.py -> build/lib/evalscope/benchmarks/infovqa 2026-05-28T10:08:28,302 copying evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/lib/evalscope/benchmarks/infovqa 2026-05-28T10:08:28,305 creating build/lib/evalscope/benchmarks/mm_bench 2026-05-28T10:08:28,306 copying evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/lib/evalscope/benchmarks/mm_bench 2026-05-28T10:08:28,308 copying evalscope/benchmarks/mm_bench/__init__.py -> build/lib/evalscope/benchmarks/mm_bench 2026-05-28T10:08:28,310 creating build/lib/evalscope/benchmarks/minimax_verifier 2026-05-28T10:08:28,311 copying evalscope/benchmarks/minimax_verifier/_validators.py -> build/lib/evalscope/benchmarks/minimax_verifier 2026-05-28T10:08:28,314 copying evalscope/benchmarks/minimax_verifier/minimax_verifier_adapter.py -> build/lib/evalscope/benchmarks/minimax_verifier 2026-05-28T10:08:28,317 copying evalscope/benchmarks/minimax_verifier/__init__.py -> build/lib/evalscope/benchmarks/minimax_verifier 2026-05-28T10:08:28,319 creating build/lib/evalscope/benchmarks/aa_lcr 2026-05-28T10:08:28,320 copying evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-05-28T10:08:28,322 copying evalscope/benchmarks/aa_lcr/__init__.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-05-28T10:08:28,324 creating build/lib/evalscope/benchmarks/arc 2026-05-28T10:08:28,325 copying evalscope/benchmarks/arc/arc_adapter.py -> build/lib/evalscope/benchmarks/arc 2026-05-28T10:08:28,327 copying evalscope/benchmarks/arc/__init__.py -> build/lib/evalscope/benchmarks/arc 2026-05-28T10:08:28,330 creating build/lib/evalscope/benchmarks/mmlu 2026-05-28T10:08:28,331 copying evalscope/benchmarks/mmlu/__init__.py -> build/lib/evalscope/benchmarks/mmlu 2026-05-28T10:08:28,332 copying evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/lib/evalscope/benchmarks/mmlu 2026-05-28T10:08:28,335 creating build/lib/evalscope/benchmarks/piqa 2026-05-28T10:08:28,336 copying evalscope/benchmarks/piqa/piqa_adapter.py -> build/lib/evalscope/benchmarks/piqa 2026-05-28T10:08:28,338 copying evalscope/benchmarks/piqa/__init__.py -> build/lib/evalscope/benchmarks/piqa 2026-05-28T10:08:28,340 creating build/lib/evalscope/benchmarks/needle_haystack 2026-05-28T10:08:28,342 copying evalscope/benchmarks/needle_haystack/utils.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-05-28T10:08:28,344 copying evalscope/benchmarks/needle_haystack/__init__.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-05-28T10:08:28,346 copying evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-05-28T10:08:28,350 creating build/lib/evalscope/benchmarks/scicode 2026-05-28T10:08:28,351 copying evalscope/benchmarks/scicode/scicode_adapter.py -> build/lib/evalscope/benchmarks/scicode 2026-05-28T10:08:28,353 copying evalscope/benchmarks/scicode/util.py -> build/lib/evalscope/benchmarks/scicode 2026-05-28T10:08:28,356 copying evalscope/benchmarks/scicode/prompt_templates.py -> build/lib/evalscope/benchmarks/scicode 2026-05-28T10:08:28,358 copying evalscope/benchmarks/scicode/__init__.py -> build/lib/evalscope/benchmarks/scicode 2026-05-28T10:08:28,361 creating build/lib/evalscope/benchmarks/musr 2026-05-28T10:08:28,362 copying evalscope/benchmarks/musr/musr_adapter.py -> build/lib/evalscope/benchmarks/musr 2026-05-28T10:08:28,364 copying evalscope/benchmarks/musr/__init__.py -> build/lib/evalscope/benchmarks/musr 2026-05-28T10:08:28,367 creating build/lib/evalscope/benchmarks/humanevalplus 2026-05-28T10:08:28,368 copying evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-05-28T10:08:28,371 copying evalscope/benchmarks/humanevalplus/__init__.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-05-28T10:08:28,373 creating build/lib/evalscope/benchmarks/logi_qa 2026-05-28T10:08:28,374 copying evalscope/benchmarks/logi_qa/__int__.py -> build/lib/evalscope/benchmarks/logi_qa 2026-05-28T10:08:28,377 copying evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/lib/evalscope/benchmarks/logi_qa 2026-05-28T10:08:28,380 creating build/lib/evalscope/benchmarks/gsm8k_v 2026-05-28T10:08:28,381 copying evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-05-28T10:08:28,383 copying evalscope/benchmarks/gsm8k_v/__init__.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-05-28T10:08:28,386 creating build/lib/evalscope/benchmarks/chartqa 2026-05-28T10:08:28,387 copying evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/lib/evalscope/benchmarks/chartqa 2026-05-28T10:08:28,390 copying evalscope/benchmarks/chartqa/utils.py -> build/lib/evalscope/benchmarks/chartqa 2026-05-28T10:08:28,392 copying evalscope/benchmarks/chartqa/__init__.py -> build/lib/evalscope/benchmarks/chartqa 2026-05-28T10:08:28,395 creating build/lib/evalscope/benchmarks/amc 2026-05-28T10:08:28,396 copying evalscope/benchmarks/amc/amc_adapter.py -> build/lib/evalscope/benchmarks/amc 2026-05-28T10:08:28,398 copying evalscope/benchmarks/amc/__init__.py -> build/lib/evalscope/benchmarks/amc 2026-05-28T10:08:28,401 creating build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,403 copying evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,405 copying evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,408 copying evalscope/benchmarks/live_code_bench/prompts.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,411 copying evalscope/benchmarks/live_code_bench/extract_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,413 copying evalscope/benchmarks/live_code_bench/load_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,415 copying evalscope/benchmarks/live_code_bench/testing_util.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,418 copying evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,421 copying evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,424 copying evalscope/benchmarks/live_code_bench/__init__.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:28,426 creating build/lib/evalscope/benchmarks/ifbench 2026-05-28T10:08:28,427 copying evalscope/benchmarks/ifbench/instructions_util.py -> build/lib/evalscope/benchmarks/ifbench 2026-05-28T10:08:28,431 copying evalscope/benchmarks/ifbench/instructions.py -> build/lib/evalscope/benchmarks/ifbench 2026-05-28T10:08:28,435 copying evalscope/benchmarks/ifbench/instructions_registry.py -> build/lib/evalscope/benchmarks/ifbench 2026-05-28T10:08:28,438 copying evalscope/benchmarks/ifbench/__init__.py -> build/lib/evalscope/benchmarks/ifbench 2026-05-28T10:08:28,439 copying evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/lib/evalscope/benchmarks/ifbench 2026-05-28T10:08:28,441 copying evalscope/benchmarks/ifbench/evaluation_lib.py -> build/lib/evalscope/benchmarks/ifbench 2026-05-28T10:08:28,444 creating build/lib/evalscope/benchmarks/chinese_simple_qa 2026-05-28T10:08:28,445 copying evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-05-28T10:08:28,447 copying evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-05-28T10:08:28,449 creating build/lib/evalscope/benchmarks/terminal_bench 2026-05-28T10:08:28,450 copying evalscope/benchmarks/terminal_bench/utils.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-05-28T10:08:28,452 copying evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-05-28T10:08:28,455 copying evalscope/benchmarks/terminal_bench/__init__.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-05-28T10:08:28,457 creating build/lib/evalscope/benchmarks/super_gpqa 2026-05-28T10:08:28,458 copying evalscope/benchmarks/super_gpqa/utils.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-05-28T10:08:28,460 copying evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-05-28T10:08:28,462 copying evalscope/benchmarks/super_gpqa/__init__.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-05-28T10:08:28,464 copying evalscope/benchmarks/super_gpqa/prompt.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-05-28T10:08:28,466 creating build/lib/evalscope/benchmarks/fleurs 2026-05-28T10:08:28,467 copying evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/lib/evalscope/benchmarks/fleurs 2026-05-28T10:08:28,470 copying evalscope/benchmarks/fleurs/__init__.py -> build/lib/evalscope/benchmarks/fleurs 2026-05-28T10:08:28,472 creating build/lib/evalscope/benchmarks/k2_verifier 2026-05-28T10:08:28,473 copying evalscope/benchmarks/k2_verifier/k2_verifier_adapter.py -> build/lib/evalscope/benchmarks/k2_verifier 2026-05-28T10:08:28,475 copying evalscope/benchmarks/k2_verifier/__init__.py -> build/lib/evalscope/benchmarks/k2_verifier 2026-05-28T10:08:28,477 creating build/lib/evalscope/benchmarks/mvbench 2026-05-28T10:08:28,478 copying evalscope/benchmarks/mvbench/mvbench_adapter.py -> build/lib/evalscope/benchmarks/mvbench 2026-05-28T10:08:28,481 copying evalscope/benchmarks/mvbench/__init__.py -> build/lib/evalscope/benchmarks/mvbench 2026-05-28T10:08:28,483 creating build/lib/evalscope/benchmarks/swe_bench 2026-05-28T10:08:28,484 copying evalscope/benchmarks/swe_bench/utils.py -> build/lib/evalscope/benchmarks/swe_bench 2026-05-28T10:08:28,486 copying evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/lib/evalscope/benchmarks/swe_bench 2026-05-28T10:08:28,489 copying evalscope/benchmarks/swe_bench/swe_bench_agentic_adapter.py -> build/lib/evalscope/benchmarks/swe_bench 2026-05-28T10:08:28,491 copying evalscope/benchmarks/swe_bench/build_images.py -> build/lib/evalscope/benchmarks/swe_bench 2026-05-28T10:08:28,493 copying evalscope/benchmarks/swe_bench/__init__.py -> build/lib/evalscope/benchmarks/swe_bench 2026-05-28T10:08:28,496 creating build/lib/evalscope/benchmarks/drop 2026-05-28T10:08:28,497 copying evalscope/benchmarks/drop/utils.py -> build/lib/evalscope/benchmarks/drop 2026-05-28T10:08:28,499 copying evalscope/benchmarks/drop/drop_adapter.py -> build/lib/evalscope/benchmarks/drop 2026-05-28T10:08:28,501 copying evalscope/benchmarks/drop/__init__.py -> build/lib/evalscope/benchmarks/drop 2026-05-28T10:08:28,503 creating build/lib/evalscope/benchmarks/ceval 2026-05-28T10:08:28,504 copying evalscope/benchmarks/ceval/ceval_adapter.py -> build/lib/evalscope/benchmarks/ceval 2026-05-28T10:08:28,507 copying evalscope/benchmarks/ceval/__init__.py -> build/lib/evalscope/benchmarks/ceval 2026-05-28T10:08:28,509 creating build/lib/evalscope/benchmarks/bbh 2026-05-28T10:08:28,510 copying evalscope/benchmarks/bbh/bbh_adapter.py -> build/lib/evalscope/benchmarks/bbh 2026-05-28T10:08:28,512 copying evalscope/benchmarks/bbh/__init__.py -> build/lib/evalscope/benchmarks/bbh 2026-05-28T10:08:28,515 creating build/lib/evalscope/benchmarks/math_500 2026-05-28T10:08:28,515 copying evalscope/benchmarks/math_500/math_500_adapter.py -> build/lib/evalscope/benchmarks/math_500 2026-05-28T10:08:28,518 copying evalscope/benchmarks/math_500/__init__.py -> build/lib/evalscope/benchmarks/math_500 2026-05-28T10:08:28,519 creating build/lib/evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:28,521 copying evalscope/benchmarks/openai_mrcr/utils.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:28,522 copying evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:28,525 copying evalscope/benchmarks/openai_mrcr/__init__.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:28,527 creating build/lib/evalscope/benchmarks/omni_bench 2026-05-28T10:08:28,528 copying evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/lib/evalscope/benchmarks/omni_bench 2026-05-28T10:08:28,530 copying evalscope/benchmarks/omni_bench/__init__.py -> build/lib/evalscope/benchmarks/omni_bench 2026-05-28T10:08:28,532 creating build/lib/evalscope/benchmarks/data_collection 2026-05-28T10:08:28,533 copying evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/lib/evalscope/benchmarks/data_collection 2026-05-28T10:08:28,536 copying evalscope/benchmarks/data_collection/__init__.py -> build/lib/evalscope/benchmarks/data_collection 2026-05-28T10:08:28,538 creating build/lib/evalscope/benchmarks/gsm8k 2026-05-28T10:08:28,539 copying evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/lib/evalscope/benchmarks/gsm8k 2026-05-28T10:08:28,541 copying evalscope/benchmarks/gsm8k/__init__.py -> build/lib/evalscope/benchmarks/gsm8k 2026-05-28T10:08:28,543 creating build/lib/evalscope/benchmarks/cmmmu 2026-05-28T10:08:28,544 copying evalscope/benchmarks/cmmmu/utils.py -> build/lib/evalscope/benchmarks/cmmmu 2026-05-28T10:08:28,547 copying evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmmu 2026-05-28T10:08:28,549 copying evalscope/benchmarks/cmmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmmu 2026-05-28T10:08:28,551 creating build/lib/evalscope/benchmarks/hmmt25 2026-05-28T10:08:28,552 copying evalscope/benchmarks/hmmt25/hmmt25_adapter.py -> build/lib/evalscope/benchmarks/hmmt25 2026-05-28T10:08:28,555 creating build/lib/evalscope/benchmarks/mbppplus 2026-05-28T10:08:28,556 copying evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/lib/evalscope/benchmarks/mbppplus 2026-05-28T10:08:28,559 copying evalscope/benchmarks/mbppplus/__init__.py -> build/lib/evalscope/benchmarks/mbppplus 2026-05-28T10:08:28,560 creating build/lib/evalscope/benchmarks/blink 2026-05-28T10:08:28,561 copying evalscope/benchmarks/blink/blink_adapter.py -> build/lib/evalscope/benchmarks/blink 2026-05-28T10:08:28,563 copying evalscope/benchmarks/blink/__init__.py -> build/lib/evalscope/benchmarks/blink 2026-05-28T10:08:28,565 creating build/lib/evalscope/benchmarks/sciq 2026-05-28T10:08:28,566 copying evalscope/benchmarks/sciq/sciq_adapter.py -> build/lib/evalscope/benchmarks/sciq 2026-05-28T10:08:28,568 copying evalscope/benchmarks/sciq/__init__.py -> build/lib/evalscope/benchmarks/sciq 2026-05-28T10:08:28,570 creating build/lib/evalscope/benchmarks/eq_bench 2026-05-28T10:08:28,571 copying evalscope/benchmarks/eq_bench/__init__.py -> build/lib/evalscope/benchmarks/eq_bench 2026-05-28T10:08:28,573 copying evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/lib/evalscope/benchmarks/eq_bench 2026-05-28T10:08:28,575 copying evalscope/benchmarks/eq_bench/answer_validation.py -> build/lib/evalscope/benchmarks/eq_bench 2026-05-28T10:08:28,578 creating build/lib/evalscope/benchmarks/iquiz 2026-05-28T10:08:28,579 copying evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/lib/evalscope/benchmarks/iquiz 2026-05-28T10:08:28,581 copying evalscope/benchmarks/iquiz/__init__.py -> build/lib/evalscope/benchmarks/iquiz 2026-05-28T10:08:28,583 creating build/lib/evalscope/benchmarks/bfcl 2026-05-28T10:08:28,584 copying evalscope/benchmarks/bfcl/__init__.py -> build/lib/evalscope/benchmarks/bfcl 2026-05-28T10:08:28,586 creating build/lib/evalscope/benchmarks/multipl_e 2026-05-28T10:08:28,587 copying evalscope/benchmarks/multipl_e/utils.py -> build/lib/evalscope/benchmarks/multipl_e 2026-05-28T10:08:28,589 copying evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-05-28T10:08:28,592 copying evalscope/benchmarks/multipl_e/__init__.py -> build/lib/evalscope/benchmarks/multipl_e 2026-05-28T10:08:28,593 copying evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-05-28T10:08:28,596 creating build/lib/evalscope/benchmarks/ocr_bench 2026-05-28T10:08:28,597 copying evalscope/benchmarks/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench 2026-05-28T10:08:28,599 creating build/lib/evalscope/benchmarks/halu_eval 2026-05-28T10:08:28,600 copying evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/lib/evalscope/benchmarks/halu_eval 2026-05-28T10:08:28,603 copying evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/lib/evalscope/benchmarks/halu_eval 2026-05-28T10:08:28,605 copying evalscope/benchmarks/halu_eval/__init__.py -> build/lib/evalscope/benchmarks/halu_eval 2026-05-28T10:08:28,607 creating build/lib/evalscope/benchmarks/mmmu_pro 2026-05-28T10:08:28,608 copying evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-05-28T10:08:28,610 copying evalscope/benchmarks/mmmu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-05-28T10:08:28,612 creating build/lib/evalscope/benchmarks/simple_qa 2026-05-28T10:08:28,613 copying evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/lib/evalscope/benchmarks/simple_qa 2026-05-28T10:08:28,616 copying evalscope/benchmarks/simple_qa/__init__.py -> build/lib/evalscope/benchmarks/simple_qa 2026-05-28T10:08:28,618 creating build/lib/evalscope/benchmarks/med_mcqa 2026-05-28T10:08:28,619 copying evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-05-28T10:08:28,621 copying evalscope/benchmarks/med_mcqa/__init__.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-05-28T10:08:28,624 creating build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,625 copying evalscope/benchmarks/ner/jnlpba_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,627 copying evalscope/benchmarks/ner/conll2003_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,629 copying evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,631 copying evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,633 copying evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,635 copying evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,636 copying evalscope/benchmarks/ner/conllpp_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,638 copying evalscope/benchmarks/ner/bc2gm_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,640 copying evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,642 copying evalscope/benchmarks/ner/genia_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,644 copying evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,646 copying evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,647 copying evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,649 copying evalscope/benchmarks/ner/cross_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,651 copying evalscope/benchmarks/ner/copious_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,653 copying evalscope/benchmarks/ner/anat_em_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,655 copying evalscope/benchmarks/ner/fin_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,657 copying evalscope/benchmarks/ner/wnut2017_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,659 copying evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,661 copying evalscope/benchmarks/ner/ncbi_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,663 copying evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,665 copying evalscope/benchmarks/ner/__init__.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,668 copying evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-05-28T10:08:28,670 creating build/lib/evalscope/benchmarks/docvqa 2026-05-28T10:08:28,671 copying evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/lib/evalscope/benchmarks/docvqa 2026-05-28T10:08:28,673 copying evalscope/benchmarks/docvqa/__init__.py -> build/lib/evalscope/benchmarks/docvqa 2026-05-28T10:08:28,675 creating build/lib/evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:28,677 copying evalscope/benchmarks/omnidoc_bench/utils.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:28,680 copying evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:28,683 copying evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:28,685 copying evalscope/benchmarks/omnidoc_bench/__init__.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:28,686 copying evalscope/benchmarks/omnidoc_bench/metrics.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:28,690 creating build/lib/evalscope/benchmarks/humaneval 2026-05-28T10:08:28,691 copying evalscope/benchmarks/humaneval/utils.py -> build/lib/evalscope/benchmarks/humaneval 2026-05-28T10:08:28,693 copying evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/lib/evalscope/benchmarks/humaneval 2026-05-28T10:08:28,695 copying evalscope/benchmarks/humaneval/__init__.py -> build/lib/evalscope/benchmarks/humaneval 2026-05-28T10:08:28,698 creating build/lib/evalscope/benchmarks/real_world_qa 2026-05-28T10:08:28,699 copying evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-05-28T10:08:28,701 copying evalscope/benchmarks/real_world_qa/__init__.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-05-28T10:08:28,702 creating build/lib/evalscope/benchmarks/arena_hard 2026-05-28T10:08:28,703 copying evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/lib/evalscope/benchmarks/arena_hard 2026-05-28T10:08:28,706 copying evalscope/benchmarks/arena_hard/utils.py -> build/lib/evalscope/benchmarks/arena_hard 2026-05-28T10:08:28,708 copying evalscope/benchmarks/arena_hard/__init__.py -> build/lib/evalscope/benchmarks/arena_hard 2026-05-28T10:08:28,710 creating build/lib/evalscope/benchmarks/tool_bench 2026-05-28T10:08:28,711 copying evalscope/benchmarks/tool_bench/utils.py -> build/lib/evalscope/benchmarks/tool_bench 2026-05-28T10:08:28,713 copying evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/lib/evalscope/benchmarks/tool_bench 2026-05-28T10:08:28,716 copying evalscope/benchmarks/tool_bench/__init__.py -> build/lib/evalscope/benchmarks/tool_bench 2026-05-28T10:08:28,718 creating build/lib/evalscope/benchmarks/pumed_qa 2026-05-28T10:08:28,719 copying evalscope/benchmarks/pumed_qa/__init__.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-05-28T10:08:28,720 copying evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-05-28T10:08:28,723 creating build/lib/evalscope/benchmarks/healthbench 2026-05-28T10:08:28,724 copying evalscope/benchmarks/healthbench/utils.py -> build/lib/evalscope/benchmarks/healthbench 2026-05-28T10:08:28,726 copying evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/lib/evalscope/benchmarks/healthbench 2026-05-28T10:08:28,729 copying evalscope/benchmarks/healthbench/__init__.py -> build/lib/evalscope/benchmarks/healthbench 2026-05-28T10:08:28,731 creating build/lib/evalscope/benchmarks/air_bench 2026-05-28T10:08:28,731 copying evalscope/benchmarks/air_bench/utils.py -> build/lib/evalscope/benchmarks/air_bench 2026-05-28T10:08:28,734 copying evalscope/benchmarks/air_bench/air_bench_foundation_adapter.py -> build/lib/evalscope/benchmarks/air_bench 2026-05-28T10:08:28,736 copying evalscope/benchmarks/air_bench/air_bench_chat_adapter.py -> build/lib/evalscope/benchmarks/air_bench 2026-05-28T10:08:28,739 copying evalscope/benchmarks/air_bench/__init__.py -> build/lib/evalscope/benchmarks/air_bench 2026-05-28T10:08:28,741 creating build/lib/evalscope/benchmarks/mmlu_redux 2026-05-28T10:08:28,742 copying evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-05-28T10:08:28,744 copying evalscope/benchmarks/mmlu_redux/__init__.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-05-28T10:08:28,746 creating build/lib/evalscope/benchmarks/biomix_qa 2026-05-28T10:08:28,747 copying evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-05-28T10:08:28,749 copying evalscope/benchmarks/biomix_qa/__init__.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-05-28T10:08:28,751 creating build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-05-28T10:08:28,752 copying evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-05-28T10:08:28,754 copying evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-05-28T10:08:28,756 creating build/lib/evalscope/benchmarks/general_qa 2026-05-28T10:08:28,757 copying evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/lib/evalscope/benchmarks/general_qa 2026-05-28T10:08:28,759 copying evalscope/benchmarks/general_qa/__init__.py -> build/lib/evalscope/benchmarks/general_qa 2026-05-28T10:08:28,762 creating build/lib/evalscope/benchmarks/videomme_v2 2026-05-28T10:08:28,763 copying evalscope/benchmarks/videomme_v2/videomme_v2_adapter.py -> build/lib/evalscope/benchmarks/videomme_v2 2026-05-28T10:08:28,765 copying evalscope/benchmarks/videomme_v2/__init__.py -> build/lib/evalscope/benchmarks/videomme_v2 2026-05-28T10:08:28,767 creating build/lib/evalscope/benchmarks/general_vmcq 2026-05-28T10:08:28,768 copying evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-05-28T10:08:28,770 copying evalscope/benchmarks/general_vmcq/__init__.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-05-28T10:08:28,772 creating build/lib/evalscope/benchmarks/alpaca_eval 2026-05-28T10:08:28,773 copying evalscope/benchmarks/alpaca_eval/__init__.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-05-28T10:08:28,775 copying evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-05-28T10:08:28,777 creating build/lib/evalscope/benchmarks/math_vision 2026-05-28T10:08:28,778 copying evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/lib/evalscope/benchmarks/math_vision 2026-05-28T10:08:28,781 copying evalscope/benchmarks/math_vision/__init__.py -> build/lib/evalscope/benchmarks/math_vision 2026-05-28T10:08:28,782 creating build/lib/evalscope/benchmarks/simple_vqa 2026-05-28T10:08:28,783 copying evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-05-28T10:08:28,786 copying evalscope/benchmarks/simple_vqa/__init__.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-05-28T10:08:28,788 creating build/lib/evalscope/benchmarks/a_okvqa 2026-05-28T10:08:28,789 copying evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-05-28T10:08:28,791 copying evalscope/benchmarks/a_okvqa/__init__.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-05-28T10:08:28,792 creating build/lib/evalscope/benchmarks/torgo 2026-05-28T10:08:28,793 copying evalscope/benchmarks/torgo/torgo_adapter.py -> build/lib/evalscope/benchmarks/torgo 2026-05-28T10:08:28,796 copying evalscope/benchmarks/torgo/__init__.py -> build/lib/evalscope/benchmarks/torgo 2026-05-28T10:08:28,798 creating build/lib/evalscope/benchmarks/librispeech 2026-05-28T10:08:28,799 copying evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/lib/evalscope/benchmarks/librispeech 2026-05-28T10:08:28,801 copying evalscope/benchmarks/librispeech/__init__.py -> build/lib/evalscope/benchmarks/librispeech 2026-05-28T10:08:28,804 creating build/lib/evalscope/benchmarks/gpqa 2026-05-28T10:08:28,805 copying evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/lib/evalscope/benchmarks/gpqa 2026-05-28T10:08:28,807 copying evalscope/benchmarks/gpqa/__init__.py -> build/lib/evalscope/benchmarks/gpqa 2026-05-28T10:08:28,809 copying evalscope/benchmarks/gpqa/prompt.py -> build/lib/evalscope/benchmarks/gpqa 2026-05-28T10:08:28,811 creating build/lib/evalscope/benchmarks/vstar_bench 2026-05-28T10:08:28,812 copying evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-05-28T10:08:28,814 copying evalscope/benchmarks/vstar_bench/__init__.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-05-28T10:08:28,816 creating build/lib/evalscope/benchmarks/zerobench 2026-05-28T10:08:28,817 copying evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/lib/evalscope/benchmarks/zerobench 2026-05-28T10:08:28,819 copying evalscope/benchmarks/zerobench/__init__.py -> build/lib/evalscope/benchmarks/zerobench 2026-05-28T10:08:28,821 creating build/lib/evalscope/benchmarks/minerva_math 2026-05-28T10:08:28,822 copying evalscope/benchmarks/minerva_math/__init__.py -> build/lib/evalscope/benchmarks/minerva_math 2026-05-28T10:08:28,824 copying evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/lib/evalscope/benchmarks/minerva_math 2026-05-28T10:08:28,826 creating build/lib/evalscope/benchmarks/docmath 2026-05-28T10:08:28,827 copying evalscope/benchmarks/docmath/utils.py -> build/lib/evalscope/benchmarks/docmath 2026-05-28T10:08:28,829 copying evalscope/benchmarks/docmath/docmath_adapter.py -> build/lib/evalscope/benchmarks/docmath 2026-05-28T10:08:28,831 copying evalscope/benchmarks/docmath/__init__.py -> build/lib/evalscope/benchmarks/docmath 2026-05-28T10:08:28,833 creating build/lib/evalscope/benchmarks/poly_math 2026-05-28T10:08:28,834 copying evalscope/benchmarks/poly_math/__init__.py -> build/lib/evalscope/benchmarks/poly_math 2026-05-28T10:08:28,836 copying evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/lib/evalscope/benchmarks/poly_math 2026-05-28T10:08:28,838 creating build/lib/evalscope/benchmarks/ifeval 2026-05-28T10:08:28,839 copying evalscope/benchmarks/ifeval/instructions_util.py -> build/lib/evalscope/benchmarks/ifeval 2026-05-28T10:08:28,842 copying evalscope/benchmarks/ifeval/utils.py -> build/lib/evalscope/benchmarks/ifeval 2026-05-28T10:08:28,844 copying evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/lib/evalscope/benchmarks/ifeval 2026-05-28T10:08:28,846 copying evalscope/benchmarks/ifeval/instructions.py -> build/lib/evalscope/benchmarks/ifeval 2026-05-28T10:08:28,849 copying evalscope/benchmarks/ifeval/instructions_registry.py -> build/lib/evalscope/benchmarks/ifeval 2026-05-28T10:08:28,851 copying evalscope/benchmarks/ifeval/__init__.py -> build/lib/evalscope/benchmarks/ifeval 2026-05-28T10:08:28,853 creating build/lib/evalscope/benchmarks/coin_flip 2026-05-28T10:08:28,854 copying evalscope/benchmarks/coin_flip/__init__.py -> build/lib/evalscope/benchmarks/coin_flip 2026-05-28T10:08:28,856 copying evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/lib/evalscope/benchmarks/coin_flip 2026-05-28T10:08:28,858 creating build/lib/evalscope/benchmarks/race 2026-05-28T10:08:28,859 copying evalscope/benchmarks/race/race_adapter.py -> build/lib/evalscope/benchmarks/race 2026-05-28T10:08:28,861 copying evalscope/benchmarks/race/__init__.py -> build/lib/evalscope/benchmarks/race 2026-05-28T10:08:28,864 creating build/lib/evalscope/benchmarks/visu_logic 2026-05-28T10:08:28,865 copying evalscope/benchmarks/visu_logic/__init__.py -> build/lib/evalscope/benchmarks/visu_logic 2026-05-28T10:08:28,866 copying evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/lib/evalscope/benchmarks/visu_logic 2026-05-28T10:08:28,869 creating build/lib/evalscope/benchmarks/hallusion_bench 2026-05-28T10:08:28,870 copying evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-05-28T10:08:28,872 copying evalscope/benchmarks/hallusion_bench/__init__.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-05-28T10:08:28,874 creating build/lib/evalscope/benchmarks/truthful_qa 2026-05-28T10:08:28,875 copying evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-05-28T10:08:28,878 copying evalscope/benchmarks/truthful_qa/__init__.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-05-28T10:08:28,880 creating build/lib/evalscope/benchmarks/cl_bench 2026-05-28T10:08:28,881 copying evalscope/benchmarks/cl_bench/utils.py -> build/lib/evalscope/benchmarks/cl_bench 2026-05-28T10:08:28,883 copying evalscope/benchmarks/cl_bench/cl_bench_adapter.py -> build/lib/evalscope/benchmarks/cl_bench 2026-05-28T10:08:28,885 copying evalscope/benchmarks/cl_bench/__init__.py -> build/lib/evalscope/benchmarks/cl_bench 2026-05-28T10:08:28,887 creating build/lib/evalscope/benchmarks/aime 2026-05-28T10:08:28,888 copying evalscope/benchmarks/aime/math_normalize.py -> build/lib/evalscope/benchmarks/aime 2026-05-28T10:08:28,891 copying evalscope/benchmarks/aime/aime_adapter.py -> build/lib/evalscope/benchmarks/aime 2026-05-28T10:08:28,893 copying evalscope/benchmarks/aime/grader.py -> build/lib/evalscope/benchmarks/aime 2026-05-28T10:08:28,895 copying evalscope/benchmarks/aime/__init__.py -> build/lib/evalscope/benchmarks/aime 2026-05-28T10:08:28,898 creating build/lib/evalscope/benchmarks/music_trivia 2026-05-28T10:08:28,899 copying evalscope/benchmarks/music_trivia/__init__.py -> build/lib/evalscope/benchmarks/music_trivia 2026-05-28T10:08:28,900 copying evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/lib/evalscope/benchmarks/music_trivia 2026-05-28T10:08:28,902 creating build/lib/evalscope/benchmarks/drivelology 2026-05-28T10:08:28,903 copying evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-05-28T10:08:28,906 copying evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-05-28T10:08:28,908 copying evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-05-28T10:08:28,910 copying evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-05-28T10:08:28,912 copying evalscope/benchmarks/drivelology/__init__.py -> build/lib/evalscope/benchmarks/drivelology 2026-05-28T10:08:28,914 creating build/lib/evalscope/benchmarks/mgsm 2026-05-28T10:08:28,915 copying evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/lib/evalscope/benchmarks/mgsm 2026-05-28T10:08:28,917 copying evalscope/benchmarks/mgsm/__init__.py -> build/lib/evalscope/benchmarks/mgsm 2026-05-28T10:08:28,919 creating build/lib/evalscope/benchmarks/tau_bench 2026-05-28T10:08:28,920 copying evalscope/benchmarks/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench 2026-05-28T10:08:28,922 creating build/lib/evalscope/benchmarks/general_fc 2026-05-28T10:08:28,923 copying evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/lib/evalscope/benchmarks/general_fc 2026-05-28T10:08:28,926 copying evalscope/benchmarks/general_fc/__init__.py -> build/lib/evalscope/benchmarks/general_fc 2026-05-28T10:08:28,928 creating build/lib/evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:28,929 copying evalscope/benchmarks/image_edit/gedit/utils.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:28,932 copying evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:28,934 copying evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:28,936 copying evalscope/benchmarks/image_edit/gedit/__init__.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:28,938 creating build/lib/evalscope/benchmarks/scicode/docker 2026-05-28T10:08:28,939 copying evalscope/benchmarks/scicode/docker/process_data.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-05-28T10:08:28,942 copying evalscope/benchmarks/scicode/docker/test_util.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-05-28T10:08:28,945 creating build/lib/evalscope/benchmarks/bfcl/v4 2026-05-28T10:08:28,946 copying evalscope/benchmarks/bfcl/v4/utils.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-05-28T10:08:28,949 copying evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-05-28T10:08:28,952 copying evalscope/benchmarks/bfcl/v4/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-05-28T10:08:28,953 creating build/lib/evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:28,955 copying evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:28,957 copying evalscope/benchmarks/bfcl/v3/utils.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:28,959 copying evalscope/benchmarks/bfcl/v3/generation.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:28,962 copying evalscope/benchmarks/bfcl/v3/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:28,964 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-05-28T10:08:28,965 copying evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-05-28T10:08:28,966 copying evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-05-28T10:08:28,969 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,970 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,972 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,974 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,976 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,978 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,980 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,983 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,986 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,988 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:28,990 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:28,991 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:28,994 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:28,996 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:28,998 creating build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:28,999 copying evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:29,001 copying evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:29,003 copying evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:29,005 copying evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:29,007 copying evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:29,009 copying evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:29,011 creating build/lib/evalscope/benchmarks/poly_math/utils 2026-05-28T10:08:29,012 copying evalscope/benchmarks/poly_math/utils/instruction.py -> build/lib/evalscope/benchmarks/poly_math/utils 2026-05-28T10:08:29,015 creating build/lib/evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:29,016 copying evalscope/benchmarks/tau_bench/tau3_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:29,019 copying evalscope/benchmarks/tau_bench/tau3_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:29,020 copying evalscope/benchmarks/tau_bench/tau3_bench/tau3_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:29,023 creating build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:29,024 copying evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:29,027 copying evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:29,029 copying evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:29,031 creating build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:29,032 copying evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:29,034 copying evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:29,036 copying evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:29,038 creating build/lib/evalscope/utils/tqdm_utils 2026-05-28T10:08:29,039 copying evalscope/utils/tqdm_utils/tqdm_logging.py -> build/lib/evalscope/utils/tqdm_utils 2026-05-28T10:08:29,042 copying evalscope/utils/tqdm_utils/progress_tracker.py -> build/lib/evalscope/utils/tqdm_utils 2026-05-28T10:08:29,044 copying evalscope/utils/tqdm_utils/__init__.py -> build/lib/evalscope/utils/tqdm_utils 2026-05-28T10:08:29,046 creating build/lib/evalscope/utils/doc_utils 2026-05-28T10:08:29,047 copying evalscope/utils/doc_utils/translate_description.py -> build/lib/evalscope/utils/doc_utils 2026-05-28T10:08:29,050 copying evalscope/utils/doc_utils/generate_dataset_md.py -> build/lib/evalscope/utils/doc_utils 2026-05-28T10:08:29,052 copying evalscope/utils/doc_utils/benchmark_stats.py -> build/lib/evalscope/utils/doc_utils 2026-05-28T10:08:29,055 copying evalscope/utils/doc_utils/readme_generator.py -> build/lib/evalscope/utils/doc_utils 2026-05-28T10:08:29,058 copying evalscope/utils/doc_utils/__init__.py -> build/lib/evalscope/utils/doc_utils 2026-05-28T10:08:29,062 creating build/lib/evalscope/service/blueprints 2026-05-28T10:08:29,063 copying evalscope/service/blueprints/reports.py -> build/lib/evalscope/service/blueprints 2026-05-28T10:08:29,065 copying evalscope/service/blueprints/eval.py -> build/lib/evalscope/service/blueprints 2026-05-28T10:08:29,068 copying evalscope/service/blueprints/__init__.py -> build/lib/evalscope/service/blueprints 2026-05-28T10:08:29,070 copying evalscope/service/blueprints/perf.py -> build/lib/evalscope/service/blueprints 2026-05-28T10:08:29,073 creating build/lib/evalscope/service/utils 2026-05-28T10:08:29,074 copying evalscope/service/utils/benchmarks.py -> build/lib/evalscope/service/utils 2026-05-28T10:08:29,076 copying evalscope/service/utils/process.py -> build/lib/evalscope/service/utils 2026-05-28T10:08:29,078 copying evalscope/service/utils/log.py -> build/lib/evalscope/service/utils 2026-05-28T10:08:29,080 copying evalscope/service/utils/__init__.py -> build/lib/evalscope/service/utils 2026-05-28T10:08:29,084 creating build/lib/evalscope/models/utils 2026-05-28T10:08:29,085 copying evalscope/models/utils/openai_responses.py -> build/lib/evalscope/models/utils 2026-05-28T10:08:29,088 copying evalscope/models/utils/openai.py -> build/lib/evalscope/models/utils 2026-05-28T10:08:29,090 copying evalscope/models/utils/anthropic.py -> build/lib/evalscope/models/utils 2026-05-28T10:08:29,094 creating build/lib/evalscope/backend/vlm_eval_kit 2026-05-28T10:08:29,095 copying evalscope/backend/vlm_eval_kit/backend_manager.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-05-28T10:08:29,097 copying evalscope/backend/vlm_eval_kit/__init__.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-05-28T10:08:29,099 creating build/lib/evalscope/backend/opencompass 2026-05-28T10:08:29,100 copying evalscope/backend/opencompass/backend_manager.py -> build/lib/evalscope/backend/opencompass 2026-05-28T10:08:29,103 copying evalscope/backend/opencompass/api_meta_template.py -> build/lib/evalscope/backend/opencompass 2026-05-28T10:08:29,105 copying evalscope/backend/opencompass/__init__.py -> build/lib/evalscope/backend/opencompass 2026-05-28T10:08:29,107 creating build/lib/evalscope/backend/rag_eval 2026-05-28T10:08:29,108 copying evalscope/backend/rag_eval/backend_manager.py -> build/lib/evalscope/backend/rag_eval 2026-05-28T10:08:29,110 copying evalscope/backend/rag_eval/__init__.py -> build/lib/evalscope/backend/rag_eval 2026-05-28T10:08:29,112 creating build/lib/evalscope/backend/opencompass/tasks 2026-05-28T10:08:29,113 copying evalscope/backend/opencompass/tasks/eval_datasets.py -> build/lib/evalscope/backend/opencompass/tasks 2026-05-28T10:08:29,115 copying evalscope/backend/opencompass/tasks/eval_api.py -> build/lib/evalscope/backend/opencompass/tasks 2026-05-28T10:08:29,117 copying evalscope/backend/opencompass/tasks/__init__.py -> build/lib/evalscope/backend/opencompass/tasks 2026-05-28T10:08:29,119 creating build/lib/evalscope/backend/rag_eval/utils 2026-05-28T10:08:29,120 copying evalscope/backend/rag_eval/utils/tools.py -> build/lib/evalscope/backend/rag_eval/utils 2026-05-28T10:08:29,122 copying evalscope/backend/rag_eval/utils/embedding.py -> build/lib/evalscope/backend/rag_eval/utils 2026-05-28T10:08:29,124 copying evalscope/backend/rag_eval/utils/llm.py -> build/lib/evalscope/backend/rag_eval/utils 2026-05-28T10:08:29,126 copying evalscope/backend/rag_eval/utils/clip.py -> build/lib/evalscope/backend/rag_eval/utils 2026-05-28T10:08:29,128 copying evalscope/backend/rag_eval/utils/__init__.py -> build/lib/evalscope/backend/rag_eval/utils 2026-05-28T10:08:29,130 creating build/lib/evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:29,131 copying evalscope/backend/rag_eval/cmteb/arguments.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:29,134 copying evalscope/backend/rag_eval/cmteb/base.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:29,135 copying evalscope/backend/rag_eval/cmteb/task_template.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:29,137 copying evalscope/backend/rag_eval/cmteb/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:29,140 creating build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:29,141 copying evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:29,143 copying evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:29,145 copying evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:29,147 copying evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:29,149 creating build/lib/evalscope/backend/rag_eval/ragas 2026-05-28T10:08:29,150 copying evalscope/backend/rag_eval/ragas/arguments.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-05-28T10:08:29,152 copying evalscope/backend/rag_eval/ragas/task_template.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-05-28T10:08:29,154 copying evalscope/backend/rag_eval/ragas/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-05-28T10:08:29,156 creating build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:29,158 copying evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:29,160 copying evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:29,162 copying evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:29,164 copying evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:29,167 copying evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:29,169 copying evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:29,172 copying evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:29,173 copying evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:29,176 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-05-28T10:08:29,177 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-05-28T10:08:29,180 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:29,181 copying evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:29,183 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:29,185 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:29,187 copying evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:29,189 creating build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-05-28T10:08:29,190 copying evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-05-28T10:08:29,192 creating build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:29,193 copying evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:29,195 copying evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:29,197 copying evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:29,200 copying evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:29,202 copying evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:29,204 creating build/lib/evalscope/third_party/toolbench_static 2026-05-28T10:08:29,205 copying evalscope/third_party/toolbench_static/eval.py -> build/lib/evalscope/third_party/toolbench_static 2026-05-28T10:08:29,207 copying evalscope/third_party/toolbench_static/infer.py -> build/lib/evalscope/third_party/toolbench_static 2026-05-28T10:08:29,210 copying evalscope/third_party/toolbench_static/toolbench_static.py -> build/lib/evalscope/third_party/toolbench_static 2026-05-28T10:08:29,212 copying evalscope/third_party/toolbench_static/__init__.py -> build/lib/evalscope/third_party/toolbench_static 2026-05-28T10:08:29,214 creating build/lib/evalscope/third_party/longbench_write 2026-05-28T10:08:29,215 copying evalscope/third_party/longbench_write/utils.py -> build/lib/evalscope/third_party/longbench_write 2026-05-28T10:08:29,217 copying evalscope/third_party/longbench_write/eval.py -> build/lib/evalscope/third_party/longbench_write 2026-05-28T10:08:29,219 copying evalscope/third_party/longbench_write/infer.py -> build/lib/evalscope/third_party/longbench_write 2026-05-28T10:08:29,222 copying evalscope/third_party/longbench_write/longbench_write.py -> build/lib/evalscope/third_party/longbench_write 2026-05-28T10:08:29,224 copying evalscope/third_party/longbench_write/__init__.py -> build/lib/evalscope/third_party/longbench_write 2026-05-28T10:08:29,226 creating build/lib/evalscope/third_party/thinkbench 2026-05-28T10:08:29,227 copying evalscope/third_party/thinkbench/eval.py -> build/lib/evalscope/third_party/thinkbench 2026-05-28T10:08:29,230 copying evalscope/third_party/thinkbench/infer.py -> build/lib/evalscope/third_party/thinkbench 2026-05-28T10:08:29,232 copying evalscope/third_party/thinkbench/__init__.py -> build/lib/evalscope/third_party/thinkbench 2026-05-28T10:08:29,234 creating build/lib/evalscope/third_party/toolbench_static/llm 2026-05-28T10:08:29,235 copying evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-05-28T10:08:29,237 copying evalscope/third_party/toolbench_static/llm/__init__.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-05-28T10:08:29,239 creating build/lib/evalscope/third_party/longbench_write/tools 2026-05-28T10:08:29,240 copying evalscope/third_party/longbench_write/tools/openai_api.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-05-28T10:08:29,242 copying evalscope/third_party/longbench_write/tools/__init__.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-05-28T10:08:29,244 copying evalscope/third_party/longbench_write/tools/data_etl.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-05-28T10:08:29,247 creating build/lib/evalscope/third_party/longbench_write/resources 2026-05-28T10:08:29,248 copying evalscope/third_party/longbench_write/resources/__init__.py -> build/lib/evalscope/third_party/longbench_write/resources 2026-05-28T10:08:29,250 creating build/lib/evalscope/third_party/thinkbench/tools 2026-05-28T10:08:29,251 copying evalscope/third_party/thinkbench/tools/utils.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-05-28T10:08:29,253 copying evalscope/third_party/thinkbench/tools/llm.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-05-28T10:08:29,255 copying evalscope/third_party/thinkbench/tools/__init__.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-05-28T10:08:29,258 creating build/lib/evalscope/api/mixin 2026-05-28T10:08:29,259 copying evalscope/api/mixin/sandbox_mixin.py -> build/lib/evalscope/api/mixin 2026-05-28T10:08:29,261 copying evalscope/api/mixin/llm_judge_mixin.py -> build/lib/evalscope/api/mixin 2026-05-28T10:08:29,263 copying evalscope/api/mixin/__init__.py -> build/lib/evalscope/api/mixin 2026-05-28T10:08:29,265 creating build/lib/evalscope/api/evaluator 2026-05-28T10:08:29,267 copying evalscope/api/evaluator/inference_result.py -> build/lib/evalscope/api/evaluator 2026-05-28T10:08:29,268 copying evalscope/api/evaluator/cache.py -> build/lib/evalscope/api/evaluator 2026-05-28T10:08:29,271 copying evalscope/api/evaluator/evaluator.py -> build/lib/evalscope/api/evaluator 2026-05-28T10:08:29,273 copying evalscope/api/evaluator/state.py -> build/lib/evalscope/api/evaluator 2026-05-28T10:08:29,275 copying evalscope/api/evaluator/__init__.py -> build/lib/evalscope/api/evaluator 2026-05-28T10:08:29,277 creating build/lib/evalscope/api/filter 2026-05-28T10:08:29,278 copying evalscope/api/filter/__init__.py -> build/lib/evalscope/api/filter 2026-05-28T10:08:29,280 copying evalscope/api/filter/filter.py -> build/lib/evalscope/api/filter 2026-05-28T10:08:29,283 creating build/lib/evalscope/api/tool 2026-05-28T10:08:29,284 copying evalscope/api/tool/utils.py -> build/lib/evalscope/api/tool 2026-05-28T10:08:29,286 copying evalscope/api/tool/tool_info.py -> build/lib/evalscope/api/tool 2026-05-28T10:08:29,288 copying evalscope/api/tool/tool_call.py -> build/lib/evalscope/api/tool 2026-05-28T10:08:29,290 copying evalscope/api/tool/__init__.py -> build/lib/evalscope/api/tool 2026-05-28T10:08:29,292 creating build/lib/evalscope/api/dataset 2026-05-28T10:08:29,293 copying evalscope/api/dataset/utils.py -> build/lib/evalscope/api/dataset 2026-05-28T10:08:29,296 copying evalscope/api/dataset/loader.py -> build/lib/evalscope/api/dataset 2026-05-28T10:08:29,298 copying evalscope/api/dataset/hub.py -> build/lib/evalscope/api/dataset 2026-05-28T10:08:29,301 copying evalscope/api/dataset/dataset.py -> build/lib/evalscope/api/dataset 2026-05-28T10:08:29,303 copying evalscope/api/dataset/__init__.py -> build/lib/evalscope/api/dataset 2026-05-28T10:08:29,306 creating build/lib/evalscope/api/messages 2026-05-28T10:08:29,307 copying evalscope/api/messages/utils.py -> build/lib/evalscope/api/messages 2026-05-28T10:08:29,309 copying evalscope/api/messages/perf_metrics.py -> build/lib/evalscope/api/messages 2026-05-28T10:08:29,311 copying evalscope/api/messages/chat_message.py -> build/lib/evalscope/api/messages 2026-05-28T10:08:29,313 copying evalscope/api/messages/content.py -> build/lib/evalscope/api/messages 2026-05-28T10:08:29,315 copying evalscope/api/messages/__init__.py -> build/lib/evalscope/api/messages 2026-05-28T10:08:29,317 creating build/lib/evalscope/api/model 2026-05-28T10:08:29,318 copying evalscope/api/model/lazy_model.py -> build/lib/evalscope/api/model 2026-05-28T10:08:29,320 copying evalscope/api/model/model_output.py -> build/lib/evalscope/api/model 2026-05-28T10:08:29,323 copying evalscope/api/model/model.py -> build/lib/evalscope/api/model 2026-05-28T10:08:29,325 copying evalscope/api/model/__init__.py -> build/lib/evalscope/api/model 2026-05-28T10:08:29,326 copying evalscope/api/model/generate_config.py -> build/lib/evalscope/api/model 2026-05-28T10:08:29,329 creating build/lib/evalscope/api/benchmark 2026-05-28T10:08:29,330 copying evalscope/api/benchmark/benchmark.py -> build/lib/evalscope/api/benchmark 2026-05-28T10:08:29,332 copying evalscope/api/benchmark/statistics.py -> build/lib/evalscope/api/benchmark 2026-05-28T10:08:29,335 copying evalscope/api/benchmark/__init__.py -> build/lib/evalscope/api/benchmark 2026-05-28T10:08:29,336 copying evalscope/api/benchmark/meta.py -> build/lib/evalscope/api/benchmark 2026-05-28T10:08:29,339 creating build/lib/evalscope/api/sandbox 2026-05-28T10:08:29,340 copying evalscope/api/sandbox/service.py -> build/lib/evalscope/api/sandbox 2026-05-28T10:08:29,343 copying evalscope/api/sandbox/engine.py -> build/lib/evalscope/api/sandbox 2026-05-28T10:08:29,345 copying evalscope/api/sandbox/__init__.py -> build/lib/evalscope/api/sandbox 2026-05-28T10:08:29,347 copying evalscope/api/sandbox/config_builder.py -> build/lib/evalscope/api/sandbox 2026-05-28T10:08:29,349 creating build/lib/evalscope/api/metric 2026-05-28T10:08:29,350 copying evalscope/api/metric/scorer.py -> build/lib/evalscope/api/metric 2026-05-28T10:08:29,352 copying evalscope/api/metric/__init__.py -> build/lib/evalscope/api/metric 2026-05-28T10:08:29,354 copying evalscope/api/metric/metric.py -> build/lib/evalscope/api/metric 2026-05-28T10:08:29,356 creating build/lib/evalscope/api/agent 2026-05-28T10:08:29,357 copying evalscope/api/agent/environment.py -> build/lib/evalscope/api/agent 2026-05-28T10:08:29,359 copying evalscope/api/agent/strategy.py -> build/lib/evalscope/api/agent 2026-05-28T10:08:29,361 copying evalscope/api/agent/tool_executor.py -> build/lib/evalscope/api/agent 2026-05-28T10:08:29,363 copying evalscope/api/agent/constants.py -> build/lib/evalscope/api/agent 2026-05-28T10:08:29,365 copying evalscope/api/agent/types.py -> build/lib/evalscope/api/agent 2026-05-28T10:08:29,367 copying evalscope/api/agent/loop.py -> build/lib/evalscope/api/agent 2026-05-28T10:08:29,370 copying evalscope/api/agent/__init__.py -> build/lib/evalscope/api/agent 2026-05-28T10:08:29,371 copying evalscope/api/agent/trace.py -> build/lib/evalscope/api/agent 2026-05-28T10:08:29,374 creating build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,375 copying evalscope/api/benchmark/adapters/ner_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,378 copying evalscope/api/benchmark/adapters/vendor_verifier_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,380 copying evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,383 copying evalscope/api/benchmark/adapters/_agent_loop_runner.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,385 copying evalscope/api/benchmark/adapters/agent_loop_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,387 copying evalscope/api/benchmark/adapters/text2image_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,389 copying evalscope/api/benchmark/adapters/multi_turn_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,392 copying evalscope/api/benchmark/adapters/agent_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,393 copying evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,395 copying evalscope/api/benchmark/adapters/default_data_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,398 copying evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,400 copying evalscope/api/benchmark/adapters/__init__.py -> build/lib/evalscope/api/benchmark/adapters 2026-05-28T10:08:29,402 creating build/lib/evalscope/api/agent/mcp 2026-05-28T10:08:29,403 copying evalscope/api/agent/mcp/source.py -> build/lib/evalscope/api/agent/mcp 2026-05-28T10:08:29,406 copying evalscope/api/agent/mcp/types.py -> build/lib/evalscope/api/agent/mcp 2026-05-28T10:08:29,408 copying evalscope/api/agent/mcp/client.py -> build/lib/evalscope/api/agent/mcp 2026-05-28T10:08:29,410 copying evalscope/api/agent/mcp/__init__.py -> build/lib/evalscope/api/agent/mcp 2026-05-28T10:08:29,412 creating build/lib/evalscope/agent/tools 2026-05-28T10:08:29,413 copying evalscope/agent/tools/python_exec.py -> build/lib/evalscope/agent/tools 2026-05-28T10:08:29,415 copying evalscope/agent/tools/bash.py -> build/lib/evalscope/agent/tools 2026-05-28T10:08:29,418 copying evalscope/agent/tools/submit.py -> build/lib/evalscope/agent/tools 2026-05-28T10:08:29,420 copying evalscope/agent/tools/__init__.py -> build/lib/evalscope/agent/tools 2026-05-28T10:08:29,422 creating build/lib/evalscope/agent/environments 2026-05-28T10:08:29,423 copying evalscope/agent/environments/local.py -> build/lib/evalscope/agent/environments 2026-05-28T10:08:29,425 copying evalscope/agent/environments/enclave.py -> build/lib/evalscope/agent/environments 2026-05-28T10:08:29,427 copying evalscope/agent/environments/__init__.py -> build/lib/evalscope/agent/environments 2026-05-28T10:08:29,430 creating build/lib/evalscope/agent/strategies 2026-05-28T10:08:29,431 copying evalscope/agent/strategies/react.py -> build/lib/evalscope/agent/strategies 2026-05-28T10:08:29,433 copying evalscope/agent/strategies/__init__.py -> build/lib/evalscope/agent/strategies 2026-05-28T10:08:29,435 copying evalscope/agent/strategies/function_calling.py -> build/lib/evalscope/agent/strategies 2026-05-28T10:08:29,437 creating build/lib/evalscope/agent/external 2026-05-28T10:08:29,439 copying evalscope/agent/external/adapter.py -> build/lib/evalscope/agent/external 2026-05-28T10:08:29,442 copying evalscope/agent/external/config.py -> build/lib/evalscope/agent/external 2026-05-28T10:08:29,444 copying evalscope/agent/external/__init__.py -> build/lib/evalscope/agent/external 2026-05-28T10:08:29,447 creating build/lib/evalscope/agent/strategies/swe_bench 2026-05-28T10:08:29,448 copying evalscope/agent/strategies/swe_bench/swe_bench_toolcall.py -> build/lib/evalscope/agent/strategies/swe_bench 2026-05-28T10:08:29,450 copying evalscope/agent/strategies/swe_bench/_observation.py -> build/lib/evalscope/agent/strategies/swe_bench 2026-05-28T10:08:29,453 copying evalscope/agent/strategies/swe_bench/swe_bench_backticks.py -> build/lib/evalscope/agent/strategies/swe_bench 2026-05-28T10:08:29,455 copying evalscope/agent/strategies/swe_bench/__init__.py -> build/lib/evalscope/agent/strategies/swe_bench 2026-05-28T10:08:29,458 creating build/lib/evalscope/agent/external/runners 2026-05-28T10:08:29,459 copying evalscope/agent/external/runners/mock.py -> build/lib/evalscope/agent/external/runners 2026-05-28T10:08:29,461 copying evalscope/agent/external/runners/claude_code.py -> build/lib/evalscope/agent/external/runners 2026-05-28T10:08:29,463 copying evalscope/agent/external/runners/base.py -> build/lib/evalscope/agent/external/runners 2026-05-28T10:08:29,465 copying evalscope/agent/external/runners/codex.py -> build/lib/evalscope/agent/external/runners 2026-05-28T10:08:29,469 copying evalscope/agent/external/runners/__init__.py -> build/lib/evalscope/agent/external/runners 2026-05-28T10:08:29,471 creating build/lib/evalscope/agent/external/helpers 2026-05-28T10:08:29,472 copying evalscope/agent/external/helpers/patch.py -> build/lib/evalscope/agent/external/helpers 2026-05-28T10:08:29,474 copying evalscope/agent/external/helpers/__init__.py -> build/lib/evalscope/agent/external/helpers 2026-05-28T10:08:29,477 creating build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,478 copying evalscope/agent/external/bridge/sse_anthropic.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,480 copying evalscope/agent/external/bridge/translate_openai.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,482 copying evalscope/agent/external/bridge/sse_responses.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,485 copying evalscope/agent/external/bridge/trace_recorder.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,487 copying evalscope/agent/external/bridge/_sse_common.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,489 copying evalscope/agent/external/bridge/sse_openai.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,491 copying evalscope/agent/external/bridge/translate_responses.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,494 copying evalscope/agent/external/bridge/translate_anthropic.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,496 copying evalscope/agent/external/bridge/server.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,499 copying evalscope/agent/external/bridge/__init__.py -> build/lib/evalscope/agent/external/bridge 2026-05-28T10:08:29,502 creating build/lib/evalscope/perf/core 2026-05-28T10:08:29,503 copying evalscope/perf/core/http_client.py -> build/lib/evalscope/perf/core 2026-05-28T10:08:29,505 copying evalscope/perf/core/metrics_consumer.py -> build/lib/evalscope/perf/core 2026-05-28T10:08:29,507 copying evalscope/perf/core/__init__.py -> build/lib/evalscope/perf/core 2026-05-28T10:08:29,510 creating build/lib/evalscope/perf/utils 2026-05-28T10:08:29,510 copying evalscope/perf/utils/log_utils.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,513 copying evalscope/perf/utils/rich_display.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,516 copying evalscope/perf/utils/handler.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,518 copying evalscope/perf/utils/perf_constants.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,520 copying evalscope/perf/utils/benchmark_util.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,522 copying evalscope/perf/utils/workload_timeline.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,525 copying evalscope/perf/utils/analysis_result.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,527 copying evalscope/perf/utils/local_server.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,529 copying evalscope/perf/utils/db_util.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,532 copying evalscope/perf/utils/__init__.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,533 copying evalscope/perf/utils/trace_metrics.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,536 copying evalscope/perf/utils/perf_models.py -> build/lib/evalscope/perf/utils 2026-05-28T10:08:29,539 creating build/lib/evalscope/perf/sla 2026-05-28T10:08:29,540 copying evalscope/perf/sla/sla_criterion.py -> build/lib/evalscope/perf/sla 2026-05-28T10:08:29,542 copying evalscope/perf/sla/sla_run.py -> build/lib/evalscope/perf/sla 2026-05-28T10:08:29,545 copying evalscope/perf/sla/__init__.py -> build/lib/evalscope/perf/sla 2026-05-28T10:08:29,547 creating build/lib/evalscope/perf/plugin 2026-05-28T10:08:29,548 copying evalscope/perf/plugin/registry.py -> build/lib/evalscope/perf/plugin 2026-05-28T10:08:29,550 copying evalscope/perf/plugin/__init__.py -> build/lib/evalscope/perf/plugin 2026-05-28T10:08:29,552 creating build/lib/evalscope/perf/core/strategies 2026-05-28T10:08:29,553 copying evalscope/perf/core/strategies/multi_turn.py -> build/lib/evalscope/perf/core/strategies 2026-05-28T10:08:29,555 copying evalscope/perf/core/strategies/closed_loop.py -> build/lib/evalscope/perf/core/strategies 2026-05-28T10:08:29,558 copying evalscope/perf/core/strategies/base.py -> build/lib/evalscope/perf/core/strategies 2026-05-28T10:08:29,559 copying evalscope/perf/core/strategies/__init__.py -> build/lib/evalscope/perf/core/strategies 2026-05-28T10:08:29,561 copying evalscope/perf/core/strategies/open_loop.py -> build/lib/evalscope/perf/core/strategies 2026-05-28T10:08:29,564 creating build/lib/evalscope/perf/utils/report 2026-05-28T10:08:29,565 copying evalscope/perf/utils/report/perf_charts.py -> build/lib/evalscope/perf/utils/report 2026-05-28T10:08:29,567 copying evalscope/perf/utils/report/perf_data.py -> build/lib/evalscope/perf/utils/report 2026-05-28T10:08:29,570 copying evalscope/perf/utils/report/__init__.py -> build/lib/evalscope/perf/utils/report 2026-05-28T10:08:29,571 copying evalscope/perf/utils/report/generate_report.py -> build/lib/evalscope/perf/utils/report 2026-05-28T10:08:29,574 creating build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,575 copying evalscope/perf/plugin/api/openai_rerank_api.py -> build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,578 copying evalscope/perf/plugin/api/openai_embedding_api.py -> build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,580 copying evalscope/perf/plugin/api/openai_api.py -> build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,583 copying evalscope/perf/plugin/api/dashscope_api.py -> build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,585 copying evalscope/perf/plugin/api/openai_responses_api.py -> build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,588 copying evalscope/perf/plugin/api/base.py -> build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,590 copying evalscope/perf/plugin/api/custom_api.py -> build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,592 copying evalscope/perf/plugin/api/default_api.py -> build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,595 copying evalscope/perf/plugin/api/__init__.py -> build/lib/evalscope/perf/plugin/api 2026-05-28T10:08:29,597 creating build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,598 copying evalscope/perf/plugin/datasets/flickr8k.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,600 copying evalscope/perf/plugin/datasets/random_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,603 copying evalscope/perf/plugin/datasets/multi_turn.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,605 copying evalscope/perf/plugin/datasets/utils.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,608 copying evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,610 copying evalscope/perf/plugin/datasets/custom.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,612 copying evalscope/perf/plugin/datasets/share_gpt.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,614 copying evalscope/perf/plugin/datasets/swe_smith.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,617 copying evalscope/perf/plugin/datasets/openqa.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,619 copying evalscope/perf/plugin/datasets/rerank_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,621 copying evalscope/perf/plugin/datasets/longalpaca.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,623 copying evalscope/perf/plugin/datasets/base.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,625 copying evalscope/perf/plugin/datasets/line_by_line.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,627 copying evalscope/perf/plugin/datasets/trie.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,630 copying evalscope/perf/plugin/datasets/speed_benchmark.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,631 copying evalscope/perf/plugin/datasets/__init__.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,633 copying evalscope/perf/plugin/datasets/kontext_bench.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,635 copying evalscope/perf/plugin/datasets/embedding_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-05-28T10:08:29,638 creating build/lib/evalscope/metrics/sem_score 2026-05-28T10:08:29,639 copying evalscope/metrics/sem_score/scorer.py -> build/lib/evalscope/metrics/sem_score 2026-05-28T10:08:29,641 copying evalscope/metrics/sem_score/__init__.py -> build/lib/evalscope/metrics/sem_score 2026-05-28T10:08:29,643 creating build/lib/evalscope/metrics/bundled_rouge_score 2026-05-28T10:08:29,644 copying evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-05-28T10:08:29,647 copying evalscope/metrics/bundled_rouge_score/__init__.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-05-28T10:08:29,649 creating build/lib/evalscope/metrics/text_normalizer 2026-05-28T10:08:29,650 copying evalscope/metrics/text_normalizer/basic.py -> build/lib/evalscope/metrics/text_normalizer 2026-05-28T10:08:29,652 copying evalscope/metrics/text_normalizer/chinese.py -> build/lib/evalscope/metrics/text_normalizer 2026-05-28T10:08:29,657 copying evalscope/metrics/text_normalizer/wer.py -> build/lib/evalscope/metrics/text_normalizer 2026-05-28T10:08:29,660 copying evalscope/metrics/text_normalizer/english.py -> build/lib/evalscope/metrics/text_normalizer 2026-05-28T10:08:29,663 copying evalscope/metrics/text_normalizer/__init__.py -> build/lib/evalscope/metrics/text_normalizer 2026-05-28T10:08:29,665 creating build/lib/evalscope/metrics/t2v_metrics 2026-05-28T10:08:29,666 copying evalscope/metrics/t2v_metrics/clipscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-05-28T10:08:29,668 copying evalscope/metrics/t2v_metrics/itmscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-05-28T10:08:29,670 copying evalscope/metrics/t2v_metrics/constants.py -> build/lib/evalscope/metrics/t2v_metrics 2026-05-28T10:08:29,672 copying evalscope/metrics/t2v_metrics/vqascore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-05-28T10:08:29,673 copying evalscope/metrics/t2v_metrics/score.py -> build/lib/evalscope/metrics/t2v_metrics 2026-05-28T10:08:29,675 copying evalscope/metrics/t2v_metrics/__init__.py -> build/lib/evalscope/metrics/t2v_metrics 2026-05-28T10:08:29,677 creating build/lib/evalscope/metrics/bert_score 2026-05-28T10:08:29,678 copying evalscope/metrics/bert_score/utils.py -> build/lib/evalscope/metrics/bert_score 2026-05-28T10:08:29,681 copying evalscope/metrics/bert_score/scorer.py -> build/lib/evalscope/metrics/bert_score 2026-05-28T10:08:29,683 copying evalscope/metrics/bert_score/__init__.py -> build/lib/evalscope/metrics/bert_score 2026-05-28T10:08:29,685 creating build/lib/evalscope/metrics/t2v_metrics/models 2026-05-28T10:08:29,687 copying evalscope/metrics/t2v_metrics/models/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-05-28T10:08:29,689 copying evalscope/metrics/t2v_metrics/models/model.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-05-28T10:08:29,691 copying evalscope/metrics/t2v_metrics/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-05-28T10:08:29,693 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:29,694 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:29,696 copying evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:29,699 copying evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:29,701 copying evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:29,703 copying evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:29,705 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:29,706 copying evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:29,708 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:29,710 copying evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:29,712 copying evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:29,714 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:29,715 copying evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:29,717 copying evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:29,719 copying evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:29,721 copying evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:29,723 copying evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:29,725 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-05-28T10:08:29,726 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-05-28T10:08:29,729 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-05-28T10:08:29,729 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-05-28T10:08:29,732 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-05-28T10:08:29,733 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-05-28T10:08:29,735 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-05-28T10:08:29,737 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-05-28T10:08:29,739 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-05-28T10:08:29,740 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-05-28T10:08:29,742 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-05-28T10:08:29,744 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-05-28T10:08:29,745 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-05-28T10:08:29,748 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:29,749 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:29,752 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:29,754 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:29,758 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:29,760 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:29,762 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:29,765 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:29,766 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:29,769 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:29,771 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:29,773 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:29,776 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:29,778 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:29,780 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:29,783 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:29,784 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:29,786 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:29,789 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:29,791 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:29,794 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,795 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,797 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,799 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,801 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,803 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,806 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,808 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,811 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,813 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,815 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,818 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:29,821 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,822 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,825 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,828 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,830 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,833 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,835 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,839 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,840 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,842 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,845 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:29,848 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-05-28T10:08:29,849 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-05-28T10:08:29,851 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-05-28T10:08:29,854 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-05-28T10:08:29,857 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-05-28T10:08:29,858 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-05-28T10:08:29,860 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-05-28T10:08:29,862 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-05-28T10:08:29,864 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:29,865 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:29,868 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:29,870 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:29,871 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:29,876 running egg_info 2026-05-28T10:08:29,886 writing evalscope.egg-info/PKG-INFO 2026-05-28T10:08:29,912 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-05-28T10:08:29,914 writing entry points to evalscope.egg-info/entry_points.txt 2026-05-28T10:08:29,929 writing requirements to evalscope.egg-info/requires.txt 2026-05-28T10:08:29,930 writing top-level names to evalscope.egg-info/top_level.txt 2026-05-28T10:08:30,153 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-05-28T10:08:30,214 reading manifest template 'MANIFEST.in' 2026-05-28T10:08:30,718 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-05-28T10:08:30,724 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-05-28T10:08:30,731 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-05-28T10:08:30,738 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-05-28T10:08:30,746 warning: no previously-included files matching '*.h5' found anywhere in distribution 2026-05-28T10:08:30,753 warning: no previously-included files matching '*.hdf5' found anywhere in distribution 2026-05-28T10:08:30,761 warning: no previously-included files matching '*.parquet' found anywhere in distribution 2026-05-28T10:08:30,768 warning: no previously-included files matching '*.bin' found anywhere in distribution 2026-05-28T10:08:30,775 warning: no previously-included files matching '*.safetensors' found anywhere in distribution 2026-05-28T10:08:30,782 warning: no previously-included files matching '*.gguf' found anywhere in distribution 2026-05-28T10:08:30,790 warning: no previously-included files matching '*.pth' found anywhere in distribution 2026-05-28T10:08:30,797 warning: no previously-included files matching '*.pt' found anywhere in distribution 2026-05-28T10:08:30,800 no previously-included directories found matching 'evalscope/web/node_modules' 2026-05-28T10:08:30,803 no previously-included directories found matching 'evalscope/web/src' 2026-05-28T10:08:30,805 warning: no previously-included files found matching 'evalscope/web/package.json' 2026-05-28T10:08:30,808 warning: no previously-included files found matching 'evalscope/web/package-lock.json' 2026-05-28T10:08:30,811 warning: no previously-included files found matching 'evalscope/web/tsconfig*.json' 2026-05-28T10:08:30,814 warning: no previously-included files found matching 'evalscope/web/vite.config.ts' 2026-05-28T10:08:30,817 warning: no previously-included files found matching 'evalscope/web/eslint.config.js' 2026-05-28T10:08:30,817 adding license file 'LICENSE' 2026-05-28T10:08:30,885 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-05-28T10:08:31,028 copying evalscope/web/.gitignore -> build/lib/evalscope/web 2026-05-28T10:08:31,030 copying evalscope/web/README.md -> build/lib/evalscope/web 2026-05-28T10:08:31,033 copying evalscope/web/index.html -> build/lib/evalscope/web 2026-05-28T10:08:31,035 creating build/lib/evalscope/web/dist 2026-05-28T10:08:31,036 copying evalscope/web/dist/favicon.svg -> build/lib/evalscope/web/dist 2026-05-28T10:08:31,039 copying evalscope/web/dist/index.html -> build/lib/evalscope/web/dist 2026-05-28T10:08:31,041 creating build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,042 copying evalscope/web/dist/assets/Breadcrumb-CXh_Hjqj.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,044 copying evalscope/web/dist/assets/KaTeX_Size4-Regular-DWFBv043.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,047 copying evalscope/web/dist/assets/DashboardPage-Dy_K4Uk2.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,049 copying evalscope/web/dist/assets/SearchInput-Qu-jjIwJ.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,051 copying evalscope/web/dist/assets/KaTeX_Math-Italic-flOr_0UB.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,054 copying evalscope/web/dist/assets/KaTeX_AMS-Regular-DMm9YOAa.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,057 copying evalscope/web/dist/assets/KaTeX_SansSerif-Italic-YYjJ1zSn.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,060 copying evalscope/web/dist/assets/KaTeX_Fraktur-Bold-BsDP51OF.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,063 copying evalscope/web/dist/assets/Tabs-rdY0A_mU.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,089 copying evalscope/web/dist/assets/LocaleContext-3RqCP440.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,092 copying evalscope/web/dist/assets/KaTeX_Fraktur-Regular-Dxdc4cR9.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,095 copying evalscope/web/dist/assets/KaTeX_SansSerif-Regular-BNo7hRIc.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,098 copying evalscope/web/dist/assets/FilterChip-GjoWt8ON.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,101 copying evalscope/web/dist/assets/KaTeX_Script-Regular-D5yQViql.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,103 copying evalscope/web/dist/assets/KaTeX_Math-BoldItalic-iY-2wyZ7.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,106 copying evalscope/web/dist/assets/ComparePage-DIqHa7xH.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,109 copying evalscope/web/dist/assets/KaTeX_Caligraphic-Regular-Di6jR-x-.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,111 copying evalscope/web/dist/assets/KaTeX_Main-Regular-B22Nviop.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,114 copying evalscope/web/dist/assets/search-_j2hWq0v.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,116 copying evalscope/web/dist/assets/KaTeX_Caligraphic-Bold-BEiXGLvX.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,118 copying evalscope/web/dist/assets/KaTeX_SansSerif-Bold-DbIhKOiC.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,121 copying evalscope/web/dist/assets/KaTeX_Main-Italic-NWA7e6Wa.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,124 copying evalscope/web/dist/assets/KaTeX_Size4-Regular-Dl5lxZxV.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,126 copying evalscope/web/dist/assets/BenchmarksPage-XDO3_q9P.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,129 copying evalscope/web/dist/assets/KaTeX_Math-Italic-t53AETM-.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,132 copying evalscope/web/dist/assets/PerfTaskPage-CeXqhvKL.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,135 copying evalscope/web/dist/assets/KaTeX_Caligraphic-Bold-ATXxdsX0.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,137 copying evalscope/web/dist/assets/KaTeX_Main-Italic-BMLOBm91.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,140 copying evalscope/web/dist/assets/KaTeX_Fraktur-Bold-BdnERNNW.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,143 copying evalscope/web/dist/assets/KaTeX_Size1-Regular-mCD8mA8B.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,145 copying evalscope/web/dist/assets/KaTeX_Math-Italic-DA0__PXp.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,148 copying evalscope/web/dist/assets/KaTeX_Caligraphic-Bold-Dq_IR9rO.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,150 copying evalscope/web/dist/assets/database-OIFuo_-B.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,152 copying evalscope/web/dist/assets/KaTeX_Main-Bold-Jm3AIy58.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,156 copying evalscope/web/dist/assets/ReportViewerPage-DawiLh7a.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,158 copying evalscope/web/dist/assets/KaTeX_Size2-Regular-oD1tc_U0.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,160 copying evalscope/web/dist/assets/KaTeX_SansSerif-Bold-D1sUS0GD.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,163 copying evalscope/web/dist/assets/KaTeX_Main-Regular-ypZvNtVU.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,166 copying evalscope/web/dist/assets/KaTeX_SansSerif-Regular-DDBCnlJ7.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,169 copying evalscope/web/dist/assets/KaTeX_Fraktur-Regular-CB_wures.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,171 copying evalscope/web/dist/assets/KaTeX_Size3-Regular-CTq5MqoE.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,174 copying evalscope/web/dist/assets/folder-open-C4sVuGAY.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,176 copying evalscope/web/dist/assets/KaTeX_Main-Italic-3WenGoN9.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,179 copying evalscope/web/dist/assets/KaTeX_Size1-Regular-Dbsnue_I.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,182 copying evalscope/web/dist/assets/KaTeX_Size4-Regular-BF-4gkZK.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,185 copying evalscope/web/dist/assets/KaTeX_Main-BoldItalic-DxDJ3AOS.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,187 copying evalscope/web/dist/assets/ReportDetailPage-CXjObGG4.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,190 copying evalscope/web/dist/assets/KaTeX_AMS-Regular-BQhdFMY1.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,193 copying evalscope/web/dist/assets/KaTeX_Main-BoldItalic-SpSLRI95.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,196 copying evalscope/web/dist/assets/external-link-Dk_VDyv-.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,198 copying evalscope/web/dist/assets/KaTeX_Fraktur-Regular-CTYiF6lA.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,200 copying evalscope/web/dist/assets/EvalTaskPage-CfhiPFIu.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,203 copying evalscope/web/dist/assets/KaTeX_Typewriter-Regular-C0xS9mPB.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,206 copying evalscope/web/dist/assets/KaTeX_Main-Regular-Dr94JaBh.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,209 copying evalscope/web/dist/assets/KaTeX_Math-BoldItalic-B3XSjfu4.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,212 copying evalscope/web/dist/assets/usePolling-9BwH1egZ.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,214 copying evalscope/web/dist/assets/ChatView-CrwMded-.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,218 copying evalscope/web/dist/assets/useQueryParams-B0TIRhYM.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,220 copying evalscope/web/dist/assets/ScoreBadge-BKgzeJIw.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,222 copying evalscope/web/dist/assets/KaTeX_Typewriter-Regular-CO6r4hn1.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,225 copying evalscope/web/dist/assets/utils-Bt-jremC.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,228 copying evalscope/web/dist/assets/KaTeX_Main-BoldItalic-DzxPMmG6.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,231 copying evalscope/web/dist/assets/KaTeX_Fraktur-Bold-CL6g_b3V.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,234 copying evalscope/web/dist/assets/KaTeX_Script-Regular-C5JkGWo-.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,236 copying evalscope/web/dist/assets/KaTeX_Size1-Regular-C195tn64.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,239 copying evalscope/web/dist/assets/ReportsPage-CwZoFJCa.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,242 copying evalscope/web/dist/assets/Button-Bw30ctK3.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,244 copying evalscope/web/dist/assets/index-4AAMqHB4.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,251 copying evalscope/web/dist/assets/Skeleton-DD8J4WTY.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,253 copying evalscope/web/dist/assets/KaTeX_SansSerif-Bold-CFMepnvq.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,256 copying evalscope/web/dist/assets/KaTeX_Size3-Regular-DgpXs0kz.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,259 copying evalscope/web/dist/assets/index-DXtIjaXa.css -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,263 copying evalscope/web/dist/assets/KaTeX_Caligraphic-Regular-wX97UBjC.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,266 copying evalscope/web/dist/assets/Card-vB7q1dY3.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,269 copying evalscope/web/dist/assets/chevron-up-BykVlddk.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,271 copying evalscope/web/dist/assets/KaTeX_Typewriter-Regular-D3Ib7_Hf.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,275 copying evalscope/web/dist/assets/KaTeX_Caligraphic-Regular-CTRA-rTL.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,277 copying evalscope/web/dist/assets/KaTeX_SansSerif-Italic-DN2j7dab.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,280 copying evalscope/web/dist/assets/KaTeX_Main-Bold-Cx986IdX.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,283 copying evalscope/web/dist/assets/KaTeX_Math-BoldItalic-CZnvNsCZ.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,286 copying evalscope/web/dist/assets/KaTeX_SansSerif-Italic-C3H0VqGB.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,288 copying evalscope/web/dist/assets/KaTeX_Size2-Regular-B7gKUWhC.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,291 copying evalscope/web/dist/assets/Badge-BM4410Li.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,293 copying evalscope/web/dist/assets/KaTeX_AMS-Regular-DRggAlZN.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,297 copying evalscope/web/dist/assets/eval-CCjFVVv8.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,299 copying evalscope/web/dist/assets/square-CF_-z8TO.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,302 copying evalscope/web/dist/assets/KaTeX_Script-Regular-D3wIWfF6.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,304 copying evalscope/web/dist/assets/KaTeX_SansSerif-Regular-CS6fqUqJ.woff -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,307 copying evalscope/web/dist/assets/loader-circle-DukK1tEn.js -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,310 copying evalscope/web/dist/assets/KaTeX_Main-Bold-waoOVXN0.ttf -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,313 copying evalscope/web/dist/assets/KaTeX_Size2-Regular-Dy4dx90m.woff2 -> build/lib/evalscope/web/dist/assets 2026-05-28T10:08:31,316 copying evalscope/benchmarks/olympiad_bench/requirements.txt -> build/lib/evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:31,318 copying evalscope/benchmarks/general_arena/requirements.txt -> build/lib/evalscope/benchmarks/general_arena 2026-05-28T10:08:31,320 copying evalscope/benchmarks/multi_if/requirements.txt -> build/lib/evalscope/benchmarks/multi_if 2026-05-28T10:08:31,322 copying evalscope/benchmarks/wmt/requirements.txt -> build/lib/evalscope/benchmarks/wmt 2026-05-28T10:08:31,323 copying evalscope/benchmarks/refcoco/requirements.txt -> build/lib/evalscope/benchmarks/refcoco 2026-05-28T10:08:31,325 copying evalscope/benchmarks/trivia_qa/samples.jsonl -> build/lib/evalscope/benchmarks/trivia_qa 2026-05-28T10:08:31,328 copying evalscope/benchmarks/needle_haystack/requirements.txt -> build/lib/evalscope/benchmarks/needle_haystack 2026-05-28T10:08:31,330 copying evalscope/benchmarks/ifbench/requirements.txt -> build/lib/evalscope/benchmarks/ifbench 2026-05-28T10:08:31,332 copying evalscope/benchmarks/terminal_bench/requirements.txt -> build/lib/evalscope/benchmarks/terminal_bench 2026-05-28T10:08:31,333 copying evalscope/benchmarks/swe_bench/requirements.txt -> build/lib/evalscope/benchmarks/swe_bench 2026-05-28T10:08:31,335 copying evalscope/benchmarks/openai_mrcr/requirements.txt -> build/lib/evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:31,337 copying evalscope/benchmarks/bfcl/requirements.txt -> build/lib/evalscope/benchmarks/bfcl 2026-05-28T10:08:31,339 copying evalscope/benchmarks/ocr_bench/requirements.txt -> build/lib/evalscope/benchmarks/ocr_bench 2026-05-28T10:08:31,341 copying evalscope/benchmarks/omnidoc_bench/requirements.txt -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:31,343 copying evalscope/benchmarks/arena_hard/requirements.txt -> build/lib/evalscope/benchmarks/arena_hard 2026-05-28T10:08:31,345 copying evalscope/benchmarks/air_bench/requirements.txt -> build/lib/evalscope/benchmarks/air_bench 2026-05-28T10:08:31,347 copying evalscope/benchmarks/torgo/requirements.txt -> build/lib/evalscope/benchmarks/torgo 2026-05-28T10:08:31,349 creating build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,350 copying evalscope/benchmarks/_meta/a_okvqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,353 copying evalscope/benchmarks/_meta/aa_lcr.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,357 copying evalscope/benchmarks/_meta/ai2d.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,360 copying evalscope/benchmarks/_meta/aime24.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,364 copying evalscope/benchmarks/_meta/aime25.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,367 copying evalscope/benchmarks/_meta/aime26.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,370 copying evalscope/benchmarks/_meta/air_bench_chat.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,373 copying evalscope/benchmarks/_meta/air_bench_foundation.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,376 copying evalscope/benchmarks/_meta/alpaca_eval.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,379 copying evalscope/benchmarks/_meta/amc.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,381 copying evalscope/benchmarks/_meta/anat_em.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,385 copying evalscope/benchmarks/_meta/arc.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,387 copying evalscope/benchmarks/_meta/arena_hard.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,390 copying evalscope/benchmarks/_meta/arxivrollbench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,393 copying evalscope/benchmarks/_meta/arxivrollbench_full.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,397 copying evalscope/benchmarks/_meta/bbh.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,400 copying evalscope/benchmarks/_meta/bc2gm.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,402 copying evalscope/benchmarks/_meta/bc4chemd.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,405 copying evalscope/benchmarks/_meta/bc5cdr.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,408 copying evalscope/benchmarks/_meta/bfcl_v3.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,411 copying evalscope/benchmarks/_meta/bfcl_v4.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,414 copying evalscope/benchmarks/_meta/biomix_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,417 copying evalscope/benchmarks/_meta/blink.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,420 copying evalscope/benchmarks/_meta/broad_twitter_corpus.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,423 copying evalscope/benchmarks/_meta/cc_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,426 copying evalscope/benchmarks/_meta/ceval.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,429 copying evalscope/benchmarks/_meta/chartqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,432 copying evalscope/benchmarks/_meta/chinese_simpleqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,435 copying evalscope/benchmarks/_meta/cl_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,438 copying evalscope/benchmarks/_meta/cmmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,441 copying evalscope/benchmarks/_meta/cmmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,444 copying evalscope/benchmarks/_meta/cmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,448 copying evalscope/benchmarks/_meta/coin_flip.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,450 copying evalscope/benchmarks/_meta/commonsense_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,453 copying evalscope/benchmarks/_meta/competition_math.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,456 copying evalscope/benchmarks/_meta/conll2003.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,459 copying evalscope/benchmarks/_meta/conllpp.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,461 copying evalscope/benchmarks/_meta/copious.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,465 copying evalscope/benchmarks/_meta/cross_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,467 copying evalscope/benchmarks/_meta/data_collection.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,470 copying evalscope/benchmarks/_meta/docmath.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,472 copying evalscope/benchmarks/_meta/docvqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,475 copying evalscope/benchmarks/_meta/drivel_binary.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,478 copying evalscope/benchmarks/_meta/drivel_multilabel.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,480 copying evalscope/benchmarks/_meta/drivel_selection.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,483 copying evalscope/benchmarks/_meta/drivel_writing.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,486 copying evalscope/benchmarks/_meta/drop.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,488 copying evalscope/benchmarks/_meta/eq_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,491 copying evalscope/benchmarks/_meta/evalmuse.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,494 copying evalscope/benchmarks/_meta/fin_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,497 copying evalscope/benchmarks/_meta/fleurs.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,499 copying evalscope/benchmarks/_meta/frames.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,502 copying evalscope/benchmarks/_meta/gaia.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,505 copying evalscope/benchmarks/_meta/gedit.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,508 copying evalscope/benchmarks/_meta/genai_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,510 copying evalscope/benchmarks/_meta/general_arena.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,513 copying evalscope/benchmarks/_meta/general_fc.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,516 copying evalscope/benchmarks/_meta/general_mcq.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,519 copying evalscope/benchmarks/_meta/general_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,521 copying evalscope/benchmarks/_meta/general_t2i.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,524 copying evalscope/benchmarks/_meta/general_vmcq.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,526 copying evalscope/benchmarks/_meta/general_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,529 copying evalscope/benchmarks/_meta/genia_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,532 copying evalscope/benchmarks/_meta/gpqa_diamond.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,534 copying evalscope/benchmarks/_meta/gsm8k.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,537 copying evalscope/benchmarks/_meta/gsm8k_v.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,540 copying evalscope/benchmarks/_meta/hallusion_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,543 copying evalscope/benchmarks/_meta/halueval.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,546 copying evalscope/benchmarks/_meta/harvey_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,549 copying evalscope/benchmarks/_meta/health_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,552 copying evalscope/benchmarks/_meta/hellaswag.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,554 copying evalscope/benchmarks/_meta/hle.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,557 copying evalscope/benchmarks/_meta/hmmt25.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,560 copying evalscope/benchmarks/_meta/hpdv2.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,562 copying evalscope/benchmarks/_meta/humaneval.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,565 copying evalscope/benchmarks/_meta/humaneval_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,567 copying evalscope/benchmarks/_meta/ifbench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,570 copying evalscope/benchmarks/_meta/ifeval.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,573 copying evalscope/benchmarks/_meta/infovqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,575 copying evalscope/benchmarks/_meta/iquiz.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,578 copying evalscope/benchmarks/_meta/jnlpba.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,580 copying evalscope/benchmarks/_meta/jnlpba_rare.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,583 copying evalscope/benchmarks/_meta/k2_verifier.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,586 copying evalscope/benchmarks/_meta/kimi_verifier.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,588 copying evalscope/benchmarks/_meta/librispeech.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,591 copying evalscope/benchmarks/_meta/live_code_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,594 copying evalscope/benchmarks/_meta/logi_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,596 copying evalscope/benchmarks/_meta/longbench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,599 copying evalscope/benchmarks/_meta/maritime_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,601 copying evalscope/benchmarks/_meta/math_500.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,604 copying evalscope/benchmarks/_meta/math_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,606 copying evalscope/benchmarks/_meta/math_verse.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,609 copying evalscope/benchmarks/_meta/math_vision.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,612 copying evalscope/benchmarks/_meta/math_vista.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,615 copying evalscope/benchmarks/_meta/mbpp.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,618 copying evalscope/benchmarks/_meta/mbpp_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,621 copying evalscope/benchmarks/_meta/med_mcqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,623 copying evalscope/benchmarks/_meta/mgsm.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,626 copying evalscope/benchmarks/_meta/mia_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,629 copying evalscope/benchmarks/_meta/micro_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,632 copying evalscope/benchmarks/_meta/minerva_math.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,635 copying evalscope/benchmarks/_meta/minimax_verifier.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,637 copying evalscope/benchmarks/_meta/mit_movie_trivia.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,640 copying evalscope/benchmarks/_meta/mit_restaurant.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,643 copying evalscope/benchmarks/_meta/mm_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,646 copying evalscope/benchmarks/_meta/mm_star.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,648 copying evalscope/benchmarks/_meta/mmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,651 copying evalscope/benchmarks/_meta/mmlu_pro.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,654 copying evalscope/benchmarks/_meta/mmlu_redux.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,657 copying evalscope/benchmarks/_meta/mmmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,660 copying evalscope/benchmarks/_meta/mmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,663 copying evalscope/benchmarks/_meta/mmmu_pro.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,666 copying evalscope/benchmarks/_meta/mri_mcqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,669 copying evalscope/benchmarks/_meta/multi_if.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,672 copying evalscope/benchmarks/_meta/multi_nerd.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,674 copying evalscope/benchmarks/_meta/multiple_humaneval.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,677 copying evalscope/benchmarks/_meta/multiple_mbpp.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,681 copying evalscope/benchmarks/_meta/music_trivia.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,683 copying evalscope/benchmarks/_meta/musr.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,686 copying evalscope/benchmarks/_meta/mvbench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,688 copying evalscope/benchmarks/_meta/ncbi.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,691 copying evalscope/benchmarks/_meta/needle_haystack.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,694 copying evalscope/benchmarks/_meta/ocr_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,696 copying evalscope/benchmarks/_meta/ocr_bench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,700 copying evalscope/benchmarks/_meta/olympiad_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,702 copying evalscope/benchmarks/_meta/omni_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,705 copying evalscope/benchmarks/_meta/omni_doc_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,709 copying evalscope/benchmarks/_meta/ontonotes5.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,712 copying evalscope/benchmarks/_meta/openai_mrcr.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,715 copying evalscope/benchmarks/_meta/piqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,717 copying evalscope/benchmarks/_meta/poly_math.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,720 copying evalscope/benchmarks/_meta/pope.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,723 copying evalscope/benchmarks/_meta/process_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,726 copying evalscope/benchmarks/_meta/pubmedqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,729 copying evalscope/benchmarks/_meta/qasc.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,732 copying evalscope/benchmarks/_meta/race.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,734 copying evalscope/benchmarks/_meta/real_world_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,737 copying evalscope/benchmarks/_meta/refcoco.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,740 copying evalscope/benchmarks/_meta/scicode.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,744 copying evalscope/benchmarks/_meta/science_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,747 copying evalscope/benchmarks/_meta/sciq.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,749 copying evalscope/benchmarks/_meta/seed_bench_2_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,751 copying evalscope/benchmarks/_meta/simple_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,754 copying evalscope/benchmarks/_meta/simple_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,757 copying evalscope/benchmarks/_meta/siqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,759 copying evalscope/benchmarks/_meta/super_gpqa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,762 copying evalscope/benchmarks/_meta/swe_bench_lite.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,765 copying evalscope/benchmarks/_meta/swe_bench_lite_agentic.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,768 copying evalscope/benchmarks/_meta/swe_bench_pro.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,771 copying evalscope/benchmarks/_meta/swe_bench_verified.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,773 copying evalscope/benchmarks/_meta/swe_bench_verified_agentic.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,777 copying evalscope/benchmarks/_meta/swe_bench_verified_mini.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,779 copying evalscope/benchmarks/_meta/swe_bench_verified_mini_agentic.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,782 copying evalscope/benchmarks/_meta/tau2_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,785 copying evalscope/benchmarks/_meta/tau3_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,788 copying evalscope/benchmarks/_meta/tau_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,790 copying evalscope/benchmarks/_meta/terminal_bench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,793 copying evalscope/benchmarks/_meta/terminal_bench_v2_1.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,795 copying evalscope/benchmarks/_meta/tifa160.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,797 copying evalscope/benchmarks/_meta/tir_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,800 copying evalscope/benchmarks/_meta/tool_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,803 copying evalscope/benchmarks/_meta/torgo.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,806 copying evalscope/benchmarks/_meta/trivia_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,808 copying evalscope/benchmarks/_meta/truthful_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,811 copying evalscope/benchmarks/_meta/tweebank_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,814 copying evalscope/benchmarks/_meta/tweet_ner_7.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,817 copying evalscope/benchmarks/_meta/videomme_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,820 copying evalscope/benchmarks/_meta/visulogic.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,822 copying evalscope/benchmarks/_meta/vstar_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,826 copying evalscope/benchmarks/_meta/winogrande.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,828 copying evalscope/benchmarks/_meta/wmt24pp.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,831 copying evalscope/benchmarks/_meta/wnut2017.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,834 copying evalscope/benchmarks/_meta/zebralogicbench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,837 copying evalscope/benchmarks/_meta/zerobench.json -> build/lib/evalscope/benchmarks/_meta 2026-05-28T10:08:31,840 copying evalscope/benchmarks/ifeval/requirements.txt -> build/lib/evalscope/benchmarks/ifeval 2026-05-28T10:08:31,841 copying evalscope/benchmarks/scicode/docker/Dockerfile -> build/lib/evalscope/benchmarks/scicode/docker 2026-05-28T10:08:31,843 copying evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/lib/evalscope/benchmarks/scicode/docker 2026-05-28T10:08:31,845 creating build/lib/evalscope/benchmarks/humanevalplus/docker 2026-05-28T10:08:31,846 copying evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/lib/evalscope/benchmarks/humanevalplus/docker 2026-05-28T10:08:31,848 creating build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,849 copying evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,851 copying evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,854 copying evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,856 copying evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,858 copying evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,860 copying evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,862 copying evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,864 copying evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,867 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,869 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,871 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,873 copying evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,875 copying evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,877 copying evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,879 copying evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,881 copying evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,883 copying evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,885 copying evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,888 copying evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,890 copying evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,892 copying evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,894 copying evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,897 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,899 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,901 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,903 copying evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,905 copying evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:31,907 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:31,910 copying evalscope/benchmarks/tau_bench/tau3_bench/requirements.txt -> build/lib/evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:31,912 copying evalscope/benchmarks/tau_bench/tau2_bench/requirements.txt -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:31,914 copying evalscope/benchmarks/tau_bench/tau_bench/requirements.txt -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:31,916 creating build/lib/evalscope/report/template 2026-05-28T10:08:31,917 copying evalscope/report/template/perf_report.html.j2 -> build/lib/evalscope/report/template 2026-05-28T10:08:31,920 copying evalscope/report/template/report.html.j2 -> build/lib/evalscope/report/template 2026-05-28T10:08:31,922 creating build/lib/evalscope/report/template/js 2026-05-28T10:08:31,923 copying evalscope/report/template/js/eval_extra.js -> build/lib/evalscope/report/template/js 2026-05-28T10:08:31,926 copying evalscope/report/template/js/i18n_eval.js -> build/lib/evalscope/report/template/js 2026-05-28T10:08:31,928 copying evalscope/report/template/js/i18n_perf.js -> build/lib/evalscope/report/template/js 2026-05-28T10:08:31,931 copying evalscope/report/template/js/perf_extra.js -> build/lib/evalscope/report/template/js 2026-05-28T10:08:31,932 copying evalscope/report/template/js/shared.js -> build/lib/evalscope/report/template/js 2026-05-28T10:08:31,935 creating build/lib/evalscope/report/template/css 2026-05-28T10:08:31,936 copying evalscope/report/template/css/base.css -> build/lib/evalscope/report/template/css 2026-05-28T10:08:31,939 copying evalscope/report/template/css/perf_extra.css -> build/lib/evalscope/report/template/css 2026-05-28T10:08:31,941 creating build/lib/evalscope/report/template/partials 2026-05-28T10:08:31,942 copying evalscope/report/template/partials/brand_logo.html -> build/lib/evalscope/report/template/partials 2026-05-28T10:08:31,944 copying evalscope/report/template/partials/footer.html -> build/lib/evalscope/report/template/partials 2026-05-28T10:08:31,946 copying evalscope/report/template/partials/header_eval.html -> build/lib/evalscope/report/template/partials 2026-05-28T10:08:31,949 copying evalscope/report/template/partials/header_perf.html -> build/lib/evalscope/report/template/partials 2026-05-28T10:08:31,951 copying evalscope/report/template/partials/toc_eval.html -> build/lib/evalscope/report/template/partials 2026-05-28T10:08:31,953 copying evalscope/report/template/partials/toc_perf.html -> build/lib/evalscope/report/template/partials 2026-05-28T10:08:31,955 creating build/lib/evalscope/web/public 2026-05-28T10:08:31,956 copying evalscope/web/public/favicon.svg -> build/lib/evalscope/web/public 2026-05-28T10:08:32,009 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-05-28T10:08:32,012 copying evalscope/third_party/toolbench_static/README.md -> build/lib/evalscope/third_party/toolbench_static 2026-05-28T10:08:32,015 copying evalscope/third_party/toolbench_static/config_default.json -> build/lib/evalscope/third_party/toolbench_static 2026-05-28T10:08:32,018 copying evalscope/third_party/toolbench_static/config_default.yaml -> build/lib/evalscope/third_party/toolbench_static 2026-05-28T10:08:32,021 copying evalscope/third_party/toolbench_static/requirements.txt -> build/lib/evalscope/third_party/toolbench_static 2026-05-28T10:08:32,024 copying evalscope/third_party/longbench_write/README.md -> build/lib/evalscope/third_party/longbench_write 2026-05-28T10:08:32,027 copying evalscope/third_party/longbench_write/default_task.json -> build/lib/evalscope/third_party/longbench_write 2026-05-28T10:08:32,030 copying evalscope/third_party/longbench_write/default_task.yaml -> build/lib/evalscope/third_party/longbench_write 2026-05-28T10:08:32,033 copying evalscope/third_party/longbench_write/resources/judge.txt -> build/lib/evalscope/third_party/longbench_write/resources 2026-05-28T10:08:32,036 copying evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-05-28T10:08:32,041 copying evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-05-28T10:08:32,044 copying evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-05-28T10:08:32,047 creating build/lib/evalscope/third_party/thinkbench/resources 2026-05-28T10:08:32,049 copying evalscope/third_party/thinkbench/resources/critique_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-05-28T10:08:32,052 copying evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-05-28T10:08:32,054 creating build/lib/evalscope/agent/external/runners/_assets 2026-05-28T10:08:32,056 copying evalscope/agent/external/runners/_assets/build_claude_code_image.sh -> build/lib/evalscope/agent/external/runners/_assets 2026-05-28T10:08:32,059 copying evalscope/agent/external/runners/_assets/claude_code.Dockerfile -> build/lib/evalscope/agent/external/runners/_assets 2026-05-28T10:08:32,062 copying evalscope/metrics/text_normalizer/english.json -> build/lib/evalscope/metrics/text_normalizer 2026-05-28T10:08:32,066 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-05-28T10:08:32,068 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-05-28T10:08:32,070 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-05-28T10:08:32,071 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-05-28T10:08:32,073 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-05-28T10:08:32,075 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-05-28T10:08:32,077 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,078 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,081 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,083 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,085 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,087 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,089 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,091 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,094 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,096 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,097 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,099 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,101 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,104 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,106 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,108 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,110 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,113 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,115 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,117 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:32,238 installing to build/bdist.linux-armv7l/wheel 2026-05-28T10:08:32,239 running install 2026-05-28T10:08:32,262 running install_lib 2026-05-28T10:08:32,269 creating build/bdist.linux-armv7l/wheel 2026-05-28T10:08:32,271 creating build/bdist.linux-armv7l/wheel/evalscope 2026-05-28T10:08:32,275 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks 2026-05-28T10:08:32,277 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/process_bench 2026-05-28T10:08:32,278 copying build/lib/evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-05-28T10:08:32,281 copying build/lib/evalscope/benchmarks/process_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-05-28T10:08:32,282 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_qa 2026-05-28T10:08:32,283 copying build/lib/evalscope/benchmarks/math_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-05-28T10:08:32,285 copying build/lib/evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-05-28T10:08:32,287 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:32,288 copying build/lib/evalscope/benchmarks/olympiad_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:32,291 copying build/lib/evalscope/benchmarks/olympiad_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:32,292 copying build/lib/evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:32,295 copying build/lib/evalscope/benchmarks/olympiad_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-05-28T10:08:32,296 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/commonsense_qa 2026-05-28T10:08:32,297 copying build/lib/evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-05-28T10:08:32,300 copying build/lib/evalscope/benchmarks/commonsense_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-05-28T10:08:32,302 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_pro 2026-05-28T10:08:32,303 copying build/lib/evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-05-28T10:08:32,305 copying build/lib/evalscope/benchmarks/mmlu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-05-28T10:08:32,307 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hellaswag 2026-05-28T10:08:32,308 copying build/lib/evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-05-28T10:08:32,310 copying build/lib/evalscope/benchmarks/hellaswag/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-05-28T10:08:32,312 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/frames 2026-05-28T10:08:32,314 copying build/lib/evalscope/benchmarks/frames/frames_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-05-28T10:08:32,316 copying build/lib/evalscope/benchmarks/frames/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-05-28T10:08:32,318 copying build/lib/evalscope/benchmarks/frames/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-05-28T10:08:32,320 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_arena 2026-05-28T10:08:32,321 copying build/lib/evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-05-28T10:08:32,324 copying build/lib/evalscope/benchmarks/general_arena/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-05-28T10:08:32,327 copying build/lib/evalscope/benchmarks/general_arena/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-05-28T10:08:32,328 copying build/lib/evalscope/benchmarks/general_arena/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-05-28T10:08:32,330 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/micro_vqa 2026-05-28T10:08:32,331 copying build/lib/evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-05-28T10:08:32,333 copying build/lib/evalscope/benchmarks/micro_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-05-28T10:08:32,335 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vista 2026-05-28T10:08:32,336 copying build/lib/evalscope/benchmarks/math_vista/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-05-28T10:08:32,338 copying build/lib/evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-05-28T10:08:32,340 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/competition_math 2026-05-28T10:08:32,341 copying build/lib/evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-05-28T10:08:32,343 copying build/lib/evalscope/benchmarks/competition_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-05-28T10:08:32,345 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ai2d 2026-05-28T10:08:32,346 copying build/lib/evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-05-28T10:08:32,348 copying build/lib/evalscope/benchmarks/ai2d/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-05-28T10:08:32,350 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmlu 2026-05-28T10:08:32,351 copying build/lib/evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-05-28T10:08:32,353 copying build/lib/evalscope/benchmarks/cmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-05-28T10:08:32,355 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_star 2026-05-28T10:08:32,357 copying build/lib/evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-05-28T10:08:32,359 copying build/lib/evalscope/benchmarks/mm_star/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-05-28T10:08:32,360 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/swe_bench_pro 2026-05-28T10:08:32,361 copying build/lib/evalscope/benchmarks/swe_bench_pro/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench_pro 2026-05-28T10:08:32,364 copying build/lib/evalscope/benchmarks/swe_bench_pro/swe_bench_pro_agentic_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench_pro 2026-05-28T10:08:32,366 copying build/lib/evalscope/benchmarks/swe_bench_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench_pro 2026-05-28T10:08:32,368 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mia_bench 2026-05-28T10:08:32,370 copying build/lib/evalscope/benchmarks/mia_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mia_bench 2026-05-28T10:08:32,372 copying build/lib/evalscope/benchmarks/mia_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mia_bench 2026-05-28T10:08:32,374 copying build/lib/evalscope/benchmarks/mia_bench/mia_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mia_bench 2026-05-28T10:08:32,376 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/siqa 2026-05-28T10:08:32,377 copying build/lib/evalscope/benchmarks/siqa/siqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-05-28T10:08:32,379 copying build/lib/evalscope/benchmarks/siqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-05-28T10:08:32,381 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pope 2026-05-28T10:08:32,383 copying build/lib/evalscope/benchmarks/pope/pope_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-05-28T10:08:32,385 copying build/lib/evalscope/benchmarks/pope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-05-28T10:08:32,387 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_mcq 2026-05-28T10:08:32,388 copying build/lib/evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-05-28T10:08:32,390 copying build/lib/evalscope/benchmarks/general_mcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-05-28T10:08:32,392 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tir_bench 2026-05-28T10:08:32,393 copying build/lib/evalscope/benchmarks/tir_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tir_bench 2026-05-28T10:08:32,395 copying build/lib/evalscope/benchmarks/tir_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tir_bench 2026-05-28T10:08:32,397 copying build/lib/evalscope/benchmarks/tir_bench/tir_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tir_bench 2026-05-28T10:08:32,399 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit 2026-05-28T10:08:32,400 copying build/lib/evalscope/benchmarks/image_edit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit 2026-05-28T10:08:32,402 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:32,403 copying build/lib/evalscope/benchmarks/image_edit/gedit/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:32,406 copying build/lib/evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:32,408 copying build/lib/evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:32,410 copying build/lib/evalscope/benchmarks/image_edit/gedit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-05-28T10:08:32,413 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multi_if 2026-05-28T10:08:32,414 copying build/lib/evalscope/benchmarks/multi_if/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-05-28T10:08:32,417 copying build/lib/evalscope/benchmarks/multi_if/ifeval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-05-28T10:08:32,420 copying build/lib/evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-05-28T10:08:32,422 copying build/lib/evalscope/benchmarks/multi_if/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-05-28T10:08:32,424 copying build/lib/evalscope/benchmarks/multi_if/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-05-28T10:08:32,427 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu 2026-05-28T10:08:32,428 copying build/lib/evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-05-28T10:08:32,430 copying build/lib/evalscope/benchmarks/mmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-05-28T10:08:32,432 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arxivrollbench 2026-05-28T10:08:32,433 copying build/lib/evalscope/benchmarks/arxivrollbench/arxivrollbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arxivrollbench 2026-05-28T10:08:32,436 copying build/lib/evalscope/benchmarks/arxivrollbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arxivrollbench 2026-05-28T10:08:32,438 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/winogrande 2026-05-28T10:08:32,439 copying build/lib/evalscope/benchmarks/winogrande/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-05-28T10:08:32,441 copying build/lib/evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-05-28T10:08:32,443 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zebralogicbench 2026-05-28T10:08:32,444 copying build/lib/evalscope/benchmarks/zebralogicbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-05-28T10:08:32,446 copying build/lib/evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-05-28T10:08:32,448 copying build/lib/evalscope/benchmarks/zebralogicbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-05-28T10:08:32,450 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/wmt 2026-05-28T10:08:32,451 copying build/lib/evalscope/benchmarks/wmt/wmt24_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-05-28T10:08:32,454 copying build/lib/evalscope/benchmarks/wmt/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-05-28T10:08:32,455 copying build/lib/evalscope/benchmarks/wmt/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-05-28T10:08:32,457 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vqa 2026-05-28T10:08:32,458 copying build/lib/evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-05-28T10:08:32,461 copying build/lib/evalscope/benchmarks/general_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-05-28T10:08:32,463 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/refcoco 2026-05-28T10:08:32,464 copying build/lib/evalscope/benchmarks/refcoco/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-05-28T10:08:32,466 copying build/lib/evalscope/benchmarks/refcoco/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-05-28T10:08:32,467 copying build/lib/evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-05-28T10:08:32,470 copying build/lib/evalscope/benchmarks/refcoco/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-05-28T10:08:32,471 copying build/lib/evalscope/benchmarks/refcoco/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-05-28T10:08:32,474 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/text2image 2026-05-28T10:08:32,475 copying build/lib/evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-05-28T10:08:32,477 copying build/lib/evalscope/benchmarks/text2image/tifa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-05-28T10:08:32,479 copying build/lib/evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-05-28T10:08:32,481 copying build/lib/evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-05-28T10:08:32,483 copying build/lib/evalscope/benchmarks/text2image/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-05-28T10:08:32,484 copying build/lib/evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-05-28T10:08:32,487 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/maritime_bench 2026-05-28T10:08:32,488 copying build/lib/evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-05-28T10:08:32,490 copying build/lib/evalscope/benchmarks/maritime_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-05-28T10:08:32,492 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmlu 2026-05-28T10:08:32,493 copying build/lib/evalscope/benchmarks/mmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-05-28T10:08:32,495 copying build/lib/evalscope/benchmarks/mmmlu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-05-28T10:08:32,497 copying build/lib/evalscope/benchmarks/mmmlu/mmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-05-28T10:08:32,499 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/science_qa 2026-05-28T10:08:32,500 copying build/lib/evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-05-28T10:08:32,502 copying build/lib/evalscope/benchmarks/science_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-05-28T10:08:32,504 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/kimi_verifier 2026-05-28T10:08:32,505 copying build/lib/evalscope/benchmarks/kimi_verifier/kimi_verifier_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/kimi_verifier 2026-05-28T10:08:32,507 copying build/lib/evalscope/benchmarks/kimi_verifier/param_spec.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/kimi_verifier 2026-05-28T10:08:32,509 copying build/lib/evalscope/benchmarks/kimi_verifier/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/kimi_verifier 2026-05-28T10:08:32,511 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_verse 2026-05-28T10:08:32,512 copying build/lib/evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-05-28T10:08:32,514 copying build/lib/evalscope/benchmarks/math_verse/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-05-28T10:08:32,517 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gaia 2026-05-28T10:08:32,518 copying build/lib/evalscope/benchmarks/gaia/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gaia 2026-05-28T10:08:32,520 copying build/lib/evalscope/benchmarks/gaia/gaia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gaia 2026-05-28T10:08:32,522 copying build/lib/evalscope/benchmarks/gaia/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gaia 2026-05-28T10:08:32,524 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmu 2026-05-28T10:08:32,525 copying build/lib/evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-05-28T10:08:32,527 copying build/lib/evalscope/benchmarks/cmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-05-28T10:08:32,529 copying build/lib/evalscope/benchmarks/cmmu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-05-28T10:08:32,531 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hle 2026-05-28T10:08:32,532 copying build/lib/evalscope/benchmarks/hle/hle_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-05-28T10:08:32,535 copying build/lib/evalscope/benchmarks/hle/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-05-28T10:08:32,537 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/trivia_qa 2026-05-28T10:08:32,538 copying build/lib/evalscope/benchmarks/trivia_qa/samples.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-05-28T10:08:32,540 copying build/lib/evalscope/benchmarks/trivia_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-05-28T10:08:32,542 copying build/lib/evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-05-28T10:08:32,545 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/longbench_v2 2026-05-28T10:08:32,546 copying build/lib/evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/longbench_v2 2026-05-28T10:08:32,548 copying build/lib/evalscope/benchmarks/longbench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/longbench_v2 2026-05-28T10:08:32,550 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mri_mcqa 2026-05-28T10:08:32,551 copying build/lib/evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-05-28T10:08:32,553 copying build/lib/evalscope/benchmarks/mri_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-05-28T10:08:32,555 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/qasc 2026-05-28T10:08:32,556 copying build/lib/evalscope/benchmarks/qasc/qasc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-05-28T10:08:32,558 copying build/lib/evalscope/benchmarks/qasc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-05-28T10:08:32,560 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbpp 2026-05-28T10:08:32,561 copying build/lib/evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-05-28T10:08:32,563 copying build/lib/evalscope/benchmarks/mbpp/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-05-28T10:08:32,565 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/infovqa 2026-05-28T10:08:32,567 copying build/lib/evalscope/benchmarks/infovqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-05-28T10:08:32,568 copying build/lib/evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-05-28T10:08:32,571 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_bench 2026-05-28T10:08:32,572 copying build/lib/evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-05-28T10:08:32,574 copying build/lib/evalscope/benchmarks/mm_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-05-28T10:08:32,576 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/minimax_verifier 2026-05-28T10:08:32,577 copying build/lib/evalscope/benchmarks/minimax_verifier/_validators.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minimax_verifier 2026-05-28T10:08:32,579 copying build/lib/evalscope/benchmarks/minimax_verifier/minimax_verifier_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minimax_verifier 2026-05-28T10:08:32,582 copying build/lib/evalscope/benchmarks/minimax_verifier/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minimax_verifier 2026-05-28T10:08:32,584 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aa_lcr 2026-05-28T10:08:32,585 copying build/lib/evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-05-28T10:08:32,588 copying build/lib/evalscope/benchmarks/aa_lcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-05-28T10:08:32,590 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arc 2026-05-28T10:08:32,591 copying build/lib/evalscope/benchmarks/arc/arc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-05-28T10:08:32,593 copying build/lib/evalscope/benchmarks/arc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-05-28T10:08:32,595 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu 2026-05-28T10:08:32,597 copying build/lib/evalscope/benchmarks/mmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-05-28T10:08:32,599 copying build/lib/evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-05-28T10:08:32,601 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/piqa 2026-05-28T10:08:32,602 copying build/lib/evalscope/benchmarks/piqa/piqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-05-28T10:08:32,604 copying build/lib/evalscope/benchmarks/piqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-05-28T10:08:32,606 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/needle_haystack 2026-05-28T10:08:32,607 copying build/lib/evalscope/benchmarks/needle_haystack/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-05-28T10:08:32,609 copying build/lib/evalscope/benchmarks/needle_haystack/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-05-28T10:08:32,611 copying build/lib/evalscope/benchmarks/needle_haystack/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-05-28T10:08:32,612 copying build/lib/evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-05-28T10:08:32,615 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode 2026-05-28T10:08:32,617 copying build/lib/evalscope/benchmarks/scicode/scicode_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-05-28T10:08:32,619 copying build/lib/evalscope/benchmarks/scicode/util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-05-28T10:08:32,622 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode/docker 2026-05-28T10:08:32,623 copying build/lib/evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-05-28T10:08:32,625 copying build/lib/evalscope/benchmarks/scicode/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-05-28T10:08:32,627 copying build/lib/evalscope/benchmarks/scicode/docker/process_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-05-28T10:08:32,629 copying build/lib/evalscope/benchmarks/scicode/docker/test_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-05-28T10:08:32,630 copying build/lib/evalscope/benchmarks/scicode/prompt_templates.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-05-28T10:08:32,632 copying build/lib/evalscope/benchmarks/scicode/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-05-28T10:08:32,634 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/musr 2026-05-28T10:08:32,635 copying build/lib/evalscope/benchmarks/musr/musr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-05-28T10:08:32,638 copying build/lib/evalscope/benchmarks/musr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-05-28T10:08:32,640 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus 2026-05-28T10:08:32,641 copying build/lib/evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-05-28T10:08:32,644 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus/docker 2026-05-28T10:08:32,645 copying build/lib/evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus/docker 2026-05-28T10:08:32,647 copying build/lib/evalscope/benchmarks/humanevalplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-05-28T10:08:32,649 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/logi_qa 2026-05-28T10:08:32,650 copying build/lib/evalscope/benchmarks/logi_qa/__int__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-05-28T10:08:32,652 copying build/lib/evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-05-28T10:08:32,654 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k_v 2026-05-28T10:08:32,655 copying build/lib/evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-05-28T10:08:32,657 copying build/lib/evalscope/benchmarks/gsm8k_v/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-05-28T10:08:32,659 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chartqa 2026-05-28T10:08:32,660 copying build/lib/evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-05-28T10:08:32,662 copying build/lib/evalscope/benchmarks/chartqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-05-28T10:08:32,664 copying build/lib/evalscope/benchmarks/chartqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-05-28T10:08:32,666 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/amc 2026-05-28T10:08:32,667 copying build/lib/evalscope/benchmarks/amc/amc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-05-28T10:08:32,669 copying build/lib/evalscope/benchmarks/amc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-05-28T10:08:32,671 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,672 copying build/lib/evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,674 copying build/lib/evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,676 copying build/lib/evalscope/benchmarks/live_code_bench/prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,678 copying build/lib/evalscope/benchmarks/live_code_bench/extract_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,680 copying build/lib/evalscope/benchmarks/live_code_bench/load_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,682 copying build/lib/evalscope/benchmarks/live_code_bench/testing_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,684 copying build/lib/evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,687 copying build/lib/evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,689 copying build/lib/evalscope/benchmarks/live_code_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-05-28T10:08:32,691 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifbench 2026-05-28T10:08:32,692 copying build/lib/evalscope/benchmarks/ifbench/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-05-28T10:08:32,695 copying build/lib/evalscope/benchmarks/ifbench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-05-28T10:08:32,697 copying build/lib/evalscope/benchmarks/ifbench/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-05-28T10:08:32,701 copying build/lib/evalscope/benchmarks/ifbench/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-05-28T10:08:32,703 copying build/lib/evalscope/benchmarks/ifbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-05-28T10:08:32,705 copying build/lib/evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-05-28T10:08:32,707 copying build/lib/evalscope/benchmarks/ifbench/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-05-28T10:08:32,710 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chinese_simple_qa 2026-05-28T10:08:32,711 copying build/lib/evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-05-28T10:08:32,713 copying build/lib/evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-05-28T10:08:32,715 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/terminal_bench 2026-05-28T10:08:32,716 copying build/lib/evalscope/benchmarks/terminal_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-05-28T10:08:32,718 copying build/lib/evalscope/benchmarks/terminal_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-05-28T10:08:32,720 copying build/lib/evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-05-28T10:08:32,723 copying build/lib/evalscope/benchmarks/terminal_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-05-28T10:08:32,725 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/super_gpqa 2026-05-28T10:08:32,726 copying build/lib/evalscope/benchmarks/super_gpqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-05-28T10:08:32,728 copying build/lib/evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-05-28T10:08:32,730 copying build/lib/evalscope/benchmarks/super_gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-05-28T10:08:32,732 copying build/lib/evalscope/benchmarks/super_gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-05-28T10:08:32,734 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/fleurs 2026-05-28T10:08:32,735 copying build/lib/evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-05-28T10:08:32,738 copying build/lib/evalscope/benchmarks/fleurs/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-05-28T10:08:32,740 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/k2_verifier 2026-05-28T10:08:32,741 copying build/lib/evalscope/benchmarks/k2_verifier/k2_verifier_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/k2_verifier 2026-05-28T10:08:32,743 copying build/lib/evalscope/benchmarks/k2_verifier/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/k2_verifier 2026-05-28T10:08:32,745 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mvbench 2026-05-28T10:08:32,746 copying build/lib/evalscope/benchmarks/mvbench/mvbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mvbench 2026-05-28T10:08:32,749 copying build/lib/evalscope/benchmarks/mvbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mvbench 2026-05-28T10:08:32,751 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/swe_bench 2026-05-28T10:08:32,752 copying build/lib/evalscope/benchmarks/swe_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-05-28T10:08:32,754 copying build/lib/evalscope/benchmarks/swe_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-05-28T10:08:32,756 copying build/lib/evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-05-28T10:08:32,759 copying build/lib/evalscope/benchmarks/swe_bench/swe_bench_agentic_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-05-28T10:08:32,761 copying build/lib/evalscope/benchmarks/swe_bench/build_images.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-05-28T10:08:32,764 copying build/lib/evalscope/benchmarks/swe_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-05-28T10:08:32,766 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drop 2026-05-28T10:08:32,767 copying build/lib/evalscope/benchmarks/drop/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-05-28T10:08:32,769 copying build/lib/evalscope/benchmarks/drop/drop_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-05-28T10:08:32,771 copying build/lib/evalscope/benchmarks/drop/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-05-28T10:08:32,773 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ceval 2026-05-28T10:08:32,774 copying build/lib/evalscope/benchmarks/ceval/ceval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-05-28T10:08:32,777 copying build/lib/evalscope/benchmarks/ceval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-05-28T10:08:32,779 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh 2026-05-28T10:08:32,781 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,782 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,784 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,785 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,787 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,789 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,791 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,793 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,795 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,797 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,799 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,801 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,803 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,805 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,807 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,809 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,811 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,812 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,814 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,816 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,818 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,820 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,821 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,823 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,825 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,827 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,829 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,831 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-05-28T10:08:32,832 copying build/lib/evalscope/benchmarks/bbh/bbh_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-05-28T10:08:32,835 copying build/lib/evalscope/benchmarks/bbh/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-05-28T10:08:32,837 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_500 2026-05-28T10:08:32,838 copying build/lib/evalscope/benchmarks/math_500/math_500_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-05-28T10:08:32,840 copying build/lib/evalscope/benchmarks/math_500/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-05-28T10:08:32,842 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:32,843 copying build/lib/evalscope/benchmarks/openai_mrcr/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:32,845 copying build/lib/evalscope/benchmarks/openai_mrcr/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:32,847 copying build/lib/evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:32,849 copying build/lib/evalscope/benchmarks/openai_mrcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-05-28T10:08:32,851 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omni_bench 2026-05-28T10:08:32,852 copying build/lib/evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-05-28T10:08:32,854 copying build/lib/evalscope/benchmarks/omni_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-05-28T10:08:32,857 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/data_collection 2026-05-28T10:08:32,858 copying build/lib/evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-05-28T10:08:32,860 copying build/lib/evalscope/benchmarks/data_collection/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-05-28T10:08:32,862 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k 2026-05-28T10:08:32,863 copying build/lib/evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-05-28T10:08:32,865 copying build/lib/evalscope/benchmarks/gsm8k/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-05-28T10:08:32,867 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmmu 2026-05-28T10:08:32,868 copying build/lib/evalscope/benchmarks/cmmmu/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-05-28T10:08:32,870 copying build/lib/evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-05-28T10:08:32,873 copying build/lib/evalscope/benchmarks/cmmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-05-28T10:08:32,874 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hmmt25 2026-05-28T10:08:32,875 copying build/lib/evalscope/benchmarks/hmmt25/hmmt25_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hmmt25 2026-05-28T10:08:32,878 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbppplus 2026-05-28T10:08:32,879 copying build/lib/evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-05-28T10:08:32,882 copying build/lib/evalscope/benchmarks/mbppplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-05-28T10:08:32,884 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/blink 2026-05-28T10:08:32,885 copying build/lib/evalscope/benchmarks/blink/blink_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-05-28T10:08:32,887 copying build/lib/evalscope/benchmarks/blink/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-05-28T10:08:32,889 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/sciq 2026-05-28T10:08:32,890 copying build/lib/evalscope/benchmarks/sciq/sciq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-05-28T10:08:32,892 copying build/lib/evalscope/benchmarks/sciq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-05-28T10:08:32,894 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/eq_bench 2026-05-28T10:08:32,896 copying build/lib/evalscope/benchmarks/eq_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-05-28T10:08:32,897 copying build/lib/evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-05-28T10:08:32,900 copying build/lib/evalscope/benchmarks/eq_bench/answer_validation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-05-28T10:08:32,903 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/iquiz 2026-05-28T10:08:32,904 copying build/lib/evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-05-28T10:08:32,906 copying build/lib/evalscope/benchmarks/iquiz/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-05-28T10:08:32,908 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl 2026-05-28T10:08:32,909 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v4 2026-05-28T10:08:32,911 copying build/lib/evalscope/benchmarks/bfcl/v4/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-05-28T10:08:32,913 copying build/lib/evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-05-28T10:08:32,916 copying build/lib/evalscope/benchmarks/bfcl/v4/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-05-28T10:08:32,917 copying build/lib/evalscope/benchmarks/bfcl/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl 2026-05-28T10:08:32,919 copying build/lib/evalscope/benchmarks/bfcl/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl 2026-05-28T10:08:32,921 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:32,922 copying build/lib/evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:32,924 copying build/lib/evalscope/benchmarks/bfcl/v3/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:32,926 copying build/lib/evalscope/benchmarks/bfcl/v3/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:32,928 copying build/lib/evalscope/benchmarks/bfcl/v3/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-05-28T10:08:32,930 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multipl_e 2026-05-28T10:08:32,931 copying build/lib/evalscope/benchmarks/multipl_e/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-05-28T10:08:32,934 copying build/lib/evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-05-28T10:08:32,936 copying build/lib/evalscope/benchmarks/multipl_e/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-05-28T10:08:32,937 copying build/lib/evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-05-28T10:08:32,940 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench 2026-05-28T10:08:32,941 copying build/lib/evalscope/benchmarks/ocr_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench 2026-05-28T10:08:32,944 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench 2026-05-28T10:08:32,945 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-05-28T10:08:32,946 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-05-28T10:08:32,949 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,951 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:32,952 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:32,954 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:32,956 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:32,959 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-05-28T10:08:32,961 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,963 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,964 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,967 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,969 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,971 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,974 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,976 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,978 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-05-28T10:08:32,980 copying build/lib/evalscope/benchmarks/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench 2026-05-28T10:08:32,982 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/halu_eval 2026-05-28T10:08:32,983 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-05-28T10:08:32,985 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-05-28T10:08:32,987 copying build/lib/evalscope/benchmarks/halu_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-05-28T10:08:32,989 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu_pro 2026-05-28T10:08:32,990 copying build/lib/evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-05-28T10:08:33,000 copying build/lib/evalscope/benchmarks/mmmu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-05-28T10:08:33,002 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_qa 2026-05-28T10:08:33,003 copying build/lib/evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-05-28T10:08:33,006 copying build/lib/evalscope/benchmarks/simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-05-28T10:08:33,008 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/med_mcqa 2026-05-28T10:08:33,009 copying build/lib/evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-05-28T10:08:33,011 copying build/lib/evalscope/benchmarks/med_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-05-28T10:08:33,013 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner 2026-05-28T10:08:33,014 copying build/lib/evalscope/benchmarks/ner/jnlpba_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,016 copying build/lib/evalscope/benchmarks/ner/conll2003_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,018 copying build/lib/evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,020 copying build/lib/evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,022 copying build/lib/evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,024 copying build/lib/evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,026 copying build/lib/evalscope/benchmarks/ner/conllpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,028 copying build/lib/evalscope/benchmarks/ner/bc2gm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,030 copying build/lib/evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,031 copying build/lib/evalscope/benchmarks/ner/genia_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,034 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:33,035 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:33,037 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:33,039 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:33,041 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:33,042 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:33,044 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-05-28T10:08:33,045 copying build/lib/evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,047 copying build/lib/evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,049 copying build/lib/evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,051 copying build/lib/evalscope/benchmarks/ner/cross_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,053 copying build/lib/evalscope/benchmarks/ner/copious_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,055 copying build/lib/evalscope/benchmarks/ner/anat_em_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,057 copying build/lib/evalscope/benchmarks/ner/fin_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,059 copying build/lib/evalscope/benchmarks/ner/wnut2017_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,061 copying build/lib/evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,063 copying build/lib/evalscope/benchmarks/ner/ncbi_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,065 copying build/lib/evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,067 copying build/lib/evalscope/benchmarks/ner/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,068 copying build/lib/evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-05-28T10:08:33,071 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docvqa 2026-05-28T10:08:33,072 copying build/lib/evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-05-28T10:08:33,074 copying build/lib/evalscope/benchmarks/docvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-05-28T10:08:33,076 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:33,077 copying build/lib/evalscope/benchmarks/omnidoc_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:33,081 copying build/lib/evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:33,083 copying build/lib/evalscope/benchmarks/omnidoc_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:33,085 copying build/lib/evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:33,087 copying build/lib/evalscope/benchmarks/omnidoc_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:33,088 copying build/lib/evalscope/benchmarks/omnidoc_bench/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-05-28T10:08:33,091 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humaneval 2026-05-28T10:08:33,092 copying build/lib/evalscope/benchmarks/humaneval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-05-28T10:08:33,095 copying build/lib/evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-05-28T10:08:33,097 copying build/lib/evalscope/benchmarks/humaneval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-05-28T10:08:33,100 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/real_world_qa 2026-05-28T10:08:33,101 copying build/lib/evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-05-28T10:08:33,103 copying build/lib/evalscope/benchmarks/real_world_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-05-28T10:08:33,105 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arena_hard 2026-05-28T10:08:33,107 copying build/lib/evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-05-28T10:08:33,109 copying build/lib/evalscope/benchmarks/arena_hard/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-05-28T10:08:33,111 copying build/lib/evalscope/benchmarks/arena_hard/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-05-28T10:08:33,113 copying build/lib/evalscope/benchmarks/arena_hard/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-05-28T10:08:33,115 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tool_bench 2026-05-28T10:08:33,116 copying build/lib/evalscope/benchmarks/tool_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-05-28T10:08:33,118 copying build/lib/evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-05-28T10:08:33,120 copying build/lib/evalscope/benchmarks/tool_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-05-28T10:08:33,122 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pumed_qa 2026-05-28T10:08:33,123 copying build/lib/evalscope/benchmarks/pumed_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-05-28T10:08:33,125 copying build/lib/evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-05-28T10:08:33,128 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/healthbench 2026-05-28T10:08:33,129 copying build/lib/evalscope/benchmarks/healthbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-05-28T10:08:33,131 copying build/lib/evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-05-28T10:08:33,133 copying build/lib/evalscope/benchmarks/healthbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-05-28T10:08:33,135 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/air_bench 2026-05-28T10:08:33,136 copying build/lib/evalscope/benchmarks/air_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/air_bench 2026-05-28T10:08:33,138 copying build/lib/evalscope/benchmarks/air_bench/air_bench_foundation_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/air_bench 2026-05-28T10:08:33,141 copying build/lib/evalscope/benchmarks/air_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/air_bench 2026-05-28T10:08:33,142 copying build/lib/evalscope/benchmarks/air_bench/air_bench_chat_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/air_bench 2026-05-28T10:08:33,145 copying build/lib/evalscope/benchmarks/air_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/air_bench 2026-05-28T10:08:33,147 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_redux 2026-05-28T10:08:33,148 copying build/lib/evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-05-28T10:08:33,150 copying build/lib/evalscope/benchmarks/mmlu_redux/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-05-28T10:08:33,152 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/biomix_qa 2026-05-28T10:08:33,153 copying build/lib/evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-05-28T10:08:33,155 copying build/lib/evalscope/benchmarks/biomix_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-05-28T10:08:33,157 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/seed_bench_2_plus 2026-05-28T10:08:33,158 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-05-28T10:08:33,160 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-05-28T10:08:33,162 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_qa 2026-05-28T10:08:33,163 copying build/lib/evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-05-28T10:08:33,166 copying build/lib/evalscope/benchmarks/general_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-05-28T10:08:33,168 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/videomme_v2 2026-05-28T10:08:33,169 copying build/lib/evalscope/benchmarks/videomme_v2/videomme_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/videomme_v2 2026-05-28T10:08:33,171 copying build/lib/evalscope/benchmarks/videomme_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/videomme_v2 2026-05-28T10:08:33,174 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vmcq 2026-05-28T10:08:33,175 copying build/lib/evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-05-28T10:08:33,177 copying build/lib/evalscope/benchmarks/general_vmcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-05-28T10:08:33,179 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/alpaca_eval 2026-05-28T10:08:33,180 copying build/lib/evalscope/benchmarks/alpaca_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-05-28T10:08:33,182 copying build/lib/evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-05-28T10:08:33,184 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vision 2026-05-28T10:08:33,185 copying build/lib/evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-05-28T10:08:33,187 copying build/lib/evalscope/benchmarks/math_vision/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-05-28T10:08:33,189 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_vqa 2026-05-28T10:08:33,190 copying build/lib/evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-05-28T10:08:33,193 copying build/lib/evalscope/benchmarks/simple_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-05-28T10:08:33,195 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/a_okvqa 2026-05-28T10:08:33,196 copying build/lib/evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-05-28T10:08:33,198 copying build/lib/evalscope/benchmarks/a_okvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-05-28T10:08:33,200 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/torgo 2026-05-28T10:08:33,201 copying build/lib/evalscope/benchmarks/torgo/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-05-28T10:08:33,203 copying build/lib/evalscope/benchmarks/torgo/torgo_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-05-28T10:08:33,205 copying build/lib/evalscope/benchmarks/torgo/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-05-28T10:08:33,207 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/librispeech 2026-05-28T10:08:33,208 copying build/lib/evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-05-28T10:08:33,211 copying build/lib/evalscope/benchmarks/librispeech/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-05-28T10:08:33,216 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/_meta 2026-05-28T10:08:33,217 copying build/lib/evalscope/benchmarks/_meta/zebralogicbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,220 copying build/lib/evalscope/benchmarks/_meta/cmmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,224 copying build/lib/evalscope/benchmarks/_meta/docmath.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,227 copying build/lib/evalscope/benchmarks/_meta/health_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,230 copying build/lib/evalscope/benchmarks/_meta/chartqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,232 copying build/lib/evalscope/benchmarks/_meta/torgo.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,235 copying build/lib/evalscope/benchmarks/_meta/seed_bench_2_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,238 copying build/lib/evalscope/benchmarks/_meta/aime26.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,241 copying build/lib/evalscope/benchmarks/_meta/amc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,243 copying build/lib/evalscope/benchmarks/_meta/genia_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,246 copying build/lib/evalscope/benchmarks/_meta/qasc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,248 copying build/lib/evalscope/benchmarks/_meta/ai2d.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,250 copying build/lib/evalscope/benchmarks/_meta/tweebank_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,253 copying build/lib/evalscope/benchmarks/_meta/tool_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,256 copying build/lib/evalscope/benchmarks/_meta/kimi_verifier.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,258 copying build/lib/evalscope/benchmarks/_meta/drivel_multilabel.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,261 copying build/lib/evalscope/benchmarks/_meta/mmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,263 copying build/lib/evalscope/benchmarks/_meta/ocr_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,266 copying build/lib/evalscope/benchmarks/_meta/humaneval_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,268 copying build/lib/evalscope/benchmarks/_meta/scicode.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,272 copying build/lib/evalscope/benchmarks/_meta/mit_movie_trivia.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,274 copying build/lib/evalscope/benchmarks/_meta/broad_twitter_corpus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,277 copying build/lib/evalscope/benchmarks/_meta/mit_restaurant.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,279 copying build/lib/evalscope/benchmarks/_meta/gedit.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,281 copying build/lib/evalscope/benchmarks/_meta/real_world_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,284 copying build/lib/evalscope/benchmarks/_meta/general_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,287 copying build/lib/evalscope/benchmarks/_meta/simple_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,289 copying build/lib/evalscope/benchmarks/_meta/cl_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,292 copying build/lib/evalscope/benchmarks/_meta/air_bench_foundation.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,295 copying build/lib/evalscope/benchmarks/_meta/tweet_ner_7.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,298 copying build/lib/evalscope/benchmarks/_meta/tau3_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,300 copying build/lib/evalscope/benchmarks/_meta/bfcl_v4.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,303 copying build/lib/evalscope/benchmarks/_meta/sciq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,305 copying build/lib/evalscope/benchmarks/_meta/needle_haystack.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,308 copying build/lib/evalscope/benchmarks/_meta/math_500.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,311 copying build/lib/evalscope/benchmarks/_meta/drivel_binary.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,313 copying build/lib/evalscope/benchmarks/_meta/arxivrollbench_full.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,317 copying build/lib/evalscope/benchmarks/_meta/anat_em.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,319 copying build/lib/evalscope/benchmarks/_meta/wmt24pp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,323 copying build/lib/evalscope/benchmarks/_meta/math_verse.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,326 copying build/lib/evalscope/benchmarks/_meta/tir_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,329 copying build/lib/evalscope/benchmarks/_meta/pubmedqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,332 copying build/lib/evalscope/benchmarks/_meta/process_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,335 copying build/lib/evalscope/benchmarks/_meta/olympiad_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,339 copying build/lib/evalscope/benchmarks/_meta/harvey_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,341 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified_agentic.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,345 copying build/lib/evalscope/benchmarks/_meta/mmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,348 copying build/lib/evalscope/benchmarks/_meta/gsm8k.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,351 copying build/lib/evalscope/benchmarks/_meta/micro_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,354 copying build/lib/evalscope/benchmarks/_meta/aa_lcr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,357 copying build/lib/evalscope/benchmarks/_meta/wnut2017.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,360 copying build/lib/evalscope/benchmarks/_meta/mmmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,363 copying build/lib/evalscope/benchmarks/_meta/swe_bench_pro.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,366 copying build/lib/evalscope/benchmarks/_meta/bbh.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,370 copying build/lib/evalscope/benchmarks/_meta/arxivrollbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,373 copying build/lib/evalscope/benchmarks/_meta/simple_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,376 copying build/lib/evalscope/benchmarks/_meta/tau_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,379 copying build/lib/evalscope/benchmarks/_meta/mmmu_pro.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,383 copying build/lib/evalscope/benchmarks/_meta/blink.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,386 copying build/lib/evalscope/benchmarks/_meta/mri_mcqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,389 copying build/lib/evalscope/benchmarks/_meta/fleurs.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,392 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified_mini.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,395 copying build/lib/evalscope/benchmarks/_meta/genai_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,398 copying build/lib/evalscope/benchmarks/_meta/tau2_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,401 copying build/lib/evalscope/benchmarks/_meta/arc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,404 copying build/lib/evalscope/benchmarks/_meta/maritime_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,406 copying build/lib/evalscope/benchmarks/_meta/multi_if.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,410 copying build/lib/evalscope/benchmarks/_meta/hmmt25.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,412 copying build/lib/evalscope/benchmarks/_meta/general_arena.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,415 copying build/lib/evalscope/benchmarks/_meta/longbench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,418 copying build/lib/evalscope/benchmarks/_meta/omni_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,421 copying build/lib/evalscope/benchmarks/_meta/gpqa_diamond.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,424 copying build/lib/evalscope/benchmarks/_meta/hle.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,427 copying build/lib/evalscope/benchmarks/_meta/cc_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,430 copying build/lib/evalscope/benchmarks/_meta/mgsm.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,433 copying build/lib/evalscope/benchmarks/_meta/multiple_humaneval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,436 copying build/lib/evalscope/benchmarks/_meta/zerobench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,439 copying build/lib/evalscope/benchmarks/_meta/terminal_bench_v2_1.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,442 copying build/lib/evalscope/benchmarks/_meta/a_okvqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,444 copying build/lib/evalscope/benchmarks/_meta/copious.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,448 copying build/lib/evalscope/benchmarks/_meta/musr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,451 copying build/lib/evalscope/benchmarks/_meta/bfcl_v3.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,454 copying build/lib/evalscope/benchmarks/_meta/fin_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,457 copying build/lib/evalscope/benchmarks/_meta/commonsense_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,459 copying build/lib/evalscope/benchmarks/_meta/live_code_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,462 copying build/lib/evalscope/benchmarks/_meta/conllpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,465 copying build/lib/evalscope/benchmarks/_meta/ceval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,469 copying build/lib/evalscope/benchmarks/_meta/multi_nerd.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,472 copying build/lib/evalscope/benchmarks/_meta/ocr_bench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,475 copying build/lib/evalscope/benchmarks/_meta/logi_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,479 copying build/lib/evalscope/benchmarks/_meta/biomix_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,483 copying build/lib/evalscope/benchmarks/_meta/general_vmcq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,486 copying build/lib/evalscope/benchmarks/_meta/hallusion_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,490 copying build/lib/evalscope/benchmarks/_meta/ncbi.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,493 copying build/lib/evalscope/benchmarks/_meta/trivia_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,497 copying build/lib/evalscope/benchmarks/_meta/mm_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,501 copying build/lib/evalscope/benchmarks/_meta/gsm8k_v.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,504 copying build/lib/evalscope/benchmarks/_meta/infovqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,507 copying build/lib/evalscope/benchmarks/_meta/frames.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,511 copying build/lib/evalscope/benchmarks/_meta/bc5cdr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,515 copying build/lib/evalscope/benchmarks/_meta/math_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,519 copying build/lib/evalscope/benchmarks/_meta/halueval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,523 copying build/lib/evalscope/benchmarks/_meta/aime25.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,527 copying build/lib/evalscope/benchmarks/_meta/bc4chemd.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,530 copying build/lib/evalscope/benchmarks/_meta/science_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,533 copying build/lib/evalscope/benchmarks/_meta/ontonotes5.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,537 copying build/lib/evalscope/benchmarks/_meta/k2_verifier.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,540 copying build/lib/evalscope/benchmarks/_meta/minerva_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,544 copying build/lib/evalscope/benchmarks/_meta/super_gpqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,548 copying build/lib/evalscope/benchmarks/_meta/refcoco.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,551 copying build/lib/evalscope/benchmarks/_meta/multiple_mbpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,554 copying build/lib/evalscope/benchmarks/_meta/mmlu_pro.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,557 copying build/lib/evalscope/benchmarks/_meta/mm_star.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,560 copying build/lib/evalscope/benchmarks/_meta/omni_doc_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,564 copying build/lib/evalscope/benchmarks/_meta/bc2gm.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,567 copying build/lib/evalscope/benchmarks/_meta/conll2003.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,570 copying build/lib/evalscope/benchmarks/_meta/drivel_selection.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,573 copying build/lib/evalscope/benchmarks/_meta/terminal_bench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,576 copying build/lib/evalscope/benchmarks/_meta/jnlpba.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,578 copying build/lib/evalscope/benchmarks/_meta/piqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,580 copying build/lib/evalscope/benchmarks/_meta/minimax_verifier.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,582 copying build/lib/evalscope/benchmarks/_meta/chinese_simpleqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,585 copying build/lib/evalscope/benchmarks/_meta/jnlpba_rare.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,588 copying build/lib/evalscope/benchmarks/_meta/data_collection.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,591 copying build/lib/evalscope/benchmarks/_meta/mmlu_redux.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,594 copying build/lib/evalscope/benchmarks/_meta/math_vista.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,596 copying build/lib/evalscope/benchmarks/_meta/race.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,598 copying build/lib/evalscope/benchmarks/_meta/cross_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,601 copying build/lib/evalscope/benchmarks/_meta/air_bench_chat.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,603 copying build/lib/evalscope/benchmarks/_meta/med_mcqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,605 copying build/lib/evalscope/benchmarks/_meta/hellaswag.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,608 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,610 copying build/lib/evalscope/benchmarks/_meta/competition_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,612 copying build/lib/evalscope/benchmarks/_meta/music_trivia.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,615 copying build/lib/evalscope/benchmarks/_meta/eq_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,617 copying build/lib/evalscope/benchmarks/_meta/humaneval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,619 copying build/lib/evalscope/benchmarks/_meta/poly_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,622 copying build/lib/evalscope/benchmarks/_meta/aime24.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,624 copying build/lib/evalscope/benchmarks/_meta/gaia.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,627 copying build/lib/evalscope/benchmarks/_meta/pope.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,629 copying build/lib/evalscope/benchmarks/_meta/siqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,632 copying build/lib/evalscope/benchmarks/_meta/math_vision.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,634 copying build/lib/evalscope/benchmarks/_meta/alpaca_eval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,636 copying build/lib/evalscope/benchmarks/_meta/swe_bench_lite.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,639 copying build/lib/evalscope/benchmarks/_meta/drivel_writing.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,641 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified_mini_agentic.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,644 copying build/lib/evalscope/benchmarks/_meta/ifeval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,646 copying build/lib/evalscope/benchmarks/_meta/vstar_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,648 copying build/lib/evalscope/benchmarks/_meta/videomme_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,650 copying build/lib/evalscope/benchmarks/_meta/mvbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,653 copying build/lib/evalscope/benchmarks/_meta/truthful_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,655 copying build/lib/evalscope/benchmarks/_meta/openai_mrcr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,657 copying build/lib/evalscope/benchmarks/_meta/general_fc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,660 copying build/lib/evalscope/benchmarks/_meta/cmmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,662 copying build/lib/evalscope/benchmarks/_meta/mia_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,664 copying build/lib/evalscope/benchmarks/_meta/coin_flip.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,667 copying build/lib/evalscope/benchmarks/_meta/docvqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,669 copying build/lib/evalscope/benchmarks/_meta/cmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,672 copying build/lib/evalscope/benchmarks/_meta/drop.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,674 copying build/lib/evalscope/benchmarks/_meta/ifbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,676 copying build/lib/evalscope/benchmarks/_meta/arena_hard.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,679 copying build/lib/evalscope/benchmarks/_meta/tifa160.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,681 copying build/lib/evalscope/benchmarks/_meta/iquiz.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,683 copying build/lib/evalscope/benchmarks/_meta/winogrande.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,685 copying build/lib/evalscope/benchmarks/_meta/mbpp_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,688 copying build/lib/evalscope/benchmarks/_meta/swe_bench_lite_agentic.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,690 copying build/lib/evalscope/benchmarks/_meta/evalmuse.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,693 copying build/lib/evalscope/benchmarks/_meta/librispeech.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,696 copying build/lib/evalscope/benchmarks/_meta/mbpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,698 copying build/lib/evalscope/benchmarks/_meta/general_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,700 copying build/lib/evalscope/benchmarks/_meta/general_mcq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,703 copying build/lib/evalscope/benchmarks/_meta/hpdv2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,705 copying build/lib/evalscope/benchmarks/_meta/visulogic.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,708 copying build/lib/evalscope/benchmarks/_meta/general_t2i.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-05-28T10:08:33,710 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gpqa 2026-05-28T10:08:33,711 copying build/lib/evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-05-28T10:08:33,713 copying build/lib/evalscope/benchmarks/gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-05-28T10:08:33,715 copying build/lib/evalscope/benchmarks/gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-05-28T10:08:33,717 copying build/lib/evalscope/benchmarks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks 2026-05-28T10:08:33,719 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/vstar_bench 2026-05-28T10:08:33,720 copying build/lib/evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-05-28T10:08:33,722 copying build/lib/evalscope/benchmarks/vstar_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-05-28T10:08:33,723 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zerobench 2026-05-28T10:08:33,724 copying build/lib/evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-05-28T10:08:33,727 copying build/lib/evalscope/benchmarks/zerobench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-05-28T10:08:33,730 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/minerva_math 2026-05-28T10:08:33,731 copying build/lib/evalscope/benchmarks/minerva_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-05-28T10:08:33,733 copying build/lib/evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-05-28T10:08:33,735 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docmath 2026-05-28T10:08:33,736 copying build/lib/evalscope/benchmarks/docmath/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-05-28T10:08:33,739 copying build/lib/evalscope/benchmarks/docmath/docmath_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-05-28T10:08:33,741 copying build/lib/evalscope/benchmarks/docmath/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-05-28T10:08:33,743 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math 2026-05-28T10:08:33,745 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math/utils 2026-05-28T10:08:33,746 copying build/lib/evalscope/benchmarks/poly_math/utils/instruction.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math/utils 2026-05-28T10:08:33,748 copying build/lib/evalscope/benchmarks/poly_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-05-28T10:08:33,750 copying build/lib/evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-05-28T10:08:33,752 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifeval 2026-05-28T10:08:33,753 copying build/lib/evalscope/benchmarks/ifeval/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-05-28T10:08:33,756 copying build/lib/evalscope/benchmarks/ifeval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-05-28T10:08:33,758 copying build/lib/evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-05-28T10:08:33,760 copying build/lib/evalscope/benchmarks/ifeval/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-05-28T10:08:33,762 copying build/lib/evalscope/benchmarks/ifeval/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-05-28T10:08:33,765 copying build/lib/evalscope/benchmarks/ifeval/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-05-28T10:08:33,767 copying build/lib/evalscope/benchmarks/ifeval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-05-28T10:08:33,769 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/coin_flip 2026-05-28T10:08:33,770 copying build/lib/evalscope/benchmarks/coin_flip/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-05-28T10:08:33,771 copying build/lib/evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-05-28T10:08:33,774 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/race 2026-05-28T10:08:33,775 copying build/lib/evalscope/benchmarks/race/race_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-05-28T10:08:33,777 copying build/lib/evalscope/benchmarks/race/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-05-28T10:08:33,779 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/visu_logic 2026-05-28T10:08:33,780 copying build/lib/evalscope/benchmarks/visu_logic/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-05-28T10:08:33,782 copying build/lib/evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-05-28T10:08:33,784 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hallusion_bench 2026-05-28T10:08:33,785 copying build/lib/evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-05-28T10:08:33,788 copying build/lib/evalscope/benchmarks/hallusion_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-05-28T10:08:33,789 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/truthful_qa 2026-05-28T10:08:33,790 copying build/lib/evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-05-28T10:08:33,793 copying build/lib/evalscope/benchmarks/truthful_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-05-28T10:08:33,795 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cl_bench 2026-05-28T10:08:33,796 copying build/lib/evalscope/benchmarks/cl_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-05-28T10:08:33,798 copying build/lib/evalscope/benchmarks/cl_bench/cl_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-05-28T10:08:33,800 copying build/lib/evalscope/benchmarks/cl_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-05-28T10:08:33,802 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aime 2026-05-28T10:08:33,803 copying build/lib/evalscope/benchmarks/aime/math_normalize.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-05-28T10:08:33,806 copying build/lib/evalscope/benchmarks/aime/aime_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-05-28T10:08:33,808 copying build/lib/evalscope/benchmarks/aime/grader.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-05-28T10:08:33,810 copying build/lib/evalscope/benchmarks/aime/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-05-28T10:08:33,813 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/music_trivia 2026-05-28T10:08:33,814 copying build/lib/evalscope/benchmarks/music_trivia/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-05-28T10:08:33,815 copying build/lib/evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-05-28T10:08:33,818 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drivelology 2026-05-28T10:08:33,819 copying build/lib/evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-05-28T10:08:33,821 copying build/lib/evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-05-28T10:08:33,823 copying build/lib/evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-05-28T10:08:33,825 copying build/lib/evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-05-28T10:08:33,827 copying build/lib/evalscope/benchmarks/drivelology/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-05-28T10:08:33,829 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mgsm 2026-05-28T10:08:33,830 copying build/lib/evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-05-28T10:08:33,832 copying build/lib/evalscope/benchmarks/mgsm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-05-28T10:08:33,834 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench 2026-05-28T10:08:33,836 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:33,837 copying build/lib/evalscope/benchmarks/tau_bench/tau3_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:33,839 copying build/lib/evalscope/benchmarks/tau_bench/tau3_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:33,841 copying build/lib/evalscope/benchmarks/tau_bench/tau3_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:33,843 copying build/lib/evalscope/benchmarks/tau_bench/tau3_bench/tau3_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau3_bench 2026-05-28T10:08:33,845 copying build/lib/evalscope/benchmarks/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench 2026-05-28T10:08:33,847 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:33,848 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:33,850 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:33,853 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:33,855 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-05-28T10:08:33,857 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:33,858 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:33,860 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:33,862 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:33,864 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-05-28T10:08:33,866 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_fc 2026-05-28T10:08:33,867 copying build/lib/evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-05-28T10:08:33,870 copying build/lib/evalscope/benchmarks/general_fc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-05-28T10:08:33,871 copying build/lib/evalscope/run.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-05-28T10:08:33,874 creating build/bdist.linux-armv7l/wheel/evalscope/filters 2026-05-28T10:08:33,875 copying build/lib/evalscope/filters/selection.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-05-28T10:08:33,877 copying build/lib/evalscope/filters/extraction.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-05-28T10:08:33,879 copying build/lib/evalscope/filters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-05-28T10:08:33,881 creating build/bdist.linux-armv7l/wheel/evalscope/utils 2026-05-28T10:08:33,882 copying build/lib/evalscope/utils/data_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,885 copying build/lib/evalscope/utils/model_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,886 copying build/lib/evalscope/utils/ner.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,889 copying build/lib/evalscope/utils/url_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,891 copying build/lib/evalscope/utils/chat_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,893 copying build/lib/evalscope/utils/deprecation_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,895 copying build/lib/evalscope/utils/multi_choices.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,897 copying build/lib/evalscope/utils/json_schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,900 copying build/lib/evalscope/utils/argument_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,902 copying build/lib/evalscope/utils/resource_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,905 copying build/lib/evalscope/utils/code_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,908 copying build/lib/evalscope/utils/function_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,911 copying build/lib/evalscope/utils/import_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,913 copying build/lib/evalscope/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,916 creating build/bdist.linux-armv7l/wheel/evalscope/utils/tqdm_utils 2026-05-28T10:08:33,917 copying build/lib/evalscope/utils/tqdm_utils/tqdm_logging.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-05-28T10:08:33,920 copying build/lib/evalscope/utils/tqdm_utils/progress_tracker.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-05-28T10:08:33,922 copying build/lib/evalscope/utils/tqdm_utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-05-28T10:08:33,925 copying build/lib/evalscope/utils/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,928 creating build/bdist.linux-armv7l/wheel/evalscope/utils/doc_utils 2026-05-28T10:08:33,929 copying build/lib/evalscope/utils/doc_utils/translate_description.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-05-28T10:08:33,932 copying build/lib/evalscope/utils/doc_utils/generate_dataset_md.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-05-28T10:08:33,935 copying build/lib/evalscope/utils/doc_utils/benchmark_stats.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-05-28T10:08:33,938 copying build/lib/evalscope/utils/doc_utils/readme_generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-05-28T10:08:33,941 copying build/lib/evalscope/utils/doc_utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-05-28T10:08:33,943 copying build/lib/evalscope/utils/io_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-05-28T10:08:33,946 creating build/bdist.linux-armv7l/wheel/evalscope/summarizer 2026-05-28T10:08:33,948 copying build/lib/evalscope/summarizer/summarizer.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-05-28T10:08:33,950 copying build/lib/evalscope/summarizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-05-28T10:08:33,953 creating build/bdist.linux-armv7l/wheel/evalscope/evaluator 2026-05-28T10:08:33,954 copying build/lib/evalscope/evaluator/batch_reviewer.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-05-28T10:08:33,957 copying build/lib/evalscope/evaluator/perf_collector.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-05-28T10:08:33,960 copying build/lib/evalscope/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-05-28T10:08:33,963 copying build/lib/evalscope/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-05-28T10:08:33,966 creating build/bdist.linux-armv7l/wheel/evalscope/report 2026-05-28T10:08:33,967 copying build/lib/evalscope/report/report.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-05-28T10:08:33,970 copying build/lib/evalscope/report/combinator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-05-28T10:08:33,973 copying build/lib/evalscope/report/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-05-28T10:08:33,975 copying build/lib/evalscope/report/renderer.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-05-28T10:08:33,978 copying build/lib/evalscope/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-05-28T10:08:33,980 copying build/lib/evalscope/report/generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-05-28T10:08:33,983 creating build/bdist.linux-armv7l/wheel/evalscope/report/template 2026-05-28T10:08:33,985 copying build/lib/evalscope/report/template/perf_report.html.j2 -> build/bdist.linux-armv7l/wheel/./evalscope/report/template 2026-05-28T10:08:33,988 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/js 2026-05-28T10:08:33,990 copying build/lib/evalscope/report/template/js/i18n_perf.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-05-28T10:08:33,993 copying build/lib/evalscope/report/template/js/perf_extra.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-05-28T10:08:33,995 copying build/lib/evalscope/report/template/js/i18n_eval.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-05-28T10:08:33,997 copying build/lib/evalscope/report/template/js/eval_extra.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-05-28T10:08:34,000 copying build/lib/evalscope/report/template/js/shared.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-05-28T10:08:34,003 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/css 2026-05-28T10:08:34,004 copying build/lib/evalscope/report/template/css/perf_extra.css -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/css 2026-05-28T10:08:34,007 copying build/lib/evalscope/report/template/css/base.css -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/css 2026-05-28T10:08:34,011 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/partials 2026-05-28T10:08:34,012 copying build/lib/evalscope/report/template/partials/toc_perf.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-05-28T10:08:34,014 copying build/lib/evalscope/report/template/partials/brand_logo.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-05-28T10:08:34,017 copying build/lib/evalscope/report/template/partials/header_perf.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-05-28T10:08:34,019 copying build/lib/evalscope/report/template/partials/toc_eval.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-05-28T10:08:34,021 copying build/lib/evalscope/report/template/partials/footer.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-05-28T10:08:34,024 copying build/lib/evalscope/report/template/partials/header_eval.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-05-28T10:08:34,026 copying build/lib/evalscope/report/template/report.html.j2 -> build/bdist.linux-armv7l/wheel/./evalscope/report/template 2026-05-28T10:08:34,029 copying build/lib/evalscope/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-05-28T10:08:34,032 creating build/bdist.linux-armv7l/wheel/evalscope/collections 2026-05-28T10:08:34,033 copying build/lib/evalscope/collections/sampler.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-05-28T10:08:34,036 copying build/lib/evalscope/collections/schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-05-28T10:08:34,039 copying build/lib/evalscope/collections/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-05-28T10:08:34,041 creating build/bdist.linux-armv7l/wheel/evalscope/service 2026-05-28T10:08:34,043 creating build/bdist.linux-armv7l/wheel/evalscope/service/blueprints 2026-05-28T10:08:34,045 copying build/lib/evalscope/service/blueprints/reports.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-05-28T10:08:34,048 copying build/lib/evalscope/service/blueprints/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-05-28T10:08:34,051 copying build/lib/evalscope/service/blueprints/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-05-28T10:08:34,054 copying build/lib/evalscope/service/blueprints/perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-05-28T10:08:34,057 creating build/bdist.linux-armv7l/wheel/evalscope/service/utils 2026-05-28T10:08:34,058 copying build/lib/evalscope/service/utils/benchmarks.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-05-28T10:08:34,061 copying build/lib/evalscope/service/utils/process.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-05-28T10:08:34,064 copying build/lib/evalscope/service/utils/log.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-05-28T10:08:34,066 copying build/lib/evalscope/service/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-05-28T10:08:34,069 copying build/lib/evalscope/service/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-05-28T10:08:34,071 copying build/lib/evalscope/service/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-05-28T10:08:34,073 copying build/lib/evalscope/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-05-28T10:08:34,076 creating build/bdist.linux-armv7l/wheel/evalscope/web 2026-05-28T10:08:34,078 copying build/lib/evalscope/web/.gitignore -> build/bdist.linux-armv7l/wheel/./evalscope/web 2026-05-28T10:08:34,080 creating build/bdist.linux-armv7l/wheel/evalscope/web/public 2026-05-28T10:08:34,082 copying build/lib/evalscope/web/public/favicon.svg -> build/bdist.linux-armv7l/wheel/./evalscope/web/public 2026-05-28T10:08:34,085 creating build/bdist.linux-armv7l/wheel/evalscope/web/dist 2026-05-28T10:08:34,087 copying build/lib/evalscope/web/dist/favicon.svg -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist 2026-05-28T10:08:34,092 creating build/bdist.linux-armv7l/wheel/evalscope/web/dist/assets 2026-05-28T10:08:34,094 copying build/lib/evalscope/web/dist/assets/Breadcrumb-CXh_Hjqj.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,096 copying build/lib/evalscope/web/dist/assets/KaTeX_Size4-Regular-DWFBv043.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,098 copying build/lib/evalscope/web/dist/assets/DashboardPage-Dy_K4Uk2.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,101 copying build/lib/evalscope/web/dist/assets/SearchInput-Qu-jjIwJ.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,103 copying build/lib/evalscope/web/dist/assets/KaTeX_Math-Italic-flOr_0UB.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,107 copying build/lib/evalscope/web/dist/assets/KaTeX_AMS-Regular-DMm9YOAa.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,110 copying build/lib/evalscope/web/dist/assets/KaTeX_SansSerif-Italic-YYjJ1zSn.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,113 copying build/lib/evalscope/web/dist/assets/KaTeX_Fraktur-Bold-BsDP51OF.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,116 copying build/lib/evalscope/web/dist/assets/Tabs-rdY0A_mU.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,138 copying build/lib/evalscope/web/dist/assets/LocaleContext-3RqCP440.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,140 copying build/lib/evalscope/web/dist/assets/KaTeX_Fraktur-Regular-Dxdc4cR9.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,143 copying build/lib/evalscope/web/dist/assets/KaTeX_SansSerif-Regular-BNo7hRIc.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,145 copying build/lib/evalscope/web/dist/assets/FilterChip-GjoWt8ON.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,147 copying build/lib/evalscope/web/dist/assets/KaTeX_Script-Regular-D5yQViql.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,149 copying build/lib/evalscope/web/dist/assets/KaTeX_Math-BoldItalic-iY-2wyZ7.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,151 copying build/lib/evalscope/web/dist/assets/ComparePage-DIqHa7xH.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,154 copying build/lib/evalscope/web/dist/assets/KaTeX_Caligraphic-Regular-Di6jR-x-.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,156 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-Regular-B22Nviop.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,159 copying build/lib/evalscope/web/dist/assets/search-_j2hWq0v.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,160 copying build/lib/evalscope/web/dist/assets/KaTeX_Caligraphic-Bold-BEiXGLvX.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,163 copying build/lib/evalscope/web/dist/assets/KaTeX_SansSerif-Bold-DbIhKOiC.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,166 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-Italic-NWA7e6Wa.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,168 copying build/lib/evalscope/web/dist/assets/KaTeX_Size4-Regular-Dl5lxZxV.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,170 copying build/lib/evalscope/web/dist/assets/BenchmarksPage-XDO3_q9P.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,173 copying build/lib/evalscope/web/dist/assets/KaTeX_Math-Italic-t53AETM-.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,175 copying build/lib/evalscope/web/dist/assets/PerfTaskPage-CeXqhvKL.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,178 copying build/lib/evalscope/web/dist/assets/KaTeX_Caligraphic-Bold-ATXxdsX0.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,180 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-Italic-BMLOBm91.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,183 copying build/lib/evalscope/web/dist/assets/KaTeX_Fraktur-Bold-BdnERNNW.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,185 copying build/lib/evalscope/web/dist/assets/KaTeX_Size1-Regular-mCD8mA8B.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,187 copying build/lib/evalscope/web/dist/assets/KaTeX_Math-Italic-DA0__PXp.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,190 copying build/lib/evalscope/web/dist/assets/KaTeX_Caligraphic-Bold-Dq_IR9rO.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,192 copying build/lib/evalscope/web/dist/assets/database-OIFuo_-B.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,194 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-Bold-Jm3AIy58.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,196 copying build/lib/evalscope/web/dist/assets/ReportViewerPage-DawiLh7a.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,198 copying build/lib/evalscope/web/dist/assets/KaTeX_Size2-Regular-oD1tc_U0.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,200 copying build/lib/evalscope/web/dist/assets/KaTeX_SansSerif-Bold-D1sUS0GD.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,203 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-Regular-ypZvNtVU.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,206 copying build/lib/evalscope/web/dist/assets/KaTeX_SansSerif-Regular-DDBCnlJ7.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,208 copying build/lib/evalscope/web/dist/assets/KaTeX_Fraktur-Regular-CB_wures.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,211 copying build/lib/evalscope/web/dist/assets/KaTeX_Size3-Regular-CTq5MqoE.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,213 copying build/lib/evalscope/web/dist/assets/folder-open-C4sVuGAY.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,214 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-Italic-3WenGoN9.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,217 copying build/lib/evalscope/web/dist/assets/KaTeX_Size1-Regular-Dbsnue_I.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,219 copying build/lib/evalscope/web/dist/assets/KaTeX_Size4-Regular-BF-4gkZK.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,221 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-BoldItalic-DxDJ3AOS.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,224 copying build/lib/evalscope/web/dist/assets/ReportDetailPage-CXjObGG4.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,227 copying build/lib/evalscope/web/dist/assets/KaTeX_AMS-Regular-BQhdFMY1.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,229 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-BoldItalic-SpSLRI95.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,232 copying build/lib/evalscope/web/dist/assets/external-link-Dk_VDyv-.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,233 copying build/lib/evalscope/web/dist/assets/KaTeX_Fraktur-Regular-CTYiF6lA.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,236 copying build/lib/evalscope/web/dist/assets/EvalTaskPage-CfhiPFIu.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,238 copying build/lib/evalscope/web/dist/assets/KaTeX_Typewriter-Regular-C0xS9mPB.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,240 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-Regular-Dr94JaBh.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,243 copying build/lib/evalscope/web/dist/assets/KaTeX_Math-BoldItalic-B3XSjfu4.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,245 copying build/lib/evalscope/web/dist/assets/usePolling-9BwH1egZ.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,248 copying build/lib/evalscope/web/dist/assets/ChatView-CrwMded-.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,251 copying build/lib/evalscope/web/dist/assets/useQueryParams-B0TIRhYM.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,252 copying build/lib/evalscope/web/dist/assets/ScoreBadge-BKgzeJIw.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,254 copying build/lib/evalscope/web/dist/assets/KaTeX_Typewriter-Regular-CO6r4hn1.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,256 copying build/lib/evalscope/web/dist/assets/utils-Bt-jremC.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,259 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-BoldItalic-DzxPMmG6.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,261 copying build/lib/evalscope/web/dist/assets/KaTeX_Fraktur-Bold-CL6g_b3V.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,263 copying build/lib/evalscope/web/dist/assets/KaTeX_Script-Regular-C5JkGWo-.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,266 copying build/lib/evalscope/web/dist/assets/KaTeX_Size1-Regular-C195tn64.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,268 copying build/lib/evalscope/web/dist/assets/ReportsPage-CwZoFJCa.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,270 copying build/lib/evalscope/web/dist/assets/Button-Bw30ctK3.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,272 copying build/lib/evalscope/web/dist/assets/index-4AAMqHB4.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,278 copying build/lib/evalscope/web/dist/assets/Skeleton-DD8J4WTY.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,280 copying build/lib/evalscope/web/dist/assets/KaTeX_SansSerif-Bold-CFMepnvq.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,283 copying build/lib/evalscope/web/dist/assets/KaTeX_Size3-Regular-DgpXs0kz.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,286 copying build/lib/evalscope/web/dist/assets/index-DXtIjaXa.css -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,291 copying build/lib/evalscope/web/dist/assets/KaTeX_Caligraphic-Regular-wX97UBjC.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,294 copying build/lib/evalscope/web/dist/assets/Card-vB7q1dY3.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,296 copying build/lib/evalscope/web/dist/assets/chevron-up-BykVlddk.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,299 copying build/lib/evalscope/web/dist/assets/KaTeX_Typewriter-Regular-D3Ib7_Hf.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,302 copying build/lib/evalscope/web/dist/assets/KaTeX_Caligraphic-Regular-CTRA-rTL.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,305 copying build/lib/evalscope/web/dist/assets/KaTeX_SansSerif-Italic-DN2j7dab.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,308 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-Bold-Cx986IdX.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,311 copying build/lib/evalscope/web/dist/assets/KaTeX_Math-BoldItalic-CZnvNsCZ.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,314 copying build/lib/evalscope/web/dist/assets/KaTeX_SansSerif-Italic-C3H0VqGB.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,317 copying build/lib/evalscope/web/dist/assets/KaTeX_Size2-Regular-B7gKUWhC.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,320 copying build/lib/evalscope/web/dist/assets/Badge-BM4410Li.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,322 copying build/lib/evalscope/web/dist/assets/KaTeX_AMS-Regular-DRggAlZN.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,327 copying build/lib/evalscope/web/dist/assets/eval-CCjFVVv8.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,329 copying build/lib/evalscope/web/dist/assets/square-CF_-z8TO.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,331 copying build/lib/evalscope/web/dist/assets/KaTeX_Script-Regular-D3wIWfF6.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,334 copying build/lib/evalscope/web/dist/assets/KaTeX_SansSerif-Regular-CS6fqUqJ.woff -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,337 copying build/lib/evalscope/web/dist/assets/loader-circle-DukK1tEn.js -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,339 copying build/lib/evalscope/web/dist/assets/KaTeX_Main-Bold-waoOVXN0.ttf -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,343 copying build/lib/evalscope/web/dist/assets/KaTeX_Size2-Regular-Dy4dx90m.woff2 -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist/assets 2026-05-28T10:08:34,346 copying build/lib/evalscope/web/dist/index.html -> build/bdist.linux-armv7l/wheel/./evalscope/web/dist 2026-05-28T10:08:34,348 copying build/lib/evalscope/web/index.html -> build/bdist.linux-armv7l/wheel/./evalscope/web 2026-05-28T10:08:34,350 copying build/lib/evalscope/web/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/web 2026-05-28T10:08:34,352 copying build/lib/evalscope/web/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/web 2026-05-28T10:08:34,355 copying build/lib/evalscope/config.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-05-28T10:08:34,359 creating build/bdist.linux-armv7l/wheel/evalscope/models 2026-05-28T10:08:34,361 creating build/bdist.linux-armv7l/wheel/evalscope/models/utils 2026-05-28T10:08:34,362 copying build/lib/evalscope/models/utils/openai_responses.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-05-28T10:08:34,365 copying build/lib/evalscope/models/utils/openai.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-05-28T10:08:34,369 copying build/lib/evalscope/models/utils/anthropic.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-05-28T10:08:34,372 copying build/lib/evalscope/models/openai_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,375 copying build/lib/evalscope/models/model_apis.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,377 copying build/lib/evalscope/models/image_edit_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,380 copying build/lib/evalscope/models/mockllm.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,382 copying build/lib/evalscope/models/litellm_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,384 copying build/lib/evalscope/models/openai_responses.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,387 copying build/lib/evalscope/models/text2image_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,388 copying build/lib/evalscope/models/modelscope.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,391 copying build/lib/evalscope/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,393 copying build/lib/evalscope/models/anthropic_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-05-28T10:08:34,395 copying build/lib/evalscope/version.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-05-28T10:08:34,397 creating build/bdist.linux-armv7l/wheel/evalscope/backend 2026-05-28T10:08:34,398 creating build/bdist.linux-armv7l/wheel/evalscope/backend/vlm_eval_kit 2026-05-28T10:08:34,400 copying build/lib/evalscope/backend/vlm_eval_kit/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-05-28T10:08:34,402 copying build/lib/evalscope/backend/vlm_eval_kit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-05-28T10:08:34,404 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass 2026-05-28T10:08:34,405 copying build/lib/evalscope/backend/opencompass/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-05-28T10:08:34,408 copying build/lib/evalscope/backend/opencompass/api_meta_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-05-28T10:08:34,410 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass/tasks 2026-05-28T10:08:34,411 copying build/lib/evalscope/backend/opencompass/tasks/eval_datasets.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-05-28T10:08:34,413 copying build/lib/evalscope/backend/opencompass/tasks/eval_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-05-28T10:08:34,415 copying build/lib/evalscope/backend/opencompass/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-05-28T10:08:34,417 copying build/lib/evalscope/backend/opencompass/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-05-28T10:08:34,419 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval 2026-05-28T10:08:34,420 copying build/lib/evalscope/backend/rag_eval/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-05-28T10:08:34,423 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/utils 2026-05-28T10:08:34,424 copying build/lib/evalscope/backend/rag_eval/utils/tools.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-05-28T10:08:34,426 copying build/lib/evalscope/backend/rag_eval/utils/embedding.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-05-28T10:08:34,428 copying build/lib/evalscope/backend/rag_eval/utils/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-05-28T10:08:34,430 copying build/lib/evalscope/backend/rag_eval/utils/clip.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-05-28T10:08:34,432 copying build/lib/evalscope/backend/rag_eval/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-05-28T10:08:34,434 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:34,435 copying build/lib/evalscope/backend/rag_eval/cmteb/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:34,437 copying build/lib/evalscope/backend/rag_eval/cmteb/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:34,439 copying build/lib/evalscope/backend/rag_eval/cmteb/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:34,442 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:34,443 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:34,445 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:34,447 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:34,449 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:34,451 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:34,453 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:34,455 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:34,457 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-05-28T10:08:34,460 copying build/lib/evalscope/backend/rag_eval/cmteb/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-05-28T10:08:34,462 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:34,463 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/utils 2026-05-28T10:08:34,464 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-05-28T10:08:34,466 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-05-28T10:08:34,469 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:34,470 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:34,473 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:34,475 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:34,476 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:34,478 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:34,481 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:34,483 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-05-28T10:08:34,485 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-05-28T10:08:34,487 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas 2026-05-28T10:08:34,488 copying build/lib/evalscope/backend/rag_eval/ragas/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-05-28T10:08:34,491 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/prompts 2026-05-28T10:08:34,492 copying build/lib/evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/prompts 2026-05-28T10:08:34,494 copying build/lib/evalscope/backend/rag_eval/ragas/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-05-28T10:08:34,496 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:34,497 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:34,499 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:34,501 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:34,503 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:34,505 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-05-28T10:08:34,507 copying build/lib/evalscope/backend/rag_eval/ragas/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-05-28T10:08:34,509 copying build/lib/evalscope/backend/rag_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-05-28T10:08:34,510 copying build/lib/evalscope/backend/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-05-28T10:08:34,512 copying build/lib/evalscope/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-05-28T10:08:34,514 creating build/bdist.linux-armv7l/wheel/evalscope/third_party 2026-05-28T10:08:34,516 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static 2026-05-28T10:08:34,517 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static/llm 2026-05-28T10:08:34,518 copying build/lib/evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-05-28T10:08:34,520 copying build/lib/evalscope/third_party/toolbench_static/llm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-05-28T10:08:34,522 copying build/lib/evalscope/third_party/toolbench_static/config_default.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-05-28T10:08:34,524 copying build/lib/evalscope/third_party/toolbench_static/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-05-28T10:08:34,526 copying build/lib/evalscope/third_party/toolbench_static/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-05-28T10:08:34,528 copying build/lib/evalscope/third_party/toolbench_static/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-05-28T10:08:34,531 copying build/lib/evalscope/third_party/toolbench_static/toolbench_static.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-05-28T10:08:34,533 copying build/lib/evalscope/third_party/toolbench_static/config_default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-05-28T10:08:34,535 copying build/lib/evalscope/third_party/toolbench_static/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-05-28T10:08:34,537 copying build/lib/evalscope/third_party/toolbench_static/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-05-28T10:08:34,540 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write 2026-05-28T10:08:34,541 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/tools 2026-05-28T10:08:34,543 copying build/lib/evalscope/third_party/longbench_write/tools/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-05-28T10:08:34,545 copying build/lib/evalscope/third_party/longbench_write/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-05-28T10:08:34,547 copying build/lib/evalscope/third_party/longbench_write/tools/data_etl.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-05-28T10:08:34,550 copying build/lib/evalscope/third_party/longbench_write/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-05-28T10:08:34,552 copying build/lib/evalscope/third_party/longbench_write/default_task.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-05-28T10:08:34,554 copying build/lib/evalscope/third_party/longbench_write/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-05-28T10:08:34,556 copying build/lib/evalscope/third_party/longbench_write/default_task.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-05-28T10:08:34,559 copying build/lib/evalscope/third_party/longbench_write/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-05-28T10:08:34,562 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/resources 2026-05-28T10:08:34,563 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-05-28T10:08:34,566 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-05-28T10:08:34,569 copying build/lib/evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-05-28T10:08:34,572 copying build/lib/evalscope/third_party/longbench_write/resources/judge.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-05-28T10:08:34,574 copying build/lib/evalscope/third_party/longbench_write/resources/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-05-28T10:08:34,577 copying build/lib/evalscope/third_party/longbench_write/longbench_write.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-05-28T10:08:34,579 copying build/lib/evalscope/third_party/longbench_write/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-05-28T10:08:34,581 copying build/lib/evalscope/third_party/longbench_write/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-05-28T10:08:34,584 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench 2026-05-28T10:08:34,586 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/tools 2026-05-28T10:08:34,587 copying build/lib/evalscope/third_party/thinkbench/tools/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-05-28T10:08:34,589 copying build/lib/evalscope/third_party/thinkbench/tools/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-05-28T10:08:34,592 copying build/lib/evalscope/third_party/thinkbench/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-05-28T10:08:34,593 copying build/lib/evalscope/third_party/thinkbench/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-05-28T10:08:34,597 copying build/lib/evalscope/third_party/thinkbench/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-05-28T10:08:34,599 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/resources 2026-05-28T10:08:34,600 copying build/lib/evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-05-28T10:08:34,603 copying build/lib/evalscope/third_party/thinkbench/resources/critique_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-05-28T10:08:34,605 copying build/lib/evalscope/third_party/thinkbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-05-28T10:08:34,607 copying build/lib/evalscope/third_party/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party 2026-05-28T10:08:34,609 copying build/lib/evalscope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-05-28T10:08:34,612 creating build/bdist.linux-armv7l/wheel/evalscope/cli 2026-05-28T10:08:34,613 copying build/lib/evalscope/cli/start_perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-05-28T10:08:34,616 copying build/lib/evalscope/cli/start_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-05-28T10:08:34,618 copying build/lib/evalscope/cli/benchmark_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-05-28T10:08:34,621 copying build/lib/evalscope/cli/cli.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-05-28T10:08:34,623 copying build/lib/evalscope/cli/start_app.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-05-28T10:08:34,625 copying build/lib/evalscope/cli/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-05-28T10:08:34,627 copying build/lib/evalscope/cli/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-05-28T10:08:34,629 copying build/lib/evalscope/cli/start_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-05-28T10:08:34,632 creating build/bdist.linux-armv7l/wheel/evalscope/api 2026-05-28T10:08:34,634 creating build/bdist.linux-armv7l/wheel/evalscope/api/mixin 2026-05-28T10:08:34,636 copying build/lib/evalscope/api/mixin/sandbox_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-05-28T10:08:34,639 copying build/lib/evalscope/api/mixin/llm_judge_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-05-28T10:08:34,641 copying build/lib/evalscope/api/mixin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-05-28T10:08:34,644 creating build/bdist.linux-armv7l/wheel/evalscope/api/evaluator 2026-05-28T10:08:34,645 copying build/lib/evalscope/api/evaluator/inference_result.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-05-28T10:08:34,648 copying build/lib/evalscope/api/evaluator/cache.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-05-28T10:08:34,650 copying build/lib/evalscope/api/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-05-28T10:08:34,652 copying build/lib/evalscope/api/evaluator/state.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-05-28T10:08:34,655 copying build/lib/evalscope/api/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-05-28T10:08:34,657 copying build/lib/evalscope/api/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-05-28T10:08:34,661 creating build/bdist.linux-armv7l/wheel/evalscope/api/filter 2026-05-28T10:08:34,662 copying build/lib/evalscope/api/filter/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-05-28T10:08:34,664 copying build/lib/evalscope/api/filter/filter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-05-28T10:08:34,667 creating build/bdist.linux-armv7l/wheel/evalscope/api/tool 2026-05-28T10:08:34,669 copying build/lib/evalscope/api/tool/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-05-28T10:08:34,671 copying build/lib/evalscope/api/tool/tool_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-05-28T10:08:34,674 copying build/lib/evalscope/api/tool/tool_call.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-05-28T10:08:34,677 copying build/lib/evalscope/api/tool/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-05-28T10:08:34,680 creating build/bdist.linux-armv7l/wheel/evalscope/api/dataset 2026-05-28T10:08:34,681 copying build/lib/evalscope/api/dataset/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-05-28T10:08:34,683 copying build/lib/evalscope/api/dataset/loader.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-05-28T10:08:34,686 copying build/lib/evalscope/api/dataset/hub.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-05-28T10:08:34,688 copying build/lib/evalscope/api/dataset/dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-05-28T10:08:34,691 copying build/lib/evalscope/api/dataset/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-05-28T10:08:34,693 creating build/bdist.linux-armv7l/wheel/evalscope/api/messages 2026-05-28T10:08:34,694 copying build/lib/evalscope/api/messages/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-05-28T10:08:34,696 copying build/lib/evalscope/api/messages/perf_metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-05-28T10:08:34,698 copying build/lib/evalscope/api/messages/chat_message.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-05-28T10:08:34,700 copying build/lib/evalscope/api/messages/content.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-05-28T10:08:34,702 copying build/lib/evalscope/api/messages/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-05-28T10:08:34,704 creating build/bdist.linux-armv7l/wheel/evalscope/api/model 2026-05-28T10:08:34,705 copying build/lib/evalscope/api/model/lazy_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-05-28T10:08:34,708 copying build/lib/evalscope/api/model/model_output.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-05-28T10:08:34,711 copying build/lib/evalscope/api/model/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-05-28T10:08:34,713 copying build/lib/evalscope/api/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-05-28T10:08:34,715 copying build/lib/evalscope/api/model/generate_config.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-05-28T10:08:34,718 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark 2026-05-28T10:08:34,719 copying build/lib/evalscope/api/benchmark/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-05-28T10:08:34,723 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark/adapters 2026-05-28T10:08:34,724 copying build/lib/evalscope/api/benchmark/adapters/ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,726 copying build/lib/evalscope/api/benchmark/adapters/vendor_verifier_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,729 copying build/lib/evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,731 copying build/lib/evalscope/api/benchmark/adapters/_agent_loop_runner.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,733 copying build/lib/evalscope/api/benchmark/adapters/agent_loop_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,735 copying build/lib/evalscope/api/benchmark/adapters/text2image_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,738 copying build/lib/evalscope/api/benchmark/adapters/multi_turn_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,740 copying build/lib/evalscope/api/benchmark/adapters/agent_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,741 copying build/lib/evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,743 copying build/lib/evalscope/api/benchmark/adapters/default_data_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,746 copying build/lib/evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,748 copying build/lib/evalscope/api/benchmark/adapters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-05-28T10:08:34,749 copying build/lib/evalscope/api/benchmark/statistics.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-05-28T10:08:34,752 copying build/lib/evalscope/api/benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-05-28T10:08:34,753 copying build/lib/evalscope/api/benchmark/meta.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-05-28T10:08:34,756 creating build/bdist.linux-armv7l/wheel/evalscope/api/sandbox 2026-05-28T10:08:34,757 copying build/lib/evalscope/api/sandbox/service.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/sandbox 2026-05-28T10:08:34,760 copying build/lib/evalscope/api/sandbox/engine.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/sandbox 2026-05-28T10:08:34,761 copying build/lib/evalscope/api/sandbox/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/sandbox 2026-05-28T10:08:34,763 copying build/lib/evalscope/api/sandbox/config_builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/sandbox 2026-05-28T10:08:34,765 copying build/lib/evalscope/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-05-28T10:08:34,767 creating build/bdist.linux-armv7l/wheel/evalscope/api/metric 2026-05-28T10:08:34,768 copying build/lib/evalscope/api/metric/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-05-28T10:08:34,769 copying build/lib/evalscope/api/metric/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-05-28T10:08:34,771 copying build/lib/evalscope/api/metric/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-05-28T10:08:34,773 creating build/bdist.linux-armv7l/wheel/evalscope/api/agent 2026-05-28T10:08:34,775 copying build/lib/evalscope/api/agent/environment.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent 2026-05-28T10:08:34,777 copying build/lib/evalscope/api/agent/strategy.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent 2026-05-28T10:08:34,779 creating build/bdist.linux-armv7l/wheel/evalscope/api/agent/mcp 2026-05-28T10:08:34,780 copying build/lib/evalscope/api/agent/mcp/source.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent/mcp 2026-05-28T10:08:34,783 copying build/lib/evalscope/api/agent/mcp/types.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent/mcp 2026-05-28T10:08:34,785 copying build/lib/evalscope/api/agent/mcp/client.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent/mcp 2026-05-28T10:08:34,787 copying build/lib/evalscope/api/agent/mcp/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent/mcp 2026-05-28T10:08:34,789 copying build/lib/evalscope/api/agent/tool_executor.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent 2026-05-28T10:08:34,790 copying build/lib/evalscope/api/agent/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent 2026-05-28T10:08:34,792 copying build/lib/evalscope/api/agent/types.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent 2026-05-28T10:08:34,795 copying build/lib/evalscope/api/agent/loop.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent 2026-05-28T10:08:34,797 copying build/lib/evalscope/api/agent/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent 2026-05-28T10:08:34,799 copying build/lib/evalscope/api/agent/trace.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/agent 2026-05-28T10:08:34,801 creating build/bdist.linux-armv7l/wheel/evalscope/agent 2026-05-28T10:08:34,803 creating build/bdist.linux-armv7l/wheel/evalscope/agent/tools 2026-05-28T10:08:34,804 copying build/lib/evalscope/agent/tools/python_exec.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/tools 2026-05-28T10:08:34,806 copying build/lib/evalscope/agent/tools/bash.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/tools 2026-05-28T10:08:34,807 copying build/lib/evalscope/agent/tools/submit.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/tools 2026-05-28T10:08:34,809 copying build/lib/evalscope/agent/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/tools 2026-05-28T10:08:34,812 creating build/bdist.linux-armv7l/wheel/evalscope/agent/environments 2026-05-28T10:08:34,813 copying build/lib/evalscope/agent/environments/local.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/environments 2026-05-28T10:08:34,815 copying build/lib/evalscope/agent/environments/enclave.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/environments 2026-05-28T10:08:34,817 copying build/lib/evalscope/agent/environments/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/environments 2026-05-28T10:08:34,819 creating build/bdist.linux-armv7l/wheel/evalscope/agent/strategies 2026-05-28T10:08:34,820 copying build/lib/evalscope/agent/strategies/react.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/strategies 2026-05-28T10:08:34,823 creating build/bdist.linux-armv7l/wheel/evalscope/agent/strategies/swe_bench 2026-05-28T10:08:34,824 copying build/lib/evalscope/agent/strategies/swe_bench/swe_bench_toolcall.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/strategies/swe_bench 2026-05-28T10:08:34,826 copying build/lib/evalscope/agent/strategies/swe_bench/_observation.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/strategies/swe_bench 2026-05-28T10:08:34,828 copying build/lib/evalscope/agent/strategies/swe_bench/swe_bench_backticks.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/strategies/swe_bench 2026-05-28T10:08:34,830 copying build/lib/evalscope/agent/strategies/swe_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/strategies/swe_bench 2026-05-28T10:08:34,832 copying build/lib/evalscope/agent/strategies/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/strategies 2026-05-28T10:08:34,834 copying build/lib/evalscope/agent/strategies/function_calling.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/strategies 2026-05-28T10:08:34,837 creating build/bdist.linux-armv7l/wheel/evalscope/agent/external 2026-05-28T10:08:34,838 creating build/bdist.linux-armv7l/wheel/evalscope/agent/external/runners 2026-05-28T10:08:34,839 copying build/lib/evalscope/agent/external/runners/mock.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/runners 2026-05-28T10:08:34,842 copying build/lib/evalscope/agent/external/runners/claude_code.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/runners 2026-05-28T10:08:34,844 copying build/lib/evalscope/agent/external/runners/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/runners 2026-05-28T10:08:34,846 copying build/lib/evalscope/agent/external/runners/codex.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/runners 2026-05-28T10:08:34,849 creating build/bdist.linux-armv7l/wheel/evalscope/agent/external/runners/_assets 2026-05-28T10:08:34,850 copying build/lib/evalscope/agent/external/runners/_assets/claude_code.Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/runners/_assets 2026-05-28T10:08:34,852 copying build/lib/evalscope/agent/external/runners/_assets/build_claude_code_image.sh -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/runners/_assets 2026-05-28T10:08:34,854 copying build/lib/evalscope/agent/external/runners/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/runners 2026-05-28T10:08:34,856 creating build/bdist.linux-armv7l/wheel/evalscope/agent/external/helpers 2026-05-28T10:08:34,857 copying build/lib/evalscope/agent/external/helpers/patch.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/helpers 2026-05-28T10:08:34,859 copying build/lib/evalscope/agent/external/helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/helpers 2026-05-28T10:08:34,861 creating build/bdist.linux-armv7l/wheel/evalscope/agent/external/bridge 2026-05-28T10:08:34,862 copying build/lib/evalscope/agent/external/bridge/sse_anthropic.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,864 copying build/lib/evalscope/agent/external/bridge/translate_openai.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,867 copying build/lib/evalscope/agent/external/bridge/sse_responses.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,869 copying build/lib/evalscope/agent/external/bridge/trace_recorder.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,871 copying build/lib/evalscope/agent/external/bridge/_sse_common.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,873 copying build/lib/evalscope/agent/external/bridge/sse_openai.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,875 copying build/lib/evalscope/agent/external/bridge/translate_responses.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,877 copying build/lib/evalscope/agent/external/bridge/translate_anthropic.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,879 copying build/lib/evalscope/agent/external/bridge/server.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,882 copying build/lib/evalscope/agent/external/bridge/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external/bridge 2026-05-28T10:08:34,884 copying build/lib/evalscope/agent/external/adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external 2026-05-28T10:08:34,886 copying build/lib/evalscope/agent/external/config.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external 2026-05-28T10:08:34,888 copying build/lib/evalscope/agent/external/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent/external 2026-05-28T10:08:34,890 copying build/lib/evalscope/agent/runner.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent 2026-05-28T10:08:34,892 copying build/lib/evalscope/agent/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/agent 2026-05-28T10:08:34,894 creating build/bdist.linux-armv7l/wheel/evalscope/perf 2026-05-28T10:08:34,895 copying build/lib/evalscope/perf/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-05-28T10:08:34,898 creating build/bdist.linux-armv7l/wheel/evalscope/perf/core 2026-05-28T10:08:34,899 copying build/lib/evalscope/perf/core/http_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/core 2026-05-28T10:08:34,902 creating build/bdist.linux-armv7l/wheel/evalscope/perf/core/strategies 2026-05-28T10:08:34,903 copying build/lib/evalscope/perf/core/strategies/multi_turn.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/core/strategies 2026-05-28T10:08:34,906 copying build/lib/evalscope/perf/core/strategies/closed_loop.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/core/strategies 2026-05-28T10:08:34,908 copying build/lib/evalscope/perf/core/strategies/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/core/strategies 2026-05-28T10:08:34,910 copying build/lib/evalscope/perf/core/strategies/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/core/strategies 2026-05-28T10:08:34,912 copying build/lib/evalscope/perf/core/strategies/open_loop.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/core/strategies 2026-05-28T10:08:34,914 copying build/lib/evalscope/perf/core/metrics_consumer.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/core 2026-05-28T10:08:34,916 copying build/lib/evalscope/perf/core/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/core 2026-05-28T10:08:34,919 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils 2026-05-28T10:08:34,920 copying build/lib/evalscope/perf/utils/log_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,922 copying build/lib/evalscope/perf/utils/rich_display.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,925 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils/report 2026-05-28T10:08:34,926 copying build/lib/evalscope/perf/utils/report/perf_charts.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-05-28T10:08:34,929 copying build/lib/evalscope/perf/utils/report/perf_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-05-28T10:08:34,931 copying build/lib/evalscope/perf/utils/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-05-28T10:08:34,933 copying build/lib/evalscope/perf/utils/report/generate_report.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-05-28T10:08:34,935 copying build/lib/evalscope/perf/utils/handler.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,937 copying build/lib/evalscope/perf/utils/perf_constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,939 copying build/lib/evalscope/perf/utils/benchmark_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,941 copying build/lib/evalscope/perf/utils/workload_timeline.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,944 copying build/lib/evalscope/perf/utils/analysis_result.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,946 copying build/lib/evalscope/perf/utils/local_server.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,948 copying build/lib/evalscope/perf/utils/db_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,950 copying build/lib/evalscope/perf/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,952 copying build/lib/evalscope/perf/utils/trace_metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,954 copying build/lib/evalscope/perf/utils/perf_models.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-05-28T10:08:34,956 copying build/lib/evalscope/perf/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-05-28T10:08:34,960 creating build/bdist.linux-armv7l/wheel/evalscope/perf/sla 2026-05-28T10:08:34,961 copying build/lib/evalscope/perf/sla/sla_criterion.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-05-28T10:08:34,963 copying build/lib/evalscope/perf/sla/sla_run.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-05-28T10:08:34,966 copying build/lib/evalscope/perf/sla/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-05-28T10:08:34,968 copying build/lib/evalscope/perf/multi_turn_benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-05-28T10:08:34,971 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin 2026-05-28T10:08:34,972 copying build/lib/evalscope/perf/plugin/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-05-28T10:08:34,974 copying build/lib/evalscope/perf/plugin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-05-28T10:08:34,976 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/api 2026-05-28T10:08:34,977 copying build/lib/evalscope/perf/plugin/api/openai_rerank_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-05-28T10:08:34,979 copying build/lib/evalscope/perf/plugin/api/openai_embedding_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-05-28T10:08:34,981 copying build/lib/evalscope/perf/plugin/api/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-05-28T10:08:34,984 copying build/lib/evalscope/perf/plugin/api/dashscope_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-05-28T10:08:34,986 copying build/lib/evalscope/perf/plugin/api/openai_responses_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-05-28T10:08:34,988 copying build/lib/evalscope/perf/plugin/api/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-05-28T10:08:34,990 copying build/lib/evalscope/perf/plugin/api/custom_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-05-28T10:08:34,992 copying build/lib/evalscope/perf/plugin/api/default_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-05-28T10:08:34,994 copying build/lib/evalscope/perf/plugin/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-05-28T10:08:34,997 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/datasets 2026-05-28T10:08:34,998 copying build/lib/evalscope/perf/plugin/datasets/flickr8k.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,000 copying build/lib/evalscope/perf/plugin/datasets/random_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,002 copying build/lib/evalscope/perf/plugin/datasets/multi_turn.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,004 copying build/lib/evalscope/perf/plugin/datasets/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,007 copying build/lib/evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,008 copying build/lib/evalscope/perf/plugin/datasets/custom.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,011 copying build/lib/evalscope/perf/plugin/datasets/share_gpt.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,013 copying build/lib/evalscope/perf/plugin/datasets/swe_smith.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,015 copying build/lib/evalscope/perf/plugin/datasets/openqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,017 copying build/lib/evalscope/perf/plugin/datasets/rerank_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,019 copying build/lib/evalscope/perf/plugin/datasets/longalpaca.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,021 copying build/lib/evalscope/perf/plugin/datasets/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,024 copying build/lib/evalscope/perf/plugin/datasets/line_by_line.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,026 copying build/lib/evalscope/perf/plugin/datasets/trie.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,028 copying build/lib/evalscope/perf/plugin/datasets/speed_benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,030 copying build/lib/evalscope/perf/plugin/datasets/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,032 copying build/lib/evalscope/perf/plugin/datasets/kontext_bench.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,033 copying build/lib/evalscope/perf/plugin/datasets/embedding_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-05-28T10:08:35,035 copying build/lib/evalscope/perf/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-05-28T10:08:35,037 copying build/lib/evalscope/perf/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-05-28T10:08:35,039 copying build/lib/evalscope/perf/multi_turn_args.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-05-28T10:08:35,041 creating build/bdist.linux-armv7l/wheel/evalscope/metrics 2026-05-28T10:08:35,042 copying build/lib/evalscope/metrics/math_parser.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-05-28T10:08:35,045 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/sem_score 2026-05-28T10:08:35,047 copying build/lib/evalscope/metrics/sem_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-05-28T10:08:35,049 copying build/lib/evalscope/metrics/sem_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-05-28T10:08:35,051 copying build/lib/evalscope/metrics/rouge_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-05-28T10:08:35,053 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bundled_rouge_score 2026-05-28T10:08:35,054 copying build/lib/evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-05-28T10:08:35,057 copying build/lib/evalscope/metrics/bundled_rouge_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-05-28T10:08:35,059 copying build/lib/evalscope/metrics/llm_judge.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-05-28T10:08:35,061 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/text_normalizer 2026-05-28T10:08:35,063 copying build/lib/evalscope/metrics/text_normalizer/basic.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-05-28T10:08:35,065 copying build/lib/evalscope/metrics/text_normalizer/chinese.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-05-28T10:08:35,068 copying build/lib/evalscope/metrics/text_normalizer/wer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-05-28T10:08:35,070 copying build/lib/evalscope/metrics/text_normalizer/english.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-05-28T10:08:35,073 copying build/lib/evalscope/metrics/text_normalizer/english.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-05-28T10:08:35,076 copying build/lib/evalscope/metrics/text_normalizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-05-28T10:08:35,079 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics 2026-05-28T10:08:35,080 copying build/lib/evalscope/metrics/t2v_metrics/clipscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-05-28T10:08:35,082 copying build/lib/evalscope/metrics/t2v_metrics/itmscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-05-28T10:08:35,084 copying build/lib/evalscope/metrics/t2v_metrics/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-05-28T10:08:35,086 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models 2026-05-28T10:08:35,087 copying build/lib/evalscope/metrics/t2v_metrics/models/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-05-28T10:08:35,090 copying build/lib/evalscope/metrics/t2v_metrics/models/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-05-28T10:08:35,093 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:35,094 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-05-28T10:08:35,097 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-05-28T10:08:35,099 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-05-28T10:08:35,100 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-05-28T10:08:35,103 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-05-28T10:08:35,104 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-05-28T10:08:35,107 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-05-28T10:08:35,109 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-05-28T10:08:35,112 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-05-28T10:08:35,113 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-05-28T10:08:35,117 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-05-28T10:08:35,119 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:35,122 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:35,124 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:35,127 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-05-28T10:08:35,129 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:35,132 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,133 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,135 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,138 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,141 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,144 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,147 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,149 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,152 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,155 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,157 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,160 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-05-28T10:08:35,164 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,165 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,169 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,172 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,175 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,178 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,182 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,186 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,188 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,190 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,194 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-05-28T10:08:35,196 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:35,199 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:35,201 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:35,205 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:35,207 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:35,209 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-05-28T10:08:35,212 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:35,213 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:35,216 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:35,219 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-05-28T10:08:35,220 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-05-28T10:08:35,222 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-05-28T10:08:35,225 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-05-28T10:08:35,227 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:35,229 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:35,231 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:35,233 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:35,235 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-05-28T10:08:35,237 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-05-28T10:08:35,239 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-05-28T10:08:35,241 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-05-28T10:08:35,242 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-05-28T10:08:35,244 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-05-28T10:08:35,246 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,247 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,249 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,251 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,253 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,255 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,256 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,258 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,260 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,262 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,264 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,266 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,267 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,269 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,271 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,273 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,275 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,276 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,278 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,280 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-05-28T10:08:35,282 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-05-28T10:08:35,284 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-05-28T10:08:35,286 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:35,287 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:35,290 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:35,292 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:35,294 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-05-28T10:08:35,296 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:35,298 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-05-28T10:08:35,300 copying build/lib/evalscope/metrics/t2v_metrics/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-05-28T10:08:35,301 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:35,303 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-05-28T10:08:35,304 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-05-28T10:08:35,306 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-05-28T10:08:35,308 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-05-28T10:08:35,310 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:35,312 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:35,314 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:35,316 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-05-28T10:08:35,318 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:35,319 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:35,322 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:35,323 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:35,325 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:35,327 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:35,329 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-05-28T10:08:35,331 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:35,333 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:35,335 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:35,337 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-05-28T10:08:35,339 copying build/lib/evalscope/metrics/t2v_metrics/vqascore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-05-28T10:08:35,340 copying build/lib/evalscope/metrics/t2v_metrics/score.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-05-28T10:08:35,342 copying build/lib/evalscope/metrics/t2v_metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-05-28T10:08:35,344 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bert_score 2026-05-28T10:08:35,345 copying build/lib/evalscope/metrics/bert_score/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-05-28T10:08:35,348 copying build/lib/evalscope/metrics/bert_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-05-28T10:08:35,350 copying build/lib/evalscope/metrics/bert_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-05-28T10:08:35,352 copying build/lib/evalscope/metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-05-28T10:08:35,353 copying build/lib/evalscope/metrics/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-05-28T10:08:35,356 copying build/lib/evalscope/metrics/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-05-28T10:08:35,359 running install_egg_info 2026-05-28T10:08:35,364 Copying evalscope.egg-info to build/bdist.linux-armv7l/wheel/./evalscope-1.8.0-py3.11.egg-info 2026-05-28T10:08:35,377 running install_scripts 2026-05-28T10:08:35,390 creating build/bdist.linux-armv7l/wheel/evalscope-1.8.0.dist-info/WHEEL 2026-05-28T10:08:35,392 creating '/tmp/pip-wheel-p75lnt3d/.tmp-lfsrr_5i/evalscope-1.8.0-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-05-28T10:08:35,395 adding 'evalscope/__init__.py' 2026-05-28T10:08:35,396 adding 'evalscope/arguments.py' 2026-05-28T10:08:35,400 adding 'evalscope/config.py' 2026-05-28T10:08:35,402 adding 'evalscope/constants.py' 2026-05-28T10:08:35,403 adding 'evalscope/run.py' 2026-05-28T10:08:35,405 adding 'evalscope/version.py' 2026-05-28T10:08:35,407 adding 'evalscope/agent/__init__.py' 2026-05-28T10:08:35,408 adding 'evalscope/agent/runner.py' 2026-05-28T10:08:35,410 adding 'evalscope/agent/environments/__init__.py' 2026-05-28T10:08:35,412 adding 'evalscope/agent/environments/enclave.py' 2026-05-28T10:08:35,414 adding 'evalscope/agent/environments/local.py' 2026-05-28T10:08:35,415 adding 'evalscope/agent/external/__init__.py' 2026-05-28T10:08:35,417 adding 'evalscope/agent/external/adapter.py' 2026-05-28T10:08:35,419 adding 'evalscope/agent/external/config.py' 2026-05-28T10:08:35,421 adding 'evalscope/agent/external/bridge/__init__.py' 2026-05-28T10:08:35,422 adding 'evalscope/agent/external/bridge/_sse_common.py' 2026-05-28T10:08:35,426 adding 'evalscope/agent/external/bridge/server.py' 2026-05-28T10:08:35,428 adding 'evalscope/agent/external/bridge/sse_anthropic.py' 2026-05-28T10:08:35,430 adding 'evalscope/agent/external/bridge/sse_openai.py' 2026-05-28T10:08:35,431 adding 'evalscope/agent/external/bridge/sse_responses.py' 2026-05-28T10:08:35,434 adding 'evalscope/agent/external/bridge/trace_recorder.py' 2026-05-28T10:08:35,436 adding 'evalscope/agent/external/bridge/translate_anthropic.py' 2026-05-28T10:08:35,438 adding 'evalscope/agent/external/bridge/translate_openai.py' 2026-05-28T10:08:35,441 adding 'evalscope/agent/external/bridge/translate_responses.py' 2026-05-28T10:08:35,443 adding 'evalscope/agent/external/helpers/__init__.py' 2026-05-28T10:08:35,444 adding 'evalscope/agent/external/helpers/patch.py' 2026-05-28T10:08:35,446 adding 'evalscope/agent/external/runners/__init__.py' 2026-05-28T10:08:35,448 adding 'evalscope/agent/external/runners/base.py' 2026-05-28T10:08:35,450 adding 'evalscope/agent/external/runners/claude_code.py' 2026-05-28T10:08:35,453 adding 'evalscope/agent/external/runners/codex.py' 2026-05-28T10:08:35,456 adding 'evalscope/agent/external/runners/mock.py' 2026-05-28T10:08:35,460 adding 'evalscope/agent/external/runners/_assets/build_claude_code_image.sh' 2026-05-28T10:08:35,463 adding 'evalscope/agent/external/runners/_assets/claude_code.Dockerfile' 2026-05-28T10:08:35,467 adding 'evalscope/agent/strategies/__init__.py' 2026-05-28T10:08:35,470 adding 'evalscope/agent/strategies/function_calling.py' 2026-05-28T10:08:35,474 adding 'evalscope/agent/strategies/react.py' 2026-05-28T10:08:35,480 adding 'evalscope/agent/strategies/swe_bench/__init__.py' 2026-05-28T10:08:35,485 adding 'evalscope/agent/strategies/swe_bench/_observation.py' 2026-05-28T10:08:35,487 adding 'evalscope/agent/strategies/swe_bench/swe_bench_backticks.py' 2026-05-28T10:08:35,489 adding 'evalscope/agent/strategies/swe_bench/swe_bench_toolcall.py' 2026-05-28T10:08:35,490 adding 'evalscope/agent/tools/__init__.py' 2026-05-28T10:08:35,492 adding 'evalscope/agent/tools/bash.py' 2026-05-28T10:08:35,493 adding 'evalscope/agent/tools/python_exec.py' 2026-05-28T10:08:35,495 adding 'evalscope/agent/tools/submit.py' 2026-05-28T10:08:35,497 adding 'evalscope/api/__init__.py' 2026-05-28T10:08:35,500 adding 'evalscope/api/registry.py' 2026-05-28T10:08:35,502 adding 'evalscope/api/agent/__init__.py' 2026-05-28T10:08:35,504 adding 'evalscope/api/agent/constants.py' 2026-05-28T10:08:35,505 adding 'evalscope/api/agent/environment.py' 2026-05-28T10:08:35,507 adding 'evalscope/api/agent/loop.py' 2026-05-28T10:08:35,509 adding 'evalscope/api/agent/strategy.py' 2026-05-28T10:08:35,510 adding 'evalscope/api/agent/tool_executor.py' 2026-05-28T10:08:35,512 adding 'evalscope/api/agent/trace.py' 2026-05-28T10:08:35,513 adding 'evalscope/api/agent/types.py' 2026-05-28T10:08:35,515 adding 'evalscope/api/agent/mcp/__init__.py' 2026-05-28T10:08:35,516 adding 'evalscope/api/agent/mcp/client.py' 2026-05-28T10:08:35,518 adding 'evalscope/api/agent/mcp/source.py' 2026-05-28T10:08:35,519 adding 'evalscope/api/agent/mcp/types.py' 2026-05-28T10:08:35,521 adding 'evalscope/api/benchmark/__init__.py' 2026-05-28T10:08:35,523 adding 'evalscope/api/benchmark/benchmark.py' 2026-05-28T10:08:35,525 adding 'evalscope/api/benchmark/meta.py' 2026-05-28T10:08:35,527 adding 'evalscope/api/benchmark/statistics.py' 2026-05-28T10:08:35,529 adding 'evalscope/api/benchmark/adapters/__init__.py' 2026-05-28T10:08:35,531 adding 'evalscope/api/benchmark/adapters/_agent_loop_runner.py' 2026-05-28T10:08:35,532 adding 'evalscope/api/benchmark/adapters/agent_adapter.py' 2026-05-28T10:08:35,534 adding 'evalscope/api/benchmark/adapters/agent_loop_adapter.py' 2026-05-28T10:08:35,538 adding 'evalscope/api/benchmark/adapters/default_data_adapter.py' 2026-05-28T10:08:35,540 adding 'evalscope/api/benchmark/adapters/image_edit_adapter.py' 2026-05-28T10:08:35,541 adding 'evalscope/api/benchmark/adapters/multi_choice_adapter.py' 2026-05-28T10:08:35,543 adding 'evalscope/api/benchmark/adapters/multi_turn_adapter.py' 2026-05-28T10:08:35,544 adding 'evalscope/api/benchmark/adapters/ner_adapter.py' 2026-05-28T10:08:35,546 adding 'evalscope/api/benchmark/adapters/text2image_adapter.py' 2026-05-28T10:08:35,548 adding 'evalscope/api/benchmark/adapters/vendor_verifier_adapter.py' 2026-05-28T10:08:35,550 adding 'evalscope/api/benchmark/adapters/vision_language_adapter.py' 2026-05-28T10:08:35,552 adding 'evalscope/api/dataset/__init__.py' 2026-05-28T10:08:35,554 adding 'evalscope/api/dataset/dataset.py' 2026-05-28T10:08:35,555 adding 'evalscope/api/dataset/hub.py' 2026-05-28T10:08:35,557 adding 'evalscope/api/dataset/loader.py' 2026-05-28T10:08:35,559 adding 'evalscope/api/dataset/utils.py' 2026-05-28T10:08:35,561 adding 'evalscope/api/evaluator/__init__.py' 2026-05-28T10:08:35,563 adding 'evalscope/api/evaluator/cache.py' 2026-05-28T10:08:35,564 adding 'evalscope/api/evaluator/evaluator.py' 2026-05-28T10:08:35,565 adding 'evalscope/api/evaluator/inference_result.py' 2026-05-28T10:08:35,567 adding 'evalscope/api/evaluator/state.py' 2026-05-28T10:08:35,570 adding 'evalscope/api/filter/__init__.py' 2026-05-28T10:08:35,572 adding 'evalscope/api/filter/filter.py' 2026-05-28T10:08:35,575 adding 'evalscope/api/messages/__init__.py' 2026-05-28T10:08:35,577 adding 'evalscope/api/messages/chat_message.py' 2026-05-28T10:08:35,578 adding 'evalscope/api/messages/content.py' 2026-05-28T10:08:35,580 adding 'evalscope/api/messages/perf_metrics.py' 2026-05-28T10:08:35,582 adding 'evalscope/api/messages/utils.py' 2026-05-28T10:08:35,584 adding 'evalscope/api/metric/__init__.py' 2026-05-28T10:08:35,586 adding 'evalscope/api/metric/metric.py' 2026-05-28T10:08:35,588 adding 'evalscope/api/metric/scorer.py' 2026-05-28T10:08:35,590 adding 'evalscope/api/mixin/__init__.py' 2026-05-28T10:08:35,592 adding 'evalscope/api/mixin/llm_judge_mixin.py' 2026-05-28T10:08:35,594 adding 'evalscope/api/mixin/sandbox_mixin.py' 2026-05-28T10:08:35,596 adding 'evalscope/api/model/__init__.py' 2026-05-28T10:08:35,599 adding 'evalscope/api/model/generate_config.py' 2026-05-28T10:08:35,601 adding 'evalscope/api/model/lazy_model.py' 2026-05-28T10:08:35,603 adding 'evalscope/api/model/model.py' 2026-05-28T10:08:35,606 adding 'evalscope/api/model/model_output.py' 2026-05-28T10:08:35,609 adding 'evalscope/api/sandbox/__init__.py' 2026-05-28T10:08:35,611 adding 'evalscope/api/sandbox/config_builder.py' 2026-05-28T10:08:35,614 adding 'evalscope/api/sandbox/engine.py' 2026-05-28T10:08:35,616 adding 'evalscope/api/sandbox/service.py' 2026-05-28T10:08:35,619 adding 'evalscope/api/tool/__init__.py' 2026-05-28T10:08:35,621 adding 'evalscope/api/tool/tool_call.py' 2026-05-28T10:08:35,623 adding 'evalscope/api/tool/tool_info.py' 2026-05-28T10:08:35,625 adding 'evalscope/api/tool/utils.py' 2026-05-28T10:08:35,627 adding 'evalscope/backend/__init__.py' 2026-05-28T10:08:35,628 adding 'evalscope/backend/base.py' 2026-05-28T10:08:35,630 adding 'evalscope/backend/opencompass/__init__.py' 2026-05-28T10:08:35,632 adding 'evalscope/backend/opencompass/api_meta_template.py' 2026-05-28T10:08:35,634 adding 'evalscope/backend/opencompass/backend_manager.py' 2026-05-28T10:08:35,636 adding 'evalscope/backend/opencompass/tasks/__init__.py' 2026-05-28T10:08:35,638 adding 'evalscope/backend/opencompass/tasks/eval_api.py' 2026-05-28T10:08:35,640 adding 'evalscope/backend/opencompass/tasks/eval_datasets.py' 2026-05-28T10:08:35,642 adding 'evalscope/backend/rag_eval/__init__.py' 2026-05-28T10:08:35,644 adding 'evalscope/backend/rag_eval/backend_manager.py' 2026-05-28T10:08:35,646 adding 'evalscope/backend/rag_eval/clip_benchmark/__init__.py' 2026-05-28T10:08:35,647 adding 'evalscope/backend/rag_eval/clip_benchmark/arguments.py' 2026-05-28T10:08:35,650 adding 'evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py' 2026-05-28T10:08:35,652 adding 'evalscope/backend/rag_eval/clip_benchmark/task_template.py' 2026-05-28T10:08:35,653 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py' 2026-05-28T10:08:35,655 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py' 2026-05-28T10:08:35,657 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py' 2026-05-28T10:08:35,660 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py' 2026-05-28T10:08:35,662 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py' 2026-05-28T10:08:35,664 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt' 2026-05-28T10:08:35,666 adding 'evalscope/backend/rag_eval/cmteb/__init__.py' 2026-05-28T10:08:35,668 adding 'evalscope/backend/rag_eval/cmteb/arguments.py' 2026-05-28T10:08:35,670 adding 'evalscope/backend/rag_eval/cmteb/base.py' 2026-05-28T10:08:35,671 adding 'evalscope/backend/rag_eval/cmteb/task_template.py' 2026-05-28T10:08:35,674 adding 'evalscope/backend/rag_eval/cmteb/tasks/Classification.py' 2026-05-28T10:08:35,676 adding 'evalscope/backend/rag_eval/cmteb/tasks/Clustering.py' 2026-05-28T10:08:35,678 adding 'evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py' 2026-05-28T10:08:35,679 adding 'evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py' 2026-05-28T10:08:35,681 adding 'evalscope/backend/rag_eval/cmteb/tasks/Reranking.py' 2026-05-28T10:08:35,683 adding 'evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py' 2026-05-28T10:08:35,685 adding 'evalscope/backend/rag_eval/cmteb/tasks/STS.py' 2026-05-28T10:08:35,687 adding 'evalscope/backend/rag_eval/cmteb/tasks/__init__.py' 2026-05-28T10:08:35,689 adding 'evalscope/backend/rag_eval/ragas/__init__.py' 2026-05-28T10:08:35,691 adding 'evalscope/backend/rag_eval/ragas/arguments.py' 2026-05-28T10:08:35,692 adding 'evalscope/backend/rag_eval/ragas/task_template.py' 2026-05-28T10:08:35,694 adding 'evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py' 2026-05-28T10:08:35,696 adding 'evalscope/backend/rag_eval/ragas/tasks/__init__.py' 2026-05-28T10:08:35,698 adding 'evalscope/backend/rag_eval/ragas/tasks/build_distribution.py' 2026-05-28T10:08:35,700 adding 'evalscope/backend/rag_eval/ragas/tasks/build_transform.py' 2026-05-28T10:08:35,702 adding 'evalscope/backend/rag_eval/ragas/tasks/testset_generation.py' 2026-05-28T10:08:35,704 adding 'evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py' 2026-05-28T10:08:35,706 adding 'evalscope/backend/rag_eval/utils/__init__.py' 2026-05-28T10:08:35,708 adding 'evalscope/backend/rag_eval/utils/clip.py' 2026-05-28T10:08:35,710 adding 'evalscope/backend/rag_eval/utils/embedding.py' 2026-05-28T10:08:35,712 adding 'evalscope/backend/rag_eval/utils/llm.py' 2026-05-28T10:08:35,714 adding 'evalscope/backend/rag_eval/utils/tools.py' 2026-05-28T10:08:35,715 adding 'evalscope/backend/vlm_eval_kit/__init__.py' 2026-05-28T10:08:35,717 adding 'evalscope/backend/vlm_eval_kit/backend_manager.py' 2026-05-28T10:08:35,720 adding 'evalscope/benchmarks/__init__.py' 2026-05-28T10:08:35,725 adding 'evalscope/benchmarks/_meta/a_okvqa.json' 2026-05-28T10:08:35,727 adding 'evalscope/benchmarks/_meta/aa_lcr.json' 2026-05-28T10:08:35,729 adding 'evalscope/benchmarks/_meta/ai2d.json' 2026-05-28T10:08:35,731 adding 'evalscope/benchmarks/_meta/aime24.json' 2026-05-28T10:08:35,733 adding 'evalscope/benchmarks/_meta/aime25.json' 2026-05-28T10:08:35,735 adding 'evalscope/benchmarks/_meta/aime26.json' 2026-05-28T10:08:35,737 adding 'evalscope/benchmarks/_meta/air_bench_chat.json' 2026-05-28T10:08:35,740 adding 'evalscope/benchmarks/_meta/air_bench_foundation.json' 2026-05-28T10:08:35,742 adding 'evalscope/benchmarks/_meta/alpaca_eval.json' 2026-05-28T10:08:35,744 adding 'evalscope/benchmarks/_meta/amc.json' 2026-05-28T10:08:35,746 adding 'evalscope/benchmarks/_meta/anat_em.json' 2026-05-28T10:08:35,748 adding 'evalscope/benchmarks/_meta/arc.json' 2026-05-28T10:08:35,750 adding 'evalscope/benchmarks/_meta/arena_hard.json' 2026-05-28T10:08:35,753 adding 'evalscope/benchmarks/_meta/arxivrollbench.json' 2026-05-28T10:08:35,756 adding 'evalscope/benchmarks/_meta/arxivrollbench_full.json' 2026-05-28T10:08:35,759 adding 'evalscope/benchmarks/_meta/bbh.json' 2026-05-28T10:08:35,761 adding 'evalscope/benchmarks/_meta/bc2gm.json' 2026-05-28T10:08:35,763 adding 'evalscope/benchmarks/_meta/bc4chemd.json' 2026-05-28T10:08:35,765 adding 'evalscope/benchmarks/_meta/bc5cdr.json' 2026-05-28T10:08:35,768 adding 'evalscope/benchmarks/_meta/bfcl_v3.json' 2026-05-28T10:08:35,770 adding 'evalscope/benchmarks/_meta/bfcl_v4.json' 2026-05-28T10:08:35,772 adding 'evalscope/benchmarks/_meta/biomix_qa.json' 2026-05-28T10:08:35,775 adding 'evalscope/benchmarks/_meta/blink.json' 2026-05-28T10:08:35,777 adding 'evalscope/benchmarks/_meta/broad_twitter_corpus.json' 2026-05-28T10:08:35,779 adding 'evalscope/benchmarks/_meta/cc_bench.json' 2026-05-28T10:08:35,782 adding 'evalscope/benchmarks/_meta/ceval.json' 2026-05-28T10:08:35,784 adding 'evalscope/benchmarks/_meta/chartqa.json' 2026-05-28T10:08:35,786 adding 'evalscope/benchmarks/_meta/chinese_simpleqa.json' 2026-05-28T10:08:35,789 adding 'evalscope/benchmarks/_meta/cl_bench.json' 2026-05-28T10:08:35,792 adding 'evalscope/benchmarks/_meta/cmmlu.json' 2026-05-28T10:08:35,796 adding 'evalscope/benchmarks/_meta/cmmmu.json' 2026-05-28T10:08:35,798 adding 'evalscope/benchmarks/_meta/cmmu.json' 2026-05-28T10:08:35,800 adding 'evalscope/benchmarks/_meta/coin_flip.json' 2026-05-28T10:08:35,802 adding 'evalscope/benchmarks/_meta/commonsense_qa.json' 2026-05-28T10:08:35,804 adding 'evalscope/benchmarks/_meta/competition_math.json' 2026-05-28T10:08:35,806 adding 'evalscope/benchmarks/_meta/conll2003.json' 2026-05-28T10:08:35,808 adding 'evalscope/benchmarks/_meta/conllpp.json' 2026-05-28T10:08:35,812 adding 'evalscope/benchmarks/_meta/copious.json' 2026-05-28T10:08:35,814 adding 'evalscope/benchmarks/_meta/cross_ner.json' 2026-05-28T10:08:35,816 adding 'evalscope/benchmarks/_meta/data_collection.json' 2026-05-28T10:08:35,818 adding 'evalscope/benchmarks/_meta/docmath.json' 2026-05-28T10:08:35,820 adding 'evalscope/benchmarks/_meta/docvqa.json' 2026-05-28T10:08:35,821 adding 'evalscope/benchmarks/_meta/drivel_binary.json' 2026-05-28T10:08:35,823 adding 'evalscope/benchmarks/_meta/drivel_multilabel.json' 2026-05-28T10:08:35,825 adding 'evalscope/benchmarks/_meta/drivel_selection.json' 2026-05-28T10:08:35,827 adding 'evalscope/benchmarks/_meta/drivel_writing.json' 2026-05-28T10:08:35,829 adding 'evalscope/benchmarks/_meta/drop.json' 2026-05-28T10:08:35,831 adding 'evalscope/benchmarks/_meta/eq_bench.json' 2026-05-28T10:08:35,833 adding 'evalscope/benchmarks/_meta/evalmuse.json' 2026-05-28T10:08:35,835 adding 'evalscope/benchmarks/_meta/fin_ner.json' 2026-05-28T10:08:35,837 adding 'evalscope/benchmarks/_meta/fleurs.json' 2026-05-28T10:08:35,839 adding 'evalscope/benchmarks/_meta/frames.json' 2026-05-28T10:08:35,841 adding 'evalscope/benchmarks/_meta/gaia.json' 2026-05-28T10:08:35,844 adding 'evalscope/benchmarks/_meta/gedit.json' 2026-05-28T10:08:35,846 adding 'evalscope/benchmarks/_meta/genai_bench.json' 2026-05-28T10:08:35,848 adding 'evalscope/benchmarks/_meta/general_arena.json' 2026-05-28T10:08:35,851 adding 'evalscope/benchmarks/_meta/general_fc.json' 2026-05-28T10:08:35,853 adding 'evalscope/benchmarks/_meta/general_mcq.json' 2026-05-28T10:08:35,854 adding 'evalscope/benchmarks/_meta/general_qa.json' 2026-05-28T10:08:35,856 adding 'evalscope/benchmarks/_meta/general_t2i.json' 2026-05-28T10:08:35,858 adding 'evalscope/benchmarks/_meta/general_vmcq.json' 2026-05-28T10:08:35,859 adding 'evalscope/benchmarks/_meta/general_vqa.json' 2026-05-28T10:08:35,861 adding 'evalscope/benchmarks/_meta/genia_ner.json' 2026-05-28T10:08:35,863 adding 'evalscope/benchmarks/_meta/gpqa_diamond.json' 2026-05-28T10:08:35,865 adding 'evalscope/benchmarks/_meta/gsm8k.json' 2026-05-28T10:08:35,867 adding 'evalscope/benchmarks/_meta/gsm8k_v.json' 2026-05-28T10:08:35,869 adding 'evalscope/benchmarks/_meta/hallusion_bench.json' 2026-05-28T10:08:35,871 adding 'evalscope/benchmarks/_meta/halueval.json' 2026-05-28T10:08:35,873 adding 'evalscope/benchmarks/_meta/harvey_ner.json' 2026-05-28T10:08:35,875 adding 'evalscope/benchmarks/_meta/health_bench.json' 2026-05-28T10:08:35,877 adding 'evalscope/benchmarks/_meta/hellaswag.json' 2026-05-28T10:08:35,880 adding 'evalscope/benchmarks/_meta/hle.json' 2026-05-28T10:08:35,882 adding 'evalscope/benchmarks/_meta/hmmt25.json' 2026-05-28T10:08:35,883 adding 'evalscope/benchmarks/_meta/hpdv2.json' 2026-05-28T10:08:35,885 adding 'evalscope/benchmarks/_meta/humaneval.json' 2026-05-28T10:08:35,888 adding 'evalscope/benchmarks/_meta/humaneval_plus.json' 2026-05-28T10:08:35,890 adding 'evalscope/benchmarks/_meta/ifbench.json' 2026-05-28T10:08:35,892 adding 'evalscope/benchmarks/_meta/ifeval.json' 2026-05-28T10:08:35,894 adding 'evalscope/benchmarks/_meta/infovqa.json' 2026-05-28T10:08:35,897 adding 'evalscope/benchmarks/_meta/iquiz.json' 2026-05-28T10:08:35,900 adding 'evalscope/benchmarks/_meta/jnlpba.json' 2026-05-28T10:08:35,903 adding 'evalscope/benchmarks/_meta/jnlpba_rare.json' 2026-05-28T10:08:35,907 adding 'evalscope/benchmarks/_meta/k2_verifier.json' 2026-05-28T10:08:35,910 adding 'evalscope/benchmarks/_meta/kimi_verifier.json' 2026-05-28T10:08:35,913 adding 'evalscope/benchmarks/_meta/librispeech.json' 2026-05-28T10:08:35,916 adding 'evalscope/benchmarks/_meta/live_code_bench.json' 2026-05-28T10:08:35,919 adding 'evalscope/benchmarks/_meta/logi_qa.json' 2026-05-28T10:08:35,922 adding 'evalscope/benchmarks/_meta/longbench_v2.json' 2026-05-28T10:08:35,924 adding 'evalscope/benchmarks/_meta/maritime_bench.json' 2026-05-28T10:08:35,927 adding 'evalscope/benchmarks/_meta/math_500.json' 2026-05-28T10:08:35,930 adding 'evalscope/benchmarks/_meta/math_qa.json' 2026-05-28T10:08:35,933 adding 'evalscope/benchmarks/_meta/math_verse.json' 2026-05-28T10:08:35,936 adding 'evalscope/benchmarks/_meta/math_vision.json' 2026-05-28T10:08:35,939 adding 'evalscope/benchmarks/_meta/math_vista.json' 2026-05-28T10:08:35,942 adding 'evalscope/benchmarks/_meta/mbpp.json' 2026-05-28T10:08:35,945 adding 'evalscope/benchmarks/_meta/mbpp_plus.json' 2026-05-28T10:08:35,948 adding 'evalscope/benchmarks/_meta/med_mcqa.json' 2026-05-28T10:08:35,951 adding 'evalscope/benchmarks/_meta/mgsm.json' 2026-05-28T10:08:35,953 adding 'evalscope/benchmarks/_meta/mia_bench.json' 2026-05-28T10:08:35,957 adding 'evalscope/benchmarks/_meta/micro_vqa.json' 2026-05-28T10:08:35,960 adding 'evalscope/benchmarks/_meta/minerva_math.json' 2026-05-28T10:08:35,963 adding 'evalscope/benchmarks/_meta/minimax_verifier.json' 2026-05-28T10:08:35,966 adding 'evalscope/benchmarks/_meta/mit_movie_trivia.json' 2026-05-28T10:08:35,969 adding 'evalscope/benchmarks/_meta/mit_restaurant.json' 2026-05-28T10:08:35,972 adding 'evalscope/benchmarks/_meta/mm_bench.json' 2026-05-28T10:08:35,975 adding 'evalscope/benchmarks/_meta/mm_star.json' 2026-05-28T10:08:35,980 adding 'evalscope/benchmarks/_meta/mmlu.json' 2026-05-28T10:08:35,984 adding 'evalscope/benchmarks/_meta/mmlu_pro.json' 2026-05-28T10:08:35,988 adding 'evalscope/benchmarks/_meta/mmlu_redux.json' 2026-05-28T10:08:35,991 adding 'evalscope/benchmarks/_meta/mmmlu.json' 2026-05-28T10:08:35,996 adding 'evalscope/benchmarks/_meta/mmmu.json' 2026-05-28T10:08:36,000 adding 'evalscope/benchmarks/_meta/mmmu_pro.json' 2026-05-28T10:08:36,002 adding 'evalscope/benchmarks/_meta/mri_mcqa.json' 2026-05-28T10:08:36,004 adding 'evalscope/benchmarks/_meta/multi_if.json' 2026-05-28T10:08:36,006 adding 'evalscope/benchmarks/_meta/multi_nerd.json' 2026-05-28T10:08:36,009 adding 'evalscope/benchmarks/_meta/multiple_humaneval.json' 2026-05-28T10:08:36,011 adding 'evalscope/benchmarks/_meta/multiple_mbpp.json' 2026-05-28T10:08:36,013 adding 'evalscope/benchmarks/_meta/music_trivia.json' 2026-05-28T10:08:36,015 adding 'evalscope/benchmarks/_meta/musr.json' 2026-05-28T10:08:36,017 adding 'evalscope/benchmarks/_meta/mvbench.json' 2026-05-28T10:08:36,019 adding 'evalscope/benchmarks/_meta/ncbi.json' 2026-05-28T10:08:36,021 adding 'evalscope/benchmarks/_meta/needle_haystack.json' 2026-05-28T10:08:36,024 adding 'evalscope/benchmarks/_meta/ocr_bench.json' 2026-05-28T10:08:36,027 adding 'evalscope/benchmarks/_meta/ocr_bench_v2.json' 2026-05-28T10:08:36,030 adding 'evalscope/benchmarks/_meta/olympiad_bench.json' 2026-05-28T10:08:36,033 adding 'evalscope/benchmarks/_meta/omni_bench.json' 2026-05-28T10:08:36,036 adding 'evalscope/benchmarks/_meta/omni_doc_bench.json' 2026-05-28T10:08:36,039 adding 'evalscope/benchmarks/_meta/ontonotes5.json' 2026-05-28T10:08:36,041 adding 'evalscope/benchmarks/_meta/openai_mrcr.json' 2026-05-28T10:08:36,043 adding 'evalscope/benchmarks/_meta/piqa.json' 2026-05-28T10:08:36,046 adding 'evalscope/benchmarks/_meta/poly_math.json' 2026-05-28T10:08:36,048 adding 'evalscope/benchmarks/_meta/pope.json' 2026-05-28T10:08:36,050 adding 'evalscope/benchmarks/_meta/process_bench.json' 2026-05-28T10:08:36,052 adding 'evalscope/benchmarks/_meta/pubmedqa.json' 2026-05-28T10:08:36,054 adding 'evalscope/benchmarks/_meta/qasc.json' 2026-05-28T10:08:36,055 adding 'evalscope/benchmarks/_meta/race.json' 2026-05-28T10:08:36,057 adding 'evalscope/benchmarks/_meta/real_world_qa.json' 2026-05-28T10:08:36,060 adding 'evalscope/benchmarks/_meta/refcoco.json' 2026-05-28T10:08:36,066 adding 'evalscope/benchmarks/_meta/scicode.json' 2026-05-28T10:08:36,069 adding 'evalscope/benchmarks/_meta/science_qa.json' 2026-05-28T10:08:36,070 adding 'evalscope/benchmarks/_meta/sciq.json' 2026-05-28T10:08:36,072 adding 'evalscope/benchmarks/_meta/seed_bench_2_plus.json' 2026-05-28T10:08:36,074 adding 'evalscope/benchmarks/_meta/simple_qa.json' 2026-05-28T10:08:36,076 adding 'evalscope/benchmarks/_meta/simple_vqa.json' 2026-05-28T10:08:36,078 adding 'evalscope/benchmarks/_meta/siqa.json' 2026-05-28T10:08:36,081 adding 'evalscope/benchmarks/_meta/super_gpqa.json' 2026-05-28T10:08:36,083 adding 'evalscope/benchmarks/_meta/swe_bench_lite.json' 2026-05-28T10:08:36,086 adding 'evalscope/benchmarks/_meta/swe_bench_lite_agentic.json' 2026-05-28T10:08:36,089 adding 'evalscope/benchmarks/_meta/swe_bench_pro.json' 2026-05-28T10:08:36,091 adding 'evalscope/benchmarks/_meta/swe_bench_verified.json' 2026-05-28T10:08:36,093 adding 'evalscope/benchmarks/_meta/swe_bench_verified_agentic.json' 2026-05-28T10:08:36,095 adding 'evalscope/benchmarks/_meta/swe_bench_verified_mini.json' 2026-05-28T10:08:36,098 adding 'evalscope/benchmarks/_meta/swe_bench_verified_mini_agentic.json' 2026-05-28T10:08:36,100 adding 'evalscope/benchmarks/_meta/tau2_bench.json' 2026-05-28T10:08:36,102 adding 'evalscope/benchmarks/_meta/tau3_bench.json' 2026-05-28T10:08:36,104 adding 'evalscope/benchmarks/_meta/tau_bench.json' 2026-05-28T10:08:36,106 adding 'evalscope/benchmarks/_meta/terminal_bench_v2.json' 2026-05-28T10:08:36,108 adding 'evalscope/benchmarks/_meta/terminal_bench_v2_1.json' 2026-05-28T10:08:36,109 adding 'evalscope/benchmarks/_meta/tifa160.json' 2026-05-28T10:08:36,112 adding 'evalscope/benchmarks/_meta/tir_bench.json' 2026-05-28T10:08:36,115 adding 'evalscope/benchmarks/_meta/tool_bench.json' 2026-05-28T10:08:36,117 adding 'evalscope/benchmarks/_meta/torgo.json' 2026-05-28T10:08:36,119 adding 'evalscope/benchmarks/_meta/trivia_qa.json' 2026-05-28T10:08:36,120 adding 'evalscope/benchmarks/_meta/truthful_qa.json' 2026-05-28T10:08:36,122 adding 'evalscope/benchmarks/_meta/tweebank_ner.json' 2026-05-28T10:08:36,125 adding 'evalscope/benchmarks/_meta/tweet_ner_7.json' 2026-05-28T10:08:36,127 adding 'evalscope/benchmarks/_meta/videomme_v2.json' 2026-05-28T10:08:36,129 adding 'evalscope/benchmarks/_meta/visulogic.json' 2026-05-28T10:08:36,131 adding 'evalscope/benchmarks/_meta/vstar_bench.json' 2026-05-28T10:08:36,133 adding 'evalscope/benchmarks/_meta/winogrande.json' 2026-05-28T10:08:36,135 adding 'evalscope/benchmarks/_meta/wmt24pp.json' 2026-05-28T10:08:36,138 adding 'evalscope/benchmarks/_meta/wnut2017.json' 2026-05-28T10:08:36,140 adding 'evalscope/benchmarks/_meta/zebralogicbench.json' 2026-05-28T10:08:36,142 adding 'evalscope/benchmarks/_meta/zerobench.json' 2026-05-28T10:08:36,143 adding 'evalscope/benchmarks/a_okvqa/__init__.py' 2026-05-28T10:08:36,145 adding 'evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py' 2026-05-28T10:08:36,146 adding 'evalscope/benchmarks/aa_lcr/__init__.py' 2026-05-28T10:08:36,148 adding 'evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py' 2026-05-28T10:08:36,150 adding 'evalscope/benchmarks/ai2d/__init__.py' 2026-05-28T10:08:36,151 adding 'evalscope/benchmarks/ai2d/ai2d_adapter.py' 2026-05-28T10:08:36,153 adding 'evalscope/benchmarks/aime/__init__.py' 2026-05-28T10:08:36,154 adding 'evalscope/benchmarks/aime/aime_adapter.py' 2026-05-28T10:08:36,156 adding 'evalscope/benchmarks/aime/grader.py' 2026-05-28T10:08:36,158 adding 'evalscope/benchmarks/aime/math_normalize.py' 2026-05-28T10:08:36,159 adding 'evalscope/benchmarks/air_bench/__init__.py' 2026-05-28T10:08:36,162 adding 'evalscope/benchmarks/air_bench/air_bench_chat_adapter.py' 2026-05-28T10:08:36,164 adding 'evalscope/benchmarks/air_bench/air_bench_foundation_adapter.py' 2026-05-28T10:08:36,165 adding 'evalscope/benchmarks/air_bench/requirements.txt' 2026-05-28T10:08:36,167 adding 'evalscope/benchmarks/air_bench/utils.py' 2026-05-28T10:08:36,168 adding 'evalscope/benchmarks/alpaca_eval/__init__.py' 2026-05-28T10:08:36,170 adding 'evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py' 2026-05-28T10:08:36,171 adding 'evalscope/benchmarks/amc/__init__.py' 2026-05-28T10:08:36,172 adding 'evalscope/benchmarks/amc/amc_adapter.py' 2026-05-28T10:08:36,174 adding 'evalscope/benchmarks/arc/__init__.py' 2026-05-28T10:08:36,175 adding 'evalscope/benchmarks/arc/arc_adapter.py' 2026-05-28T10:08:36,177 adding 'evalscope/benchmarks/arena_hard/__init__.py' 2026-05-28T10:08:36,179 adding 'evalscope/benchmarks/arena_hard/arena_hard_adapter.py' 2026-05-28T10:08:36,180 adding 'evalscope/benchmarks/arena_hard/requirements.txt' 2026-05-28T10:08:36,181 adding 'evalscope/benchmarks/arena_hard/utils.py' 2026-05-28T10:08:36,183 adding 'evalscope/benchmarks/arxivrollbench/__init__.py' 2026-05-28T10:08:36,184 adding 'evalscope/benchmarks/arxivrollbench/arxivrollbench_adapter.py' 2026-05-28T10:08:36,186 adding 'evalscope/benchmarks/bbh/__init__.py' 2026-05-28T10:08:36,188 adding 'evalscope/benchmarks/bbh/bbh_adapter.py' 2026-05-28T10:08:36,190 adding 'evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt' 2026-05-28T10:08:36,191 adding 'evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt' 2026-05-28T10:08:36,193 adding 'evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt' 2026-05-28T10:08:36,194 adding 'evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt' 2026-05-28T10:08:36,195 adding 'evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt' 2026-05-28T10:08:36,197 adding 'evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt' 2026-05-28T10:08:36,198 adding 'evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt' 2026-05-28T10:08:36,199 adding 'evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt' 2026-05-28T10:08:36,201 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt' 2026-05-28T10:08:36,202 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt' 2026-05-28T10:08:36,203 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt' 2026-05-28T10:08:36,204 adding 'evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt' 2026-05-28T10:08:36,206 adding 'evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt' 2026-05-28T10:08:36,207 adding 'evalscope/benchmarks/bbh/cot_prompts/navigate.txt' 2026-05-28T10:08:36,208 adding 'evalscope/benchmarks/bbh/cot_prompts/object_counting.txt' 2026-05-28T10:08:36,210 adding 'evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt' 2026-05-28T10:08:36,211 adding 'evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt' 2026-05-28T10:08:36,212 adding 'evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt' 2026-05-28T10:08:36,214 adding 'evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt' 2026-05-28T10:08:36,215 adding 'evalscope/benchmarks/bbh/cot_prompts/snarks.txt' 2026-05-28T10:08:36,216 adding 'evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt' 2026-05-28T10:08:36,218 adding 'evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt' 2026-05-28T10:08:36,219 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt' 2026-05-28T10:08:36,220 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt' 2026-05-28T10:08:36,222 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt' 2026-05-28T10:08:36,223 adding 'evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt' 2026-05-28T10:08:36,224 adding 'evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt' 2026-05-28T10:08:36,226 adding 'evalscope/benchmarks/bfcl/__init__.py' 2026-05-28T10:08:36,227 adding 'evalscope/benchmarks/bfcl/requirements.txt' 2026-05-28T10:08:36,228 adding 'evalscope/benchmarks/bfcl/v3/__init__.py' 2026-05-28T10:08:36,230 adding 'evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py' 2026-05-28T10:08:36,232 adding 'evalscope/benchmarks/bfcl/v3/generation.py' 2026-05-28T10:08:36,233 adding 'evalscope/benchmarks/bfcl/v3/utils.py' 2026-05-28T10:08:36,234 adding 'evalscope/benchmarks/bfcl/v4/__init__.py' 2026-05-28T10:08:36,236 adding 'evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py' 2026-05-28T10:08:36,239 adding 'evalscope/benchmarks/bfcl/v4/utils.py' 2026-05-28T10:08:36,241 adding 'evalscope/benchmarks/biomix_qa/__init__.py' 2026-05-28T10:08:36,242 adding 'evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py' 2026-05-28T10:08:36,243 adding 'evalscope/benchmarks/blink/__init__.py' 2026-05-28T10:08:36,245 adding 'evalscope/benchmarks/blink/blink_adapter.py' 2026-05-28T10:08:36,246 adding 'evalscope/benchmarks/ceval/__init__.py' 2026-05-28T10:08:36,248 adding 'evalscope/benchmarks/ceval/ceval_adapter.py' 2026-05-28T10:08:36,250 adding 'evalscope/benchmarks/chartqa/__init__.py' 2026-05-28T10:08:36,251 adding 'evalscope/benchmarks/chartqa/chartqa_adapter.py' 2026-05-28T10:08:36,252 adding 'evalscope/benchmarks/chartqa/utils.py' 2026-05-28T10:08:36,254 adding 'evalscope/benchmarks/chinese_simple_qa/__init__.py' 2026-05-28T10:08:36,256 adding 'evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py' 2026-05-28T10:08:36,257 adding 'evalscope/benchmarks/cl_bench/__init__.py' 2026-05-28T10:08:36,259 adding 'evalscope/benchmarks/cl_bench/cl_bench_adapter.py' 2026-05-28T10:08:36,260 adding 'evalscope/benchmarks/cl_bench/utils.py' 2026-05-28T10:08:36,262 adding 'evalscope/benchmarks/cmmlu/__init__.py' 2026-05-28T10:08:36,264 adding 'evalscope/benchmarks/cmmlu/cmmlu_adapter.py' 2026-05-28T10:08:36,265 adding 'evalscope/benchmarks/cmmmu/__init__.py' 2026-05-28T10:08:36,267 adding 'evalscope/benchmarks/cmmmu/cmmmu_adapter.py' 2026-05-28T10:08:36,269 adding 'evalscope/benchmarks/cmmmu/utils.py' 2026-05-28T10:08:36,271 adding 'evalscope/benchmarks/cmmu/__init__.py' 2026-05-28T10:08:36,272 adding 'evalscope/benchmarks/cmmu/cmmu_adapter.py' 2026-05-28T10:08:36,274 adding 'evalscope/benchmarks/cmmu/prompt.py' 2026-05-28T10:08:36,275 adding 'evalscope/benchmarks/coin_flip/__init__.py' 2026-05-28T10:08:36,277 adding 'evalscope/benchmarks/coin_flip/coin_flip_adapter.py' 2026-05-28T10:08:36,279 adding 'evalscope/benchmarks/commonsense_qa/__init__.py' 2026-05-28T10:08:36,280 adding 'evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py' 2026-05-28T10:08:36,281 adding 'evalscope/benchmarks/competition_math/__init__.py' 2026-05-28T10:08:36,283 adding 'evalscope/benchmarks/competition_math/competition_math_adapter.py' 2026-05-28T10:08:36,285 adding 'evalscope/benchmarks/data_collection/__init__.py' 2026-05-28T10:08:36,287 adding 'evalscope/benchmarks/data_collection/data_collection_adapter.py' 2026-05-28T10:08:36,288 adding 'evalscope/benchmarks/docmath/__init__.py' 2026-05-28T10:08:36,290 adding 'evalscope/benchmarks/docmath/docmath_adapter.py' 2026-05-28T10:08:36,292 adding 'evalscope/benchmarks/docmath/utils.py' 2026-05-28T10:08:36,293 adding 'evalscope/benchmarks/docvqa/__init__.py' 2026-05-28T10:08:36,295 adding 'evalscope/benchmarks/docvqa/docvqa_adapter.py' 2026-05-28T10:08:36,297 adding 'evalscope/benchmarks/drivelology/__init__.py' 2026-05-28T10:08:36,298 adding 'evalscope/benchmarks/drivelology/drivelology_binary_adapter.py' 2026-05-28T10:08:36,300 adding 'evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py' 2026-05-28T10:08:36,302 adding 'evalscope/benchmarks/drivelology/drivelology_selection_adapter.py' 2026-05-28T10:08:36,303 adding 'evalscope/benchmarks/drivelology/drivelology_writing_adapter.py' 2026-05-28T10:08:36,305 adding 'evalscope/benchmarks/drop/__init__.py' 2026-05-28T10:08:36,307 adding 'evalscope/benchmarks/drop/drop_adapter.py' 2026-05-28T10:08:36,308 adding 'evalscope/benchmarks/drop/utils.py' 2026-05-28T10:08:36,310 adding 'evalscope/benchmarks/eq_bench/__init__.py' 2026-05-28T10:08:36,312 adding 'evalscope/benchmarks/eq_bench/answer_validation.py' 2026-05-28T10:08:36,314 adding 'evalscope/benchmarks/eq_bench/eq_bench_adapter.py' 2026-05-28T10:08:36,315 adding 'evalscope/benchmarks/fleurs/__init__.py' 2026-05-28T10:08:36,317 adding 'evalscope/benchmarks/fleurs/fleurs_adapter.py' 2026-05-28T10:08:36,318 adding 'evalscope/benchmarks/frames/__init__.py' 2026-05-28T10:08:36,320 adding 'evalscope/benchmarks/frames/frames_adapter.py' 2026-05-28T10:08:36,321 adding 'evalscope/benchmarks/frames/utils.py' 2026-05-28T10:08:36,322 adding 'evalscope/benchmarks/gaia/__init__.py' 2026-05-28T10:08:36,324 adding 'evalscope/benchmarks/gaia/gaia_adapter.py' 2026-05-28T10:08:36,326 adding 'evalscope/benchmarks/gaia/scorer.py' 2026-05-28T10:08:36,327 adding 'evalscope/benchmarks/general_arena/__init__.py' 2026-05-28T10:08:36,330 adding 'evalscope/benchmarks/general_arena/general_arena_adapter.py' 2026-05-28T10:08:36,331 adding 'evalscope/benchmarks/general_arena/requirements.txt' 2026-05-28T10:08:36,333 adding 'evalscope/benchmarks/general_arena/utils.py' 2026-05-28T10:08:36,334 adding 'evalscope/benchmarks/general_fc/__init__.py' 2026-05-28T10:08:36,336 adding 'evalscope/benchmarks/general_fc/general_fc_adapter.py' 2026-05-28T10:08:36,338 adding 'evalscope/benchmarks/general_mcq/__init__.py' 2026-05-28T10:08:36,339 adding 'evalscope/benchmarks/general_mcq/general_mcq_adapter.py' 2026-05-28T10:08:36,341 adding 'evalscope/benchmarks/general_qa/__init__.py' 2026-05-28T10:08:36,342 adding 'evalscope/benchmarks/general_qa/general_qa_adapter.py' 2026-05-28T10:08:36,344 adding 'evalscope/benchmarks/general_vmcq/__init__.py' 2026-05-28T10:08:36,345 adding 'evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py' 2026-05-28T10:08:36,348 adding 'evalscope/benchmarks/general_vqa/__init__.py' 2026-05-28T10:08:36,349 adding 'evalscope/benchmarks/general_vqa/general_vqa_adapter.py' 2026-05-28T10:08:36,351 adding 'evalscope/benchmarks/gpqa/__init__.py' 2026-05-28T10:08:36,352 adding 'evalscope/benchmarks/gpqa/gpqa_adapter.py' 2026-05-28T10:08:36,354 adding 'evalscope/benchmarks/gpqa/prompt.py' 2026-05-28T10:08:36,356 adding 'evalscope/benchmarks/gsm8k/__init__.py' 2026-05-28T10:08:36,357 adding 'evalscope/benchmarks/gsm8k/gsm8k_adapter.py' 2026-05-28T10:08:36,359 adding 'evalscope/benchmarks/gsm8k_v/__init__.py' 2026-05-28T10:08:36,360 adding 'evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py' 2026-05-28T10:08:36,362 adding 'evalscope/benchmarks/hallusion_bench/__init__.py' 2026-05-28T10:08:36,363 adding 'evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py' 2026-05-28T10:08:36,365 adding 'evalscope/benchmarks/halu_eval/__init__.py' 2026-05-28T10:08:36,367 adding 'evalscope/benchmarks/halu_eval/halu_eval_adapter.py' 2026-05-28T10:08:36,369 adding 'evalscope/benchmarks/halu_eval/halu_eval_instructions.py' 2026-05-28T10:08:36,370 adding 'evalscope/benchmarks/healthbench/__init__.py' 2026-05-28T10:08:36,372 adding 'evalscope/benchmarks/healthbench/healthbench_adapter.py' 2026-05-28T10:08:36,374 adding 'evalscope/benchmarks/healthbench/utils.py' 2026-05-28T10:08:36,375 adding 'evalscope/benchmarks/hellaswag/__init__.py' 2026-05-28T10:08:36,377 adding 'evalscope/benchmarks/hellaswag/hellaswag_adapter.py' 2026-05-28T10:08:36,379 adding 'evalscope/benchmarks/hle/__init__.py' 2026-05-28T10:08:36,380 adding 'evalscope/benchmarks/hle/hle_adapter.py' 2026-05-28T10:08:36,382 adding 'evalscope/benchmarks/hmmt25/hmmt25_adapter.py' 2026-05-28T10:08:36,384 adding 'evalscope/benchmarks/humaneval/__init__.py' 2026-05-28T10:08:36,385 adding 'evalscope/benchmarks/humaneval/humaneval_adapter.py' 2026-05-28T10:08:36,387 adding 'evalscope/benchmarks/humaneval/utils.py' 2026-05-28T10:08:36,388 adding 'evalscope/benchmarks/humanevalplus/__init__.py' 2026-05-28T10:08:36,390 adding 'evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py' 2026-05-28T10:08:36,391 adding 'evalscope/benchmarks/humanevalplus/docker/Dockerfile' 2026-05-28T10:08:36,393 adding 'evalscope/benchmarks/ifbench/__init__.py' 2026-05-28T10:08:36,394 adding 'evalscope/benchmarks/ifbench/evaluation_lib.py' 2026-05-28T10:08:36,396 adding 'evalscope/benchmarks/ifbench/ifbench_adapter.py' 2026-05-28T10:08:36,403 adding 'evalscope/benchmarks/ifbench/instructions.py' 2026-05-28T10:08:36,405 adding 'evalscope/benchmarks/ifbench/instructions_registry.py' 2026-05-28T10:08:36,408 adding 'evalscope/benchmarks/ifbench/instructions_util.py' 2026-05-28T10:08:36,409 adding 'evalscope/benchmarks/ifbench/requirements.txt' 2026-05-28T10:08:36,411 adding 'evalscope/benchmarks/ifeval/__init__.py' 2026-05-28T10:08:36,412 adding 'evalscope/benchmarks/ifeval/ifeval_adapter.py' 2026-05-28T10:08:36,417 adding 'evalscope/benchmarks/ifeval/instructions.py' 2026-05-28T10:08:36,419 adding 'evalscope/benchmarks/ifeval/instructions_registry.py' 2026-05-28T10:08:36,422 adding 'evalscope/benchmarks/ifeval/instructions_util.py' 2026-05-28T10:08:36,423 adding 'evalscope/benchmarks/ifeval/requirements.txt' 2026-05-28T10:08:36,425 adding 'evalscope/benchmarks/ifeval/utils.py' 2026-05-28T10:08:36,426 adding 'evalscope/benchmarks/image_edit/__init__.py' 2026-05-28T10:08:36,428 adding 'evalscope/benchmarks/image_edit/gedit/__init__.py' 2026-05-28T10:08:36,429 adding 'evalscope/benchmarks/image_edit/gedit/gedit_adapter.py' 2026-05-28T10:08:36,431 adding 'evalscope/benchmarks/image_edit/gedit/utils.py' 2026-05-28T10:08:36,434 adding 'evalscope/benchmarks/image_edit/gedit/vie_prompts.py' 2026-05-28T10:08:36,435 adding 'evalscope/benchmarks/infovqa/__init__.py' 2026-05-28T10:08:36,437 adding 'evalscope/benchmarks/infovqa/infovqa_adapter.py' 2026-05-28T10:08:36,438 adding 'evalscope/benchmarks/iquiz/__init__.py' 2026-05-28T10:08:36,440 adding 'evalscope/benchmarks/iquiz/iquiz_adapter.py' 2026-05-28T10:08:36,441 adding 'evalscope/benchmarks/k2_verifier/__init__.py' 2026-05-28T10:08:36,443 adding 'evalscope/benchmarks/k2_verifier/k2_verifier_adapter.py' 2026-05-28T10:08:36,445 adding 'evalscope/benchmarks/kimi_verifier/__init__.py' 2026-05-28T10:08:36,447 adding 'evalscope/benchmarks/kimi_verifier/kimi_verifier_adapter.py' 2026-05-28T10:08:36,448 adding 'evalscope/benchmarks/kimi_verifier/param_spec.py' 2026-05-28T10:08:36,450 adding 'evalscope/benchmarks/librispeech/__init__.py' 2026-05-28T10:08:36,452 adding 'evalscope/benchmarks/librispeech/librispeech_adapter.py' 2026-05-28T10:08:36,453 adding 'evalscope/benchmarks/live_code_bench/__init__.py' 2026-05-28T10:08:36,455 adding 'evalscope/benchmarks/live_code_bench/evaluate_utils.py' 2026-05-28T10:08:36,456 adding 'evalscope/benchmarks/live_code_bench/extract_utils.py' 2026-05-28T10:08:36,458 adding 'evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py' 2026-05-28T10:08:36,459 adding 'evalscope/benchmarks/live_code_bench/load_utils.py' 2026-05-28T10:08:36,461 adding 'evalscope/benchmarks/live_code_bench/pass_k_utils.py' 2026-05-28T10:08:36,463 adding 'evalscope/benchmarks/live_code_bench/prompts.py' 2026-05-28T10:08:36,464 adding 'evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py' 2026-05-28T10:08:36,467 adding 'evalscope/benchmarks/live_code_bench/testing_util.py' 2026-05-28T10:08:36,468 adding 'evalscope/benchmarks/logi_qa/__int__.py' 2026-05-28T10:08:36,470 adding 'evalscope/benchmarks/logi_qa/logi_qa_adapter.py' 2026-05-28T10:08:36,471 adding 'evalscope/benchmarks/longbench_v2/__init__.py' 2026-05-28T10:08:36,473 adding 'evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py' 2026-05-28T10:08:36,474 adding 'evalscope/benchmarks/maritime_bench/__init__.py' 2026-05-28T10:08:36,476 adding 'evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py' 2026-05-28T10:08:36,477 adding 'evalscope/benchmarks/math_500/__init__.py' 2026-05-28T10:08:36,479 adding 'evalscope/benchmarks/math_500/math_500_adapter.py' 2026-05-28T10:08:36,480 adding 'evalscope/benchmarks/math_qa/__init__.py' 2026-05-28T10:08:36,481 adding 'evalscope/benchmarks/math_qa/math_qa_adapter.py' 2026-05-28T10:08:36,483 adding 'evalscope/benchmarks/math_verse/__init__.py' 2026-05-28T10:08:36,484 adding 'evalscope/benchmarks/math_verse/math_verse_adapter.py' 2026-05-28T10:08:36,486 adding 'evalscope/benchmarks/math_vision/__init__.py' 2026-05-28T10:08:36,487 adding 'evalscope/benchmarks/math_vision/math_vision_adapter.py' 2026-05-28T10:08:36,489 adding 'evalscope/benchmarks/math_vista/__init__.py' 2026-05-28T10:08:36,490 adding 'evalscope/benchmarks/math_vista/math_vista_adapter.py' 2026-05-28T10:08:36,492 adding 'evalscope/benchmarks/mbpp/__init__.py' 2026-05-28T10:08:36,493 adding 'evalscope/benchmarks/mbpp/mbpp_adapter.py' 2026-05-28T10:08:36,495 adding 'evalscope/benchmarks/mbppplus/__init__.py' 2026-05-28T10:08:36,496 adding 'evalscope/benchmarks/mbppplus/mbppplus_adapter.py' 2026-05-28T10:08:36,498 adding 'evalscope/benchmarks/med_mcqa/__init__.py' 2026-05-28T10:08:36,499 adding 'evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py' 2026-05-28T10:08:36,501 adding 'evalscope/benchmarks/mgsm/__init__.py' 2026-05-28T10:08:36,502 adding 'evalscope/benchmarks/mgsm/mgsm_adapter.py' 2026-05-28T10:08:36,504 adding 'evalscope/benchmarks/mia_bench/__init__.py' 2026-05-28T10:08:36,505 adding 'evalscope/benchmarks/mia_bench/mia_bench_adapter.py' 2026-05-28T10:08:36,507 adding 'evalscope/benchmarks/mia_bench/utils.py' 2026-05-28T10:08:36,508 adding 'evalscope/benchmarks/micro_vqa/__init__.py' 2026-05-28T10:08:36,510 adding 'evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py' 2026-05-28T10:08:36,512 adding 'evalscope/benchmarks/minerva_math/__init__.py' 2026-05-28T10:08:36,513 adding 'evalscope/benchmarks/minerva_math/minerva_math_adapter.py' 2026-05-28T10:08:36,515 adding 'evalscope/benchmarks/minimax_verifier/__init__.py' 2026-05-28T10:08:36,517 adding 'evalscope/benchmarks/minimax_verifier/_validators.py' 2026-05-28T10:08:36,519 adding 'evalscope/benchmarks/minimax_verifier/minimax_verifier_adapter.py' 2026-05-28T10:08:36,521 adding 'evalscope/benchmarks/mm_bench/__init__.py' 2026-05-28T10:08:36,522 adding 'evalscope/benchmarks/mm_bench/mm_bench_adapter.py' 2026-05-28T10:08:36,524 adding 'evalscope/benchmarks/mm_star/__init__.py' 2026-05-28T10:08:36,525 adding 'evalscope/benchmarks/mm_star/mm_star_adapter.py' 2026-05-28T10:08:36,527 adding 'evalscope/benchmarks/mmlu/__init__.py' 2026-05-28T10:08:36,528 adding 'evalscope/benchmarks/mmlu/mmlu_adapter.py' 2026-05-28T10:08:36,530 adding 'evalscope/benchmarks/mmlu_pro/__init__.py' 2026-05-28T10:08:36,532 adding 'evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py' 2026-05-28T10:08:36,533 adding 'evalscope/benchmarks/mmlu_redux/__init__.py' 2026-05-28T10:08:36,535 adding 'evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py' 2026-05-28T10:08:36,537 adding 'evalscope/benchmarks/mmmlu/__init__.py' 2026-05-28T10:08:36,538 adding 'evalscope/benchmarks/mmmlu/mmmlu_adapter.py' 2026-05-28T10:08:36,540 adding 'evalscope/benchmarks/mmmlu/prompt.py' 2026-05-28T10:08:36,541 adding 'evalscope/benchmarks/mmmu/__init__.py' 2026-05-28T10:08:36,543 adding 'evalscope/benchmarks/mmmu/mmmu_adapter.py' 2026-05-28T10:08:36,545 adding 'evalscope/benchmarks/mmmu_pro/__init__.py' 2026-05-28T10:08:36,546 adding 'evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py' 2026-05-28T10:08:36,548 adding 'evalscope/benchmarks/mri_mcqa/__init__.py' 2026-05-28T10:08:36,549 adding 'evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py' 2026-05-28T10:08:36,551 adding 'evalscope/benchmarks/multi_if/__init__.py' 2026-05-28T10:08:36,559 adding 'evalscope/benchmarks/multi_if/ifeval.py' 2026-05-28T10:08:36,561 adding 'evalscope/benchmarks/multi_if/metrics.py' 2026-05-28T10:08:36,563 adding 'evalscope/benchmarks/multi_if/multi_if_adapter.py' 2026-05-28T10:08:36,564 adding 'evalscope/benchmarks/multi_if/requirements.txt' 2026-05-28T10:08:36,565 adding 'evalscope/benchmarks/multipl_e/__init__.py' 2026-05-28T10:08:36,567 adding 'evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py' 2026-05-28T10:08:36,568 adding 'evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py' 2026-05-28T10:08:36,570 adding 'evalscope/benchmarks/multipl_e/utils.py' 2026-05-28T10:08:36,571 adding 'evalscope/benchmarks/music_trivia/__init__.py' 2026-05-28T10:08:36,573 adding 'evalscope/benchmarks/music_trivia/music_trivia_adapter.py' 2026-05-28T10:08:36,574 adding 'evalscope/benchmarks/musr/__init__.py' 2026-05-28T10:08:36,575 adding 'evalscope/benchmarks/musr/musr_adapter.py' 2026-05-28T10:08:36,577 adding 'evalscope/benchmarks/mvbench/__init__.py' 2026-05-28T10:08:36,579 adding 'evalscope/benchmarks/mvbench/mvbench_adapter.py' 2026-05-28T10:08:36,580 adding 'evalscope/benchmarks/needle_haystack/__init__.py' 2026-05-28T10:08:36,583 adding 'evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py' 2026-05-28T10:08:36,584 adding 'evalscope/benchmarks/needle_haystack/requirements.txt' 2026-05-28T10:08:36,585 adding 'evalscope/benchmarks/needle_haystack/utils.py' 2026-05-28T10:08:36,587 adding 'evalscope/benchmarks/ner/__init__.py' 2026-05-28T10:08:36,588 adding 'evalscope/benchmarks/ner/anat_em_adapter.py' 2026-05-28T10:08:36,589 adding 'evalscope/benchmarks/ner/bc2gm_adapter.py' 2026-05-28T10:08:36,591 adding 'evalscope/benchmarks/ner/bc4chemd_adapter.py' 2026-05-28T10:08:36,592 adding 'evalscope/benchmarks/ner/bc5cdr_adapter.py' 2026-05-28T10:08:36,593 adding 'evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py' 2026-05-28T10:08:36,595 adding 'evalscope/benchmarks/ner/conll2003_adapter.py' 2026-05-28T10:08:36,596 adding 'evalscope/benchmarks/ner/conllpp_adapter.py' 2026-05-28T10:08:36,598 adding 'evalscope/benchmarks/ner/copious_adapter.py' 2026-05-28T10:08:36,599 adding 'evalscope/benchmarks/ner/cross_ner_adapter.py' 2026-05-28T10:08:36,601 adding 'evalscope/benchmarks/ner/fin_ner_adapter.py' 2026-05-28T10:08:36,602 adding 'evalscope/benchmarks/ner/genia_ner_adapter.py' 2026-05-28T10:08:36,603 adding 'evalscope/benchmarks/ner/harvey_ner_adapter.py' 2026-05-28T10:08:36,605 adding 'evalscope/benchmarks/ner/jnlpba_adapter.py' 2026-05-28T10:08:36,606 adding 'evalscope/benchmarks/ner/jnlpba_rare_adapter.py' 2026-05-28T10:08:36,607 adding 'evalscope/benchmarks/ner/mit_movie_trivia_adapter.py' 2026-05-28T10:08:36,609 adding 'evalscope/benchmarks/ner/mit_restaurant_adapter.py' 2026-05-28T10:08:36,610 adding 'evalscope/benchmarks/ner/multi_nerd_adapter.py' 2026-05-28T10:08:36,612 adding 'evalscope/benchmarks/ner/ncbi_adapter.py' 2026-05-28T10:08:36,613 adding 'evalscope/benchmarks/ner/ontonotes5_adapter.py' 2026-05-28T10:08:36,615 adding 'evalscope/benchmarks/ner/tweebank_ner_adapter.py' 2026-05-28T10:08:36,616 adding 'evalscope/benchmarks/ner/tweet_ner_7_adapter.py' 2026-05-28T10:08:36,617 adding 'evalscope/benchmarks/ner/wnut2017_adapter.py' 2026-05-28T10:08:36,619 adding 'evalscope/benchmarks/ner/cross_ner_entities/__init__.py' 2026-05-28T10:08:36,620 adding 'evalscope/benchmarks/ner/cross_ner_entities/ai.py' 2026-05-28T10:08:36,622 adding 'evalscope/benchmarks/ner/cross_ner_entities/literature.py' 2026-05-28T10:08:36,623 adding 'evalscope/benchmarks/ner/cross_ner_entities/music.py' 2026-05-28T10:08:36,624 adding 'evalscope/benchmarks/ner/cross_ner_entities/politics.py' 2026-05-28T10:08:36,626 adding 'evalscope/benchmarks/ner/cross_ner_entities/science.py' 2026-05-28T10:08:36,627 adding 'evalscope/benchmarks/ocr_bench/__init__.py' 2026-05-28T10:08:36,629 adding 'evalscope/benchmarks/ocr_bench/requirements.txt' 2026-05-28T10:08:36,630 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py' 2026-05-28T10:08:36,632 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py' 2026-05-28T10:08:36,633 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py' 2026-05-28T10:08:36,637 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py' 2026-05-28T10:08:36,638 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py' 2026-05-28T10:08:36,640 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py' 2026-05-28T10:08:36,641 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py' 2026-05-28T10:08:36,643 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py' 2026-05-28T10:08:36,644 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py' 2026-05-28T10:08:36,646 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py' 2026-05-28T10:08:36,648 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py' 2026-05-28T10:08:36,649 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py' 2026-05-28T10:08:36,651 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt' 2026-05-28T10:08:36,653 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py' 2026-05-28T10:08:36,655 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py' 2026-05-28T10:08:36,657 adding 'evalscope/benchmarks/olympiad_bench/__init__.py' 2026-05-28T10:08:36,659 adding 'evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py' 2026-05-28T10:08:36,660 adding 'evalscope/benchmarks/olympiad_bench/requirements.txt' 2026-05-28T10:08:36,663 adding 'evalscope/benchmarks/olympiad_bench/utils.py' 2026-05-28T10:08:36,664 adding 'evalscope/benchmarks/omni_bench/__init__.py' 2026-05-28T10:08:36,666 adding 'evalscope/benchmarks/omni_bench/omni_bench_adapter.py' 2026-05-28T10:08:36,668 adding 'evalscope/benchmarks/omnidoc_bench/__init__.py' 2026-05-28T10:08:36,670 adding 'evalscope/benchmarks/omnidoc_bench/end2end_eval.py' 2026-05-28T10:08:36,672 adding 'evalscope/benchmarks/omnidoc_bench/metrics.py' 2026-05-28T10:08:36,674 adding 'evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py' 2026-05-28T10:08:36,675 adding 'evalscope/benchmarks/omnidoc_bench/requirements.txt' 2026-05-28T10:08:36,683 adding 'evalscope/benchmarks/omnidoc_bench/utils.py' 2026-05-28T10:08:36,685 adding 'evalscope/benchmarks/openai_mrcr/__init__.py' 2026-05-28T10:08:36,687 adding 'evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py' 2026-05-28T10:08:36,689 adding 'evalscope/benchmarks/openai_mrcr/requirements.txt' 2026-05-28T10:08:36,690 adding 'evalscope/benchmarks/openai_mrcr/utils.py' 2026-05-28T10:08:36,692 adding 'evalscope/benchmarks/piqa/__init__.py' 2026-05-28T10:08:36,693 adding 'evalscope/benchmarks/piqa/piqa_adapter.py' 2026-05-28T10:08:36,695 adding 'evalscope/benchmarks/poly_math/__init__.py' 2026-05-28T10:08:36,696 adding 'evalscope/benchmarks/poly_math/poly_math_adapter.py' 2026-05-28T10:08:36,698 adding 'evalscope/benchmarks/poly_math/utils/instruction.py' 2026-05-28T10:08:36,700 adding 'evalscope/benchmarks/pope/__init__.py' 2026-05-28T10:08:36,702 adding 'evalscope/benchmarks/pope/pope_adapter.py' 2026-05-28T10:08:36,703 adding 'evalscope/benchmarks/process_bench/__init__.py' 2026-05-28T10:08:36,705 adding 'evalscope/benchmarks/process_bench/process_bench_adapter.py' 2026-05-28T10:08:36,707 adding 'evalscope/benchmarks/pumed_qa/__init__.py' 2026-05-28T10:08:36,708 adding 'evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py' 2026-05-28T10:08:36,710 adding 'evalscope/benchmarks/qasc/__init__.py' 2026-05-28T10:08:36,711 adding 'evalscope/benchmarks/qasc/qasc_adapter.py' 2026-05-28T10:08:36,713 adding 'evalscope/benchmarks/race/__init__.py' 2026-05-28T10:08:36,714 adding 'evalscope/benchmarks/race/race_adapter.py' 2026-05-28T10:08:36,716 adding 'evalscope/benchmarks/real_world_qa/__init__.py' 2026-05-28T10:08:36,717 adding 'evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py' 2026-05-28T10:08:36,719 adding 'evalscope/benchmarks/refcoco/__init__.py' 2026-05-28T10:08:36,720 adding 'evalscope/benchmarks/refcoco/evaluation_lib.py' 2026-05-28T10:08:36,722 adding 'evalscope/benchmarks/refcoco/refcoco_adapter.py' 2026-05-28T10:08:36,723 adding 'evalscope/benchmarks/refcoco/requirements.txt' 2026-05-28T10:08:36,724 adding 'evalscope/benchmarks/refcoco/utils.py' 2026-05-28T10:08:36,726 adding 'evalscope/benchmarks/scicode/__init__.py' 2026-05-28T10:08:36,727 adding 'evalscope/benchmarks/scicode/prompt_templates.py' 2026-05-28T10:08:36,729 adding 'evalscope/benchmarks/scicode/scicode_adapter.py' 2026-05-28T10:08:36,730 adding 'evalscope/benchmarks/scicode/util.py' 2026-05-28T10:08:36,732 adding 'evalscope/benchmarks/scicode/docker/Dockerfile' 2026-05-28T10:08:36,733 adding 'evalscope/benchmarks/scicode/docker/docker_requirements.txt' 2026-05-28T10:08:36,734 adding 'evalscope/benchmarks/scicode/docker/process_data.py' 2026-05-28T10:08:36,736 adding 'evalscope/benchmarks/scicode/docker/test_util.py' 2026-05-28T10:08:36,738 adding 'evalscope/benchmarks/science_qa/__init__.py' 2026-05-28T10:08:36,739 adding 'evalscope/benchmarks/science_qa/science_qa_adapter.py' 2026-05-28T10:08:36,740 adding 'evalscope/benchmarks/sciq/__init__.py' 2026-05-28T10:08:36,742 adding 'evalscope/benchmarks/sciq/sciq_adapter.py' 2026-05-28T10:08:36,743 adding 'evalscope/benchmarks/seed_bench_2_plus/__init__.py' 2026-05-28T10:08:36,745 adding 'evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py' 2026-05-28T10:08:36,746 adding 'evalscope/benchmarks/simple_qa/__init__.py' 2026-05-28T10:08:36,748 adding 'evalscope/benchmarks/simple_qa/simple_qa_adapter.py' 2026-05-28T10:08:36,750 adding 'evalscope/benchmarks/simple_vqa/__init__.py' 2026-05-28T10:08:36,752 adding 'evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py' 2026-05-28T10:08:36,753 adding 'evalscope/benchmarks/siqa/__init__.py' 2026-05-28T10:08:36,755 adding 'evalscope/benchmarks/siqa/siqa_adapter.py' 2026-05-28T10:08:36,757 adding 'evalscope/benchmarks/super_gpqa/__init__.py' 2026-05-28T10:08:36,758 adding 'evalscope/benchmarks/super_gpqa/prompt.py' 2026-05-28T10:08:36,760 adding 'evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py' 2026-05-28T10:08:36,761 adding 'evalscope/benchmarks/super_gpqa/utils.py' 2026-05-28T10:08:36,763 adding 'evalscope/benchmarks/swe_bench/__init__.py' 2026-05-28T10:08:36,764 adding 'evalscope/benchmarks/swe_bench/build_images.py' 2026-05-28T10:08:36,766 adding 'evalscope/benchmarks/swe_bench/requirements.txt' 2026-05-28T10:08:36,767 adding 'evalscope/benchmarks/swe_bench/swe_bench_adapter.py' 2026-05-28T10:08:36,770 adding 'evalscope/benchmarks/swe_bench/swe_bench_agentic_adapter.py' 2026-05-28T10:08:36,772 adding 'evalscope/benchmarks/swe_bench/utils.py' 2026-05-28T10:08:36,774 adding 'evalscope/benchmarks/swe_bench_pro/__init__.py' 2026-05-28T10:08:36,776 adding 'evalscope/benchmarks/swe_bench_pro/swe_bench_pro_agentic_adapter.py' 2026-05-28T10:08:36,779 adding 'evalscope/benchmarks/swe_bench_pro/utils.py' 2026-05-28T10:08:36,780 adding 'evalscope/benchmarks/tau_bench/__init__.py' 2026-05-28T10:08:36,782 adding 'evalscope/benchmarks/tau_bench/tau2_bench/__init__.py' 2026-05-28T10:08:36,784 adding 'evalscope/benchmarks/tau_bench/tau2_bench/generation.py' 2026-05-28T10:08:36,785 adding 'evalscope/benchmarks/tau_bench/tau2_bench/requirements.txt' 2026-05-28T10:08:36,786 adding 'evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py' 2026-05-28T10:08:36,788 adding 'evalscope/benchmarks/tau_bench/tau3_bench/__init__.py' 2026-05-28T10:08:36,790 adding 'evalscope/benchmarks/tau_bench/tau3_bench/generation.py' 2026-05-28T10:08:36,791 adding 'evalscope/benchmarks/tau_bench/tau3_bench/requirements.txt' 2026-05-28T10:08:36,793 adding 'evalscope/benchmarks/tau_bench/tau3_bench/tau3_bench_adapter.py' 2026-05-28T10:08:36,795 adding 'evalscope/benchmarks/tau_bench/tau_bench/__init__.py' 2026-05-28T10:08:36,796 adding 'evalscope/benchmarks/tau_bench/tau_bench/generation.py' 2026-05-28T10:08:36,797 adding 'evalscope/benchmarks/tau_bench/tau_bench/requirements.txt' 2026-05-28T10:08:36,799 adding 'evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py' 2026-05-28T10:08:36,801 adding 'evalscope/benchmarks/terminal_bench/__init__.py' 2026-05-28T10:08:36,802 adding 'evalscope/benchmarks/terminal_bench/requirements.txt' 2026-05-28T10:08:36,804 adding 'evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py' 2026-05-28T10:08:36,805 adding 'evalscope/benchmarks/terminal_bench/utils.py' 2026-05-28T10:08:36,807 adding 'evalscope/benchmarks/text2image/__init__.py' 2026-05-28T10:08:36,808 adding 'evalscope/benchmarks/text2image/evalmuse_adapter.py' 2026-05-28T10:08:36,810 adding 'evalscope/benchmarks/text2image/genai_bench_adapter.py' 2026-05-28T10:08:36,811 adding 'evalscope/benchmarks/text2image/general_t2i_adapter.py' 2026-05-28T10:08:36,812 adding 'evalscope/benchmarks/text2image/hpdv2_adapter.py' 2026-05-28T10:08:36,814 adding 'evalscope/benchmarks/text2image/tifa_adapter.py' 2026-05-28T10:08:36,815 adding 'evalscope/benchmarks/tir_bench/__init__.py' 2026-05-28T10:08:36,817 adding 'evalscope/benchmarks/tir_bench/tir_bench_adapter.py' 2026-05-28T10:08:36,819 adding 'evalscope/benchmarks/tir_bench/utils.py' 2026-05-28T10:08:36,821 adding 'evalscope/benchmarks/tool_bench/__init__.py' 2026-05-28T10:08:36,822 adding 'evalscope/benchmarks/tool_bench/tool_bench_adapter.py' 2026-05-28T10:08:36,824 adding 'evalscope/benchmarks/tool_bench/utils.py' 2026-05-28T10:08:36,826 adding 'evalscope/benchmarks/torgo/__init__.py' 2026-05-28T10:08:36,827 adding 'evalscope/benchmarks/torgo/requirements.txt' 2026-05-28T10:08:36,828 adding 'evalscope/benchmarks/torgo/torgo_adapter.py' 2026-05-28T10:08:36,830 adding 'evalscope/benchmarks/trivia_qa/__init__.py' 2026-05-28T10:08:36,831 adding 'evalscope/benchmarks/trivia_qa/samples.jsonl' 2026-05-28T10:08:36,833 adding 'evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py' 2026-05-28T10:08:36,835 adding 'evalscope/benchmarks/truthful_qa/__init__.py' 2026-05-28T10:08:36,836 adding 'evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py' 2026-05-28T10:08:36,838 adding 'evalscope/benchmarks/videomme_v2/__init__.py' 2026-05-28T10:08:36,840 adding 'evalscope/benchmarks/videomme_v2/videomme_v2_adapter.py' 2026-05-28T10:08:36,841 adding 'evalscope/benchmarks/visu_logic/__init__.py' 2026-05-28T10:08:36,843 adding 'evalscope/benchmarks/visu_logic/visu_logic_adapter.py' 2026-05-28T10:08:36,844 adding 'evalscope/benchmarks/vstar_bench/__init__.py' 2026-05-28T10:08:36,846 adding 'evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py' 2026-05-28T10:08:36,848 adding 'evalscope/benchmarks/winogrande/__init__.py' 2026-05-28T10:08:36,849 adding 'evalscope/benchmarks/winogrande/winogrande_adapter.py' 2026-05-28T10:08:36,850 adding 'evalscope/benchmarks/wmt/__init__.py' 2026-05-28T10:08:36,852 adding 'evalscope/benchmarks/wmt/requirements.txt' 2026-05-28T10:08:36,853 adding 'evalscope/benchmarks/wmt/wmt24_adapter.py' 2026-05-28T10:08:36,855 adding 'evalscope/benchmarks/zebralogicbench/__init__.py' 2026-05-28T10:08:36,857 adding 'evalscope/benchmarks/zebralogicbench/utils.py' 2026-05-28T10:08:36,859 adding 'evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py' 2026-05-28T10:08:36,860 adding 'evalscope/benchmarks/zerobench/__init__.py' 2026-05-28T10:08:36,862 adding 'evalscope/benchmarks/zerobench/zerobench_adapter.py' 2026-05-28T10:08:36,864 adding 'evalscope/cli/__init__.py' 2026-05-28T10:08:36,865 adding 'evalscope/cli/base.py' 2026-05-28T10:08:36,867 adding 'evalscope/cli/benchmark_info.py' 2026-05-28T10:08:36,868 adding 'evalscope/cli/cli.py' 2026-05-28T10:08:36,870 adding 'evalscope/cli/start_app.py' 2026-05-28T10:08:36,871 adding 'evalscope/cli/start_eval.py' 2026-05-28T10:08:36,872 adding 'evalscope/cli/start_perf.py' 2026-05-28T10:08:36,873 adding 'evalscope/cli/start_service.py' 2026-05-28T10:08:36,875 adding 'evalscope/collections/__init__.py' 2026-05-28T10:08:36,877 adding 'evalscope/collections/sampler.py' 2026-05-28T10:08:36,878 adding 'evalscope/collections/schema.py' 2026-05-28T10:08:36,880 adding 'evalscope/evaluator/__init__.py' 2026-05-28T10:08:36,881 adding 'evalscope/evaluator/batch_reviewer.py' 2026-05-28T10:08:36,884 adding 'evalscope/evaluator/evaluator.py' 2026-05-28T10:08:36,886 adding 'evalscope/evaluator/perf_collector.py' 2026-05-28T10:08:36,888 adding 'evalscope/filters/__init__.py' 2026-05-28T10:08:36,889 adding 'evalscope/filters/extraction.py' 2026-05-28T10:08:36,890 adding 'evalscope/filters/selection.py' 2026-05-28T10:08:36,892 adding 'evalscope/metrics/__init__.py' 2026-05-28T10:08:36,894 adding 'evalscope/metrics/llm_judge.py' 2026-05-28T10:08:36,896 adding 'evalscope/metrics/math_parser.py' 2026-05-28T10:08:36,899 adding 'evalscope/metrics/metric.py' 2026-05-28T10:08:36,902 adding 'evalscope/metrics/metrics.py' 2026-05-28T10:08:36,903 adding 'evalscope/metrics/rouge_metric.py' 2026-05-28T10:08:36,905 adding 'evalscope/metrics/bert_score/__init__.py' 2026-05-28T10:08:36,907 adding 'evalscope/metrics/bert_score/scorer.py' 2026-05-28T10:08:36,910 adding 'evalscope/metrics/bert_score/utils.py' 2026-05-28T10:08:36,912 adding 'evalscope/metrics/bundled_rouge_score/__init__.py' 2026-05-28T10:08:36,914 adding 'evalscope/metrics/bundled_rouge_score/rouge_scorer.py' 2026-05-28T10:08:36,915 adding 'evalscope/metrics/sem_score/__init__.py' 2026-05-28T10:08:36,917 adding 'evalscope/metrics/sem_score/scorer.py' 2026-05-28T10:08:36,919 adding 'evalscope/metrics/t2v_metrics/__init__.py' 2026-05-28T10:08:36,920 adding 'evalscope/metrics/t2v_metrics/clipscore.py' 2026-05-28T10:08:36,921 adding 'evalscope/metrics/t2v_metrics/constants.py' 2026-05-28T10:08:36,922 adding 'evalscope/metrics/t2v_metrics/itmscore.py' 2026-05-28T10:08:36,924 adding 'evalscope/metrics/t2v_metrics/score.py' 2026-05-28T10:08:36,925 adding 'evalscope/metrics/t2v_metrics/vqascore.py' 2026-05-28T10:08:36,926 adding 'evalscope/metrics/t2v_metrics/models/__init__.py' 2026-05-28T10:08:36,928 adding 'evalscope/metrics/t2v_metrics/models/model.py' 2026-05-28T10:08:36,929 adding 'evalscope/metrics/t2v_metrics/models/utils.py' 2026-05-28T10:08:36,931 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py' 2026-05-28T10:08:36,932 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py' 2026-05-28T10:08:36,934 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py' 2026-05-28T10:08:36,935 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py' 2026-05-28T10:08:36,936 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py' 2026-05-28T10:08:36,938 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py' 2026-05-28T10:08:36,939 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py' 2026-05-28T10:08:36,941 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py' 2026-05-28T10:08:36,942 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py' 2026-05-28T10:08:36,944 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py' 2026-05-28T10:08:36,946 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py' 2026-05-28T10:08:36,947 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py' 2026-05-28T10:08:36,948 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py' 2026-05-28T10:08:36,951 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py' 2026-05-28T10:08:36,952 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py' 2026-05-28T10:08:36,953 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py' 2026-05-28T10:08:36,955 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py' 2026-05-28T10:08:36,957 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py' 2026-05-28T10:08:36,958 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py' 2026-05-28T10:08:36,959 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py' 2026-05-28T10:08:36,961 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py' 2026-05-28T10:08:36,962 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py' 2026-05-28T10:08:36,963 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py' 2026-05-28T10:08:36,966 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py' 2026-05-28T10:08:36,967 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py' 2026-05-28T10:08:36,969 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py' 2026-05-28T10:08:36,971 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py' 2026-05-28T10:08:36,972 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py' 2026-05-28T10:08:36,975 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py' 2026-05-28T10:08:36,977 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py' 2026-05-28T10:08:36,978 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py' 2026-05-28T10:08:36,979 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py' 2026-05-28T10:08:36,981 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py' 2026-05-28T10:08:36,982 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py' 2026-05-28T10:08:36,984 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py' 2026-05-28T10:08:36,986 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py' 2026-05-28T10:08:36,988 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py' 2026-05-28T10:08:36,990 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py' 2026-05-28T10:08:36,991 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml' 2026-05-28T10:08:36,993 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json' 2026-05-28T10:08:36,994 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json' 2026-05-28T10:08:36,995 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json' 2026-05-28T10:08:36,997 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml' 2026-05-28T10:08:36,998 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml' 2026-05-28T10:08:37,000 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml' 2026-05-28T10:08:37,001 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml' 2026-05-28T10:08:37,002 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml' 2026-05-28T10:08:37,003 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml' 2026-05-28T10:08:37,005 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml' 2026-05-28T10:08:37,006 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml' 2026-05-28T10:08:37,007 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml' 2026-05-28T10:08:37,008 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml' 2026-05-28T10:08:37,010 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml' 2026-05-28T10:08:37,011 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml' 2026-05-28T10:08:37,012 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml' 2026-05-28T10:08:37,013 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml' 2026-05-28T10:08:37,014 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml' 2026-05-28T10:08:37,016 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml' 2026-05-28T10:08:37,017 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml' 2026-05-28T10:08:37,018 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml' 2026-05-28T10:08:37,020 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml' 2026-05-28T10:08:37,022 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py' 2026-05-28T10:08:37,023 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py' 2026-05-28T10:08:37,025 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py' 2026-05-28T10:08:37,028 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py' 2026-05-28T10:08:37,034 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py' 2026-05-28T10:08:37,037 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py' 2026-05-28T10:08:37,042 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py' 2026-05-28T10:08:37,044 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py' 2026-05-28T10:08:37,045 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py' 2026-05-28T10:08:37,047 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py' 2026-05-28T10:08:37,049 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py' 2026-05-28T10:08:37,051 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py' 2026-05-28T10:08:37,054 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py' 2026-05-28T10:08:37,056 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py' 2026-05-28T10:08:37,061 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py' 2026-05-28T10:08:37,069 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py' 2026-05-28T10:08:37,071 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py' 2026-05-28T10:08:37,073 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py' 2026-05-28T10:08:37,074 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py' 2026-05-28T10:08:37,076 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py' 2026-05-28T10:08:37,078 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py' 2026-05-28T10:08:37,080 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py' 2026-05-28T10:08:37,082 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py' 2026-05-28T10:08:37,083 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py' 2026-05-28T10:08:37,085 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py' 2026-05-28T10:08:37,087 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py' 2026-05-28T10:08:37,092 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py' 2026-05-28T10:08:37,094 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py' 2026-05-28T10:08:37,095 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py' 2026-05-28T10:08:37,097 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py' 2026-05-28T10:08:37,098 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py' 2026-05-28T10:08:37,100 adding 'evalscope/metrics/text_normalizer/__init__.py' 2026-05-28T10:08:37,102 adding 'evalscope/metrics/text_normalizer/basic.py' 2026-05-28T10:08:37,108 adding 'evalscope/metrics/text_normalizer/chinese.py' 2026-05-28T10:08:37,117 adding 'evalscope/metrics/text_normalizer/english.json' 2026-05-28T10:08:37,120 adding 'evalscope/metrics/text_normalizer/english.py' 2026-05-28T10:08:37,121 adding 'evalscope/metrics/text_normalizer/wer.py' 2026-05-28T10:08:37,123 adding 'evalscope/models/__init__.py' 2026-05-28T10:08:37,125 adding 'evalscope/models/anthropic_compatible.py' 2026-05-28T10:08:37,126 adding 'evalscope/models/image_edit_model.py' 2026-05-28T10:08:37,128 adding 'evalscope/models/litellm_compatible.py' 2026-05-28T10:08:37,129 adding 'evalscope/models/mockllm.py' 2026-05-28T10:08:37,131 adding 'evalscope/models/model_apis.py' 2026-05-28T10:08:37,133 adding 'evalscope/models/modelscope.py' 2026-05-28T10:08:37,135 adding 'evalscope/models/openai_compatible.py' 2026-05-28T10:08:37,137 adding 'evalscope/models/openai_responses.py' 2026-05-28T10:08:37,138 adding 'evalscope/models/text2image_model.py' 2026-05-28T10:08:37,142 adding 'evalscope/models/utils/anthropic.py' 2026-05-28T10:08:37,146 adding 'evalscope/models/utils/openai.py' 2026-05-28T10:08:37,149 adding 'evalscope/models/utils/openai_responses.py' 2026-05-28T10:08:37,150 adding 'evalscope/perf/__init__.py' 2026-05-28T10:08:37,154 adding 'evalscope/perf/arguments.py' 2026-05-28T10:08:37,156 adding 'evalscope/perf/benchmark.py' 2026-05-28T10:08:37,157 adding 'evalscope/perf/main.py' 2026-05-28T10:08:37,159 adding 'evalscope/perf/multi_turn_args.py' 2026-05-28T10:08:37,160 adding 'evalscope/perf/multi_turn_benchmark.py' 2026-05-28T10:08:37,162 adding 'evalscope/perf/core/__init__.py' 2026-05-28T10:08:37,164 adding 'evalscope/perf/core/http_client.py' 2026-05-28T10:08:37,165 adding 'evalscope/perf/core/metrics_consumer.py' 2026-05-28T10:08:37,167 adding 'evalscope/perf/core/strategies/__init__.py' 2026-05-28T10:08:37,168 adding 'evalscope/perf/core/strategies/base.py' 2026-05-28T10:08:37,170 adding 'evalscope/perf/core/strategies/closed_loop.py' 2026-05-28T10:08:37,172 adding 'evalscope/perf/core/strategies/multi_turn.py' 2026-05-28T10:08:37,174 adding 'evalscope/perf/core/strategies/open_loop.py' 2026-05-28T10:08:37,175 adding 'evalscope/perf/plugin/__init__.py' 2026-05-28T10:08:37,177 adding 'evalscope/perf/plugin/registry.py' 2026-05-28T10:08:37,178 adding 'evalscope/perf/plugin/api/__init__.py' 2026-05-28T10:08:37,180 adding 'evalscope/perf/plugin/api/base.py' 2026-05-28T10:08:37,181 adding 'evalscope/perf/plugin/api/custom_api.py' 2026-05-28T10:08:37,182 adding 'evalscope/perf/plugin/api/dashscope_api.py' 2026-05-28T10:08:37,184 adding 'evalscope/perf/plugin/api/default_api.py' 2026-05-28T10:08:37,186 adding 'evalscope/perf/plugin/api/openai_api.py' 2026-05-28T10:08:37,188 adding 'evalscope/perf/plugin/api/openai_embedding_api.py' 2026-05-28T10:08:37,189 adding 'evalscope/perf/plugin/api/openai_rerank_api.py' 2026-05-28T10:08:37,191 adding 'evalscope/perf/plugin/api/openai_responses_api.py' 2026-05-28T10:08:37,193 adding 'evalscope/perf/plugin/datasets/__init__.py' 2026-05-28T10:08:37,194 adding 'evalscope/perf/plugin/datasets/base.py' 2026-05-28T10:08:37,196 adding 'evalscope/perf/plugin/datasets/custom.py' 2026-05-28T10:08:37,198 adding 'evalscope/perf/plugin/datasets/embedding_dataset.py' 2026-05-28T10:08:37,199 adding 'evalscope/perf/plugin/datasets/flickr8k.py' 2026-05-28T10:08:37,200 adding 'evalscope/perf/plugin/datasets/kontext_bench.py' 2026-05-28T10:08:37,201 adding 'evalscope/perf/plugin/datasets/line_by_line.py' 2026-05-28T10:08:37,203 adding 'evalscope/perf/plugin/datasets/longalpaca.py' 2026-05-28T10:08:37,204 adding 'evalscope/perf/plugin/datasets/multi_turn.py' 2026-05-28T10:08:37,206 adding 'evalscope/perf/plugin/datasets/openqa.py' 2026-05-28T10:08:37,207 adding 'evalscope/perf/plugin/datasets/random_dataset.py' 2026-05-28T10:08:37,209 adding 'evalscope/perf/plugin/datasets/random_vl_dataset.py' 2026-05-28T10:08:37,210 adding 'evalscope/perf/plugin/datasets/rerank_dataset.py' 2026-05-28T10:08:37,212 adding 'evalscope/perf/plugin/datasets/share_gpt.py' 2026-05-28T10:08:37,213 adding 'evalscope/perf/plugin/datasets/speed_benchmark.py' 2026-05-28T10:08:37,216 adding 'evalscope/perf/plugin/datasets/swe_smith.py' 2026-05-28T10:08:37,217 adding 'evalscope/perf/plugin/datasets/trie.py' 2026-05-28T10:08:37,219 adding 'evalscope/perf/plugin/datasets/utils.py' 2026-05-28T10:08:37,221 adding 'evalscope/perf/sla/__init__.py' 2026-05-28T10:08:37,222 adding 'evalscope/perf/sla/sla_criterion.py' 2026-05-28T10:08:37,224 adding 'evalscope/perf/sla/sla_run.py' 2026-05-28T10:08:37,226 adding 'evalscope/perf/utils/__init__.py' 2026-05-28T10:08:37,228 adding 'evalscope/perf/utils/analysis_result.py' 2026-05-28T10:08:37,230 adding 'evalscope/perf/utils/benchmark_util.py' 2026-05-28T10:08:37,233 adding 'evalscope/perf/utils/db_util.py' 2026-05-28T10:08:37,234 adding 'evalscope/perf/utils/handler.py' 2026-05-28T10:08:37,236 adding 'evalscope/perf/utils/local_server.py' 2026-05-28T10:08:37,237 adding 'evalscope/perf/utils/log_utils.py' 2026-05-28T10:08:37,238 adding 'evalscope/perf/utils/perf_constants.py' 2026-05-28T10:08:37,241 adding 'evalscope/perf/utils/perf_models.py' 2026-05-28T10:08:37,244 adding 'evalscope/perf/utils/rich_display.py' 2026-05-28T10:08:37,246 adding 'evalscope/perf/utils/trace_metrics.py' 2026-05-28T10:08:37,248 adding 'evalscope/perf/utils/workload_timeline.py' 2026-05-28T10:08:37,249 adding 'evalscope/perf/utils/report/__init__.py' 2026-05-28T10:08:37,251 adding 'evalscope/perf/utils/report/generate_report.py' 2026-05-28T10:08:37,254 adding 'evalscope/perf/utils/report/perf_charts.py' 2026-05-28T10:08:37,255 adding 'evalscope/perf/utils/report/perf_data.py' 2026-05-28T10:08:37,257 adding 'evalscope/report/__init__.py' 2026-05-28T10:08:37,259 adding 'evalscope/report/combinator.py' 2026-05-28T10:08:37,260 adding 'evalscope/report/generator.py' 2026-05-28T10:08:37,263 adding 'evalscope/report/renderer.py' 2026-05-28T10:08:37,264 adding 'evalscope/report/report.py' 2026-05-28T10:08:37,266 adding 'evalscope/report/visualization.py' 2026-05-28T10:08:37,268 adding 'evalscope/report/template/perf_report.html.j2' 2026-05-28T10:08:37,270 adding 'evalscope/report/template/report.html.j2' 2026-05-28T10:08:37,273 adding 'evalscope/report/template/css/base.css' 2026-05-28T10:08:37,274 adding 'evalscope/report/template/css/perf_extra.css' 2026-05-28T10:08:37,276 adding 'evalscope/report/template/js/eval_extra.js' 2026-05-28T10:08:37,277 adding 'evalscope/report/template/js/i18n_eval.js' 2026-05-28T10:08:37,279 adding 'evalscope/report/template/js/i18n_perf.js' 2026-05-28T10:08:37,280 adding 'evalscope/report/template/js/perf_extra.js' 2026-05-28T10:08:37,281 adding 'evalscope/report/template/js/shared.js' 2026-05-28T10:08:37,283 adding 'evalscope/report/template/partials/brand_logo.html' 2026-05-28T10:08:37,284 adding 'evalscope/report/template/partials/footer.html' 2026-05-28T10:08:37,285 adding 'evalscope/report/template/partials/header_eval.html' 2026-05-28T10:08:37,286 adding 'evalscope/report/template/partials/header_perf.html' 2026-05-28T10:08:37,288 adding 'evalscope/report/template/partials/toc_eval.html' 2026-05-28T10:08:37,289 adding 'evalscope/report/template/partials/toc_perf.html' 2026-05-28T10:08:37,290 adding 'evalscope/service/__init__.py' 2026-05-28T10:08:37,292 adding 'evalscope/service/app.py' 2026-05-28T10:08:37,294 adding 'evalscope/service/blueprints/__init__.py' 2026-05-28T10:08:37,296 adding 'evalscope/service/blueprints/eval.py' 2026-05-28T10:08:37,297 adding 'evalscope/service/blueprints/perf.py' 2026-05-28T10:08:37,300 adding 'evalscope/service/blueprints/reports.py' 2026-05-28T10:08:37,301 adding 'evalscope/service/utils/__init__.py' 2026-05-28T10:08:37,303 adding 'evalscope/service/utils/benchmarks.py' 2026-05-28T10:08:37,305 adding 'evalscope/service/utils/log.py' 2026-05-28T10:08:37,306 adding 'evalscope/service/utils/process.py' 2026-05-28T10:08:37,308 adding 'evalscope/summarizer/__init__.py' 2026-05-28T10:08:37,309 adding 'evalscope/summarizer/summarizer.py' 2026-05-28T10:08:37,311 adding 'evalscope/third_party/__init__.py' 2026-05-28T10:08:37,313 adding 'evalscope/third_party/longbench_write/README.md' 2026-05-28T10:08:37,314 adding 'evalscope/third_party/longbench_write/__init__.py' 2026-05-28T10:08:37,315 adding 'evalscope/third_party/longbench_write/default_task.json' 2026-05-28T10:08:37,317 adding 'evalscope/third_party/longbench_write/default_task.yaml' 2026-05-28T10:08:37,318 adding 'evalscope/third_party/longbench_write/eval.py' 2026-05-28T10:08:37,320 adding 'evalscope/third_party/longbench_write/infer.py' 2026-05-28T10:08:37,321 adding 'evalscope/third_party/longbench_write/longbench_write.py' 2026-05-28T10:08:37,323 adding 'evalscope/third_party/longbench_write/utils.py' 2026-05-28T10:08:37,324 adding 'evalscope/third_party/longbench_write/resources/__init__.py' 2026-05-28T10:08:37,325 adding 'evalscope/third_party/longbench_write/resources/judge.txt' 2026-05-28T10:08:37,333 adding 'evalscope/third_party/longbench_write/resources/longbench_write.jsonl' 2026-05-28T10:08:37,336 adding 'evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl' 2026-05-28T10:08:37,338 adding 'evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl' 2026-05-28T10:08:37,340 adding 'evalscope/third_party/longbench_write/tools/__init__.py' 2026-05-28T10:08:37,341 adding 'evalscope/third_party/longbench_write/tools/data_etl.py' 2026-05-28T10:08:37,343 adding 'evalscope/third_party/longbench_write/tools/openai_api.py' 2026-05-28T10:08:37,344 adding 'evalscope/third_party/thinkbench/__init__.py' 2026-05-28T10:08:37,347 adding 'evalscope/third_party/thinkbench/eval.py' 2026-05-28T10:08:37,348 adding 'evalscope/third_party/thinkbench/infer.py' 2026-05-28T10:08:37,350 adding 'evalscope/third_party/thinkbench/resources/critique_template.txt' 2026-05-28T10:08:37,351 adding 'evalscope/third_party/thinkbench/resources/reformat_template.txt' 2026-05-28T10:08:37,353 adding 'evalscope/third_party/thinkbench/tools/__init__.py' 2026-05-28T10:08:37,354 adding 'evalscope/third_party/thinkbench/tools/llm.py' 2026-05-28T10:08:37,355 adding 'evalscope/third_party/thinkbench/tools/utils.py' 2026-05-28T10:08:37,357 adding 'evalscope/third_party/toolbench_static/README.md' 2026-05-28T10:08:37,359 adding 'evalscope/third_party/toolbench_static/__init__.py' 2026-05-28T10:08:37,360 adding 'evalscope/third_party/toolbench_static/config_default.json' 2026-05-28T10:08:37,361 adding 'evalscope/third_party/toolbench_static/config_default.yaml' 2026-05-28T10:08:37,362 adding 'evalscope/third_party/toolbench_static/eval.py' 2026-05-28T10:08:37,364 adding 'evalscope/third_party/toolbench_static/infer.py' 2026-05-28T10:08:37,365 adding 'evalscope/third_party/toolbench_static/requirements.txt' 2026-05-28T10:08:37,366 adding 'evalscope/third_party/toolbench_static/toolbench_static.py' 2026-05-28T10:08:37,368 adding 'evalscope/third_party/toolbench_static/llm/__init__.py' 2026-05-28T10:08:37,369 adding 'evalscope/third_party/toolbench_static/llm/swift_infer.py' 2026-05-28T10:08:37,371 adding 'evalscope/utils/__init__.py' 2026-05-28T10:08:37,372 adding 'evalscope/utils/argument_utils.py' 2026-05-28T10:08:37,374 adding 'evalscope/utils/chat_service.py' 2026-05-28T10:08:37,377 adding 'evalscope/utils/code_utils.py' 2026-05-28T10:08:37,379 adding 'evalscope/utils/data_utils.py' 2026-05-28T10:08:37,380 adding 'evalscope/utils/deprecation_utils.py' 2026-05-28T10:08:37,383 adding 'evalscope/utils/function_utils.py' 2026-05-28T10:08:37,384 adding 'evalscope/utils/import_utils.py' 2026-05-28T10:08:37,388 adding 'evalscope/utils/io_utils.py' 2026-05-28T10:08:37,390 adding 'evalscope/utils/json_schema.py' 2026-05-28T10:08:37,392 adding 'evalscope/utils/logger.py' 2026-05-28T10:08:37,393 adding 'evalscope/utils/model_utils.py' 2026-05-28T10:08:37,395 adding 'evalscope/utils/multi_choices.py' 2026-05-28T10:08:37,397 adding 'evalscope/utils/ner.py' 2026-05-28T10:08:37,399 adding 'evalscope/utils/resource_utils.py' 2026-05-28T10:08:37,400 adding 'evalscope/utils/url_utils.py' 2026-05-28T10:08:37,402 adding 'evalscope/utils/doc_utils/__init__.py' 2026-05-28T10:08:37,406 adding 'evalscope/utils/doc_utils/benchmark_stats.py' 2026-05-28T10:08:37,408 adding 'evalscope/utils/doc_utils/generate_dataset_md.py' 2026-05-28T10:08:37,410 adding 'evalscope/utils/doc_utils/readme_generator.py' 2026-05-28T10:08:37,412 adding 'evalscope/utils/doc_utils/translate_description.py' 2026-05-28T10:08:37,414 adding 'evalscope/utils/tqdm_utils/__init__.py' 2026-05-28T10:08:37,416 adding 'evalscope/utils/tqdm_utils/progress_tracker.py' 2026-05-28T10:08:37,417 adding 'evalscope/utils/tqdm_utils/tqdm_logging.py' 2026-05-28T10:08:37,419 adding 'evalscope/web/.gitignore' 2026-05-28T10:08:37,421 adding 'evalscope/web/README.md' 2026-05-28T10:08:37,422 adding 'evalscope/web/__init__.py' 2026-05-28T10:08:37,423 adding 'evalscope/web/index.html' 2026-05-28T10:08:37,425 adding 'evalscope/web/dist/favicon.svg' 2026-05-28T10:08:37,427 adding 'evalscope/web/dist/index.html' 2026-05-28T10:08:37,430 adding 'evalscope/web/dist/assets/Badge-BM4410Li.js' 2026-05-28T10:08:37,431 adding 'evalscope/web/dist/assets/BenchmarksPage-XDO3_q9P.js' 2026-05-28T10:08:37,433 adding 'evalscope/web/dist/assets/Breadcrumb-CXh_Hjqj.js' 2026-05-28T10:08:37,434 adding 'evalscope/web/dist/assets/Button-Bw30ctK3.js' 2026-05-28T10:08:37,435 adding 'evalscope/web/dist/assets/Card-vB7q1dY3.js' 2026-05-28T10:08:37,442 adding 'evalscope/web/dist/assets/ChatView-CrwMded-.js' 2026-05-28T10:08:37,444 adding 'evalscope/web/dist/assets/ComparePage-DIqHa7xH.js' 2026-05-28T10:08:37,446 adding 'evalscope/web/dist/assets/DashboardPage-Dy_K4Uk2.js' 2026-05-28T10:08:37,448 adding 'evalscope/web/dist/assets/EvalTaskPage-CfhiPFIu.js' 2026-05-28T10:08:37,449 adding 'evalscope/web/dist/assets/FilterChip-GjoWt8ON.js' 2026-05-28T10:08:37,452 adding 'evalscope/web/dist/assets/KaTeX_AMS-Regular-BQhdFMY1.woff2' 2026-05-28T10:08:37,457 adding 'evalscope/web/dist/assets/KaTeX_AMS-Regular-DMm9YOAa.woff' 2026-05-28T10:08:37,465 adding 'evalscope/web/dist/assets/KaTeX_AMS-Regular-DRggAlZN.ttf' 2026-05-28T10:08:37,468 adding 'evalscope/web/dist/assets/KaTeX_Caligraphic-Bold-ATXxdsX0.ttf' 2026-05-28T10:08:37,470 adding 'evalscope/web/dist/assets/KaTeX_Caligraphic-Bold-BEiXGLvX.woff' 2026-05-28T10:08:37,471 adding 'evalscope/web/dist/assets/KaTeX_Caligraphic-Bold-Dq_IR9rO.woff2' 2026-05-28T10:08:37,473 adding 'evalscope/web/dist/assets/KaTeX_Caligraphic-Regular-CTRA-rTL.woff' 2026-05-28T10:08:37,475 adding 'evalscope/web/dist/assets/KaTeX_Caligraphic-Regular-Di6jR-x-.woff2' 2026-05-28T10:08:37,477 adding 'evalscope/web/dist/assets/KaTeX_Caligraphic-Regular-wX97UBjC.ttf' 2026-05-28T10:08:37,480 adding 'evalscope/web/dist/assets/KaTeX_Fraktur-Bold-BdnERNNW.ttf' 2026-05-28T10:08:37,482 adding 'evalscope/web/dist/assets/KaTeX_Fraktur-Bold-BsDP51OF.woff' 2026-05-28T10:08:37,484 adding 'evalscope/web/dist/assets/KaTeX_Fraktur-Bold-CL6g_b3V.woff2' 2026-05-28T10:08:37,486 adding 'evalscope/web/dist/assets/KaTeX_Fraktur-Regular-CB_wures.ttf' 2026-05-28T10:08:37,488 adding 'evalscope/web/dist/assets/KaTeX_Fraktur-Regular-CTYiF6lA.woff2' 2026-05-28T10:08:37,490 adding 'evalscope/web/dist/assets/KaTeX_Fraktur-Regular-Dxdc4cR9.woff' 2026-05-28T10:08:37,493 adding 'evalscope/web/dist/assets/KaTeX_Main-Bold-Cx986IdX.woff2' 2026-05-28T10:08:37,496 adding 'evalscope/web/dist/assets/KaTeX_Main-Bold-Jm3AIy58.woff' 2026-05-28T10:08:37,504 adding 'evalscope/web/dist/assets/KaTeX_Main-Bold-waoOVXN0.ttf' 2026-05-28T10:08:37,507 adding 'evalscope/web/dist/assets/KaTeX_Main-BoldItalic-DxDJ3AOS.woff2' 2026-05-28T10:08:37,511 adding 'evalscope/web/dist/assets/KaTeX_Main-BoldItalic-DzxPMmG6.ttf' 2026-05-28T10:08:37,514 adding 'evalscope/web/dist/assets/KaTeX_Main-BoldItalic-SpSLRI95.woff' 2026-05-28T10:08:37,519 adding 'evalscope/web/dist/assets/KaTeX_Main-Italic-3WenGoN9.ttf' 2026-05-28T10:08:37,522 adding 'evalscope/web/dist/assets/KaTeX_Main-Italic-BMLOBm91.woff' 2026-05-28T10:08:37,524 adding 'evalscope/web/dist/assets/KaTeX_Main-Italic-NWA7e6Wa.woff2' 2026-05-28T10:08:37,527 adding 'evalscope/web/dist/assets/KaTeX_Main-Regular-B22Nviop.woff2' 2026-05-28T10:08:37,531 adding 'evalscope/web/dist/assets/KaTeX_Main-Regular-Dr94JaBh.woff' 2026-05-28T10:08:37,539 adding 'evalscope/web/dist/assets/KaTeX_Main-Regular-ypZvNtVU.ttf' 2026-05-28T10:08:37,543 adding 'evalscope/web/dist/assets/KaTeX_Math-BoldItalic-B3XSjfu4.ttf' 2026-05-28T10:08:37,546 adding 'evalscope/web/dist/assets/KaTeX_Math-BoldItalic-CZnvNsCZ.woff2' 2026-05-28T10:08:37,548 adding 'evalscope/web/dist/assets/KaTeX_Math-BoldItalic-iY-2wyZ7.woff' 2026-05-28T10:08:37,551 adding 'evalscope/web/dist/assets/KaTeX_Math-Italic-DA0__PXp.woff' 2026-05-28T10:08:37,556 adding 'evalscope/web/dist/assets/KaTeX_Math-Italic-flOr_0UB.ttf' 2026-05-28T10:08:37,558 adding 'evalscope/web/dist/assets/KaTeX_Math-Italic-t53AETM-.woff2' 2026-05-28T10:08:37,561 adding 'evalscope/web/dist/assets/KaTeX_SansSerif-Bold-CFMepnvq.ttf' 2026-05-28T10:08:37,563 adding 'evalscope/web/dist/assets/KaTeX_SansSerif-Bold-D1sUS0GD.woff2' 2026-05-28T10:08:37,566 adding 'evalscope/web/dist/assets/KaTeX_SansSerif-Bold-DbIhKOiC.woff' 2026-05-28T10:08:37,568 adding 'evalscope/web/dist/assets/KaTeX_SansSerif-Italic-C3H0VqGB.woff2' 2026-05-28T10:08:37,570 adding 'evalscope/web/dist/assets/KaTeX_SansSerif-Italic-DN2j7dab.woff' 2026-05-28T10:08:37,573 adding 'evalscope/web/dist/assets/KaTeX_SansSerif-Italic-YYjJ1zSn.ttf' 2026-05-28T10:08:37,576 adding 'evalscope/web/dist/assets/KaTeX_SansSerif-Regular-BNo7hRIc.ttf' 2026-05-28T10:08:37,579 adding 'evalscope/web/dist/assets/KaTeX_SansSerif-Regular-CS6fqUqJ.woff' 2026-05-28T10:08:37,580 adding 'evalscope/web/dist/assets/KaTeX_SansSerif-Regular-DDBCnlJ7.woff2' 2026-05-28T10:08:37,583 adding 'evalscope/web/dist/assets/KaTeX_Script-Regular-C5JkGWo-.ttf' 2026-05-28T10:08:37,585 adding 'evalscope/web/dist/assets/KaTeX_Script-Regular-D3wIWfF6.woff2' 2026-05-28T10:08:37,587 adding 'evalscope/web/dist/assets/KaTeX_Script-Regular-D5yQViql.woff' 2026-05-28T10:08:37,588 adding 'evalscope/web/dist/assets/KaTeX_Size1-Regular-C195tn64.woff' 2026-05-28T10:08:37,591 adding 'evalscope/web/dist/assets/KaTeX_Size1-Regular-Dbsnue_I.ttf' 2026-05-28T10:08:37,592 adding 'evalscope/web/dist/assets/KaTeX_Size1-Regular-mCD8mA8B.woff2' 2026-05-28T10:08:37,594 adding 'evalscope/web/dist/assets/KaTeX_Size2-Regular-B7gKUWhC.ttf' 2026-05-28T10:08:37,596 adding 'evalscope/web/dist/assets/KaTeX_Size2-Regular-Dy4dx90m.woff2' 2026-05-28T10:08:37,597 adding 'evalscope/web/dist/assets/KaTeX_Size2-Regular-oD1tc_U0.woff' 2026-05-28T10:08:37,599 adding 'evalscope/web/dist/assets/KaTeX_Size3-Regular-CTq5MqoE.woff' 2026-05-28T10:08:37,600 adding 'evalscope/web/dist/assets/KaTeX_Size3-Regular-DgpXs0kz.ttf' 2026-05-28T10:08:37,602 adding 'evalscope/web/dist/assets/KaTeX_Size4-Regular-BF-4gkZK.woff' 2026-05-28T10:08:37,604 adding 'evalscope/web/dist/assets/KaTeX_Size4-Regular-DWFBv043.ttf' 2026-05-28T10:08:37,605 adding 'evalscope/web/dist/assets/KaTeX_Size4-Regular-Dl5lxZxV.woff2' 2026-05-28T10:08:37,607 adding 'evalscope/web/dist/assets/KaTeX_Typewriter-Regular-C0xS9mPB.woff' 2026-05-28T10:08:37,609 adding 'evalscope/web/dist/assets/KaTeX_Typewriter-Regular-CO6r4hn1.woff2' 2026-05-28T10:08:37,613 adding 'evalscope/web/dist/assets/KaTeX_Typewriter-Regular-D3Ib7_Hf.ttf' 2026-05-28T10:08:37,616 adding 'evalscope/web/dist/assets/LocaleContext-3RqCP440.js' 2026-05-28T10:08:37,618 adding 'evalscope/web/dist/assets/PerfTaskPage-CeXqhvKL.js' 2026-05-28T10:08:37,621 adding 'evalscope/web/dist/assets/ReportDetailPage-CXjObGG4.js' 2026-05-28T10:08:37,622 adding 'evalscope/web/dist/assets/ReportViewerPage-DawiLh7a.js' 2026-05-28T10:08:37,624 adding 'evalscope/web/dist/assets/ReportsPage-CwZoFJCa.js' 2026-05-28T10:08:37,625 adding 'evalscope/web/dist/assets/ScoreBadge-BKgzeJIw.js' 2026-05-28T10:08:37,627 adding 'evalscope/web/dist/assets/SearchInput-Qu-jjIwJ.js' 2026-05-28T10:08:37,628 adding 'evalscope/web/dist/assets/Skeleton-DD8J4WTY.js' 2026-05-28T10:08:37,755 adding 'evalscope/web/dist/assets/Tabs-rdY0A_mU.js' 2026-05-28T10:08:37,763 adding 'evalscope/web/dist/assets/chevron-up-BykVlddk.js' 2026-05-28T10:08:37,764 adding 'evalscope/web/dist/assets/database-OIFuo_-B.js' 2026-05-28T10:08:37,765 adding 'evalscope/web/dist/assets/eval-CCjFVVv8.js' 2026-05-28T10:08:37,767 adding 'evalscope/web/dist/assets/external-link-Dk_VDyv-.js' 2026-05-28T10:08:37,768 adding 'evalscope/web/dist/assets/folder-open-C4sVuGAY.js' 2026-05-28T10:08:37,801 adding 'evalscope/web/dist/assets/index-4AAMqHB4.js' 2026-05-28T10:08:37,810 adding 'evalscope/web/dist/assets/index-DXtIjaXa.css' 2026-05-28T10:08:37,812 adding 'evalscope/web/dist/assets/loader-circle-DukK1tEn.js' 2026-05-28T10:08:37,813 adding 'evalscope/web/dist/assets/search-_j2hWq0v.js' 2026-05-28T10:08:37,814 adding 'evalscope/web/dist/assets/square-CF_-z8TO.js' 2026-05-28T10:08:37,816 adding 'evalscope/web/dist/assets/usePolling-9BwH1egZ.js' 2026-05-28T10:08:37,817 adding 'evalscope/web/dist/assets/useQueryParams-B0TIRhYM.js' 2026-05-28T10:08:37,820 adding 'evalscope/web/dist/assets/utils-Bt-jremC.js' 2026-05-28T10:08:37,823 adding 'evalscope/web/public/favicon.svg' 2026-05-28T10:08:37,826 adding 'evalscope-1.8.0.dist-info/licenses/LICENSE' 2026-05-28T10:08:37,830 adding 'evalscope-1.8.0.dist-info/METADATA' 2026-05-28T10:08:37,831 adding 'evalscope-1.8.0.dist-info/WHEEL' 2026-05-28T10:08:37,832 adding 'evalscope-1.8.0.dist-info/entry_points.txt' 2026-05-28T10:08:37,833 adding 'evalscope-1.8.0.dist-info/top_level.txt' 2026-05-28T10:08:37,852 adding 'evalscope-1.8.0.dist-info/RECORD' 2026-05-28T10:08:37,904 removing build/bdist.linux-armv7l/wheel 2026-05-28T10:08:38,326 Building wheel for evalscope (pyproject.toml): finished with status 'done' 2026-05-28T10:08:38,405 Created wheel for evalscope: filename=evalscope-1.8.0-py3-none-any.whl size=3917687 sha256=b3b96e5b495f6525cf2199c4f45a3faf007b55746f6ccd9dcf11b4ea841a8325 2026-05-28T10:08:38,407 Stored in directory: /tmp/pip-ephem-wheel-cache-8lzx138m/wheels/e1/7d/21/e1a240afbaa0a88efbb9b20e72a05005453d4a35649a022015 2026-05-28T10:08:38,461 Successfully built evalscope 2026-05-28T10:08:38,567 Removed build tracker: '/tmp/pip-build-tracker-f2nuu9mq'