2026-04-24T06:57:12,116 Created temporary directory: /tmp/pip-ephem-wheel-cache-0q7zrspc 2026-04-24T06:57:12,118 Created temporary directory: /tmp/pip-build-tracker-hudghgmg 2026-04-24T06:57:12,118 Initialized build tracking at /tmp/pip-build-tracker-hudghgmg 2026-04-24T06:57:12,119 Created build tracker: /tmp/pip-build-tracker-hudghgmg 2026-04-24T06:57:12,119 Entered build tracker: /tmp/pip-build-tracker-hudghgmg 2026-04-24T06:57:12,120 Created temporary directory: /tmp/pip-wheel-nciwmr4a 2026-04-24T06:57:12,123 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-24T06:57:12,126 Created temporary directory: /tmp/pip-ephem-wheel-cache-qtchmhm0 2026-04-24T06:57:12,148 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-24T06:57:12,152 2 location(s) to search for versions of evalscope: 2026-04-24T06:57:12,152 * https://pypi.org/simple/evalscope/ 2026-04-24T06:57:12,152 * https://www.piwheels.org/simple/evalscope/ 2026-04-24T06:57:12,152 Fetching project page and analyzing links: https://pypi.org/simple/evalscope/ 2026-04-24T06:57:12,153 Getting page https://pypi.org/simple/evalscope/ 2026-04-24T06:57:12,155 Found index url https://pypi.org/simple 2026-04-24T06:57:12,387 Fetched page https://pypi.org/simple/evalscope/ as application/vnd.pypi.simple.v1+json 2026-04-24T06:57:12,405 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/94/d2/dc5e929802776bf4e662a46d794b765876bb93e2300189cafd113cac74d6/evalscope-0.5.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,406 Found link https://files.pythonhosted.org/packages/e5/70/45a5dad24b1fa535bff194b99a4668e7f5f328be972b51b3b91eafb4cdbb/evalscope-0.5.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0rc0 2026-04-24T06:57:12,407 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ac/eb/341fe367df2bc9a0ae7ef5eb2037a5d549d9bb8c0d7ad84844c9926e0947/evalscope-0.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,408 Found link https://files.pythonhosted.org/packages/3b/3f/585f7f1cf2ce90b234c1cfd654bb26977be4d889c0e5eed0122cb3024c45/evalscope-0.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0 2026-04-24T06:57:12,409 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ad/58/c0ce004159cfac6df9b5736c011576d52bdceb778943de8a022a419d86eb/evalscope-0.5.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,410 Found link https://files.pythonhosted.org/packages/94/33/7ad2f2285f5b68953ad4466d23dc5de1a2e57e7cc63d5924ab0e84d156ba/evalscope-0.5.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.2 2026-04-24T06:57:12,411 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/59/a9e1c4cf88018ece1fdd8d8b7fa976e28f9b4b181ef7ceb74a5e2db533ab/evalscope-0.5.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,412 Found link https://files.pythonhosted.org/packages/04/57/9ca7b1fd68f2acc32802b22236c83b597a5690a483d5938d38183b549d22/evalscope-0.5.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.3 2026-04-24T06:57:12,413 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/fa/c8/fcbaf01b7486c3b29b7790167c2cda560f00a04d100cec808ee9a3349ca0/evalscope-0.5.4-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,414 Found link https://files.pythonhosted.org/packages/8c/cc/abd412bad714c0266be1f0159b49a817d45db099c3bd031134d223589e93/evalscope-0.5.4.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.4 2026-04-24T06:57:12,415 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4d/da/d705d457683223f289e8c5d6cadbaad15d15e098692c53ea7e6196a94373/evalscope-0.5.5rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,415 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7e/4c/414dd545a1833245a53797d70a35a78ae1b9cfbcc81b35a4e1763e678437/evalscope-0.5.5rc1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,416 Found link https://files.pythonhosted.org/packages/56/5f/aa7fcf62102694dd66b69e88cb2523094bf04b53f785854e17ee6b7234a7/evalscope-0.5.5rc1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5rc1 2026-04-24T06:57:12,417 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/3e/d0/91a7a1f95f3fa19dd8d4e434dc711768abe5c006f32a514a8602b429e049/evalscope-0.5.5-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,418 Found link https://files.pythonhosted.org/packages/04/7e/a7f065d6ebac15fe172d3b0906ff5b26a71df5a9975c0f14978044211cf1/evalscope-0.5.5.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5 2026-04-24T06:57:12,418 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/89/b812b01a5ed91fc079dea052e1341860cd65d25d463c75c90e5a30ab6ae8/evalscope-0.6.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,419 Found link https://files.pythonhosted.org/packages/d5/ad/57a2d5f33c5b7d5066f8a5dcb1d34f14bf246112f7900228b9f2fb41b21b/evalscope-0.6.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0rc0 2026-04-24T06:57:12,420 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/25/3f03d9d924f1b65610724c9f10727b48ef952afdfee8687a461949c88c78/evalscope-0.6.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,421 Found link https://files.pythonhosted.org/packages/9d/ac/1f432bcc46ccb8348b869b80d2aaabde5e583b370418ba48714083e31068/evalscope-0.6.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0 2026-04-24T06:57:12,422 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7b/e1/42c9e58b4690f23ef48bce841fc95cbf0744c1579cdc80fa6f33b0453344/evalscope-0.6.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,423 Found link https://files.pythonhosted.org/packages/cd/d6/1d9d2db9acda6e61d4210074f51e7e3dee4d0212fabdd94999105db23eed/evalscope-0.6.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.1 2026-04-24T06:57:12,424 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/20/92/6fff1dfd12a4f73489c451dab56351b6a3c1095a92bb55025c2934fc625d/evalscope-0.7.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,425 Found link https://files.pythonhosted.org/packages/fe/3c/500d655a27ca80e1aba3fb2b1e8886951942732b869ad1516422d9e6ac97/evalscope-0.7.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.0 2026-04-24T06:57:12,425 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a5/3a/ae22d4d9a44ad37ac887da21b948bf6e784307001a09802d36c5bf04018d/evalscope-0.7.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,426 Found link https://files.pythonhosted.org/packages/88/47/69d067f0d3d784a7975cc4ea067fdc55c8f785ece6fbe86e5e21edc8b36f/evalscope-0.7.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.1 2026-04-24T06:57:12,427 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/23/acbcbc2ed6f00d3bb81220054651e6a2b1b02714d42b1aeb018a2f5574c4/evalscope-0.7.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,428 Found link https://files.pythonhosted.org/packages/a4/38/9126b9329cd2ad6ccfd4a73f04402bf71b65921564c3c12cb0d62b3b421d/evalscope-0.7.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.2 2026-04-24T06:57:12,429 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/1f/9598183da3026696adc19a6727a19d57f9abbf4fd7aeb20cdf12faee7693/evalscope-0.8.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,430 Found link https://files.pythonhosted.org/packages/63/7e/fc44d30e3a83dbc3070396d78279c9e3e8716cbc4cc05811a70f1b463bfb/evalscope-0.8.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.0 2026-04-24T06:57:12,430 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/37/a9/a9c6cda95a6b837c9303e9a1598999f2c4e605abb507365c6ff70b372a5a/evalscope-0.8.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,431 Found link https://files.pythonhosted.org/packages/63/5a/bbc230bb06a7bc40dd3985dde4615a8a71f111bf95761e40f6f0f8a7e1a6/evalscope-0.8.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.1 2026-04-24T06:57:12,432 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/07/39/a1eb2efed77d21e8253daa37f082669a004c4d288813a2ee9e15398f2e80/evalscope-0.8.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,433 Found link https://files.pythonhosted.org/packages/57/bc/5ff6d538e459d8b3c567577c991eec72ab6adbc19497dc164d96cd634d2f/evalscope-0.8.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.2 2026-04-24T06:57:12,433 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f6/29/81a188c03b272bf7def0c9bb556b9e9465adbc68ecb18907b636f1e8cbd7/evalscope-0.9.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,434 Found link https://files.pythonhosted.org/packages/c0/c2/19da3be1fbd6b548ecdc877d47269e92503518de53acbbfe96120c5c9753/evalscope-0.9.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.9.0 2026-04-24T06:57:12,435 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/6d/45d7407f31d6878494c3b493d7e49a8318b1839161839293c1a2e66aadcf/evalscope-0.10.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,436 Found link https://files.pythonhosted.org/packages/16/b1/b6cef37a0dd0acfa5873ca4763ac6b4ac4b19a0b15ca6bdc8f30d4443682/evalscope-0.10.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.0 2026-04-24T06:57:12,437 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a7/8f/907045290d359b4e07e7acc96ec60173380748b49e9f3c91b7ddd8e8342d/evalscope-0.10.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,438 Found link https://files.pythonhosted.org/packages/e3/d3/dda2ac0513904bff8aa0c2efe77bc851d3acc6d514707db36648e4a903d2/evalscope-0.10.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.1 2026-04-24T06:57:12,439 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d6/e5/852326943d86c85b5ca6b548a5f3753c217b771d93968c61ec2ca46ee0b1/evalscope-0.11.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,440 Found link https://files.pythonhosted.org/packages/5f/3f/e2816b99487b4ead257453a242b5282a85881f9d26fbb5efb21cc5cf88fa/evalscope-0.11.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.11.0 2026-04-24T06:57:12,441 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7d/8e/f9eceecf8bc7d740603f915eef7fab3e9d657a01f5de2c523e531445299c/evalscope-0.12.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,442 Found link https://files.pythonhosted.org/packages/33/82/7765517ff80a73eac7465369767aa45a5be3d5e0fb7c4f4a3ff743811f8c/evalscope-0.12.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.0 2026-04-24T06:57:12,442 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ee/4c/84a5e18985e149eb4283fef2b58a81ff2cec2d099c017684938ec3a3935f/evalscope-0.12.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,443 Found link https://files.pythonhosted.org/packages/b2/24/83b5530319bdb02142289e04640c6008dda1b988043c42feb7f0a5eab3b7/evalscope-0.12.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.1 2026-04-24T06:57:12,444 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/a5/65faf0660cd8ae2660354b002b2b3a586b9419bc894120fea97efd506cb6/evalscope-0.13.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,444 Found link https://files.pythonhosted.org/packages/d0/39/d5eb469a94191760c61d1bfcb235e28be1d2a080d88b44792f53d76c45d1/evalscope-0.13.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.0 2026-04-24T06:57:12,445 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f7/fc/91b7b4379131d2e15ca1f575f533daea589357293e492ef2c93e0aac6b55/evalscope-0.13.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,446 Found link https://files.pythonhosted.org/packages/9e/a3/33b4ce270d5500fe7c8f32fa2160749b607f248141328d4785b6032c8f2a/evalscope-0.13.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.1 2026-04-24T06:57:12,447 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e6/66/83752c305879cf3dea54170398839ab08a046485bc18c41a34f41aca11ab/evalscope-0.13.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,448 Found link https://files.pythonhosted.org/packages/b7/8d/711ae30b80329e2dd7da760c001d9a5b45e4d8e5292f317f1ea10c744c29/evalscope-0.13.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.2 2026-04-24T06:57:12,449 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a8/b4/b22c7e52e6a7381333bdfa0bf92fae0258e7812064b1e208cbab56a62d08/evalscope-0.14.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,450 Found link https://files.pythonhosted.org/packages/40/44/7db2cb90e6ca0c9db92124f10c0273d7c6ef4b81523e1c98c34a88e67faa/evalscope-0.14.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.14.0 2026-04-24T06:57:12,450 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b2/47/976633e0f29b58b8c9f3faacf7373a2da734771a7915ac45d721d96e0ad7/evalscope-0.15.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,451 Found link https://files.pythonhosted.org/packages/b8/e3/bd534d69328afa98bdd497b5eaaf4b7416da9e8f56109d045c332b17d016/evalscope-0.15.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.0 2026-04-24T06:57:12,452 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5a/54/237aee5f0317fa04450c9ad67c3bf28b730460b2e6dc1e65b74b4bf2cd67/evalscope-0.15.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,453 Found link https://files.pythonhosted.org/packages/7a/d4/8b87e83a3a08f87ce5b4325f0cd5ab9bc54d296dc3f3492a1d3216a97a6d/evalscope-0.15.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.1 2026-04-24T06:57:12,454 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/1e/b5/6fec1cbb02a41ab79430eb3fb51eea7709525df0f9753ae2c54fbc4633f1/evalscope-0.16.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,455 Found link https://files.pythonhosted.org/packages/99/69/63997bedfa6fd33af671b539f77c375b111017eff23313f76693e24b8872/evalscope-0.16.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.0 2026-04-24T06:57:12,455 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ea/76/639641578bdc92d25211eb8dae24d1fae19e40cd1649e17946f6ad8a5dc3/evalscope-0.16.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,456 Found link https://files.pythonhosted.org/packages/94/13/616bf9c33b0769db44a2bae32b54d33cf7129874392682aa76326a51e085/evalscope-0.16.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.1 2026-04-24T06:57:12,457 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f1/d3/9e83cc1b5a132342a05ef6ee79018bd7561f90a6406dc9db5c85fb0a281f/evalscope-0.16.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,458 Found link https://files.pythonhosted.org/packages/94/47/edd3faaddd321e464ae72db6e7bf82246dd4ca2f0f67127ca8c427cac664/evalscope-0.16.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.2 2026-04-24T06:57:12,458 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5d/4e/34c56086cdfe7d1ad912241af819b3e67f20373f382016e33ac89dd43dde/evalscope-0.16.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,459 Found link https://files.pythonhosted.org/packages/22/07/14603c038a8019472881f574f1c47bd4193481f256b6dc702c65d8b8f984/evalscope-0.16.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.3 2026-04-24T06:57:12,460 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f3/e2/104156f74779cf2849f53566bba585015492d7320c36a7cb76c7196b0ef5/evalscope-0.17.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,461 Found link https://files.pythonhosted.org/packages/c5/d0/b66d1b97ec67d65b6df54f18682e30cfeb6401604b93c9b1bdd1e97b8d79/evalscope-0.17.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.17.0 2026-04-24T06:57:12,462 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a9/b5/630d2c5dc5c32e9fbad5034e04d8aba6f6461dc08f255df77dd8d463857f/evalscope-0.17.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,463 Found link https://files.pythonhosted.org/packages/38/88/326e48929bb9577a6a36e07afa65bbf6bd870c1b644f82e5713874ae3238/evalscope-0.17.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 0.17.1 2026-04-24T06:57:12,464 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/21/18/9a6c208d2bc119ac67b5537b60851c6bccc99f25229eaa96cbe6e38721bd/evalscope-1.0.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,464 Found link https://files.pythonhosted.org/packages/d8/44/3d727dd28fcc50317c95d12e5bb850ed2863105f812373ac120877875434/evalscope-1.0.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.0 2026-04-24T06:57:12,465 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e1/8a/50456fa7dd77be4c3a0ea0b3d96cb7ae5b2557454fdd35cbf0009a9d792f/evalscope-1.0.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,466 Found link https://files.pythonhosted.org/packages/8d/52/93569134b3d8dea2a0d0bc2134c03056f0ffee1840f7299eb83d475457df/evalscope-1.0.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.1 2026-04-24T06:57:12,467 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/22/f6/32fb0fef08a6c881ac840117455a5697a0d63226db8a24cce5208b720829/evalscope-1.0.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,468 Found link https://files.pythonhosted.org/packages/3d/f5/025baefe432d9af1ed845ab5738b638b7b97f2dd3767e9478b8eee10966f/evalscope-1.0.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.2 2026-04-24T06:57:12,469 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/6e/12/f31dbb18daa7e3c6cfecf856ddc323a303e115271166c100e06af58ea6b6/evalscope-1.1.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,470 Found link https://files.pythonhosted.org/packages/b4/98/7449040e89beaa4556bf35ba1e171b2d4955ff15b2b4c43f2ff55b048aeb/evalscope-1.1.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.1.0 2026-04-24T06:57:12,470 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/79/d1/afc8b23345ad8f11a5e1f8c6c3112a8679604d833bdbc02aa06787952fd1/evalscope-1.1.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,471 Found link https://files.pythonhosted.org/packages/e8/3f/d67b73ce19789e914d6a78740fa7bfd0c07f161bc239b92cd3c26541f2fb/evalscope-1.1.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.1.1 2026-04-24T06:57:12,472 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/82/0a/d984751e4b5751f064209da3745b6aff6cc0f1d9d93f13cab0a1017a8639/evalscope-1.2.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,473 Found link https://files.pythonhosted.org/packages/7f/f9/0a2a069ee4500666ec5c3d10b302fc71d176c17bbe70447336e610953e1a/evalscope-1.2.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.2.0 2026-04-24T06:57:12,473 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/58/823d009dfa49cfdc750ac257e744eb97456d69528a09ac108ee8cab15316/evalscope-1.3.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,474 Found link https://files.pythonhosted.org/packages/42/5a/a309f7ce1fbe2b39e4b0a1f26cfcd7864eaa90e4792a5290e8cdd2ce3b4f/evalscope-1.3.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.3.0 2026-04-24T06:57:12,475 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c6/57/005d3ef07ecdd5163e1bbed3413537b653b637d9c8b62a2bcdd97546607b/evalscope-1.4.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,476 Found link https://files.pythonhosted.org/packages/11/d5/268f610ac7db9c5c2109936f65ac4df8b4bc52106ed4369509d3b3c4f127/evalscope-1.4.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.0 2026-04-24T06:57:12,476 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b8/96/a2b4fabf6fa6cf09ac6669aa01dc483ac53576a8fd7c2c4be21ea281840f/evalscope-1.4.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,477 Found link https://files.pythonhosted.org/packages/1c/8d/4fa3d9a2a5443ba2df06147ebc057961b0550bef806c716b2552fe7c7f66/evalscope-1.4.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.1 2026-04-24T06:57:12,478 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/bf/0f/97e68e89f7925160df49ea1dbbcef7f3f8e808a51756c199aaaadc75f5a5/evalscope-1.4.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,479 Found link https://files.pythonhosted.org/packages/97/dd/6e1cda8f161363ef806929b4989a0e9a7df733b7b82361ef1897315c12d0/evalscope-1.4.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.2 2026-04-24T06:57:12,480 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c0/fb/e6b1a396bad204e38591a6d6de1172dac2ce3e0d15b87e812d57e22d0e4f/evalscope-1.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,481 Found link https://files.pythonhosted.org/packages/a7/32/518a920ac8a73c4c6e39f7e443df6da6ea9a3be6567c4a425def866b8f5e/evalscope-1.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.0 2026-04-24T06:57:12,481 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/eb/68/0c870a84e38d5a8d3e7c9df918739a4ba6a45c3ddb624d2792a41a8d3293/evalscope-1.5.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,482 Found link https://files.pythonhosted.org/packages/2f/74/c4275a5a1746667352246ce10b9076137ab661d0f132796d2db32eca97fe/evalscope-1.5.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.1 2026-04-24T06:57:12,483 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/55/1f/b1b087b1d646f635e9e225c8b610f80b1e6e2590802228c15d1d58ae026e/evalscope-1.5.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,484 Found link https://files.pythonhosted.org/packages/ff/13/56b351e22964e93e6d74dbfdb71a4d5e2f96b4ae716f76d5b5ed4d88bae7/evalscope-1.5.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.2 2026-04-24T06:57:12,484 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/53/5b/3d5e1067f98e08cc2aac5c45f4fa67e6a183d62471439e61528469bc5e61/evalscope-1.5.2.post1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,485 Found link https://files.pythonhosted.org/packages/64/d7/ab3fad322268613661de3c4451df7f236475ef4aef7645619e6998b3199d/evalscope-1.5.2.post1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.2.post1 2026-04-24T06:57:12,486 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/02/f3/d38eb7c4488f85d92caf2a6c4b9af852e72ac4d23f5132715d6d5062a82a/evalscope-1.6.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,487 Found link https://files.pythonhosted.org/packages/ba/13/b96ac9df484c1ff1f3e202bd103bc715c87c338c465ac9dff585109bee98/evalscope-1.6.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.6.0 2026-04-24T06:57:12,487 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/20/57/898c44800b1a77a2c4e31afad151f237dad2004666db5ce69a1fec2f654e/evalscope-1.6.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,488 Found link https://files.pythonhosted.org/packages/e6/6e/9ebaec0d2d4b09ca3fa6aed3d9ed4b61e52824c8cc8f5b9ff2dfa58d6ea4/evalscope-1.6.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.6.1 2026-04-24T06:57:12,489 Fetching project page and analyzing links: https://www.piwheels.org/simple/evalscope/ 2026-04-24T06:57:12,490 Getting page https://www.piwheels.org/simple/evalscope/ 2026-04-24T06:57:12,491 Found index url https://www.piwheels.org/simple 2026-04-24T06:57:12,666 Fetched page https://www.piwheels.org/simple/evalscope/ as text/html 2026-04-24T06:57:12,677 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.6.0-py3-none-any.whl#sha256=5f6a31b463e6844867fdddb6488a17d304a770d4a30ad0a3615f853d4650f220 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,678 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.2.post1-py3-none-any.whl#sha256=76e17bb53fe492f1648148d0cebe8d500169cef934d52fee6e0e3edbf5351b90 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,678 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.2-py3-none-any.whl#sha256=db9427c4cca0bcaa951a6faac7db6c94854d1384ab0914a0ba0f9d377c947f66 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,679 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.1-py3-none-any.whl#sha256=a95eabf175191595bfeebe4e6face613a6c137a65067e8fd7dca613567bba440 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,679 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.0-py3-none-any.whl#sha256=933f1aa9915ed658bc3ae6901e0b96efbdbf80db96eca40c2d42453b26530d9b (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,680 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.2-py3-none-any.whl#sha256=222e938fe502394b9935f3c00677cf1372892caa530b6fb48476ae909a91399a (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,680 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.1-py3-none-any.whl#sha256=d74ddb7150b19de1eb995026c697b598cdfc0b75fee3a2d110219256c4241688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,681 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.0-py3-none-any.whl#sha256=307c9ed70f562ba776fdf9b0136b35fe4e361b1b974bb0f8ca39e425f4738e6e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,682 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.3.0-py3-none-any.whl#sha256=d9949bacf6c08b5ab341f872c6ee4b31995d724fe963f64ddf3c129e0e39145e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,682 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.2.0-py3-none-any.whl#sha256=5969cf4a3132f6a29f9ef39aa35a8a3be24f114bbcd7a77a19c145bbec432be9 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,683 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.1-py3-none-any.whl#sha256=5bfb8c55f45e1bcd5df5cb0cd4ecceae2ccb93cd09c9477b0b0e6c097ecb1d1f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-24T06:57:12,683 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.0-py3-none-any.whl#sha256=ca7c951fa316bb7ec6fb0e38ad6503632067f07f469e37828fe9e39e51591994 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,684 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.2-py3-none-any.whl#sha256=e14016040022bcd666c05ffa806f3713775e9de19a98290bd2a0a36e5c435409 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,685 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.1-py3-none-any.whl#sha256=61b9c14f3409804d84ddf31b749b587c7c7441f9d1aa0d453991b1bd0bbda74c (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,685 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.0-py3-none-any.whl#sha256=ea1044755177db8d9e94cc4ccafbd56e35b173a8425db7d1653dd9f66e1463ad (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,686 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.1-py3-none-any.whl#sha256=aa0054b8aed77684e56d0836d4568080fa4827799a16d62fef6ec13802cd4050 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-24T06:57:12,686 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.0-py3-none-any.whl#sha256=e7a2549f9f5ac5b0061d01f8ca99900e06f6340b91d3e546163423b896287862 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,687 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.3-py3-none-any.whl#sha256=75dcbd7cc0a0336f68d407a3925fe065ecaaea0fa9030ceebfaa3f22f0f3b417 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,687 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.2-py3-none-any.whl#sha256=14e00f16b506b723a359799e3cb271370a3da768da3667563612e458982d6847 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,688 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.1-py3-none-any.whl#sha256=296b90c06a33f69c9e7049a768f16929acd1279af0830f47f18fc598560d0e13 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,688 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.0-py3-none-any.whl#sha256=0a6f11a7a4d564d4a1dca5fe424cda19608cf947b9ab739079f5b60842651c7e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,689 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.1-py3-none-any.whl#sha256=504717cfc96a8fdbb0d4bc080d0292e80423891b658d61283385737794c4e95f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,690 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.0-py3-none-any.whl#sha256=263020a7b7f7788e17515c738604ea64904ca0da34d09664d2b7ee16d3522a00 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,690 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.14.0-py3-none-any.whl#sha256=cdf26beb4b188e1dd5e09feeb1832ccf909c9d2a535eb76e702fb3c66fc65688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,691 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.2-py3-none-any.whl#sha256=7d360083cf9dd960996847cde2140085d0830ddc8b12aa8007b4f72d395c5211 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,691 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.1-py3-none-any.whl#sha256=6b26f11daca05d6b56da3cb1b78ea0a8f28de94901b7520171c66e2a16b1c638 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,692 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.0-py3-none-any.whl#sha256=8ca93c4011f04e35df239b5486248e03100a9c9e265664e01330ef4b1cc691f7 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,693 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.1-py3-none-any.whl#sha256=72e5815923789c6cab6c32425477393bf645f670edcc493f8e4dec6e93f3da23 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,693 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.0-py3-none-any.whl#sha256=f26c7317a5bacac8527806723d45bac74563a2608ed761d822ace03be3a5e45f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,694 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.11.0-py3-none-any.whl#sha256=6e7d3242c5bf97b54d24644a1300575fdf41c1d90eef7c1344c20bf0d1518671 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,694 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.1-py3-none-any.whl#sha256=a7e645410333c1aaec5a024be4c02166dfb0d6b4635b020c181ce672d31102d2 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,695 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.0-py3-none-any.whl#sha256=494f1742178def5f86e552c004562c877cd8b8b6f5d4267d29f961f20e8bf569 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,695 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.9.0-py3-none-any.whl#sha256=26fd59fd387850f3e84f995fd38357994460e71d0124bc09954ca8837027ec52 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,696 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.2-py3-none-any.whl#sha256=3c4ed25b5c39d7a706927607e552e6ced6d50532bc442bba74979f70720f4894 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,696 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.1-py3-none-any.whl#sha256=64e9306453082a95b0c0507d6fd1dbf50ed0ab210ebcfebdcb52c534769d1856 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,697 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.0-py3-none-any.whl#sha256=7758550ef3406d9c6096de05909cc1c97ed0c2f7be6edaf3fe5344244e34f233 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,698 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.2-py3-none-any.whl#sha256=21a1f4448ffca926853b7516b6375f801ef6a8067501dde546e7d794fb759f20 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,698 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.1-py3-none-any.whl#sha256=b959f2d9850544f2d96ac765b06ed112cf579a11899618fcc5245407f4e33843 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,699 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.0-py3-none-any.whl#sha256=2deddeb89bfb6fa02844b72a2be24627526136cbd45315e8e4dadb672d53c9fc (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-24T06:57:12,699 Skipping link: not a file: https://www.piwheels.org/simple/evalscope/ 2026-04-24T06:57:12,700 Skipping link: not a file: https://pypi.org/simple/evalscope/ 2026-04-24T06:57:12,727 Given no hashes to check 1 links for project 'evalscope': discarding no candidates 2026-04-24T06:57:12,746 Collecting evalscope==1.6.1 2026-04-24T06:57:12,749 Created temporary directory: /tmp/pip-unpack-hqsf3w6t 2026-04-24T06:57:12,984 Downloading evalscope-1.6.1.tar.gz (1.5 MB) 2026-04-24T06:57:15,426 Added evalscope==1.6.1 from https://files.pythonhosted.org/packages/e6/6e/9ebaec0d2d4b09ca3fa6aed3d9ed4b61e52824c8cc8f5b9ff2dfa58d6ea4/evalscope-1.6.1.tar.gz to build tracker '/tmp/pip-build-tracker-hudghgmg' 2026-04-24T06:57:15,433 Created temporary directory: /tmp/pip-build-env-lg0vhxpk 2026-04-24T06:57:15,437 Installing build dependencies: started 2026-04-24T06:57:15,438 Running command pip subprocess to install build dependencies 2026-04-24T06:57:16,693 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-04-24T06:57:17,143 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-24T06:57:17,171 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-24T06:57:18,919 Collecting setuptools>=69 2026-04-24T06:57:19,004 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-04-24T06:57:19,272 Collecting wheel 2026-04-24T06:57:19,287 Using cached https://www.piwheels.org/simple/wheel/wheel-0.47.0-py3-none-any.whl (32 kB) 2026-04-24T06:57:19,465 Collecting packaging>=24.0 2026-04-24T06:57:19,482 Using cached https://www.piwheels.org/simple/packaging/packaging-26.1-py3-none-any.whl (95 kB) 2026-04-24T06:57:22,456 Installing collected packages: setuptools, packaging, wheel 2026-04-24T06:57:25,931 Creating /tmp/pip-build-env-lg0vhxpk/overlay/local/bin 2026-04-24T06:57:25,933 changing mode of /tmp/pip-build-env-lg0vhxpk/overlay/local/bin/wheel to 755 2026-04-24T06:57:25,955 Successfully installed packaging-26.1 setuptools-82.0.1 wheel-0.47.0 2026-04-24T06:57:26,229 Installing build dependencies: finished with status 'done' 2026-04-24T06:57:26,236 Getting requirements to build wheel: started 2026-04-24T06:57:26,237 Running command Getting requirements to build wheel 2026-04-24T06:57:26,980 /tmp/pip-build-env-lg0vhxpk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-04-24T06:57:26,980 !! 2026-04-24T06:57:26,981 ******************************************************************************** 2026-04-24T06:57:26,982 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-04-24T06:57:26,983 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-24T06:57:26,983 or your builds will no longer be supported. 2026-04-24T06:57:26,984 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-24T06:57:26,985 ******************************************************************************** 2026-04-24T06:57:26,986 !! 2026-04-24T06:57:26,987 corresp(dist, value, root_dir) 2026-04-24T06:57:27,068 /tmp/pip-build-env-lg0vhxpk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-24T06:57:27,069 !! 2026-04-24T06:57:27,070 ******************************************************************************** 2026-04-24T06:57:27,070 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-24T06:57:27,072 License :: OSI Approved :: Apache Software License 2026-04-24T06:57:27,073 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-24T06:57:27,073 ******************************************************************************** 2026-04-24T06:57:27,074 !! 2026-04-24T06:57:27,074 dist._finalize_license_expression() 2026-04-24T06:57:27,075 /tmp/pip-build-env-lg0vhxpk/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-24T06:57:27,075 !! 2026-04-24T06:57:27,076 ******************************************************************************** 2026-04-24T06:57:27,077 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-24T06:57:27,078 License :: OSI Approved :: Apache Software License 2026-04-24T06:57:27,079 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-24T06:57:27,079 ******************************************************************************** 2026-04-24T06:57:27,080 !! 2026-04-24T06:57:27,081 self._finalize_license_expression() 2026-04-24T06:57:27,081 running egg_info 2026-04-24T06:57:27,087 writing evalscope.egg-info/PKG-INFO 2026-04-24T06:57:27,113 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-04-24T06:57:27,115 writing entry points to evalscope.egg-info/entry_points.txt 2026-04-24T06:57:27,129 writing requirements to evalscope.egg-info/requires.txt 2026-04-24T06:57:27,131 writing top-level names to evalscope.egg-info/top_level.txt 2026-04-24T06:57:27,396 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-04-24T06:57:27,449 reading manifest template 'MANIFEST.in' 2026-04-24T06:57:27,907 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-04-24T06:57:27,912 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-04-24T06:57:27,918 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-04-24T06:57:27,924 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-04-24T06:57:27,925 adding license file 'LICENSE' 2026-04-24T06:57:27,981 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-04-24T06:57:28,082 Getting requirements to build wheel: finished with status 'done' 2026-04-24T06:57:28,086 Created temporary directory: /tmp/pip-modern-metadata-ylf29bb6 2026-04-24T06:57:28,089 Preparing metadata (pyproject.toml): started 2026-04-24T06:57:28,090 Running command Preparing metadata (pyproject.toml) 2026-04-24T06:57:28,736 /tmp/pip-build-env-lg0vhxpk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-04-24T06:57:28,736 !! 2026-04-24T06:57:28,738 ******************************************************************************** 2026-04-24T06:57:28,738 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-04-24T06:57:28,739 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-24T06:57:28,740 or your builds will no longer be supported. 2026-04-24T06:57:28,741 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-24T06:57:28,741 ******************************************************************************** 2026-04-24T06:57:28,743 !! 2026-04-24T06:57:28,743 corresp(dist, value, root_dir) 2026-04-24T06:57:28,819 /tmp/pip-build-env-lg0vhxpk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-24T06:57:28,820 !! 2026-04-24T06:57:28,821 ******************************************************************************** 2026-04-24T06:57:28,821 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-24T06:57:28,822 License :: OSI Approved :: Apache Software License 2026-04-24T06:57:28,823 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-24T06:57:28,824 ******************************************************************************** 2026-04-24T06:57:28,825 !! 2026-04-24T06:57:28,825 dist._finalize_license_expression() 2026-04-24T06:57:28,827 /tmp/pip-build-env-lg0vhxpk/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-24T06:57:28,828 !! 2026-04-24T06:57:28,829 ******************************************************************************** 2026-04-24T06:57:28,830 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-24T06:57:28,831 License :: OSI Approved :: Apache Software License 2026-04-24T06:57:28,831 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-24T06:57:28,832 ******************************************************************************** 2026-04-24T06:57:28,833 !! 2026-04-24T06:57:28,833 self._finalize_license_expression() 2026-04-24T06:57:28,834 running dist_info 2026-04-24T06:57:28,844 creating /tmp/pip-modern-metadata-ylf29bb6/evalscope.egg-info 2026-04-24T06:57:28,845 writing /tmp/pip-modern-metadata-ylf29bb6/evalscope.egg-info/PKG-INFO 2026-04-24T06:57:28,872 writing dependency_links to /tmp/pip-modern-metadata-ylf29bb6/evalscope.egg-info/dependency_links.txt 2026-04-24T06:57:28,874 writing entry points to /tmp/pip-modern-metadata-ylf29bb6/evalscope.egg-info/entry_points.txt 2026-04-24T06:57:28,887 writing requirements to /tmp/pip-modern-metadata-ylf29bb6/evalscope.egg-info/requires.txt 2026-04-24T06:57:28,889 writing top-level names to /tmp/pip-modern-metadata-ylf29bb6/evalscope.egg-info/top_level.txt 2026-04-24T06:57:28,890 writing manifest file '/tmp/pip-modern-metadata-ylf29bb6/evalscope.egg-info/SOURCES.txt' 2026-04-24T06:57:29,112 reading manifest file '/tmp/pip-modern-metadata-ylf29bb6/evalscope.egg-info/SOURCES.txt' 2026-04-24T06:57:29,114 reading manifest template 'MANIFEST.in' 2026-04-24T06:57:29,575 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-04-24T06:57:29,578 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-04-24T06:57:29,582 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-04-24T06:57:29,586 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-04-24T06:57:29,587 adding license file 'LICENSE' 2026-04-24T06:57:29,630 writing manifest file '/tmp/pip-modern-metadata-ylf29bb6/evalscope.egg-info/SOURCES.txt' 2026-04-24T06:57:29,632 creating '/tmp/pip-modern-metadata-ylf29bb6/evalscope-1.6.1.dist-info' 2026-04-24T06:57:29,762 Preparing metadata (pyproject.toml): finished with status 'done' 2026-04-24T06:57:29,772 Source in /tmp/pip-wheel-nciwmr4a/evalscope_8ab21f4e6a5d424fa2180dfcf124f05f has version 1.6.1, which satisfies requirement evalscope==1.6.1 from https://files.pythonhosted.org/packages/e6/6e/9ebaec0d2d4b09ca3fa6aed3d9ed4b61e52824c8cc8f5b9ff2dfa58d6ea4/evalscope-1.6.1.tar.gz 2026-04-24T06:57:29,773 Removed evalscope==1.6.1 from https://files.pythonhosted.org/packages/e6/6e/9ebaec0d2d4b09ca3fa6aed3d9ed4b61e52824c8cc8f5b9ff2dfa58d6ea4/evalscope-1.6.1.tar.gz from build tracker '/tmp/pip-build-tracker-hudghgmg' 2026-04-24T06:57:29,786 Created temporary directory: /tmp/pip-unpack-awjnf100 2026-04-24T06:57:29,787 Building wheels for collected packages: evalscope 2026-04-24T06:57:29,794 Created temporary directory: /tmp/pip-wheel-06f9xic9 2026-04-24T06:57:29,794 Destination directory: /tmp/pip-wheel-06f9xic9 2026-04-24T06:57:29,797 Building wheel for evalscope (pyproject.toml): started 2026-04-24T06:57:29,799 Running command Building wheel for evalscope (pyproject.toml) 2026-04-24T06:57:30,438 /tmp/pip-build-env-lg0vhxpk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-04-24T06:57:30,438 !! 2026-04-24T06:57:30,439 ******************************************************************************** 2026-04-24T06:57:30,440 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-04-24T06:57:30,441 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-24T06:57:30,441 or your builds will no longer be supported. 2026-04-24T06:57:30,442 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-24T06:57:30,443 ******************************************************************************** 2026-04-24T06:57:30,444 !! 2026-04-24T06:57:30,444 corresp(dist, value, root_dir) 2026-04-24T06:57:30,518 /tmp/pip-build-env-lg0vhxpk/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-24T06:57:30,519 !! 2026-04-24T06:57:30,520 ******************************************************************************** 2026-04-24T06:57:30,521 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-24T06:57:30,522 License :: OSI Approved :: Apache Software License 2026-04-24T06:57:30,523 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-24T06:57:30,523 ******************************************************************************** 2026-04-24T06:57:30,524 !! 2026-04-24T06:57:30,525 dist._finalize_license_expression() 2026-04-24T06:57:30,527 /tmp/pip-build-env-lg0vhxpk/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-24T06:57:30,527 !! 2026-04-24T06:57:30,528 ******************************************************************************** 2026-04-24T06:57:30,529 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-24T06:57:30,530 License :: OSI Approved :: Apache Software License 2026-04-24T06:57:30,531 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-24T06:57:30,532 ******************************************************************************** 2026-04-24T06:57:30,533 !! 2026-04-24T06:57:30,533 self._finalize_license_expression() 2026-04-24T06:57:30,534 running bdist_wheel 2026-04-24T06:57:30,546 running build 2026-04-24T06:57:30,547 running build_py 2026-04-24T06:57:30,553 creating build/lib/evalscope 2026-04-24T06:57:30,556 copying evalscope/run.py -> build/lib/evalscope 2026-04-24T06:57:30,558 copying evalscope/__init__.py -> build/lib/evalscope 2026-04-24T06:57:30,560 copying evalscope/config.py -> build/lib/evalscope 2026-04-24T06:57:30,563 copying evalscope/version.py -> build/lib/evalscope 2026-04-24T06:57:30,564 copying evalscope/constants.py -> build/lib/evalscope 2026-04-24T06:57:30,567 copying evalscope/arguments.py -> build/lib/evalscope 2026-04-24T06:57:30,569 creating build/lib/evalscope/sandbox 2026-04-24T06:57:30,570 copying evalscope/sandbox/__init__.py -> build/lib/evalscope/sandbox 2026-04-24T06:57:30,572 copying evalscope/sandbox/volcengine.py -> build/lib/evalscope/sandbox 2026-04-24T06:57:30,575 creating build/lib/evalscope/perf 2026-04-24T06:57:30,576 copying evalscope/perf/__init__.py -> build/lib/evalscope/perf 2026-04-24T06:57:30,578 copying evalscope/perf/http_client.py -> build/lib/evalscope/perf 2026-04-24T06:57:30,580 copying evalscope/perf/main.py -> build/lib/evalscope/perf 2026-04-24T06:57:30,582 copying evalscope/perf/benchmark.py -> build/lib/evalscope/perf 2026-04-24T06:57:30,585 copying evalscope/perf/multi_turn_benchmark.py -> build/lib/evalscope/perf 2026-04-24T06:57:30,588 copying evalscope/perf/arguments.py -> build/lib/evalscope/perf 2026-04-24T06:57:30,592 creating build/lib/evalscope/utils 2026-04-24T06:57:30,593 copying evalscope/utils/__init__.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,596 copying evalscope/utils/argument_utils.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,598 copying evalscope/utils/json_schema.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,602 copying evalscope/utils/io_utils.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,608 copying evalscope/utils/function_utils.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,612 copying evalscope/utils/model_utils.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,616 copying evalscope/utils/url_utils.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,620 copying evalscope/utils/chat_service.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,624 copying evalscope/utils/resource_utils.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,628 copying evalscope/utils/code_utils.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,631 copying evalscope/utils/multi_choices.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,633 copying evalscope/utils/ner.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,635 copying evalscope/utils/deprecation_utils.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,637 copying evalscope/utils/logger.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,639 copying evalscope/utils/import_utils.py -> build/lib/evalscope/utils 2026-04-24T06:57:30,642 creating build/lib/evalscope/backend 2026-04-24T06:57:30,643 copying evalscope/backend/__init__.py -> build/lib/evalscope/backend 2026-04-24T06:57:30,645 copying evalscope/backend/base.py -> build/lib/evalscope/backend 2026-04-24T06:57:30,647 creating build/lib/evalscope/summarizer 2026-04-24T06:57:30,648 copying evalscope/summarizer/__init__.py -> build/lib/evalscope/summarizer 2026-04-24T06:57:30,650 copying evalscope/summarizer/summarizer.py -> build/lib/evalscope/summarizer 2026-04-24T06:57:30,652 creating build/lib/evalscope/cli 2026-04-24T06:57:30,653 copying evalscope/cli/__init__.py -> build/lib/evalscope/cli 2026-04-24T06:57:30,655 copying evalscope/cli/base.py -> build/lib/evalscope/cli 2026-04-24T06:57:30,657 copying evalscope/cli/start_service.py -> build/lib/evalscope/cli 2026-04-24T06:57:30,659 copying evalscope/cli/benchmark_info.py -> build/lib/evalscope/cli 2026-04-24T06:57:30,661 copying evalscope/cli/start_app.py -> build/lib/evalscope/cli 2026-04-24T06:57:30,663 copying evalscope/cli/start_eval.py -> build/lib/evalscope/cli 2026-04-24T06:57:30,665 copying evalscope/cli/cli.py -> build/lib/evalscope/cli 2026-04-24T06:57:30,667 copying evalscope/cli/start_perf.py -> build/lib/evalscope/cli 2026-04-24T06:57:30,669 creating build/lib/evalscope/collections 2026-04-24T06:57:30,670 copying evalscope/collections/__init__.py -> build/lib/evalscope/collections 2026-04-24T06:57:30,672 copying evalscope/collections/schema.py -> build/lib/evalscope/collections 2026-04-24T06:57:30,674 copying evalscope/collections/sampler.py -> build/lib/evalscope/collections 2026-04-24T06:57:30,677 creating build/lib/evalscope/api 2026-04-24T06:57:30,678 copying evalscope/api/__init__.py -> build/lib/evalscope/api 2026-04-24T06:57:30,680 copying evalscope/api/registry.py -> build/lib/evalscope/api 2026-04-24T06:57:30,682 creating build/lib/evalscope/filters 2026-04-24T06:57:30,683 copying evalscope/filters/__init__.py -> build/lib/evalscope/filters 2026-04-24T06:57:30,685 copying evalscope/filters/selection.py -> build/lib/evalscope/filters 2026-04-24T06:57:30,686 copying evalscope/filters/extraction.py -> build/lib/evalscope/filters 2026-04-24T06:57:30,689 creating build/lib/evalscope/metrics 2026-04-24T06:57:30,690 copying evalscope/metrics/__init__.py -> build/lib/evalscope/metrics 2026-04-24T06:57:30,692 copying evalscope/metrics/metrics.py -> build/lib/evalscope/metrics 2026-04-24T06:57:30,694 copying evalscope/metrics/math_parser.py -> build/lib/evalscope/metrics 2026-04-24T06:57:30,696 copying evalscope/metrics/metric.py -> build/lib/evalscope/metrics 2026-04-24T06:57:30,699 copying evalscope/metrics/rouge_metric.py -> build/lib/evalscope/metrics 2026-04-24T06:57:30,701 copying evalscope/metrics/llm_judge.py -> build/lib/evalscope/metrics 2026-04-24T06:57:30,704 creating build/lib/evalscope/benchmarks 2026-04-24T06:57:30,705 copying evalscope/benchmarks/__init__.py -> build/lib/evalscope/benchmarks 2026-04-24T06:57:30,707 creating build/lib/evalscope/app 2026-04-24T06:57:30,708 copying evalscope/app/app.py -> build/lib/evalscope/app 2026-04-24T06:57:30,710 copying evalscope/app/__init__.py -> build/lib/evalscope/app 2026-04-24T06:57:30,711 copying evalscope/app/constants.py -> build/lib/evalscope/app 2026-04-24T06:57:30,713 copying evalscope/app/arguments.py -> build/lib/evalscope/app 2026-04-24T06:57:30,715 creating build/lib/evalscope/third_party 2026-04-24T06:57:30,716 copying evalscope/third_party/__init__.py -> build/lib/evalscope/third_party 2026-04-24T06:57:30,719 creating build/lib/evalscope/report 2026-04-24T06:57:30,720 copying evalscope/report/generator.py -> build/lib/evalscope/report 2026-04-24T06:57:30,722 copying evalscope/report/__init__.py -> build/lib/evalscope/report 2026-04-24T06:57:30,724 copying evalscope/report/combinator.py -> build/lib/evalscope/report 2026-04-24T06:57:30,726 copying evalscope/report/renderer.py -> build/lib/evalscope/report 2026-04-24T06:57:30,728 copying evalscope/report/report.py -> build/lib/evalscope/report 2026-04-24T06:57:30,731 creating build/lib/evalscope/service 2026-04-24T06:57:30,732 copying evalscope/service/app.py -> build/lib/evalscope/service 2026-04-24T06:57:30,734 copying evalscope/service/__init__.py -> build/lib/evalscope/service 2026-04-24T06:57:30,736 creating build/lib/evalscope/evaluator 2026-04-24T06:57:30,737 copying evalscope/evaluator/__init__.py -> build/lib/evalscope/evaluator 2026-04-24T06:57:30,739 copying evalscope/evaluator/evaluator.py -> build/lib/evalscope/evaluator 2026-04-24T06:57:30,742 copying evalscope/evaluator/perf_collector.py -> build/lib/evalscope/evaluator 2026-04-24T06:57:30,744 copying evalscope/evaluator/batch_reviewer.py -> build/lib/evalscope/evaluator 2026-04-24T06:57:30,747 creating build/lib/evalscope/models 2026-04-24T06:57:30,748 copying evalscope/models/model_apis.py -> build/lib/evalscope/models 2026-04-24T06:57:30,750 copying evalscope/models/modelscope.py -> build/lib/evalscope/models 2026-04-24T06:57:30,752 copying evalscope/models/__init__.py -> build/lib/evalscope/models 2026-04-24T06:57:30,758 copying evalscope/models/mockllm.py -> build/lib/evalscope/models 2026-04-24T06:57:30,771 copying evalscope/models/text2image_model.py -> build/lib/evalscope/models 2026-04-24T06:57:30,773 copying evalscope/models/anthropic_compatible.py -> build/lib/evalscope/models 2026-04-24T06:57:30,775 copying evalscope/models/openai_compatible.py -> build/lib/evalscope/models 2026-04-24T06:57:30,778 copying evalscope/models/image_edit_model.py -> build/lib/evalscope/models 2026-04-24T06:57:30,780 creating build/lib/evalscope/perf/utils 2026-04-24T06:57:30,781 copying evalscope/perf/utils/__init__.py -> build/lib/evalscope/perf/utils 2026-04-24T06:57:30,783 copying evalscope/perf/utils/db_util.py -> build/lib/evalscope/perf/utils 2026-04-24T06:57:30,786 copying evalscope/perf/utils/rich_display.py -> build/lib/evalscope/perf/utils 2026-04-24T06:57:30,788 copying evalscope/perf/utils/handler.py -> build/lib/evalscope/perf/utils 2026-04-24T06:57:30,790 copying evalscope/perf/utils/local_server.py -> build/lib/evalscope/perf/utils 2026-04-24T06:57:30,792 copying evalscope/perf/utils/log_utils.py -> build/lib/evalscope/perf/utils 2026-04-24T06:57:30,794 copying evalscope/perf/utils/benchmark_util.py -> build/lib/evalscope/perf/utils 2026-04-24T06:57:30,797 copying evalscope/perf/utils/analysis_result.py -> build/lib/evalscope/perf/utils 2026-04-24T06:57:30,799 creating build/lib/evalscope/perf/plugin 2026-04-24T06:57:30,800 copying evalscope/perf/plugin/__init__.py -> build/lib/evalscope/perf/plugin 2026-04-24T06:57:30,801 copying evalscope/perf/plugin/registry.py -> build/lib/evalscope/perf/plugin 2026-04-24T06:57:30,804 creating build/lib/evalscope/perf/sla 2026-04-24T06:57:30,805 copying evalscope/perf/sla/sla_run.py -> build/lib/evalscope/perf/sla 2026-04-24T06:57:30,807 copying evalscope/perf/sla/__init__.py -> build/lib/evalscope/perf/sla 2026-04-24T06:57:30,809 copying evalscope/perf/sla/sla_criterion.py -> build/lib/evalscope/perf/sla 2026-04-24T06:57:30,811 creating build/lib/evalscope/perf/utils/report 2026-04-24T06:57:30,812 copying evalscope/perf/utils/report/__init__.py -> build/lib/evalscope/perf/utils/report 2026-04-24T06:57:30,814 copying evalscope/perf/utils/report/perf_charts.py -> build/lib/evalscope/perf/utils/report 2026-04-24T06:57:30,816 copying evalscope/perf/utils/report/perf_data.py -> build/lib/evalscope/perf/utils/report 2026-04-24T06:57:30,818 copying evalscope/perf/utils/report/generate_report.py -> build/lib/evalscope/perf/utils/report 2026-04-24T06:57:30,821 creating build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,822 copying evalscope/perf/plugin/datasets/custom.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,825 copying evalscope/perf/plugin/datasets/__init__.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,827 copying evalscope/perf/plugin/datasets/base.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,829 copying evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,831 copying evalscope/perf/plugin/datasets/speed_benchmark.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,833 copying evalscope/perf/plugin/datasets/kontext_bench.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,834 copying evalscope/perf/plugin/datasets/embedding_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,837 copying evalscope/perf/plugin/datasets/openqa.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,838 copying evalscope/perf/plugin/datasets/multi_turn.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,841 copying evalscope/perf/plugin/datasets/share_gpt.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,843 copying evalscope/perf/plugin/datasets/longalpaca.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,845 copying evalscope/perf/plugin/datasets/random_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,847 copying evalscope/perf/plugin/datasets/rerank_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,849 copying evalscope/perf/plugin/datasets/flickr8k.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,852 copying evalscope/perf/plugin/datasets/line_by_line.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,853 copying evalscope/perf/plugin/datasets/utils.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-24T06:57:30,856 creating build/lib/evalscope/perf/plugin/api 2026-04-24T06:57:30,857 copying evalscope/perf/plugin/api/__init__.py -> build/lib/evalscope/perf/plugin/api 2026-04-24T06:57:30,859 copying evalscope/perf/plugin/api/base.py -> build/lib/evalscope/perf/plugin/api 2026-04-24T06:57:30,861 copying evalscope/perf/plugin/api/dashscope_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-24T06:57:30,863 copying evalscope/perf/plugin/api/openai_rerank_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-24T06:57:30,866 copying evalscope/perf/plugin/api/openai_embedding_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-24T06:57:30,868 copying evalscope/perf/plugin/api/custom_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-24T06:57:30,870 copying evalscope/perf/plugin/api/default_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-24T06:57:30,872 copying evalscope/perf/plugin/api/openai_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-24T06:57:30,875 creating build/lib/evalscope/utils/doc_utils 2026-04-24T06:57:30,876 copying evalscope/utils/doc_utils/benchmark_stats.py -> build/lib/evalscope/utils/doc_utils 2026-04-24T06:57:30,879 copying evalscope/utils/doc_utils/__init__.py -> build/lib/evalscope/utils/doc_utils 2026-04-24T06:57:30,881 copying evalscope/utils/doc_utils/readme_generator.py -> build/lib/evalscope/utils/doc_utils 2026-04-24T06:57:30,883 copying evalscope/utils/doc_utils/generate_dataset_md.py -> build/lib/evalscope/utils/doc_utils 2026-04-24T06:57:30,885 copying evalscope/utils/doc_utils/translate_description.py -> build/lib/evalscope/utils/doc_utils 2026-04-24T06:57:30,888 creating build/lib/evalscope/utils/tqdm_utils 2026-04-24T06:57:30,889 copying evalscope/utils/tqdm_utils/tqdm_logging.py -> build/lib/evalscope/utils/tqdm_utils 2026-04-24T06:57:30,891 copying evalscope/utils/tqdm_utils/__init__.py -> build/lib/evalscope/utils/tqdm_utils 2026-04-24T06:57:30,893 copying evalscope/utils/tqdm_utils/progress_tracker.py -> build/lib/evalscope/utils/tqdm_utils 2026-04-24T06:57:30,895 creating build/lib/evalscope/backend/opencompass 2026-04-24T06:57:30,896 copying evalscope/backend/opencompass/api_meta_template.py -> build/lib/evalscope/backend/opencompass 2026-04-24T06:57:30,898 copying evalscope/backend/opencompass/__init__.py -> build/lib/evalscope/backend/opencompass 2026-04-24T06:57:30,900 copying evalscope/backend/opencompass/backend_manager.py -> build/lib/evalscope/backend/opencompass 2026-04-24T06:57:30,903 creating build/lib/evalscope/backend/vlm_eval_kit 2026-04-24T06:57:30,904 copying evalscope/backend/vlm_eval_kit/__init__.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-04-24T06:57:30,906 copying evalscope/backend/vlm_eval_kit/backend_manager.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-04-24T06:57:30,909 creating build/lib/evalscope/backend/rag_eval 2026-04-24T06:57:30,910 copying evalscope/backend/rag_eval/__init__.py -> build/lib/evalscope/backend/rag_eval 2026-04-24T06:57:30,912 copying evalscope/backend/rag_eval/backend_manager.py -> build/lib/evalscope/backend/rag_eval 2026-04-24T06:57:30,914 creating build/lib/evalscope/backend/opencompass/tasks 2026-04-24T06:57:30,915 copying evalscope/backend/opencompass/tasks/__init__.py -> build/lib/evalscope/backend/opencompass/tasks 2026-04-24T06:57:30,917 copying evalscope/backend/opencompass/tasks/eval_api.py -> build/lib/evalscope/backend/opencompass/tasks 2026-04-24T06:57:30,919 copying evalscope/backend/opencompass/tasks/eval_datasets.py -> build/lib/evalscope/backend/opencompass/tasks 2026-04-24T06:57:30,921 creating build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:30,922 copying evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:30,924 copying evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:30,927 copying evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:30,929 copying evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:30,931 creating build/lib/evalscope/backend/rag_eval/utils 2026-04-24T06:57:30,932 copying evalscope/backend/rag_eval/utils/clip.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-24T06:57:30,935 copying evalscope/backend/rag_eval/utils/__init__.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-24T06:57:30,936 copying evalscope/backend/rag_eval/utils/embedding.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-24T06:57:30,938 copying evalscope/backend/rag_eval/utils/tools.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-24T06:57:30,940 copying evalscope/backend/rag_eval/utils/llm.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-24T06:57:30,943 creating build/lib/evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:30,944 copying evalscope/backend/rag_eval/cmteb/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:30,946 copying evalscope/backend/rag_eval/cmteb/base.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:30,948 copying evalscope/backend/rag_eval/cmteb/arguments.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:30,949 copying evalscope/backend/rag_eval/cmteb/task_template.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:30,952 creating build/lib/evalscope/backend/rag_eval/ragas 2026-04-24T06:57:30,952 copying evalscope/backend/rag_eval/ragas/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-04-24T06:57:30,954 copying evalscope/backend/rag_eval/ragas/arguments.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-04-24T06:57:30,956 copying evalscope/backend/rag_eval/ragas/task_template.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-04-24T06:57:30,959 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-24T06:57:30,959 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-24T06:57:30,962 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:30,963 copying evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:30,965 copying evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:30,967 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:30,969 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:30,972 creating build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:30,973 copying evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:30,975 copying evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:30,977 copying evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:30,979 copying evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:30,981 copying evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:30,984 copying evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:30,986 copying evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:30,989 copying evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:30,991 creating build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-04-24T06:57:30,992 copying evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-04-24T06:57:30,995 creating build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:30,996 copying evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:30,998 copying evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:31,000 copying evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:31,002 copying evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:31,004 copying evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:31,006 creating build/lib/evalscope/api/benchmark 2026-04-24T06:57:31,007 copying evalscope/api/benchmark/meta.py -> build/lib/evalscope/api/benchmark 2026-04-24T06:57:31,010 copying evalscope/api/benchmark/__init__.py -> build/lib/evalscope/api/benchmark 2026-04-24T06:57:31,012 copying evalscope/api/benchmark/benchmark.py -> build/lib/evalscope/api/benchmark 2026-04-24T06:57:31,015 copying evalscope/api/benchmark/statistics.py -> build/lib/evalscope/api/benchmark 2026-04-24T06:57:31,018 creating build/lib/evalscope/api/tool 2026-04-24T06:57:31,019 copying evalscope/api/tool/__init__.py -> build/lib/evalscope/api/tool 2026-04-24T06:57:31,021 copying evalscope/api/tool/tool_call.py -> build/lib/evalscope/api/tool 2026-04-24T06:57:31,024 copying evalscope/api/tool/tool_info.py -> build/lib/evalscope/api/tool 2026-04-24T06:57:31,026 copying evalscope/api/tool/utils.py -> build/lib/evalscope/api/tool 2026-04-24T06:57:31,030 creating build/lib/evalscope/api/messages 2026-04-24T06:57:31,031 copying evalscope/api/messages/__init__.py -> build/lib/evalscope/api/messages 2026-04-24T06:57:31,034 copying evalscope/api/messages/content.py -> build/lib/evalscope/api/messages 2026-04-24T06:57:31,036 copying evalscope/api/messages/chat_message.py -> build/lib/evalscope/api/messages 2026-04-24T06:57:31,039 copying evalscope/api/messages/utils.py -> build/lib/evalscope/api/messages 2026-04-24T06:57:31,042 creating build/lib/evalscope/api/metric 2026-04-24T06:57:31,043 copying evalscope/api/metric/__init__.py -> build/lib/evalscope/api/metric 2026-04-24T06:57:31,045 copying evalscope/api/metric/scorer.py -> build/lib/evalscope/api/metric 2026-04-24T06:57:31,048 copying evalscope/api/metric/metric.py -> build/lib/evalscope/api/metric 2026-04-24T06:57:31,051 creating build/lib/evalscope/api/model 2026-04-24T06:57:31,052 copying evalscope/api/model/__init__.py -> build/lib/evalscope/api/model 2026-04-24T06:57:31,054 copying evalscope/api/model/model_output.py -> build/lib/evalscope/api/model 2026-04-24T06:57:31,057 copying evalscope/api/model/generate_config.py -> build/lib/evalscope/api/model 2026-04-24T06:57:31,059 copying evalscope/api/model/model.py -> build/lib/evalscope/api/model 2026-04-24T06:57:31,062 copying evalscope/api/model/perf_metrics.py -> build/lib/evalscope/api/model 2026-04-24T06:57:31,065 copying evalscope/api/model/lazy_model.py -> build/lib/evalscope/api/model 2026-04-24T06:57:31,069 creating build/lib/evalscope/api/filter 2026-04-24T06:57:31,070 copying evalscope/api/filter/__init__.py -> build/lib/evalscope/api/filter 2026-04-24T06:57:31,072 copying evalscope/api/filter/filter.py -> build/lib/evalscope/api/filter 2026-04-24T06:57:31,075 creating build/lib/evalscope/api/mixin 2026-04-24T06:57:31,076 copying evalscope/api/mixin/__init__.py -> build/lib/evalscope/api/mixin 2026-04-24T06:57:31,078 copying evalscope/api/mixin/sandbox_mixin.py -> build/lib/evalscope/api/mixin 2026-04-24T06:57:31,080 copying evalscope/api/mixin/llm_judge_mixin.py -> build/lib/evalscope/api/mixin 2026-04-24T06:57:31,083 creating build/lib/evalscope/api/evaluator 2026-04-24T06:57:31,084 copying evalscope/api/evaluator/__init__.py -> build/lib/evalscope/api/evaluator 2026-04-24T06:57:31,086 copying evalscope/api/evaluator/evaluator.py -> build/lib/evalscope/api/evaluator 2026-04-24T06:57:31,088 copying evalscope/api/evaluator/state.py -> build/lib/evalscope/api/evaluator 2026-04-24T06:57:31,090 copying evalscope/api/evaluator/cache.py -> build/lib/evalscope/api/evaluator 2026-04-24T06:57:31,093 creating build/lib/evalscope/api/dataset 2026-04-24T06:57:31,094 copying evalscope/api/dataset/__init__.py -> build/lib/evalscope/api/dataset 2026-04-24T06:57:31,096 copying evalscope/api/dataset/dataset.py -> build/lib/evalscope/api/dataset 2026-04-24T06:57:31,098 copying evalscope/api/dataset/loader.py -> build/lib/evalscope/api/dataset 2026-04-24T06:57:31,101 copying evalscope/api/dataset/utils.py -> build/lib/evalscope/api/dataset 2026-04-24T06:57:31,103 creating build/lib/evalscope/api/benchmark/adapters 2026-04-24T06:57:31,104 copying evalscope/api/benchmark/adapters/ner_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-24T06:57:31,107 copying evalscope/api/benchmark/adapters/__init__.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-24T06:57:31,109 copying evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-24T06:57:31,110 copying evalscope/api/benchmark/adapters/default_data_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-24T06:57:31,113 copying evalscope/api/benchmark/adapters/agent_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-24T06:57:31,115 copying evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-24T06:57:31,117 copying evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-24T06:57:31,119 copying evalscope/api/benchmark/adapters/text2image_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-24T06:57:31,121 creating build/lib/evalscope/metrics/bundled_rouge_score 2026-04-24T06:57:31,122 copying evalscope/metrics/bundled_rouge_score/__init__.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-04-24T06:57:31,125 copying evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-04-24T06:57:31,128 creating build/lib/evalscope/metrics/bert_score 2026-04-24T06:57:31,129 copying evalscope/metrics/bert_score/__init__.py -> build/lib/evalscope/metrics/bert_score 2026-04-24T06:57:31,130 copying evalscope/metrics/bert_score/scorer.py -> build/lib/evalscope/metrics/bert_score 2026-04-24T06:57:31,133 copying evalscope/metrics/bert_score/utils.py -> build/lib/evalscope/metrics/bert_score 2026-04-24T06:57:31,136 creating build/lib/evalscope/metrics/t2v_metrics 2026-04-24T06:57:31,137 copying evalscope/metrics/t2v_metrics/score.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-24T06:57:31,140 copying evalscope/metrics/t2v_metrics/itmscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-24T06:57:31,141 copying evalscope/metrics/t2v_metrics/__init__.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-24T06:57:31,143 copying evalscope/metrics/t2v_metrics/vqascore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-24T06:57:31,145 copying evalscope/metrics/t2v_metrics/clipscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-24T06:57:31,147 copying evalscope/metrics/t2v_metrics/constants.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-24T06:57:31,149 creating build/lib/evalscope/metrics/text_normalizer 2026-04-24T06:57:31,151 copying evalscope/metrics/text_normalizer/chinese.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-24T06:57:31,154 copying evalscope/metrics/text_normalizer/__init__.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-24T06:57:31,156 copying evalscope/metrics/text_normalizer/wer.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-24T06:57:31,158 copying evalscope/metrics/text_normalizer/basic.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-24T06:57:31,160 copying evalscope/metrics/text_normalizer/english.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-24T06:57:31,163 creating build/lib/evalscope/metrics/sem_score 2026-04-24T06:57:31,164 copying evalscope/metrics/sem_score/__init__.py -> build/lib/evalscope/metrics/sem_score 2026-04-24T06:57:31,166 copying evalscope/metrics/sem_score/scorer.py -> build/lib/evalscope/metrics/sem_score 2026-04-24T06:57:31,168 creating build/lib/evalscope/metrics/t2v_metrics/models 2026-04-24T06:57:31,169 copying evalscope/metrics/t2v_metrics/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-04-24T06:57:31,171 copying evalscope/metrics/t2v_metrics/models/model.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-04-24T06:57:31,173 copying evalscope/metrics/t2v_metrics/models/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-04-24T06:57:31,175 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:31,176 copying evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:31,178 copying evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:31,180 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:31,182 copying evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:31,184 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:31,185 copying evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:31,187 copying evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:31,189 copying evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:31,191 copying evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:31,193 copying evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:31,195 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:31,196 copying evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:31,198 copying evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:31,200 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:31,203 copying evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:31,205 copying evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:31,207 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-24T06:57:31,208 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-24T06:57:31,209 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-24T06:57:31,211 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-24T06:57:31,214 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:31,215 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:31,217 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:31,219 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:31,221 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:31,223 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-04-24T06:57:31,224 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-04-24T06:57:31,226 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-04-24T06:57:31,227 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-04-24T06:57:31,230 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-04-24T06:57:31,231 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-04-24T06:57:31,233 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-04-24T06:57:31,234 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-04-24T06:57:31,237 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-24T06:57:31,238 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-24T06:57:31,240 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-24T06:57:31,243 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-04-24T06:57:31,245 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-04-24T06:57:31,248 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:31,250 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:31,253 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:31,255 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:31,258 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:31,262 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:31,263 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:31,266 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:31,269 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:31,271 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:31,274 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:31,276 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:31,279 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:31,283 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:31,284 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:31,287 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:31,290 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:31,293 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:31,296 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:31,300 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:31,304 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-24T06:57:31,306 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-24T06:57:31,308 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-24T06:57:31,311 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-24T06:57:31,315 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,317 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,320 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,322 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,325 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,327 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,331 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,334 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,338 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,341 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,345 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:31,348 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,349 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,351 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,353 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,355 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,358 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,360 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,363 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,365 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,367 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,369 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,371 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:31,374 creating build/lib/evalscope/benchmarks/ocr_bench 2026-04-24T06:57:31,375 copying evalscope/benchmarks/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench 2026-04-24T06:57:31,377 creating build/lib/evalscope/benchmarks/general_vmcq 2026-04-24T06:57:31,378 copying evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-04-24T06:57:31,381 copying evalscope/benchmarks/general_vmcq/__init__.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-04-24T06:57:31,382 creating build/lib/evalscope/benchmarks/mmmlu 2026-04-24T06:57:31,383 copying evalscope/benchmarks/mmmlu/mmmlu_adapter.py -> build/lib/evalscope/benchmarks/mmmlu 2026-04-24T06:57:31,385 copying evalscope/benchmarks/mmmlu/__init__.py -> build/lib/evalscope/benchmarks/mmmlu 2026-04-24T06:57:31,387 copying evalscope/benchmarks/mmmlu/prompt.py -> build/lib/evalscope/benchmarks/mmmlu 2026-04-24T06:57:31,389 creating build/lib/evalscope/benchmarks/mmmu_pro 2026-04-24T06:57:31,390 copying evalscope/benchmarks/mmmu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-04-24T06:57:31,392 copying evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-04-24T06:57:31,394 creating build/lib/evalscope/benchmarks/chinese_simple_qa 2026-04-24T06:57:31,396 copying evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-04-24T06:57:31,397 copying evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-04-24T06:57:31,400 creating build/lib/evalscope/benchmarks/simple_qa 2026-04-24T06:57:31,401 copying evalscope/benchmarks/simple_qa/__init__.py -> build/lib/evalscope/benchmarks/simple_qa 2026-04-24T06:57:31,403 copying evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/lib/evalscope/benchmarks/simple_qa 2026-04-24T06:57:31,406 creating build/lib/evalscope/benchmarks/text2image 2026-04-24T06:57:31,407 copying evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-24T06:57:31,410 copying evalscope/benchmarks/text2image/tifa_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-24T06:57:31,412 copying evalscope/benchmarks/text2image/__init__.py -> build/lib/evalscope/benchmarks/text2image 2026-04-24T06:57:31,414 copying evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-24T06:57:31,416 copying evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-24T06:57:31,418 copying evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-24T06:57:31,421 creating build/lib/evalscope/benchmarks/minerva_math 2026-04-24T06:57:31,422 copying evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/lib/evalscope/benchmarks/minerva_math 2026-04-24T06:57:31,425 copying evalscope/benchmarks/minerva_math/__init__.py -> build/lib/evalscope/benchmarks/minerva_math 2026-04-24T06:57:31,427 creating build/lib/evalscope/benchmarks/chartqa 2026-04-24T06:57:31,429 copying evalscope/benchmarks/chartqa/__init__.py -> build/lib/evalscope/benchmarks/chartqa 2026-04-24T06:57:31,430 copying evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/lib/evalscope/benchmarks/chartqa 2026-04-24T06:57:31,433 copying evalscope/benchmarks/chartqa/utils.py -> build/lib/evalscope/benchmarks/chartqa 2026-04-24T06:57:31,436 creating build/lib/evalscope/benchmarks/qasc 2026-04-24T06:57:31,437 copying evalscope/benchmarks/qasc/__init__.py -> build/lib/evalscope/benchmarks/qasc 2026-04-24T06:57:31,439 copying evalscope/benchmarks/qasc/qasc_adapter.py -> build/lib/evalscope/benchmarks/qasc 2026-04-24T06:57:31,443 creating build/lib/evalscope/benchmarks/arena_hard 2026-04-24T06:57:31,444 copying evalscope/benchmarks/arena_hard/__init__.py -> build/lib/evalscope/benchmarks/arena_hard 2026-04-24T06:57:31,446 copying evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/lib/evalscope/benchmarks/arena_hard 2026-04-24T06:57:31,449 copying evalscope/benchmarks/arena_hard/utils.py -> build/lib/evalscope/benchmarks/arena_hard 2026-04-24T06:57:31,452 creating build/lib/evalscope/benchmarks/logi_qa 2026-04-24T06:57:31,454 copying evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/lib/evalscope/benchmarks/logi_qa 2026-04-24T06:57:31,457 copying evalscope/benchmarks/logi_qa/__int__.py -> build/lib/evalscope/benchmarks/logi_qa 2026-04-24T06:57:31,460 creating build/lib/evalscope/benchmarks/truthful_qa 2026-04-24T06:57:31,462 copying evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-04-24T06:57:31,466 copying evalscope/benchmarks/truthful_qa/__init__.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-04-24T06:57:31,470 creating build/lib/evalscope/benchmarks/docvqa 2026-04-24T06:57:31,471 copying evalscope/benchmarks/docvqa/__init__.py -> build/lib/evalscope/benchmarks/docvqa 2026-04-24T06:57:31,475 copying evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/lib/evalscope/benchmarks/docvqa 2026-04-24T06:57:31,480 creating build/lib/evalscope/benchmarks/multi_if 2026-04-24T06:57:31,482 copying evalscope/benchmarks/multi_if/__init__.py -> build/lib/evalscope/benchmarks/multi_if 2026-04-24T06:57:31,484 copying evalscope/benchmarks/multi_if/metrics.py -> build/lib/evalscope/benchmarks/multi_if 2026-04-24T06:57:31,487 copying evalscope/benchmarks/multi_if/ifeval.py -> build/lib/evalscope/benchmarks/multi_if 2026-04-24T06:57:31,491 copying evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/lib/evalscope/benchmarks/multi_if 2026-04-24T06:57:31,494 creating build/lib/evalscope/benchmarks/simple_vqa 2026-04-24T06:57:31,495 copying evalscope/benchmarks/simple_vqa/__init__.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-04-24T06:57:31,497 copying evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-04-24T06:57:31,500 creating build/lib/evalscope/benchmarks/humaneval 2026-04-24T06:57:31,502 copying evalscope/benchmarks/humaneval/__init__.py -> build/lib/evalscope/benchmarks/humaneval 2026-04-24T06:57:31,504 copying evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/lib/evalscope/benchmarks/humaneval 2026-04-24T06:57:31,507 copying evalscope/benchmarks/humaneval/utils.py -> build/lib/evalscope/benchmarks/humaneval 2026-04-24T06:57:31,510 creating build/lib/evalscope/benchmarks/cmmlu 2026-04-24T06:57:31,511 copying evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/lib/evalscope/benchmarks/cmmlu 2026-04-24T06:57:31,514 copying evalscope/benchmarks/cmmlu/__init__.py -> build/lib/evalscope/benchmarks/cmmlu 2026-04-24T06:57:31,517 creating build/lib/evalscope/benchmarks/sciq 2026-04-24T06:57:31,520 copying evalscope/benchmarks/sciq/sciq_adapter.py -> build/lib/evalscope/benchmarks/sciq 2026-04-24T06:57:31,522 copying evalscope/benchmarks/sciq/__init__.py -> build/lib/evalscope/benchmarks/sciq 2026-04-24T06:57:31,524 creating build/lib/evalscope/benchmarks/general_fc 2026-04-24T06:57:31,525 copying evalscope/benchmarks/general_fc/__init__.py -> build/lib/evalscope/benchmarks/general_fc 2026-04-24T06:57:31,527 copying evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/lib/evalscope/benchmarks/general_fc 2026-04-24T06:57:31,530 creating build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:31,531 copying evalscope/benchmarks/omnidoc_bench/__init__.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:31,533 copying evalscope/benchmarks/omnidoc_bench/metrics.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:31,536 copying evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:31,539 copying evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:31,541 copying evalscope/benchmarks/omnidoc_bench/utils.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:31,544 creating build/lib/evalscope/benchmarks/a_okvqa 2026-04-24T06:57:31,545 copying evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-04-24T06:57:31,547 copying evalscope/benchmarks/a_okvqa/__init__.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-04-24T06:57:31,549 creating build/lib/evalscope/benchmarks/eq_bench 2026-04-24T06:57:31,550 copying evalscope/benchmarks/eq_bench/__init__.py -> build/lib/evalscope/benchmarks/eq_bench 2026-04-24T06:57:31,553 copying evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/lib/evalscope/benchmarks/eq_bench 2026-04-24T06:57:31,556 copying evalscope/benchmarks/eq_bench/answer_validation.py -> build/lib/evalscope/benchmarks/eq_bench 2026-04-24T06:57:31,559 creating build/lib/evalscope/benchmarks/mia_bench 2026-04-24T06:57:31,560 copying evalscope/benchmarks/mia_bench/__init__.py -> build/lib/evalscope/benchmarks/mia_bench 2026-04-24T06:57:31,561 copying evalscope/benchmarks/mia_bench/mia_bench_adapter.py -> build/lib/evalscope/benchmarks/mia_bench 2026-04-24T06:57:31,564 copying evalscope/benchmarks/mia_bench/utils.py -> build/lib/evalscope/benchmarks/mia_bench 2026-04-24T06:57:31,566 creating build/lib/evalscope/benchmarks/gsm8k_v 2026-04-24T06:57:31,568 copying evalscope/benchmarks/gsm8k_v/__init__.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-04-24T06:57:31,569 copying evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-04-24T06:57:31,572 creating build/lib/evalscope/benchmarks/hellaswag 2026-04-24T06:57:31,573 copying evalscope/benchmarks/hellaswag/__init__.py -> build/lib/evalscope/benchmarks/hellaswag 2026-04-24T06:57:31,575 copying evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/lib/evalscope/benchmarks/hellaswag 2026-04-24T06:57:31,578 creating build/lib/evalscope/benchmarks/siqa 2026-04-24T06:57:31,579 copying evalscope/benchmarks/siqa/__init__.py -> build/lib/evalscope/benchmarks/siqa 2026-04-24T06:57:31,580 copying evalscope/benchmarks/siqa/siqa_adapter.py -> build/lib/evalscope/benchmarks/siqa 2026-04-24T06:57:31,583 creating build/lib/evalscope/benchmarks/biomix_qa 2026-04-24T06:57:31,584 copying evalscope/benchmarks/biomix_qa/__init__.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-04-24T06:57:31,586 copying evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-04-24T06:57:31,589 creating build/lib/evalscope/benchmarks/piqa 2026-04-24T06:57:31,590 copying evalscope/benchmarks/piqa/__init__.py -> build/lib/evalscope/benchmarks/piqa 2026-04-24T06:57:31,591 copying evalscope/benchmarks/piqa/piqa_adapter.py -> build/lib/evalscope/benchmarks/piqa 2026-04-24T06:57:31,594 creating build/lib/evalscope/benchmarks/ifeval 2026-04-24T06:57:31,595 copying evalscope/benchmarks/ifeval/instructions.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-24T06:57:31,599 copying evalscope/benchmarks/ifeval/instructions_util.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-24T06:57:31,602 copying evalscope/benchmarks/ifeval/__init__.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-24T06:57:31,603 copying evalscope/benchmarks/ifeval/instructions_registry.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-24T06:57:31,606 copying evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-24T06:57:31,608 copying evalscope/benchmarks/ifeval/utils.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-24T06:57:31,611 creating build/lib/evalscope/benchmarks/mmlu 2026-04-24T06:57:31,612 copying evalscope/benchmarks/mmlu/__init__.py -> build/lib/evalscope/benchmarks/mmlu 2026-04-24T06:57:31,615 copying evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/lib/evalscope/benchmarks/mmlu 2026-04-24T06:57:31,617 creating build/lib/evalscope/benchmarks/swe_bench 2026-04-24T06:57:31,618 copying evalscope/benchmarks/swe_bench/build_images.py -> build/lib/evalscope/benchmarks/swe_bench 2026-04-24T06:57:31,621 copying evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/lib/evalscope/benchmarks/swe_bench 2026-04-24T06:57:31,623 copying evalscope/benchmarks/swe_bench/__init__.py -> build/lib/evalscope/benchmarks/swe_bench 2026-04-24T06:57:31,625 copying evalscope/benchmarks/swe_bench/utils.py -> build/lib/evalscope/benchmarks/swe_bench 2026-04-24T06:57:31,628 creating build/lib/evalscope/benchmarks/math_qa 2026-04-24T06:57:31,629 copying evalscope/benchmarks/math_qa/__init__.py -> build/lib/evalscope/benchmarks/math_qa 2026-04-24T06:57:31,631 copying evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/lib/evalscope/benchmarks/math_qa 2026-04-24T06:57:31,633 creating build/lib/evalscope/benchmarks/math_vision 2026-04-24T06:57:31,634 copying evalscope/benchmarks/math_vision/__init__.py -> build/lib/evalscope/benchmarks/math_vision 2026-04-24T06:57:31,636 copying evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/lib/evalscope/benchmarks/math_vision 2026-04-24T06:57:31,639 creating build/lib/evalscope/benchmarks/docmath 2026-04-24T06:57:31,640 copying evalscope/benchmarks/docmath/__init__.py -> build/lib/evalscope/benchmarks/docmath 2026-04-24T06:57:31,641 copying evalscope/benchmarks/docmath/docmath_adapter.py -> build/lib/evalscope/benchmarks/docmath 2026-04-24T06:57:31,644 copying evalscope/benchmarks/docmath/utils.py -> build/lib/evalscope/benchmarks/docmath 2026-04-24T06:57:31,647 creating build/lib/evalscope/benchmarks/omni_bench 2026-04-24T06:57:31,648 copying evalscope/benchmarks/omni_bench/__init__.py -> build/lib/evalscope/benchmarks/omni_bench 2026-04-24T06:57:31,649 copying evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/lib/evalscope/benchmarks/omni_bench 2026-04-24T06:57:31,652 creating build/lib/evalscope/benchmarks/fleurs 2026-04-24T06:57:31,653 copying evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/lib/evalscope/benchmarks/fleurs 2026-04-24T06:57:31,656 copying evalscope/benchmarks/fleurs/__init__.py -> build/lib/evalscope/benchmarks/fleurs 2026-04-24T06:57:31,658 creating build/lib/evalscope/benchmarks/mm_star 2026-04-24T06:57:31,659 copying evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/lib/evalscope/benchmarks/mm_star 2026-04-24T06:57:31,661 copying evalscope/benchmarks/mm_star/__init__.py -> build/lib/evalscope/benchmarks/mm_star 2026-04-24T06:57:31,663 creating build/lib/evalscope/benchmarks/pope 2026-04-24T06:57:31,664 copying evalscope/benchmarks/pope/__init__.py -> build/lib/evalscope/benchmarks/pope 2026-04-24T06:57:31,665 copying evalscope/benchmarks/pope/pope_adapter.py -> build/lib/evalscope/benchmarks/pope 2026-04-24T06:57:31,668 creating build/lib/evalscope/benchmarks/scicode 2026-04-24T06:57:31,669 copying evalscope/benchmarks/scicode/__init__.py -> build/lib/evalscope/benchmarks/scicode 2026-04-24T06:57:31,671 copying evalscope/benchmarks/scicode/util.py -> build/lib/evalscope/benchmarks/scicode 2026-04-24T06:57:31,673 copying evalscope/benchmarks/scicode/scicode_adapter.py -> build/lib/evalscope/benchmarks/scicode 2026-04-24T06:57:31,675 copying evalscope/benchmarks/scicode/prompt_templates.py -> build/lib/evalscope/benchmarks/scicode 2026-04-24T06:57:31,678 creating build/lib/evalscope/benchmarks/mm_bench 2026-04-24T06:57:31,679 copying evalscope/benchmarks/mm_bench/__init__.py -> build/lib/evalscope/benchmarks/mm_bench 2026-04-24T06:57:31,681 copying evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/lib/evalscope/benchmarks/mm_bench 2026-04-24T06:57:31,684 creating build/lib/evalscope/benchmarks/drop 2026-04-24T06:57:31,685 copying evalscope/benchmarks/drop/drop_adapter.py -> build/lib/evalscope/benchmarks/drop 2026-04-24T06:57:31,688 copying evalscope/benchmarks/drop/__init__.py -> build/lib/evalscope/benchmarks/drop 2026-04-24T06:57:31,690 copying evalscope/benchmarks/drop/utils.py -> build/lib/evalscope/benchmarks/drop 2026-04-24T06:57:31,693 creating build/lib/evalscope/benchmarks/general_arena 2026-04-24T06:57:31,694 copying evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/lib/evalscope/benchmarks/general_arena 2026-04-24T06:57:31,697 copying evalscope/benchmarks/general_arena/__init__.py -> build/lib/evalscope/benchmarks/general_arena 2026-04-24T06:57:31,698 copying evalscope/benchmarks/general_arena/utils.py -> build/lib/evalscope/benchmarks/general_arena 2026-04-24T06:57:31,701 creating build/lib/evalscope/benchmarks/tir_bench 2026-04-24T06:57:31,702 copying evalscope/benchmarks/tir_bench/__init__.py -> build/lib/evalscope/benchmarks/tir_bench 2026-04-24T06:57:31,704 copying evalscope/benchmarks/tir_bench/tir_bench_adapter.py -> build/lib/evalscope/benchmarks/tir_bench 2026-04-24T06:57:31,706 copying evalscope/benchmarks/tir_bench/utils.py -> build/lib/evalscope/benchmarks/tir_bench 2026-04-24T06:57:31,709 creating build/lib/evalscope/benchmarks/tau_bench 2026-04-24T06:57:31,710 copying evalscope/benchmarks/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench 2026-04-24T06:57:31,713 creating build/lib/evalscope/benchmarks/visu_logic 2026-04-24T06:57:31,714 copying evalscope/benchmarks/visu_logic/__init__.py -> build/lib/evalscope/benchmarks/visu_logic 2026-04-24T06:57:31,715 copying evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/lib/evalscope/benchmarks/visu_logic 2026-04-24T06:57:31,718 creating build/lib/evalscope/benchmarks/arc 2026-04-24T06:57:31,719 copying evalscope/benchmarks/arc/arc_adapter.py -> build/lib/evalscope/benchmarks/arc 2026-04-24T06:57:31,721 copying evalscope/benchmarks/arc/__init__.py -> build/lib/evalscope/benchmarks/arc 2026-04-24T06:57:31,724 creating build/lib/evalscope/benchmarks/maritime_bench 2026-04-24T06:57:31,725 copying evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-04-24T06:57:31,727 copying evalscope/benchmarks/maritime_bench/__init__.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-04-24T06:57:31,729 creating build/lib/evalscope/benchmarks/musr 2026-04-24T06:57:31,730 copying evalscope/benchmarks/musr/__init__.py -> build/lib/evalscope/benchmarks/musr 2026-04-24T06:57:31,732 copying evalscope/benchmarks/musr/musr_adapter.py -> build/lib/evalscope/benchmarks/musr 2026-04-24T06:57:31,735 creating build/lib/evalscope/benchmarks/drivelology 2026-04-24T06:57:31,736 copying evalscope/benchmarks/drivelology/__init__.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-24T06:57:31,738 copying evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-24T06:57:31,740 copying evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-24T06:57:31,742 copying evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-24T06:57:31,744 copying evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-24T06:57:31,748 creating build/lib/evalscope/benchmarks/blink 2026-04-24T06:57:31,749 copying evalscope/benchmarks/blink/__init__.py -> build/lib/evalscope/benchmarks/blink 2026-04-24T06:57:31,750 copying evalscope/benchmarks/blink/blink_adapter.py -> build/lib/evalscope/benchmarks/blink 2026-04-24T06:57:31,753 creating build/lib/evalscope/benchmarks/commonsense_qa 2026-04-24T06:57:31,754 copying evalscope/benchmarks/commonsense_qa/__init__.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-04-24T06:57:31,756 copying evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-04-24T06:57:31,759 creating build/lib/evalscope/benchmarks/process_bench 2026-04-24T06:57:31,760 copying evalscope/benchmarks/process_bench/__init__.py -> build/lib/evalscope/benchmarks/process_bench 2026-04-24T06:57:31,762 copying evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/lib/evalscope/benchmarks/process_bench 2026-04-24T06:57:31,765 creating build/lib/evalscope/benchmarks/mgsm 2026-04-24T06:57:31,766 copying evalscope/benchmarks/mgsm/__init__.py -> build/lib/evalscope/benchmarks/mgsm 2026-04-24T06:57:31,767 copying evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/lib/evalscope/benchmarks/mgsm 2026-04-24T06:57:31,770 creating build/lib/evalscope/benchmarks/pumed_qa 2026-04-24T06:57:31,771 copying evalscope/benchmarks/pumed_qa/__init__.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-04-24T06:57:31,772 copying evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-04-24T06:57:31,776 creating build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,777 copying evalscope/benchmarks/ner/conll2003_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,779 copying evalscope/benchmarks/ner/genia_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,781 copying evalscope/benchmarks/ner/__init__.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,783 copying evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,784 copying evalscope/benchmarks/ner/wnut2017_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,786 copying evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,788 copying evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,790 copying evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,792 copying evalscope/benchmarks/ner/copious_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,795 copying evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,797 copying evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,799 copying evalscope/benchmarks/ner/cross_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,801 copying evalscope/benchmarks/ner/ncbi_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,803 copying evalscope/benchmarks/ner/fin_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,805 copying evalscope/benchmarks/ner/anat_em_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,807 copying evalscope/benchmarks/ner/conllpp_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,809 copying evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,811 copying evalscope/benchmarks/ner/bc2gm_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,813 copying evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,816 copying evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,818 copying evalscope/benchmarks/ner/jnlpba_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,820 copying evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,822 copying evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-24T06:57:31,824 creating build/lib/evalscope/benchmarks/tool_bench 2026-04-24T06:57:31,825 copying evalscope/benchmarks/tool_bench/__init__.py -> build/lib/evalscope/benchmarks/tool_bench 2026-04-24T06:57:31,827 copying evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/lib/evalscope/benchmarks/tool_bench 2026-04-24T06:57:31,829 copying evalscope/benchmarks/tool_bench/utils.py -> build/lib/evalscope/benchmarks/tool_bench 2026-04-24T06:57:31,832 creating build/lib/evalscope/benchmarks/amc 2026-04-24T06:57:31,833 copying evalscope/benchmarks/amc/__init__.py -> build/lib/evalscope/benchmarks/amc 2026-04-24T06:57:31,835 copying evalscope/benchmarks/amc/amc_adapter.py -> build/lib/evalscope/benchmarks/amc 2026-04-24T06:57:31,837 creating build/lib/evalscope/benchmarks/science_qa 2026-04-24T06:57:31,838 copying evalscope/benchmarks/science_qa/__init__.py -> build/lib/evalscope/benchmarks/science_qa 2026-04-24T06:57:31,841 copying evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/lib/evalscope/benchmarks/science_qa 2026-04-24T06:57:31,843 creating build/lib/evalscope/benchmarks/competition_math 2026-04-24T06:57:31,844 copying evalscope/benchmarks/competition_math/__init__.py -> build/lib/evalscope/benchmarks/competition_math 2026-04-24T06:57:31,846 copying evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/lib/evalscope/benchmarks/competition_math 2026-04-24T06:57:31,849 creating build/lib/evalscope/benchmarks/mbpp 2026-04-24T06:57:31,850 copying evalscope/benchmarks/mbpp/__init__.py -> build/lib/evalscope/benchmarks/mbpp 2026-04-24T06:57:31,852 copying evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/lib/evalscope/benchmarks/mbpp 2026-04-24T06:57:31,855 creating build/lib/evalscope/benchmarks/infovqa 2026-04-24T06:57:31,856 copying evalscope/benchmarks/infovqa/__init__.py -> build/lib/evalscope/benchmarks/infovqa 2026-04-24T06:57:31,857 copying evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/lib/evalscope/benchmarks/infovqa 2026-04-24T06:57:31,860 creating build/lib/evalscope/benchmarks/aime 2026-04-24T06:57:31,861 copying evalscope/benchmarks/aime/math_normalize.py -> build/lib/evalscope/benchmarks/aime 2026-04-24T06:57:31,863 copying evalscope/benchmarks/aime/__init__.py -> build/lib/evalscope/benchmarks/aime 2026-04-24T06:57:31,865 copying evalscope/benchmarks/aime/grader.py -> build/lib/evalscope/benchmarks/aime 2026-04-24T06:57:31,867 copying evalscope/benchmarks/aime/aime_adapter.py -> build/lib/evalscope/benchmarks/aime 2026-04-24T06:57:31,870 creating build/lib/evalscope/benchmarks/super_gpqa 2026-04-24T06:57:31,871 copying evalscope/benchmarks/super_gpqa/__init__.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-04-24T06:57:31,873 copying evalscope/benchmarks/super_gpqa/prompt.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-04-24T06:57:31,876 copying evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-04-24T06:57:31,878 copying evalscope/benchmarks/super_gpqa/utils.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-04-24T06:57:31,880 creating build/lib/evalscope/benchmarks/music_trivia 2026-04-24T06:57:31,881 copying evalscope/benchmarks/music_trivia/__init__.py -> build/lib/evalscope/benchmarks/music_trivia 2026-04-24T06:57:31,883 copying evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/lib/evalscope/benchmarks/music_trivia 2026-04-24T06:57:31,886 creating build/lib/evalscope/benchmarks/mmmu 2026-04-24T06:57:31,887 copying evalscope/benchmarks/mmmu/__init__.py -> build/lib/evalscope/benchmarks/mmmu 2026-04-24T06:57:31,889 copying evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/lib/evalscope/benchmarks/mmmu 2026-04-24T06:57:31,891 creating build/lib/evalscope/benchmarks/longbench_v2 2026-04-24T06:57:31,893 copying evalscope/benchmarks/longbench_v2/__init__.py -> build/lib/evalscope/benchmarks/longbench_v2 2026-04-24T06:57:31,894 copying evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py -> build/lib/evalscope/benchmarks/longbench_v2 2026-04-24T06:57:31,897 creating build/lib/evalscope/benchmarks/zerobench 2026-04-24T06:57:31,898 copying evalscope/benchmarks/zerobench/__init__.py -> build/lib/evalscope/benchmarks/zerobench 2026-04-24T06:57:31,900 copying evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/lib/evalscope/benchmarks/zerobench 2026-04-24T06:57:31,903 creating build/lib/evalscope/benchmarks/hallusion_bench 2026-04-24T06:57:31,904 copying evalscope/benchmarks/hallusion_bench/__init__.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-04-24T06:57:31,906 copying evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-04-24T06:57:31,909 creating build/lib/evalscope/benchmarks/math_verse 2026-04-24T06:57:31,910 copying evalscope/benchmarks/math_verse/__init__.py -> build/lib/evalscope/benchmarks/math_verse 2026-04-24T06:57:31,912 copying evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/lib/evalscope/benchmarks/math_verse 2026-04-24T06:57:31,915 creating build/lib/evalscope/benchmarks/librispeech 2026-04-24T06:57:31,916 copying evalscope/benchmarks/librispeech/__init__.py -> build/lib/evalscope/benchmarks/librispeech 2026-04-24T06:57:31,919 copying evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/lib/evalscope/benchmarks/librispeech 2026-04-24T06:57:31,922 creating build/lib/evalscope/benchmarks/math_vista 2026-04-24T06:57:31,923 copying evalscope/benchmarks/math_vista/__init__.py -> build/lib/evalscope/benchmarks/math_vista 2026-04-24T06:57:31,925 copying evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/lib/evalscope/benchmarks/math_vista 2026-04-24T06:57:31,929 creating build/lib/evalscope/benchmarks/general_mcq 2026-04-24T06:57:31,930 copying evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/lib/evalscope/benchmarks/general_mcq 2026-04-24T06:57:31,933 copying evalscope/benchmarks/general_mcq/__init__.py -> build/lib/evalscope/benchmarks/general_mcq 2026-04-24T06:57:31,935 creating build/lib/evalscope/benchmarks/med_mcqa 2026-04-24T06:57:31,937 copying evalscope/benchmarks/med_mcqa/__init__.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-04-24T06:57:31,939 copying evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-04-24T06:57:31,942 creating build/lib/evalscope/benchmarks/hle 2026-04-24T06:57:31,944 copying evalscope/benchmarks/hle/hle_adapter.py -> build/lib/evalscope/benchmarks/hle 2026-04-24T06:57:31,948 copying evalscope/benchmarks/hle/__init__.py -> build/lib/evalscope/benchmarks/hle 2026-04-24T06:57:31,950 creating build/lib/evalscope/benchmarks/cmmmu 2026-04-24T06:57:31,951 copying evalscope/benchmarks/cmmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmmu 2026-04-24T06:57:31,954 copying evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmmu 2026-04-24T06:57:31,957 copying evalscope/benchmarks/cmmmu/utils.py -> build/lib/evalscope/benchmarks/cmmmu 2026-04-24T06:57:31,961 creating build/lib/evalscope/benchmarks/hmmt25 2026-04-24T06:57:31,963 copying evalscope/benchmarks/hmmt25/hmmt25_adapter.py -> build/lib/evalscope/benchmarks/hmmt25 2026-04-24T06:57:31,966 creating build/lib/evalscope/benchmarks/vstar_bench 2026-04-24T06:57:31,967 copying evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-04-24T06:57:31,970 copying evalscope/benchmarks/vstar_bench/__init__.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-04-24T06:57:31,972 creating build/lib/evalscope/benchmarks/bbh 2026-04-24T06:57:31,973 copying evalscope/benchmarks/bbh/__init__.py -> build/lib/evalscope/benchmarks/bbh 2026-04-24T06:57:31,976 copying evalscope/benchmarks/bbh/bbh_adapter.py -> build/lib/evalscope/benchmarks/bbh 2026-04-24T06:57:31,979 creating build/lib/evalscope/benchmarks/zebralogicbench 2026-04-24T06:57:31,980 copying evalscope/benchmarks/zebralogicbench/__init__.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-04-24T06:57:31,982 copying evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-04-24T06:57:31,984 copying evalscope/benchmarks/zebralogicbench/utils.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-04-24T06:57:31,987 creating build/lib/evalscope/benchmarks/needle_haystack 2026-04-24T06:57:31,988 copying evalscope/benchmarks/needle_haystack/__init__.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-04-24T06:57:31,990 copying evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-04-24T06:57:31,993 copying evalscope/benchmarks/needle_haystack/utils.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-04-24T06:57:31,995 creating build/lib/evalscope/benchmarks/coin_flip 2026-04-24T06:57:31,996 copying evalscope/benchmarks/coin_flip/__init__.py -> build/lib/evalscope/benchmarks/coin_flip 2026-04-24T06:57:31,998 copying evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/lib/evalscope/benchmarks/coin_flip 2026-04-24T06:57:32,001 creating build/lib/evalscope/benchmarks/gsm8k 2026-04-24T06:57:32,002 copying evalscope/benchmarks/gsm8k/__init__.py -> build/lib/evalscope/benchmarks/gsm8k 2026-04-24T06:57:32,004 copying evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/lib/evalscope/benchmarks/gsm8k 2026-04-24T06:57:32,006 creating build/lib/evalscope/benchmarks/cl_bench 2026-04-24T06:57:32,007 copying evalscope/benchmarks/cl_bench/__init__.py -> build/lib/evalscope/benchmarks/cl_bench 2026-04-24T06:57:32,009 copying evalscope/benchmarks/cl_bench/cl_bench_adapter.py -> build/lib/evalscope/benchmarks/cl_bench 2026-04-24T06:57:32,011 copying evalscope/benchmarks/cl_bench/utils.py -> build/lib/evalscope/benchmarks/cl_bench 2026-04-24T06:57:32,013 creating build/lib/evalscope/benchmarks/poly_math 2026-04-24T06:57:32,014 copying evalscope/benchmarks/poly_math/__init__.py -> build/lib/evalscope/benchmarks/poly_math 2026-04-24T06:57:32,016 copying evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/lib/evalscope/benchmarks/poly_math 2026-04-24T06:57:32,019 creating build/lib/evalscope/benchmarks/image_edit 2026-04-24T06:57:32,020 copying evalscope/benchmarks/image_edit/__init__.py -> build/lib/evalscope/benchmarks/image_edit 2026-04-24T06:57:32,022 creating build/lib/evalscope/benchmarks/real_world_qa 2026-04-24T06:57:32,023 copying evalscope/benchmarks/real_world_qa/__init__.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-04-24T06:57:32,025 copying evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-04-24T06:57:32,027 creating build/lib/evalscope/benchmarks/mbppplus 2026-04-24T06:57:32,028 copying evalscope/benchmarks/mbppplus/__init__.py -> build/lib/evalscope/benchmarks/mbppplus 2026-04-24T06:57:32,030 copying evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/lib/evalscope/benchmarks/mbppplus 2026-04-24T06:57:32,033 creating build/lib/evalscope/benchmarks/refcoco 2026-04-24T06:57:32,034 copying evalscope/benchmarks/refcoco/__init__.py -> build/lib/evalscope/benchmarks/refcoco 2026-04-24T06:57:32,036 copying evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/lib/evalscope/benchmarks/refcoco 2026-04-24T06:57:32,038 copying evalscope/benchmarks/refcoco/evaluation_lib.py -> build/lib/evalscope/benchmarks/refcoco 2026-04-24T06:57:32,040 copying evalscope/benchmarks/refcoco/utils.py -> build/lib/evalscope/benchmarks/refcoco 2026-04-24T06:57:32,043 creating build/lib/evalscope/benchmarks/general_vqa 2026-04-24T06:57:32,045 copying evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/lib/evalscope/benchmarks/general_vqa 2026-04-24T06:57:32,048 copying evalscope/benchmarks/general_vqa/__init__.py -> build/lib/evalscope/benchmarks/general_vqa 2026-04-24T06:57:32,050 creating build/lib/evalscope/benchmarks/multipl_e 2026-04-24T06:57:32,052 copying evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-04-24T06:57:32,055 copying evalscope/benchmarks/multipl_e/__init__.py -> build/lib/evalscope/benchmarks/multipl_e 2026-04-24T06:57:32,057 copying evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-04-24T06:57:32,060 copying evalscope/benchmarks/multipl_e/utils.py -> build/lib/evalscope/benchmarks/multipl_e 2026-04-24T06:57:32,064 creating build/lib/evalscope/benchmarks/ai2d 2026-04-24T06:57:32,065 copying evalscope/benchmarks/ai2d/__init__.py -> build/lib/evalscope/benchmarks/ai2d 2026-04-24T06:57:32,067 copying evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/lib/evalscope/benchmarks/ai2d 2026-04-24T06:57:32,070 creating build/lib/evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:32,072 copying evalscope/benchmarks/openai_mrcr/__init__.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:32,074 copying evalscope/benchmarks/openai_mrcr/utils.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:32,077 copying evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:32,080 creating build/lib/evalscope/benchmarks/general_qa 2026-04-24T06:57:32,082 copying evalscope/benchmarks/general_qa/__init__.py -> build/lib/evalscope/benchmarks/general_qa 2026-04-24T06:57:32,084 copying evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/lib/evalscope/benchmarks/general_qa 2026-04-24T06:57:32,088 creating build/lib/evalscope/benchmarks/ifbench 2026-04-24T06:57:32,089 copying evalscope/benchmarks/ifbench/instructions.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-24T06:57:32,094 copying evalscope/benchmarks/ifbench/instructions_util.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-24T06:57:32,098 copying evalscope/benchmarks/ifbench/__init__.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-24T06:57:32,100 copying evalscope/benchmarks/ifbench/instructions_registry.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-24T06:57:32,102 copying evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-24T06:57:32,105 copying evalscope/benchmarks/ifbench/evaluation_lib.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-24T06:57:32,108 creating build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,110 copying evalscope/benchmarks/live_code_bench/extract_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,112 copying evalscope/benchmarks/live_code_bench/__init__.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,115 copying evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,117 copying evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,120 copying evalscope/benchmarks/live_code_bench/load_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,123 copying evalscope/benchmarks/live_code_bench/testing_util.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,126 copying evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,129 copying evalscope/benchmarks/live_code_bench/prompts.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,132 copying evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:32,136 creating build/lib/evalscope/benchmarks/trivia_qa 2026-04-24T06:57:32,137 copying evalscope/benchmarks/trivia_qa/__init__.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-04-24T06:57:32,140 copying evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-04-24T06:57:32,143 creating build/lib/evalscope/benchmarks/halu_eval 2026-04-24T06:57:32,144 copying evalscope/benchmarks/halu_eval/__init__.py -> build/lib/evalscope/benchmarks/halu_eval 2026-04-24T06:57:32,146 copying evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/lib/evalscope/benchmarks/halu_eval 2026-04-24T06:57:32,148 copying evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/lib/evalscope/benchmarks/halu_eval 2026-04-24T06:57:32,151 creating build/lib/evalscope/benchmarks/winogrande 2026-04-24T06:57:32,152 copying evalscope/benchmarks/winogrande/__init__.py -> build/lib/evalscope/benchmarks/winogrande 2026-04-24T06:57:32,154 copying evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/lib/evalscope/benchmarks/winogrande 2026-04-24T06:57:32,157 creating build/lib/evalscope/benchmarks/micro_vqa 2026-04-24T06:57:32,158 copying evalscope/benchmarks/micro_vqa/__init__.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-04-24T06:57:32,159 copying evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-04-24T06:57:32,162 creating build/lib/evalscope/benchmarks/terminal_bench 2026-04-24T06:57:32,163 copying evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-04-24T06:57:32,166 copying evalscope/benchmarks/terminal_bench/__init__.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-04-24T06:57:32,168 copying evalscope/benchmarks/terminal_bench/utils.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-04-24T06:57:32,170 creating build/lib/evalscope/benchmarks/iquiz 2026-04-24T06:57:32,171 copying evalscope/benchmarks/iquiz/__init__.py -> build/lib/evalscope/benchmarks/iquiz 2026-04-24T06:57:32,173 copying evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/lib/evalscope/benchmarks/iquiz 2026-04-24T06:57:32,175 creating build/lib/evalscope/benchmarks/torgo 2026-04-24T06:57:32,176 copying evalscope/benchmarks/torgo/__init__.py -> build/lib/evalscope/benchmarks/torgo 2026-04-24T06:57:32,178 copying evalscope/benchmarks/torgo/torgo_adapter.py -> build/lib/evalscope/benchmarks/torgo 2026-04-24T06:57:32,181 creating build/lib/evalscope/benchmarks/mmlu_redux 2026-04-24T06:57:32,182 copying evalscope/benchmarks/mmlu_redux/__init__.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-04-24T06:57:32,184 copying evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-04-24T06:57:32,187 creating build/lib/evalscope/benchmarks/math_500 2026-04-24T06:57:32,188 copying evalscope/benchmarks/math_500/__init__.py -> build/lib/evalscope/benchmarks/math_500 2026-04-24T06:57:32,190 copying evalscope/benchmarks/math_500/math_500_adapter.py -> build/lib/evalscope/benchmarks/math_500 2026-04-24T06:57:32,192 creating build/lib/evalscope/benchmarks/humanevalplus 2026-04-24T06:57:32,193 copying evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-04-24T06:57:32,195 copying evalscope/benchmarks/humanevalplus/__init__.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-04-24T06:57:32,198 creating build/lib/evalscope/benchmarks/aa_lcr 2026-04-24T06:57:32,199 copying evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-04-24T06:57:32,201 copying evalscope/benchmarks/aa_lcr/__init__.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-04-24T06:57:32,203 creating build/lib/evalscope/benchmarks/wmt 2026-04-24T06:57:32,204 copying evalscope/benchmarks/wmt/__init__.py -> build/lib/evalscope/benchmarks/wmt 2026-04-24T06:57:32,206 copying evalscope/benchmarks/wmt/wmt24_adapter.py -> build/lib/evalscope/benchmarks/wmt 2026-04-24T06:57:32,209 creating build/lib/evalscope/benchmarks/race 2026-04-24T06:57:32,210 copying evalscope/benchmarks/race/__init__.py -> build/lib/evalscope/benchmarks/race 2026-04-24T06:57:32,212 copying evalscope/benchmarks/race/race_adapter.py -> build/lib/evalscope/benchmarks/race 2026-04-24T06:57:32,214 creating build/lib/evalscope/benchmarks/data_collection 2026-04-24T06:57:32,215 copying evalscope/benchmarks/data_collection/__init__.py -> build/lib/evalscope/benchmarks/data_collection 2026-04-24T06:57:32,217 copying evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/lib/evalscope/benchmarks/data_collection 2026-04-24T06:57:32,220 creating build/lib/evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:32,221 copying evalscope/benchmarks/olympiad_bench/__init__.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:32,223 copying evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:32,226 copying evalscope/benchmarks/olympiad_bench/utils.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:32,229 creating build/lib/evalscope/benchmarks/cmmu 2026-04-24T06:57:32,230 copying evalscope/benchmarks/cmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmu 2026-04-24T06:57:32,231 copying evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmu 2026-04-24T06:57:32,234 copying evalscope/benchmarks/cmmu/prompt.py -> build/lib/evalscope/benchmarks/cmmu 2026-04-24T06:57:32,236 creating build/lib/evalscope/benchmarks/gpqa 2026-04-24T06:57:32,237 copying evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/lib/evalscope/benchmarks/gpqa 2026-04-24T06:57:32,240 copying evalscope/benchmarks/gpqa/__init__.py -> build/lib/evalscope/benchmarks/gpqa 2026-04-24T06:57:32,241 copying evalscope/benchmarks/gpqa/prompt.py -> build/lib/evalscope/benchmarks/gpqa 2026-04-24T06:57:32,245 creating build/lib/evalscope/benchmarks/bfcl 2026-04-24T06:57:32,246 copying evalscope/benchmarks/bfcl/__init__.py -> build/lib/evalscope/benchmarks/bfcl 2026-04-24T06:57:32,248 creating build/lib/evalscope/benchmarks/alpaca_eval 2026-04-24T06:57:32,249 copying evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-04-24T06:57:32,252 copying evalscope/benchmarks/alpaca_eval/__init__.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-04-24T06:57:32,254 creating build/lib/evalscope/benchmarks/mri_mcqa 2026-04-24T06:57:32,255 copying evalscope/benchmarks/mri_mcqa/__init__.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-04-24T06:57:32,257 copying evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-04-24T06:57:32,259 creating build/lib/evalscope/benchmarks/ceval 2026-04-24T06:57:32,261 copying evalscope/benchmarks/ceval/__init__.py -> build/lib/evalscope/benchmarks/ceval 2026-04-24T06:57:32,263 copying evalscope/benchmarks/ceval/ceval_adapter.py -> build/lib/evalscope/benchmarks/ceval 2026-04-24T06:57:32,266 creating build/lib/evalscope/benchmarks/frames 2026-04-24T06:57:32,267 copying evalscope/benchmarks/frames/__init__.py -> build/lib/evalscope/benchmarks/frames 2026-04-24T06:57:32,269 copying evalscope/benchmarks/frames/frames_adapter.py -> build/lib/evalscope/benchmarks/frames 2026-04-24T06:57:32,271 copying evalscope/benchmarks/frames/utils.py -> build/lib/evalscope/benchmarks/frames 2026-04-24T06:57:32,273 creating build/lib/evalscope/benchmarks/mmlu_pro 2026-04-24T06:57:32,274 copying evalscope/benchmarks/mmlu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-04-24T06:57:32,276 copying evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-04-24T06:57:32,280 creating build/lib/evalscope/benchmarks/healthbench 2026-04-24T06:57:32,281 copying evalscope/benchmarks/healthbench/__init__.py -> build/lib/evalscope/benchmarks/healthbench 2026-04-24T06:57:32,283 copying evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/lib/evalscope/benchmarks/healthbench 2026-04-24T06:57:32,286 copying evalscope/benchmarks/healthbench/utils.py -> build/lib/evalscope/benchmarks/healthbench 2026-04-24T06:57:32,289 creating build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-04-24T06:57:32,290 copying evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-04-24T06:57:32,292 copying evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-04-24T06:57:32,294 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-24T06:57:32,295 copying evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-24T06:57:32,298 copying evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-24T06:57:32,300 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,301 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,302 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,304 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,306 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,308 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,311 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,313 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,315 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,317 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:32,321 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:32,322 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:32,323 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:32,326 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:32,329 creating build/lib/evalscope/benchmarks/scicode/docker 2026-04-24T06:57:32,330 copying evalscope/benchmarks/scicode/docker/process_data.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-04-24T06:57:32,333 copying evalscope/benchmarks/scicode/docker/test_util.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-04-24T06:57:32,335 creating build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:32,336 copying evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:32,338 copying evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:32,340 copying evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:32,343 creating build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:32,344 copying evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:32,347 copying evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:32,349 copying evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:32,352 creating build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:32,354 copying evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:32,357 copying evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:32,359 copying evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:32,361 copying evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:32,363 copying evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:32,366 copying evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:32,370 creating build/lib/evalscope/benchmarks/poly_math/utils 2026-04-24T06:57:32,371 copying evalscope/benchmarks/poly_math/utils/instruction.py -> build/lib/evalscope/benchmarks/poly_math/utils 2026-04-24T06:57:32,375 creating build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:32,376 copying evalscope/benchmarks/image_edit/gedit/__init__.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:32,381 copying evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:32,387 copying evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:32,393 copying evalscope/benchmarks/image_edit/gedit/utils.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:32,400 creating build/lib/evalscope/benchmarks/bfcl/v4 2026-04-24T06:57:32,401 copying evalscope/benchmarks/bfcl/v4/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-04-24T06:57:32,403 copying evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-04-24T06:57:32,405 copying evalscope/benchmarks/bfcl/v4/utils.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-04-24T06:57:32,409 creating build/lib/evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:32,410 copying evalscope/benchmarks/bfcl/v3/generation.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:32,412 copying evalscope/benchmarks/bfcl/v3/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:32,414 copying evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:32,416 copying evalscope/benchmarks/bfcl/v3/utils.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:32,419 creating build/lib/evalscope/app/utils 2026-04-24T06:57:32,420 copying evalscope/app/utils/data_utils.py -> build/lib/evalscope/app/utils 2026-04-24T06:57:32,423 copying evalscope/app/utils/text_utils.py -> build/lib/evalscope/app/utils 2026-04-24T06:57:32,425 copying evalscope/app/utils/localization.py -> build/lib/evalscope/app/utils 2026-04-24T06:57:32,427 copying evalscope/app/utils/visualization.py -> build/lib/evalscope/app/utils 2026-04-24T06:57:32,430 copying evalscope/app/utils/env_utils.py -> build/lib/evalscope/app/utils 2026-04-24T06:57:32,432 creating build/lib/evalscope/app/ui 2026-04-24T06:57:32,433 copying evalscope/app/ui/__init__.py -> build/lib/evalscope/app/ui 2026-04-24T06:57:32,435 copying evalscope/app/ui/multi_model.py -> build/lib/evalscope/app/ui 2026-04-24T06:57:32,438 copying evalscope/app/ui/app_ui.py -> build/lib/evalscope/app/ui 2026-04-24T06:57:32,440 copying evalscope/app/ui/sidebar.py -> build/lib/evalscope/app/ui 2026-04-24T06:57:32,442 copying evalscope/app/ui/visualization.py -> build/lib/evalscope/app/ui 2026-04-24T06:57:32,444 copying evalscope/app/ui/single_model.py -> build/lib/evalscope/app/ui 2026-04-24T06:57:32,447 creating build/lib/evalscope/third_party/thinkbench 2026-04-24T06:57:32,448 copying evalscope/third_party/thinkbench/eval.py -> build/lib/evalscope/third_party/thinkbench 2026-04-24T06:57:32,451 copying evalscope/third_party/thinkbench/__init__.py -> build/lib/evalscope/third_party/thinkbench 2026-04-24T06:57:32,453 copying evalscope/third_party/thinkbench/infer.py -> build/lib/evalscope/third_party/thinkbench 2026-04-24T06:57:32,456 creating build/lib/evalscope/third_party/longbench_write 2026-04-24T06:57:32,457 copying evalscope/third_party/longbench_write/eval.py -> build/lib/evalscope/third_party/longbench_write 2026-04-24T06:57:32,460 copying evalscope/third_party/longbench_write/__init__.py -> build/lib/evalscope/third_party/longbench_write 2026-04-24T06:57:32,462 copying evalscope/third_party/longbench_write/infer.py -> build/lib/evalscope/third_party/longbench_write 2026-04-24T06:57:32,464 copying evalscope/third_party/longbench_write/longbench_write.py -> build/lib/evalscope/third_party/longbench_write 2026-04-24T06:57:32,466 copying evalscope/third_party/longbench_write/utils.py -> build/lib/evalscope/third_party/longbench_write 2026-04-24T06:57:32,469 creating build/lib/evalscope/third_party/toolbench_static 2026-04-24T06:57:32,470 copying evalscope/third_party/toolbench_static/eval.py -> build/lib/evalscope/third_party/toolbench_static 2026-04-24T06:57:32,472 copying evalscope/third_party/toolbench_static/__init__.py -> build/lib/evalscope/third_party/toolbench_static 2026-04-24T06:57:32,474 copying evalscope/third_party/toolbench_static/toolbench_static.py -> build/lib/evalscope/third_party/toolbench_static 2026-04-24T06:57:32,476 copying evalscope/third_party/toolbench_static/infer.py -> build/lib/evalscope/third_party/toolbench_static 2026-04-24T06:57:32,480 creating build/lib/evalscope/third_party/thinkbench/tools 2026-04-24T06:57:32,481 copying evalscope/third_party/thinkbench/tools/__init__.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-04-24T06:57:32,483 copying evalscope/third_party/thinkbench/tools/llm.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-04-24T06:57:32,485 copying evalscope/third_party/thinkbench/tools/utils.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-04-24T06:57:32,487 creating build/lib/evalscope/third_party/longbench_write/resources 2026-04-24T06:57:32,489 copying evalscope/third_party/longbench_write/resources/__init__.py -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-24T06:57:32,491 creating build/lib/evalscope/third_party/longbench_write/tools 2026-04-24T06:57:32,492 copying evalscope/third_party/longbench_write/tools/data_etl.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-04-24T06:57:32,495 copying evalscope/third_party/longbench_write/tools/__init__.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-04-24T06:57:32,497 copying evalscope/third_party/longbench_write/tools/openai_api.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-04-24T06:57:32,500 creating build/lib/evalscope/third_party/toolbench_static/llm 2026-04-24T06:57:32,501 copying evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-04-24T06:57:32,503 copying evalscope/third_party/toolbench_static/llm/__init__.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-04-24T06:57:32,508 creating build/lib/evalscope/service/blueprints 2026-04-24T06:57:32,509 copying evalscope/service/blueprints/eval.py -> build/lib/evalscope/service/blueprints 2026-04-24T06:57:32,512 copying evalscope/service/blueprints/perf.py -> build/lib/evalscope/service/blueprints 2026-04-24T06:57:32,514 copying evalscope/service/blueprints/__init__.py -> build/lib/evalscope/service/blueprints 2026-04-24T06:57:32,516 creating build/lib/evalscope/service/utils 2026-04-24T06:57:32,517 copying evalscope/service/utils/__init__.py -> build/lib/evalscope/service/utils 2026-04-24T06:57:32,519 copying evalscope/service/utils/benchmarks.py -> build/lib/evalscope/service/utils 2026-04-24T06:57:32,522 copying evalscope/service/utils/process.py -> build/lib/evalscope/service/utils 2026-04-24T06:57:32,524 copying evalscope/service/utils/log.py -> build/lib/evalscope/service/utils 2026-04-24T06:57:32,527 creating build/lib/evalscope/service/frontend 2026-04-24T06:57:32,528 copying evalscope/service/frontend/async_client.py -> build/lib/evalscope/service/frontend 2026-04-24T06:57:32,531 copying evalscope/service/frontend/__init__.py -> build/lib/evalscope/service/frontend 2026-04-24T06:57:32,532 copying evalscope/service/frontend/main.py -> build/lib/evalscope/service/frontend 2026-04-24T06:57:32,535 copying evalscope/service/frontend/utils.py -> build/lib/evalscope/service/frontend 2026-04-24T06:57:32,537 creating build/lib/evalscope/models/utils 2026-04-24T06:57:32,538 copying evalscope/models/utils/anthropic.py -> build/lib/evalscope/models/utils 2026-04-24T06:57:32,541 copying evalscope/models/utils/openai.py -> build/lib/evalscope/models/utils 2026-04-24T06:57:32,546 running egg_info 2026-04-24T06:57:32,556 writing evalscope.egg-info/PKG-INFO 2026-04-24T06:57:32,582 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-04-24T06:57:32,584 writing entry points to evalscope.egg-info/entry_points.txt 2026-04-24T06:57:32,597 writing requirements to evalscope.egg-info/requires.txt 2026-04-24T06:57:32,599 writing top-level names to evalscope.egg-info/top_level.txt 2026-04-24T06:57:33,050 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-04-24T06:57:33,107 reading manifest template 'MANIFEST.in' 2026-04-24T06:57:33,584 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-04-24T06:57:33,589 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-04-24T06:57:33,595 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-04-24T06:57:33,601 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-04-24T06:57:33,602 adding license file 'LICENSE' 2026-04-24T06:57:33,657 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-04-24T06:57:33,777 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-24T06:57:33,779 copying evalscope/metrics/text_normalizer/english.json -> build/lib/evalscope/metrics/text_normalizer 2026-04-24T06:57:33,783 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-04-24T06:57:33,784 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-04-24T06:57:33,787 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-24T06:57:33,788 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-24T06:57:33,790 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-24T06:57:33,792 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-24T06:57:33,795 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,796 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,798 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,801 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,803 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,806 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,808 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,810 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,813 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,815 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,817 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,819 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,821 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,824 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,826 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,828 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,831 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,833 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,835 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,838 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:33,840 copying evalscope/benchmarks/ocr_bench/requirements.txt -> build/lib/evalscope/benchmarks/ocr_bench 2026-04-24T06:57:33,842 copying evalscope/benchmarks/arena_hard/requirements.txt -> build/lib/evalscope/benchmarks/arena_hard 2026-04-24T06:57:33,845 copying evalscope/benchmarks/multi_if/requirements.txt -> build/lib/evalscope/benchmarks/multi_if 2026-04-24T06:57:33,847 copying evalscope/benchmarks/omnidoc_bench/requirements.txt -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:33,849 copying evalscope/benchmarks/ifeval/requirements.txt -> build/lib/evalscope/benchmarks/ifeval 2026-04-24T06:57:33,851 copying evalscope/benchmarks/swe_bench/requirements.txt -> build/lib/evalscope/benchmarks/swe_bench 2026-04-24T06:57:33,853 copying evalscope/benchmarks/general_arena/requirements.txt -> build/lib/evalscope/benchmarks/general_arena 2026-04-24T06:57:33,855 copying evalscope/benchmarks/needle_haystack/requirements.txt -> build/lib/evalscope/benchmarks/needle_haystack 2026-04-24T06:57:33,857 copying evalscope/benchmarks/refcoco/requirements.txt -> build/lib/evalscope/benchmarks/refcoco 2026-04-24T06:57:33,859 copying evalscope/benchmarks/openai_mrcr/requirements.txt -> build/lib/evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:33,861 copying evalscope/benchmarks/ifbench/requirements.txt -> build/lib/evalscope/benchmarks/ifbench 2026-04-24T06:57:33,863 copying evalscope/benchmarks/trivia_qa/samples.jsonl -> build/lib/evalscope/benchmarks/trivia_qa 2026-04-24T06:57:33,865 copying evalscope/benchmarks/terminal_bench/requirements.txt -> build/lib/evalscope/benchmarks/terminal_bench 2026-04-24T06:57:33,867 copying evalscope/benchmarks/torgo/requirements.txt -> build/lib/evalscope/benchmarks/torgo 2026-04-24T06:57:33,869 copying evalscope/benchmarks/wmt/requirements.txt -> build/lib/evalscope/benchmarks/wmt 2026-04-24T06:57:33,871 copying evalscope/benchmarks/olympiad_bench/requirements.txt -> build/lib/evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:33,873 creating build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,874 copying evalscope/benchmarks/_meta/a_okvqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,877 copying evalscope/benchmarks/_meta/aa_lcr.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,880 copying evalscope/benchmarks/_meta/ai2d.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,883 copying evalscope/benchmarks/_meta/aime24.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,886 copying evalscope/benchmarks/_meta/aime25.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,890 copying evalscope/benchmarks/_meta/aime26.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,892 copying evalscope/benchmarks/_meta/alpaca_eval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,895 copying evalscope/benchmarks/_meta/amc.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,898 copying evalscope/benchmarks/_meta/anat_em.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,901 copying evalscope/benchmarks/_meta/arc.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,904 copying evalscope/benchmarks/_meta/arena_hard.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,906 copying evalscope/benchmarks/_meta/bbh.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,910 copying evalscope/benchmarks/_meta/bc2gm.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,913 copying evalscope/benchmarks/_meta/bc4chemd.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,916 copying evalscope/benchmarks/_meta/bc5cdr.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,919 copying evalscope/benchmarks/_meta/bfcl_v3.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,922 copying evalscope/benchmarks/_meta/bfcl_v4.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,925 copying evalscope/benchmarks/_meta/biomix_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,927 copying evalscope/benchmarks/_meta/blink.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,930 copying evalscope/benchmarks/_meta/broad_twitter_corpus.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,934 copying evalscope/benchmarks/_meta/cc_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,937 copying evalscope/benchmarks/_meta/ceval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,940 copying evalscope/benchmarks/_meta/chartqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,943 copying evalscope/benchmarks/_meta/chinese_simpleqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,946 copying evalscope/benchmarks/_meta/cl_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,950 copying evalscope/benchmarks/_meta/cmmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,954 copying evalscope/benchmarks/_meta/cmmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,958 copying evalscope/benchmarks/_meta/cmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,961 copying evalscope/benchmarks/_meta/coin_flip.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,965 copying evalscope/benchmarks/_meta/commonsense_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,969 copying evalscope/benchmarks/_meta/competition_math.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,973 copying evalscope/benchmarks/_meta/conll2003.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,977 copying evalscope/benchmarks/_meta/conllpp.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:33,984 copying evalscope/benchmarks/_meta/copious.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,026 copying evalscope/benchmarks/_meta/cross_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,029 copying evalscope/benchmarks/_meta/data_collection.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,032 copying evalscope/benchmarks/_meta/docmath.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,035 copying evalscope/benchmarks/_meta/docvqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,038 copying evalscope/benchmarks/_meta/drivel_binary.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,041 copying evalscope/benchmarks/_meta/drivel_multilabel.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,044 copying evalscope/benchmarks/_meta/drivel_selection.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,047 copying evalscope/benchmarks/_meta/drivel_writing.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,050 copying evalscope/benchmarks/_meta/drop.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,053 copying evalscope/benchmarks/_meta/eq_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,056 copying evalscope/benchmarks/_meta/evalmuse.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,059 copying evalscope/benchmarks/_meta/fin_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,062 copying evalscope/benchmarks/_meta/fleurs.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,065 copying evalscope/benchmarks/_meta/frames.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,069 copying evalscope/benchmarks/_meta/gedit.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,072 copying evalscope/benchmarks/_meta/genai_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,075 copying evalscope/benchmarks/_meta/general_arena.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,079 copying evalscope/benchmarks/_meta/general_fc.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,082 copying evalscope/benchmarks/_meta/general_mcq.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,086 copying evalscope/benchmarks/_meta/general_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,089 copying evalscope/benchmarks/_meta/general_t2i.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,091 copying evalscope/benchmarks/_meta/general_vmcq.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,095 copying evalscope/benchmarks/_meta/general_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,098 copying evalscope/benchmarks/_meta/genia_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,101 copying evalscope/benchmarks/_meta/gpqa_diamond.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,104 copying evalscope/benchmarks/_meta/gsm8k.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,107 copying evalscope/benchmarks/_meta/gsm8k_v.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,111 copying evalscope/benchmarks/_meta/hallusion_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,114 copying evalscope/benchmarks/_meta/halueval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,117 copying evalscope/benchmarks/_meta/harvey_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,120 copying evalscope/benchmarks/_meta/health_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,124 copying evalscope/benchmarks/_meta/hellaswag.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,127 copying evalscope/benchmarks/_meta/hle.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,131 copying evalscope/benchmarks/_meta/hmmt25.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,134 copying evalscope/benchmarks/_meta/hpdv2.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,137 copying evalscope/benchmarks/_meta/humaneval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,141 copying evalscope/benchmarks/_meta/humaneval_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,145 copying evalscope/benchmarks/_meta/ifbench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,148 copying evalscope/benchmarks/_meta/ifeval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,152 copying evalscope/benchmarks/_meta/infovqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,155 copying evalscope/benchmarks/_meta/iquiz.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,158 copying evalscope/benchmarks/_meta/jnlpba.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,162 copying evalscope/benchmarks/_meta/jnlpba_rare.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,165 copying evalscope/benchmarks/_meta/librispeech.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,168 copying evalscope/benchmarks/_meta/live_code_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,171 copying evalscope/benchmarks/_meta/logi_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,175 copying evalscope/benchmarks/_meta/longbench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,178 copying evalscope/benchmarks/_meta/maritime_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,181 copying evalscope/benchmarks/_meta/math_500.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,185 copying evalscope/benchmarks/_meta/math_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,188 copying evalscope/benchmarks/_meta/math_verse.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,192 copying evalscope/benchmarks/_meta/math_vision.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,195 copying evalscope/benchmarks/_meta/math_vista.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,198 copying evalscope/benchmarks/_meta/mbpp.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,202 copying evalscope/benchmarks/_meta/mbpp_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,205 copying evalscope/benchmarks/_meta/med_mcqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,208 copying evalscope/benchmarks/_meta/mgsm.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,212 copying evalscope/benchmarks/_meta/mia_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,215 copying evalscope/benchmarks/_meta/micro_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,219 copying evalscope/benchmarks/_meta/minerva_math.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,222 copying evalscope/benchmarks/_meta/mit_movie_trivia.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,226 copying evalscope/benchmarks/_meta/mit_restaurant.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,229 copying evalscope/benchmarks/_meta/mm_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,232 copying evalscope/benchmarks/_meta/mm_star.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,236 copying evalscope/benchmarks/_meta/mmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,240 copying evalscope/benchmarks/_meta/mmlu_pro.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,243 copying evalscope/benchmarks/_meta/mmlu_redux.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,247 copying evalscope/benchmarks/_meta/mmmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,250 copying evalscope/benchmarks/_meta/mmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,254 copying evalscope/benchmarks/_meta/mmmu_pro.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,260 copying evalscope/benchmarks/_meta/mri_mcqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,263 copying evalscope/benchmarks/_meta/multi_if.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,267 copying evalscope/benchmarks/_meta/multi_nerd.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,271 copying evalscope/benchmarks/_meta/multiple_humaneval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,274 copying evalscope/benchmarks/_meta/multiple_mbpp.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,278 copying evalscope/benchmarks/_meta/music_trivia.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,281 copying evalscope/benchmarks/_meta/musr.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,284 copying evalscope/benchmarks/_meta/ncbi.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,288 copying evalscope/benchmarks/_meta/needle_haystack.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,291 copying evalscope/benchmarks/_meta/ocr_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,295 copying evalscope/benchmarks/_meta/ocr_bench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,299 copying evalscope/benchmarks/_meta/olympiad_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,302 copying evalscope/benchmarks/_meta/omni_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,306 copying evalscope/benchmarks/_meta/omni_doc_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,310 copying evalscope/benchmarks/_meta/ontonotes5.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,313 copying evalscope/benchmarks/_meta/openai_mrcr.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,317 copying evalscope/benchmarks/_meta/piqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,320 copying evalscope/benchmarks/_meta/poly_math.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,324 copying evalscope/benchmarks/_meta/pope.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,327 copying evalscope/benchmarks/_meta/process_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,330 copying evalscope/benchmarks/_meta/pubmedqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,334 copying evalscope/benchmarks/_meta/qasc.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,337 copying evalscope/benchmarks/_meta/race.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,340 copying evalscope/benchmarks/_meta/real_world_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,344 copying evalscope/benchmarks/_meta/refcoco.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,347 copying evalscope/benchmarks/_meta/scicode.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,352 copying evalscope/benchmarks/_meta/science_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,355 copying evalscope/benchmarks/_meta/sciq.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,358 copying evalscope/benchmarks/_meta/seed_bench_2_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,361 copying evalscope/benchmarks/_meta/simple_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,365 copying evalscope/benchmarks/_meta/simple_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,369 copying evalscope/benchmarks/_meta/siqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,372 copying evalscope/benchmarks/_meta/super_gpqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,377 copying evalscope/benchmarks/_meta/swe_bench_lite.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,381 copying evalscope/benchmarks/_meta/swe_bench_verified.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,385 copying evalscope/benchmarks/_meta/swe_bench_verified_mini.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,389 copying evalscope/benchmarks/_meta/tau2_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,393 copying evalscope/benchmarks/_meta/tau_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,397 copying evalscope/benchmarks/_meta/terminal_bench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,401 copying evalscope/benchmarks/_meta/tifa160.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,405 copying evalscope/benchmarks/_meta/tir_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,410 copying evalscope/benchmarks/_meta/tool_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,414 copying evalscope/benchmarks/_meta/torgo.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,419 copying evalscope/benchmarks/_meta/trivia_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,424 copying evalscope/benchmarks/_meta/truthful_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,428 copying evalscope/benchmarks/_meta/tweebank_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,431 copying evalscope/benchmarks/_meta/tweet_ner_7.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,435 copying evalscope/benchmarks/_meta/visulogic.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,438 copying evalscope/benchmarks/_meta/vstar_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,441 copying evalscope/benchmarks/_meta/winogrande.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,445 copying evalscope/benchmarks/_meta/wmt24pp.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,448 copying evalscope/benchmarks/_meta/wnut2017.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,452 copying evalscope/benchmarks/_meta/zebralogicbench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,455 copying evalscope/benchmarks/_meta/zerobench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-24T06:57:34,458 copying evalscope/benchmarks/bfcl/requirements.txt -> build/lib/evalscope/benchmarks/bfcl 2026-04-24T06:57:34,461 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:34,464 copying evalscope/benchmarks/scicode/docker/Dockerfile -> build/lib/evalscope/benchmarks/scicode/docker 2026-04-24T06:57:34,466 copying evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/lib/evalscope/benchmarks/scicode/docker 2026-04-24T06:57:34,469 copying evalscope/benchmarks/tau_bench/tau2_bench/requirements.txt -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:34,472 copying evalscope/benchmarks/tau_bench/tau_bench/requirements.txt -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:34,475 creating build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,477 copying evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,480 copying evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,484 copying evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,487 copying evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,491 copying evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,494 copying evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,498 copying evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,501 copying evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,505 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,508 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,511 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,515 copying evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,519 copying evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,522 copying evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,525 copying evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,529 copying evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,532 copying evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,535 copying evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,539 copying evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,543 copying evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,546 copying evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,550 copying evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,553 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,557 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,560 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,564 copying evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,568 copying evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:34,571 creating build/lib/evalscope/benchmarks/humanevalplus/docker 2026-04-24T06:57:34,573 copying evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/lib/evalscope/benchmarks/humanevalplus/docker 2026-04-24T06:57:34,575 copying evalscope/third_party/longbench_write/README.md -> build/lib/evalscope/third_party/longbench_write 2026-04-24T06:57:34,578 copying evalscope/third_party/longbench_write/default_task.json -> build/lib/evalscope/third_party/longbench_write 2026-04-24T06:57:34,581 copying evalscope/third_party/longbench_write/default_task.yaml -> build/lib/evalscope/third_party/longbench_write 2026-04-24T06:57:34,584 copying evalscope/third_party/toolbench_static/README.md -> build/lib/evalscope/third_party/toolbench_static 2026-04-24T06:57:34,587 copying evalscope/third_party/toolbench_static/config_default.json -> build/lib/evalscope/third_party/toolbench_static 2026-04-24T06:57:34,590 copying evalscope/third_party/toolbench_static/config_default.yaml -> build/lib/evalscope/third_party/toolbench_static 2026-04-24T06:57:34,592 copying evalscope/third_party/toolbench_static/requirements.txt -> build/lib/evalscope/third_party/toolbench_static 2026-04-24T06:57:34,595 creating build/lib/evalscope/third_party/thinkbench/resources 2026-04-24T06:57:34,597 copying evalscope/third_party/thinkbench/resources/critique_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-04-24T06:57:34,600 copying evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-04-24T06:57:34,602 copying evalscope/third_party/longbench_write/resources/judge.txt -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-24T06:57:34,605 copying evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-24T06:57:34,609 copying evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-24T06:57:34,613 copying evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-24T06:57:34,616 creating build/lib/evalscope/report/template 2026-04-24T06:57:34,617 copying evalscope/report/template/perf_report.html.j2 -> build/lib/evalscope/report/template 2026-04-24T06:57:34,621 copying evalscope/report/template/report.html.j2 -> build/lib/evalscope/report/template 2026-04-24T06:57:34,624 creating build/lib/evalscope/report/template/css 2026-04-24T06:57:34,625 copying evalscope/report/template/css/base.css -> build/lib/evalscope/report/template/css 2026-04-24T06:57:34,629 copying evalscope/report/template/css/perf_extra.css -> build/lib/evalscope/report/template/css 2026-04-24T06:57:34,631 creating build/lib/evalscope/report/template/js 2026-04-24T06:57:34,633 copying evalscope/report/template/js/eval_extra.js -> build/lib/evalscope/report/template/js 2026-04-24T06:57:34,636 copying evalscope/report/template/js/i18n_eval.js -> build/lib/evalscope/report/template/js 2026-04-24T06:57:34,639 copying evalscope/report/template/js/i18n_perf.js -> build/lib/evalscope/report/template/js 2026-04-24T06:57:34,642 copying evalscope/report/template/js/perf_extra.js -> build/lib/evalscope/report/template/js 2026-04-24T06:57:34,644 copying evalscope/report/template/js/shared.js -> build/lib/evalscope/report/template/js 2026-04-24T06:57:34,647 creating build/lib/evalscope/report/template/partials 2026-04-24T06:57:34,649 copying evalscope/report/template/partials/brand_logo.html -> build/lib/evalscope/report/template/partials 2026-04-24T06:57:34,652 copying evalscope/report/template/partials/footer.html -> build/lib/evalscope/report/template/partials 2026-04-24T06:57:34,655 copying evalscope/report/template/partials/header_eval.html -> build/lib/evalscope/report/template/partials 2026-04-24T06:57:34,657 copying evalscope/report/template/partials/header_perf.html -> build/lib/evalscope/report/template/partials 2026-04-24T06:57:34,660 copying evalscope/report/template/partials/toc_eval.html -> build/lib/evalscope/report/template/partials 2026-04-24T06:57:34,663 copying evalscope/report/template/partials/toc_perf.html -> build/lib/evalscope/report/template/partials 2026-04-24T06:57:34,770 installing to build/bdist.linux-armv7l/wheel 2026-04-24T06:57:34,771 running install 2026-04-24T06:57:34,795 running install_lib 2026-04-24T06:57:34,801 creating build/bdist.linux-armv7l/wheel 2026-04-24T06:57:34,804 creating build/bdist.linux-armv7l/wheel/evalscope 2026-04-24T06:57:34,806 creating build/bdist.linux-armv7l/wheel/evalscope/sandbox 2026-04-24T06:57:34,808 copying build/lib/evalscope/sandbox/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/sandbox 2026-04-24T06:57:34,810 copying build/lib/evalscope/sandbox/volcengine.py -> build/bdist.linux-armv7l/wheel/./evalscope/sandbox 2026-04-24T06:57:34,814 creating build/bdist.linux-armv7l/wheel/evalscope/perf 2026-04-24T06:57:34,816 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils 2026-04-24T06:57:34,817 copying build/lib/evalscope/perf/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-24T06:57:34,819 copying build/lib/evalscope/perf/utils/db_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-24T06:57:34,822 copying build/lib/evalscope/perf/utils/rich_display.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-24T06:57:34,825 copying build/lib/evalscope/perf/utils/handler.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-24T06:57:34,828 copying build/lib/evalscope/perf/utils/local_server.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-24T06:57:34,830 copying build/lib/evalscope/perf/utils/log_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-24T06:57:34,833 copying build/lib/evalscope/perf/utils/benchmark_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-24T06:57:34,837 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils/report 2026-04-24T06:57:34,838 copying build/lib/evalscope/perf/utils/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-04-24T06:57:34,840 copying build/lib/evalscope/perf/utils/report/perf_charts.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-04-24T06:57:34,843 copying build/lib/evalscope/perf/utils/report/perf_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-04-24T06:57:34,846 copying build/lib/evalscope/perf/utils/report/generate_report.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-04-24T06:57:34,849 copying build/lib/evalscope/perf/utils/analysis_result.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-24T06:57:34,851 copying build/lib/evalscope/perf/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-24T06:57:34,854 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin 2026-04-24T06:57:34,856 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/datasets 2026-04-24T06:57:34,857 copying build/lib/evalscope/perf/plugin/datasets/custom.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,860 copying build/lib/evalscope/perf/plugin/datasets/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,862 copying build/lib/evalscope/perf/plugin/datasets/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,865 copying build/lib/evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,868 copying build/lib/evalscope/perf/plugin/datasets/speed_benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,870 copying build/lib/evalscope/perf/plugin/datasets/kontext_bench.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,872 copying build/lib/evalscope/perf/plugin/datasets/embedding_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,875 copying build/lib/evalscope/perf/plugin/datasets/openqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,877 copying build/lib/evalscope/perf/plugin/datasets/multi_turn.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,880 copying build/lib/evalscope/perf/plugin/datasets/share_gpt.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,882 copying build/lib/evalscope/perf/plugin/datasets/longalpaca.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,884 copying build/lib/evalscope/perf/plugin/datasets/random_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,887 copying build/lib/evalscope/perf/plugin/datasets/rerank_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,890 copying build/lib/evalscope/perf/plugin/datasets/flickr8k.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,892 copying build/lib/evalscope/perf/plugin/datasets/line_by_line.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,894 copying build/lib/evalscope/perf/plugin/datasets/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-24T06:57:34,897 copying build/lib/evalscope/perf/plugin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-04-24T06:57:34,899 copying build/lib/evalscope/perf/plugin/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-04-24T06:57:34,902 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/api 2026-04-24T06:57:34,903 copying build/lib/evalscope/perf/plugin/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-24T06:57:34,906 copying build/lib/evalscope/perf/plugin/api/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-24T06:57:34,908 copying build/lib/evalscope/perf/plugin/api/dashscope_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-24T06:57:34,910 copying build/lib/evalscope/perf/plugin/api/openai_rerank_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-24T06:57:34,913 copying build/lib/evalscope/perf/plugin/api/openai_embedding_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-24T06:57:34,916 copying build/lib/evalscope/perf/plugin/api/custom_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-24T06:57:34,919 copying build/lib/evalscope/perf/plugin/api/default_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-24T06:57:34,922 copying build/lib/evalscope/perf/plugin/api/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-24T06:57:34,925 copying build/lib/evalscope/perf/http_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-24T06:57:34,928 creating build/bdist.linux-armv7l/wheel/evalscope/perf/sla 2026-04-24T06:57:34,929 copying build/lib/evalscope/perf/sla/sla_run.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-04-24T06:57:34,932 copying build/lib/evalscope/perf/sla/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-04-24T06:57:34,934 copying build/lib/evalscope/perf/sla/sla_criterion.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-04-24T06:57:34,937 copying build/lib/evalscope/perf/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-24T06:57:34,939 copying build/lib/evalscope/perf/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-24T06:57:34,942 copying build/lib/evalscope/perf/multi_turn_benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-24T06:57:34,945 copying build/lib/evalscope/perf/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-24T06:57:34,948 copying build/lib/evalscope/run.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-24T06:57:34,951 creating build/bdist.linux-armv7l/wheel/evalscope/utils 2026-04-24T06:57:34,953 copying build/lib/evalscope/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,955 copying build/lib/evalscope/utils/argument_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,958 copying build/lib/evalscope/utils/json_schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,960 copying build/lib/evalscope/utils/io_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,963 copying build/lib/evalscope/utils/function_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,966 copying build/lib/evalscope/utils/model_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,969 copying build/lib/evalscope/utils/url_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,971 copying build/lib/evalscope/utils/chat_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,974 copying build/lib/evalscope/utils/resource_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,977 creating build/bdist.linux-armv7l/wheel/evalscope/utils/doc_utils 2026-04-24T06:57:34,979 copying build/lib/evalscope/utils/doc_utils/benchmark_stats.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-24T06:57:34,982 copying build/lib/evalscope/utils/doc_utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-24T06:57:34,984 copying build/lib/evalscope/utils/doc_utils/readme_generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-24T06:57:34,987 copying build/lib/evalscope/utils/doc_utils/generate_dataset_md.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-24T06:57:34,990 copying build/lib/evalscope/utils/doc_utils/translate_description.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-24T06:57:34,993 copying build/lib/evalscope/utils/code_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,996 copying build/lib/evalscope/utils/multi_choices.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:34,999 copying build/lib/evalscope/utils/ner.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:35,002 copying build/lib/evalscope/utils/deprecation_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:35,005 copying build/lib/evalscope/utils/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:35,008 creating build/bdist.linux-armv7l/wheel/evalscope/utils/tqdm_utils 2026-04-24T06:57:35,009 copying build/lib/evalscope/utils/tqdm_utils/tqdm_logging.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-04-24T06:57:35,011 copying build/lib/evalscope/utils/tqdm_utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-04-24T06:57:35,013 copying build/lib/evalscope/utils/tqdm_utils/progress_tracker.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-04-24T06:57:35,016 copying build/lib/evalscope/utils/import_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-24T06:57:35,018 copying build/lib/evalscope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-24T06:57:35,021 creating build/bdist.linux-armv7l/wheel/evalscope/backend 2026-04-24T06:57:35,022 copying build/lib/evalscope/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-04-24T06:57:35,024 copying build/lib/evalscope/backend/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-04-24T06:57:35,026 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass 2026-04-24T06:57:35,027 copying build/lib/evalscope/backend/opencompass/api_meta_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-04-24T06:57:35,029 copying build/lib/evalscope/backend/opencompass/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-04-24T06:57:35,031 copying build/lib/evalscope/backend/opencompass/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-04-24T06:57:35,034 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass/tasks 2026-04-24T06:57:35,035 copying build/lib/evalscope/backend/opencompass/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-04-24T06:57:35,038 copying build/lib/evalscope/backend/opencompass/tasks/eval_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-04-24T06:57:35,039 copying build/lib/evalscope/backend/opencompass/tasks/eval_datasets.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-04-24T06:57:35,043 creating build/bdist.linux-armv7l/wheel/evalscope/backend/vlm_eval_kit 2026-04-24T06:57:35,044 copying build/lib/evalscope/backend/vlm_eval_kit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-04-24T06:57:35,046 copying build/lib/evalscope/backend/vlm_eval_kit/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-04-24T06:57:35,049 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval 2026-04-24T06:57:35,051 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:35,052 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-24T06:57:35,054 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-24T06:57:35,056 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-24T06:57:35,058 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:35,060 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:35,063 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:35,064 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:35,066 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:35,068 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:35,070 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-24T06:57:35,073 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:35,075 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-04-24T06:57:35,077 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/utils 2026-04-24T06:57:35,079 copying build/lib/evalscope/backend/rag_eval/utils/clip.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-24T06:57:35,081 copying build/lib/evalscope/backend/rag_eval/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-24T06:57:35,082 copying build/lib/evalscope/backend/rag_eval/utils/embedding.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-24T06:57:35,085 copying build/lib/evalscope/backend/rag_eval/utils/tools.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-24T06:57:35,087 copying build/lib/evalscope/backend/rag_eval/utils/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-24T06:57:35,089 copying build/lib/evalscope/backend/rag_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-04-24T06:57:35,092 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:35,093 copying build/lib/evalscope/backend/rag_eval/cmteb/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:35,095 copying build/lib/evalscope/backend/rag_eval/cmteb/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:35,098 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:35,099 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:35,101 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:35,103 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:35,105 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:35,107 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:35,110 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:35,113 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:35,115 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-24T06:57:35,117 copying build/lib/evalscope/backend/rag_eval/cmteb/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:35,120 copying build/lib/evalscope/backend/rag_eval/cmteb/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-04-24T06:57:35,122 copying build/lib/evalscope/backend/rag_eval/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-04-24T06:57:35,124 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas 2026-04-24T06:57:35,125 copying build/lib/evalscope/backend/rag_eval/ragas/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-04-24T06:57:35,128 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/prompts 2026-04-24T06:57:35,129 copying build/lib/evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/prompts 2026-04-24T06:57:35,132 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:35,133 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:35,135 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:35,137 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:35,140 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:35,141 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-24T06:57:35,143 copying build/lib/evalscope/backend/rag_eval/ragas/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-04-24T06:57:35,146 copying build/lib/evalscope/backend/rag_eval/ragas/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-04-24T06:57:35,147 copying build/lib/evalscope/config.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-24T06:57:35,151 creating build/bdist.linux-armv7l/wheel/evalscope/summarizer 2026-04-24T06:57:35,152 copying build/lib/evalscope/summarizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-04-24T06:57:35,154 copying build/lib/evalscope/summarizer/summarizer.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-04-24T06:57:35,156 copying build/lib/evalscope/version.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-24T06:57:35,159 creating build/bdist.linux-armv7l/wheel/evalscope/cli 2026-04-24T06:57:35,160 copying build/lib/evalscope/cli/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-24T06:57:35,162 copying build/lib/evalscope/cli/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-24T06:57:35,164 copying build/lib/evalscope/cli/start_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-24T06:57:35,166 copying build/lib/evalscope/cli/benchmark_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-24T06:57:35,169 copying build/lib/evalscope/cli/start_app.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-24T06:57:35,170 copying build/lib/evalscope/cli/start_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-24T06:57:35,172 copying build/lib/evalscope/cli/cli.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-24T06:57:35,174 copying build/lib/evalscope/cli/start_perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-24T06:57:35,177 creating build/bdist.linux-armv7l/wheel/evalscope/collections 2026-04-24T06:57:35,178 copying build/lib/evalscope/collections/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-04-24T06:57:35,180 copying build/lib/evalscope/collections/schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-04-24T06:57:35,183 copying build/lib/evalscope/collections/sampler.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-04-24T06:57:35,186 creating build/bdist.linux-armv7l/wheel/evalscope/api 2026-04-24T06:57:35,187 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark 2026-04-24T06:57:35,189 copying build/lib/evalscope/api/benchmark/meta.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-04-24T06:57:35,191 copying build/lib/evalscope/api/benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-04-24T06:57:35,193 copying build/lib/evalscope/api/benchmark/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-04-24T06:57:35,196 copying build/lib/evalscope/api/benchmark/statistics.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-04-24T06:57:35,200 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark/adapters 2026-04-24T06:57:35,201 copying build/lib/evalscope/api/benchmark/adapters/ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-24T06:57:35,203 copying build/lib/evalscope/api/benchmark/adapters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-24T06:57:35,206 copying build/lib/evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-24T06:57:35,208 copying build/lib/evalscope/api/benchmark/adapters/default_data_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-24T06:57:35,211 copying build/lib/evalscope/api/benchmark/adapters/agent_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-24T06:57:35,212 copying build/lib/evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-24T06:57:35,214 copying build/lib/evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-24T06:57:35,216 copying build/lib/evalscope/api/benchmark/adapters/text2image_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-24T06:57:35,219 creating build/bdist.linux-armv7l/wheel/evalscope/api/tool 2026-04-24T06:57:35,220 copying build/lib/evalscope/api/tool/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-04-24T06:57:35,222 copying build/lib/evalscope/api/tool/tool_call.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-04-24T06:57:35,224 copying build/lib/evalscope/api/tool/tool_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-04-24T06:57:35,226 copying build/lib/evalscope/api/tool/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-04-24T06:57:35,228 copying build/lib/evalscope/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-04-24T06:57:35,230 copying build/lib/evalscope/api/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-04-24T06:57:35,232 creating build/bdist.linux-armv7l/wheel/evalscope/api/messages 2026-04-24T06:57:35,234 copying build/lib/evalscope/api/messages/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-04-24T06:57:35,235 copying build/lib/evalscope/api/messages/content.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-04-24T06:57:35,237 copying build/lib/evalscope/api/messages/chat_message.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-04-24T06:57:35,239 copying build/lib/evalscope/api/messages/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-04-24T06:57:35,242 creating build/bdist.linux-armv7l/wheel/evalscope/api/metric 2026-04-24T06:57:35,243 copying build/lib/evalscope/api/metric/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-04-24T06:57:35,245 copying build/lib/evalscope/api/metric/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-04-24T06:57:35,247 copying build/lib/evalscope/api/metric/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-04-24T06:57:35,249 creating build/bdist.linux-armv7l/wheel/evalscope/api/model 2026-04-24T06:57:35,250 copying build/lib/evalscope/api/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-24T06:57:35,252 copying build/lib/evalscope/api/model/model_output.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-24T06:57:35,255 copying build/lib/evalscope/api/model/generate_config.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-24T06:57:35,257 copying build/lib/evalscope/api/model/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-24T06:57:35,259 copying build/lib/evalscope/api/model/perf_metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-24T06:57:35,261 copying build/lib/evalscope/api/model/lazy_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-24T06:57:35,264 creating build/bdist.linux-armv7l/wheel/evalscope/api/filter 2026-04-24T06:57:35,265 copying build/lib/evalscope/api/filter/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-04-24T06:57:35,267 copying build/lib/evalscope/api/filter/filter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-04-24T06:57:35,269 creating build/bdist.linux-armv7l/wheel/evalscope/api/mixin 2026-04-24T06:57:35,270 copying build/lib/evalscope/api/mixin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-04-24T06:57:35,272 copying build/lib/evalscope/api/mixin/sandbox_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-04-24T06:57:35,274 copying build/lib/evalscope/api/mixin/llm_judge_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-04-24T06:57:35,277 creating build/bdist.linux-armv7l/wheel/evalscope/api/evaluator 2026-04-24T06:57:35,278 copying build/lib/evalscope/api/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-04-24T06:57:35,280 copying build/lib/evalscope/api/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-04-24T06:57:35,282 copying build/lib/evalscope/api/evaluator/state.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-04-24T06:57:35,285 copying build/lib/evalscope/api/evaluator/cache.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-04-24T06:57:35,288 creating build/bdist.linux-armv7l/wheel/evalscope/api/dataset 2026-04-24T06:57:35,289 copying build/lib/evalscope/api/dataset/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-04-24T06:57:35,291 copying build/lib/evalscope/api/dataset/dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-04-24T06:57:35,293 copying build/lib/evalscope/api/dataset/loader.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-04-24T06:57:35,296 copying build/lib/evalscope/api/dataset/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-04-24T06:57:35,298 creating build/bdist.linux-armv7l/wheel/evalscope/filters 2026-04-24T06:57:35,299 copying build/lib/evalscope/filters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-04-24T06:57:35,301 copying build/lib/evalscope/filters/selection.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-04-24T06:57:35,303 copying build/lib/evalscope/filters/extraction.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-04-24T06:57:35,306 creating build/bdist.linux-armv7l/wheel/evalscope/metrics 2026-04-24T06:57:35,308 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bundled_rouge_score 2026-04-24T06:57:35,309 copying build/lib/evalscope/metrics/bundled_rouge_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-04-24T06:57:35,311 copying build/lib/evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-04-24T06:57:35,313 copying build/lib/evalscope/metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-24T06:57:35,315 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bert_score 2026-04-24T06:57:35,316 copying build/lib/evalscope/metrics/bert_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-04-24T06:57:35,318 copying build/lib/evalscope/metrics/bert_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-04-24T06:57:35,321 copying build/lib/evalscope/metrics/bert_score/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-04-24T06:57:35,323 copying build/lib/evalscope/metrics/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-24T06:57:35,326 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics 2026-04-24T06:57:35,327 copying build/lib/evalscope/metrics/t2v_metrics/score.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-24T06:57:35,329 copying build/lib/evalscope/metrics/t2v_metrics/itmscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-24T06:57:35,331 copying build/lib/evalscope/metrics/t2v_metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-24T06:57:35,332 copying build/lib/evalscope/metrics/t2v_metrics/vqascore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-24T06:57:35,334 copying build/lib/evalscope/metrics/t2v_metrics/clipscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-24T06:57:35,335 copying build/lib/evalscope/metrics/t2v_metrics/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-24T06:57:35,338 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models 2026-04-24T06:57:35,339 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:35,340 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:35,343 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-24T06:57:35,344 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-24T06:57:35,346 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-24T06:57:35,347 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-24T06:57:35,350 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:35,352 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:35,354 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-24T06:57:35,356 copying build/lib/evalscope/metrics/t2v_metrics/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-04-24T06:57:35,358 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:35,358 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:35,360 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:35,362 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:35,365 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:35,366 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:35,368 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:35,370 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:35,371 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-24T06:57:35,373 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:35,375 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-24T06:57:35,377 copying build/lib/evalscope/metrics/t2v_metrics/models/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-04-24T06:57:35,379 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:35,380 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:35,383 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-04-24T06:57:35,384 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-04-24T06:57:35,386 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-04-24T06:57:35,387 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-04-24T06:57:35,389 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-04-24T06:57:35,390 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-04-24T06:57:35,393 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-24T06:57:35,394 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-24T06:57:35,396 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-24T06:57:35,398 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-04-24T06:57:35,399 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-04-24T06:57:35,402 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:35,405 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:35,407 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:35,410 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-04-24T06:57:35,412 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:35,413 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:35,415 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:35,417 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:35,419 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-24T06:57:35,421 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-04-24T06:57:35,424 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-04-24T06:57:35,425 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-04-24T06:57:35,428 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-24T06:57:35,429 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-24T06:57:35,431 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,432 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,435 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,437 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,438 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,440 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,442 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,444 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,446 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,448 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,450 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,452 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,454 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,456 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,458 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,459 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,461 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,463 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,465 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,467 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-24T06:57:35,469 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-24T06:57:35,471 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-24T06:57:35,474 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:35,475 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:35,477 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:35,479 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:35,481 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:35,484 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:35,486 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:35,489 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-24T06:57:35,490 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-24T06:57:35,492 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-24T06:57:35,495 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-24T06:57:35,497 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-24T06:57:35,500 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:35,502 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,503 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,506 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,508 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,510 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,513 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,516 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,518 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,521 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,524 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,528 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-24T06:57:35,530 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:35,532 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:35,536 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,537 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,539 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,541 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,544 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,546 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,549 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,552 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,554 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,557 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,559 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,562 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-24T06:57:35,564 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:35,567 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:35,569 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:35,572 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-24T06:57:35,575 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-24T06:57:35,576 copying build/lib/evalscope/metrics/t2v_metrics/models/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-04-24T06:57:35,578 copying build/lib/evalscope/metrics/math_parser.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-24T06:57:35,581 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/text_normalizer 2026-04-24T06:57:35,583 copying build/lib/evalscope/metrics/text_normalizer/chinese.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-24T06:57:35,586 copying build/lib/evalscope/metrics/text_normalizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-24T06:57:35,588 copying build/lib/evalscope/metrics/text_normalizer/english.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-24T06:57:35,592 copying build/lib/evalscope/metrics/text_normalizer/wer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-24T06:57:35,594 copying build/lib/evalscope/metrics/text_normalizer/basic.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-24T06:57:35,596 copying build/lib/evalscope/metrics/text_normalizer/english.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-24T06:57:35,599 copying build/lib/evalscope/metrics/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-24T06:57:35,602 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/sem_score 2026-04-24T06:57:35,604 copying build/lib/evalscope/metrics/sem_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-04-24T06:57:35,605 copying build/lib/evalscope/metrics/sem_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-04-24T06:57:35,608 copying build/lib/evalscope/metrics/rouge_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-24T06:57:35,610 copying build/lib/evalscope/metrics/llm_judge.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-24T06:57:35,615 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks 2026-04-24T06:57:35,616 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench 2026-04-24T06:57:35,618 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-24T06:57:35,619 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-24T06:57:35,621 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-24T06:57:35,623 copying build/lib/evalscope/benchmarks/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench 2026-04-24T06:57:35,625 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,626 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,627 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,629 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,631 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,633 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,635 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,637 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,640 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,643 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:35,644 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:35,646 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:35,649 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:35,651 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-24T06:57:35,654 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-24T06:57:35,656 copying build/lib/evalscope/benchmarks/ocr_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench 2026-04-24T06:57:35,659 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vmcq 2026-04-24T06:57:35,660 copying build/lib/evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-04-24T06:57:35,663 copying build/lib/evalscope/benchmarks/general_vmcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-04-24T06:57:35,665 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmlu 2026-04-24T06:57:35,666 copying build/lib/evalscope/benchmarks/mmmlu/mmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-04-24T06:57:35,669 copying build/lib/evalscope/benchmarks/mmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-04-24T06:57:35,670 copying build/lib/evalscope/benchmarks/mmmlu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-04-24T06:57:35,673 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu_pro 2026-04-24T06:57:35,674 copying build/lib/evalscope/benchmarks/mmmu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-04-24T06:57:35,676 copying build/lib/evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-04-24T06:57:35,678 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chinese_simple_qa 2026-04-24T06:57:35,679 copying build/lib/evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-04-24T06:57:35,681 copying build/lib/evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-04-24T06:57:35,684 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_qa 2026-04-24T06:57:35,685 copying build/lib/evalscope/benchmarks/simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-04-24T06:57:35,687 copying build/lib/evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-04-24T06:57:35,690 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/text2image 2026-04-24T06:57:35,691 copying build/lib/evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-24T06:57:35,693 copying build/lib/evalscope/benchmarks/text2image/tifa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-24T06:57:35,695 copying build/lib/evalscope/benchmarks/text2image/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-24T06:57:35,696 copying build/lib/evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-24T06:57:35,699 copying build/lib/evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-24T06:57:35,701 copying build/lib/evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-24T06:57:35,703 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/minerva_math 2026-04-24T06:57:35,704 copying build/lib/evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-04-24T06:57:35,706 copying build/lib/evalscope/benchmarks/minerva_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-04-24T06:57:35,709 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chartqa 2026-04-24T06:57:35,710 copying build/lib/evalscope/benchmarks/chartqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-04-24T06:57:35,711 copying build/lib/evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-04-24T06:57:35,713 copying build/lib/evalscope/benchmarks/chartqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-04-24T06:57:35,716 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/qasc 2026-04-24T06:57:35,717 copying build/lib/evalscope/benchmarks/qasc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-04-24T06:57:35,719 copying build/lib/evalscope/benchmarks/qasc/qasc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-04-24T06:57:35,722 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arena_hard 2026-04-24T06:57:35,723 copying build/lib/evalscope/benchmarks/arena_hard/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-04-24T06:57:35,725 copying build/lib/evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-04-24T06:57:35,727 copying build/lib/evalscope/benchmarks/arena_hard/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-04-24T06:57:35,729 copying build/lib/evalscope/benchmarks/arena_hard/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-04-24T06:57:35,732 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/logi_qa 2026-04-24T06:57:35,733 copying build/lib/evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-04-24T06:57:35,735 copying build/lib/evalscope/benchmarks/logi_qa/__int__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-04-24T06:57:35,737 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/truthful_qa 2026-04-24T06:57:35,738 copying build/lib/evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-04-24T06:57:35,740 copying build/lib/evalscope/benchmarks/truthful_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-04-24T06:57:35,743 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docvqa 2026-04-24T06:57:35,744 copying build/lib/evalscope/benchmarks/docvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-04-24T06:57:35,746 copying build/lib/evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-04-24T06:57:35,748 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multi_if 2026-04-24T06:57:35,750 copying build/lib/evalscope/benchmarks/multi_if/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-24T06:57:35,752 copying build/lib/evalscope/benchmarks/multi_if/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-24T06:57:35,754 copying build/lib/evalscope/benchmarks/multi_if/ifeval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-24T06:57:35,758 copying build/lib/evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-24T06:57:35,760 copying build/lib/evalscope/benchmarks/multi_if/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-24T06:57:35,763 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_vqa 2026-04-24T06:57:35,764 copying build/lib/evalscope/benchmarks/simple_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-04-24T06:57:35,765 copying build/lib/evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-04-24T06:57:35,768 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humaneval 2026-04-24T06:57:35,769 copying build/lib/evalscope/benchmarks/humaneval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-04-24T06:57:35,772 copying build/lib/evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-04-24T06:57:35,774 copying build/lib/evalscope/benchmarks/humaneval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-04-24T06:57:35,777 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmlu 2026-04-24T06:57:35,778 copying build/lib/evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-04-24T06:57:35,780 copying build/lib/evalscope/benchmarks/cmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-04-24T06:57:35,783 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/sciq 2026-04-24T06:57:35,784 copying build/lib/evalscope/benchmarks/sciq/sciq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-04-24T06:57:35,786 copying build/lib/evalscope/benchmarks/sciq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-04-24T06:57:35,788 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_fc 2026-04-24T06:57:35,789 copying build/lib/evalscope/benchmarks/general_fc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-04-24T06:57:35,791 copying build/lib/evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-04-24T06:57:35,794 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:35,795 copying build/lib/evalscope/benchmarks/omnidoc_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:35,797 copying build/lib/evalscope/benchmarks/omnidoc_bench/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:35,800 copying build/lib/evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:35,802 copying build/lib/evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:35,805 copying build/lib/evalscope/benchmarks/omnidoc_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:35,807 copying build/lib/evalscope/benchmarks/omnidoc_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-24T06:57:35,812 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/a_okvqa 2026-04-24T06:57:35,813 copying build/lib/evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-04-24T06:57:35,815 copying build/lib/evalscope/benchmarks/a_okvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-04-24T06:57:35,817 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/eq_bench 2026-04-24T06:57:35,819 copying build/lib/evalscope/benchmarks/eq_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-04-24T06:57:35,821 copying build/lib/evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-04-24T06:57:35,823 copying build/lib/evalscope/benchmarks/eq_bench/answer_validation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-04-24T06:57:35,827 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mia_bench 2026-04-24T06:57:35,828 copying build/lib/evalscope/benchmarks/mia_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mia_bench 2026-04-24T06:57:35,829 copying build/lib/evalscope/benchmarks/mia_bench/mia_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mia_bench 2026-04-24T06:57:35,832 copying build/lib/evalscope/benchmarks/mia_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mia_bench 2026-04-24T06:57:35,834 copying build/lib/evalscope/benchmarks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks 2026-04-24T06:57:35,837 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k_v 2026-04-24T06:57:35,838 copying build/lib/evalscope/benchmarks/gsm8k_v/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-04-24T06:57:35,840 copying build/lib/evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-04-24T06:57:35,843 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hellaswag 2026-04-24T06:57:35,844 copying build/lib/evalscope/benchmarks/hellaswag/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-04-24T06:57:35,846 copying build/lib/evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-04-24T06:57:35,848 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/siqa 2026-04-24T06:57:35,849 copying build/lib/evalscope/benchmarks/siqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-04-24T06:57:35,851 copying build/lib/evalscope/benchmarks/siqa/siqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-04-24T06:57:35,854 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/biomix_qa 2026-04-24T06:57:35,855 copying build/lib/evalscope/benchmarks/biomix_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-04-24T06:57:35,857 copying build/lib/evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-04-24T06:57:35,859 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/piqa 2026-04-24T06:57:35,860 copying build/lib/evalscope/benchmarks/piqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-04-24T06:57:35,862 copying build/lib/evalscope/benchmarks/piqa/piqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-04-24T06:57:35,865 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifeval 2026-04-24T06:57:35,866 copying build/lib/evalscope/benchmarks/ifeval/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-24T06:57:35,869 copying build/lib/evalscope/benchmarks/ifeval/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-24T06:57:35,872 copying build/lib/evalscope/benchmarks/ifeval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-24T06:57:35,873 copying build/lib/evalscope/benchmarks/ifeval/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-24T06:57:35,876 copying build/lib/evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-24T06:57:35,878 copying build/lib/evalscope/benchmarks/ifeval/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-24T06:57:35,880 copying build/lib/evalscope/benchmarks/ifeval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-24T06:57:35,883 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu 2026-04-24T06:57:35,884 copying build/lib/evalscope/benchmarks/mmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-04-24T06:57:35,888 copying build/lib/evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-04-24T06:57:35,892 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/swe_bench 2026-04-24T06:57:35,893 copying build/lib/evalscope/benchmarks/swe_bench/build_images.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-24T06:57:35,896 copying build/lib/evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-24T06:57:35,898 copying build/lib/evalscope/benchmarks/swe_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-24T06:57:35,900 copying build/lib/evalscope/benchmarks/swe_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-24T06:57:35,902 copying build/lib/evalscope/benchmarks/swe_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-24T06:57:35,905 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_qa 2026-04-24T06:57:35,906 copying build/lib/evalscope/benchmarks/math_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-04-24T06:57:35,908 copying build/lib/evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-04-24T06:57:35,910 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vision 2026-04-24T06:57:35,911 copying build/lib/evalscope/benchmarks/math_vision/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-04-24T06:57:35,913 copying build/lib/evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-04-24T06:57:35,916 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docmath 2026-04-24T06:57:35,917 copying build/lib/evalscope/benchmarks/docmath/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-04-24T06:57:35,919 copying build/lib/evalscope/benchmarks/docmath/docmath_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-04-24T06:57:35,921 copying build/lib/evalscope/benchmarks/docmath/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-04-24T06:57:35,924 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omni_bench 2026-04-24T06:57:35,925 copying build/lib/evalscope/benchmarks/omni_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-04-24T06:57:35,927 copying build/lib/evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-04-24T06:57:35,930 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/fleurs 2026-04-24T06:57:35,931 copying build/lib/evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-04-24T06:57:35,933 copying build/lib/evalscope/benchmarks/fleurs/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-04-24T06:57:35,935 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_star 2026-04-24T06:57:35,936 copying build/lib/evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-04-24T06:57:35,938 copying build/lib/evalscope/benchmarks/mm_star/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-04-24T06:57:35,940 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pope 2026-04-24T06:57:35,941 copying build/lib/evalscope/benchmarks/pope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-04-24T06:57:35,943 copying build/lib/evalscope/benchmarks/pope/pope_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-04-24T06:57:35,945 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode 2026-04-24T06:57:35,946 copying build/lib/evalscope/benchmarks/scicode/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-04-24T06:57:35,948 copying build/lib/evalscope/benchmarks/scicode/util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-04-24T06:57:35,950 copying build/lib/evalscope/benchmarks/scicode/scicode_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-04-24T06:57:35,953 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode/docker 2026-04-24T06:57:35,954 copying build/lib/evalscope/benchmarks/scicode/docker/process_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-04-24T06:57:35,956 copying build/lib/evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-04-24T06:57:35,958 copying build/lib/evalscope/benchmarks/scicode/docker/test_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-04-24T06:57:35,960 copying build/lib/evalscope/benchmarks/scicode/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-04-24T06:57:35,961 copying build/lib/evalscope/benchmarks/scicode/prompt_templates.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-04-24T06:57:35,964 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_bench 2026-04-24T06:57:35,964 copying build/lib/evalscope/benchmarks/mm_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-04-24T06:57:35,966 copying build/lib/evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-04-24T06:57:35,968 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drop 2026-04-24T06:57:35,969 copying build/lib/evalscope/benchmarks/drop/drop_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-04-24T06:57:35,972 copying build/lib/evalscope/benchmarks/drop/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-04-24T06:57:35,973 copying build/lib/evalscope/benchmarks/drop/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-04-24T06:57:35,976 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_arena 2026-04-24T06:57:35,977 copying build/lib/evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-04-24T06:57:35,979 copying build/lib/evalscope/benchmarks/general_arena/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-04-24T06:57:35,981 copying build/lib/evalscope/benchmarks/general_arena/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-04-24T06:57:35,983 copying build/lib/evalscope/benchmarks/general_arena/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-04-24T06:57:35,986 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tir_bench 2026-04-24T06:57:35,987 copying build/lib/evalscope/benchmarks/tir_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tir_bench 2026-04-24T06:57:35,990 copying build/lib/evalscope/benchmarks/tir_bench/tir_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tir_bench 2026-04-24T06:57:35,993 copying build/lib/evalscope/benchmarks/tir_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tir_bench 2026-04-24T06:57:35,997 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench 2026-04-24T06:57:35,999 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:36,001 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:36,004 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:36,006 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:36,009 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-04-24T06:57:36,012 copying build/lib/evalscope/benchmarks/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench 2026-04-24T06:57:36,015 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:36,016 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:36,019 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:36,021 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:36,025 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-04-24T06:57:36,028 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/visu_logic 2026-04-24T06:57:36,029 copying build/lib/evalscope/benchmarks/visu_logic/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-04-24T06:57:36,031 copying build/lib/evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-04-24T06:57:36,034 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arc 2026-04-24T06:57:36,036 copying build/lib/evalscope/benchmarks/arc/arc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-04-24T06:57:36,039 copying build/lib/evalscope/benchmarks/arc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-04-24T06:57:36,042 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/maritime_bench 2026-04-24T06:57:36,044 copying build/lib/evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-04-24T06:57:36,046 copying build/lib/evalscope/benchmarks/maritime_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-04-24T06:57:36,049 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/musr 2026-04-24T06:57:36,051 copying build/lib/evalscope/benchmarks/musr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-04-24T06:57:36,053 copying build/lib/evalscope/benchmarks/musr/musr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-04-24T06:57:36,056 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drivelology 2026-04-24T06:57:36,058 copying build/lib/evalscope/benchmarks/drivelology/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-24T06:57:36,060 copying build/lib/evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-24T06:57:36,063 copying build/lib/evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-24T06:57:36,066 copying build/lib/evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-24T06:57:36,068 copying build/lib/evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-24T06:57:36,072 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/blink 2026-04-24T06:57:36,074 copying build/lib/evalscope/benchmarks/blink/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-04-24T06:57:36,076 copying build/lib/evalscope/benchmarks/blink/blink_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-04-24T06:57:36,079 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/commonsense_qa 2026-04-24T06:57:36,081 copying build/lib/evalscope/benchmarks/commonsense_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-04-24T06:57:36,083 copying build/lib/evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-04-24T06:57:36,086 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/process_bench 2026-04-24T06:57:36,087 copying build/lib/evalscope/benchmarks/process_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-04-24T06:57:36,089 copying build/lib/evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-04-24T06:57:36,091 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mgsm 2026-04-24T06:57:36,092 copying build/lib/evalscope/benchmarks/mgsm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-04-24T06:57:36,095 copying build/lib/evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-04-24T06:57:36,097 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pumed_qa 2026-04-24T06:57:36,098 copying build/lib/evalscope/benchmarks/pumed_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-04-24T06:57:36,100 copying build/lib/evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-04-24T06:57:36,103 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner 2026-04-24T06:57:36,104 copying build/lib/evalscope/benchmarks/ner/conll2003_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,106 copying build/lib/evalscope/benchmarks/ner/genia_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,108 copying build/lib/evalscope/benchmarks/ner/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,110 copying build/lib/evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,112 copying build/lib/evalscope/benchmarks/ner/wnut2017_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,114 copying build/lib/evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,116 copying build/lib/evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,118 copying build/lib/evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,120 copying build/lib/evalscope/benchmarks/ner/copious_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,122 copying build/lib/evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,124 copying build/lib/evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,126 copying build/lib/evalscope/benchmarks/ner/cross_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,129 copying build/lib/evalscope/benchmarks/ner/ncbi_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,131 copying build/lib/evalscope/benchmarks/ner/fin_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,133 copying build/lib/evalscope/benchmarks/ner/anat_em_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,135 copying build/lib/evalscope/benchmarks/ner/conllpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,137 copying build/lib/evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,140 copying build/lib/evalscope/benchmarks/ner/bc2gm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,142 copying build/lib/evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,145 copying build/lib/evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,147 copying build/lib/evalscope/benchmarks/ner/jnlpba_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,149 copying build/lib/evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,151 copying build/lib/evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-24T06:57:36,153 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:36,154 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:36,156 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:36,158 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:36,160 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:36,162 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:36,164 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-24T06:57:36,166 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tool_bench 2026-04-24T06:57:36,167 copying build/lib/evalscope/benchmarks/tool_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-04-24T06:57:36,169 copying build/lib/evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-04-24T06:57:36,171 copying build/lib/evalscope/benchmarks/tool_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-04-24T06:57:36,174 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/amc 2026-04-24T06:57:36,175 copying build/lib/evalscope/benchmarks/amc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-04-24T06:57:36,176 copying build/lib/evalscope/benchmarks/amc/amc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-04-24T06:57:36,179 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/science_qa 2026-04-24T06:57:36,180 copying build/lib/evalscope/benchmarks/science_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-04-24T06:57:36,182 copying build/lib/evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-04-24T06:57:36,184 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/competition_math 2026-04-24T06:57:36,185 copying build/lib/evalscope/benchmarks/competition_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-04-24T06:57:36,188 copying build/lib/evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-04-24T06:57:36,190 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbpp 2026-04-24T06:57:36,191 copying build/lib/evalscope/benchmarks/mbpp/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-04-24T06:57:36,193 copying build/lib/evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-04-24T06:57:36,196 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/infovqa 2026-04-24T06:57:36,197 copying build/lib/evalscope/benchmarks/infovqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-04-24T06:57:36,199 copying build/lib/evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-04-24T06:57:36,201 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aime 2026-04-24T06:57:36,202 copying build/lib/evalscope/benchmarks/aime/math_normalize.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-04-24T06:57:36,205 copying build/lib/evalscope/benchmarks/aime/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-04-24T06:57:36,206 copying build/lib/evalscope/benchmarks/aime/grader.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-04-24T06:57:36,209 copying build/lib/evalscope/benchmarks/aime/aime_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-04-24T06:57:36,212 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/super_gpqa 2026-04-24T06:57:36,213 copying build/lib/evalscope/benchmarks/super_gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-04-24T06:57:36,214 copying build/lib/evalscope/benchmarks/super_gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-04-24T06:57:36,216 copying build/lib/evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-04-24T06:57:36,218 copying build/lib/evalscope/benchmarks/super_gpqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-04-24T06:57:36,221 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/music_trivia 2026-04-24T06:57:36,222 copying build/lib/evalscope/benchmarks/music_trivia/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-04-24T06:57:36,224 copying build/lib/evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-04-24T06:57:36,226 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu 2026-04-24T06:57:36,227 copying build/lib/evalscope/benchmarks/mmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-04-24T06:57:36,229 copying build/lib/evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-04-24T06:57:36,232 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/longbench_v2 2026-04-24T06:57:36,233 copying build/lib/evalscope/benchmarks/longbench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/longbench_v2 2026-04-24T06:57:36,235 copying build/lib/evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/longbench_v2 2026-04-24T06:57:36,237 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zerobench 2026-04-24T06:57:36,238 copying build/lib/evalscope/benchmarks/zerobench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-04-24T06:57:36,240 copying build/lib/evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-04-24T06:57:36,243 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hallusion_bench 2026-04-24T06:57:36,244 copying build/lib/evalscope/benchmarks/hallusion_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-04-24T06:57:36,246 copying build/lib/evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-04-24T06:57:36,249 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_verse 2026-04-24T06:57:36,250 copying build/lib/evalscope/benchmarks/math_verse/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-04-24T06:57:36,252 copying build/lib/evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-04-24T06:57:36,254 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/librispeech 2026-04-24T06:57:36,256 copying build/lib/evalscope/benchmarks/librispeech/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-04-24T06:57:36,257 copying build/lib/evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-04-24T06:57:36,260 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vista 2026-04-24T06:57:36,261 copying build/lib/evalscope/benchmarks/math_vista/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-04-24T06:57:36,263 copying build/lib/evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-04-24T06:57:36,266 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_mcq 2026-04-24T06:57:36,267 copying build/lib/evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-04-24T06:57:36,269 copying build/lib/evalscope/benchmarks/general_mcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-04-24T06:57:36,271 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/med_mcqa 2026-04-24T06:57:36,272 copying build/lib/evalscope/benchmarks/med_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-04-24T06:57:36,274 copying build/lib/evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-04-24T06:57:36,276 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hle 2026-04-24T06:57:36,277 copying build/lib/evalscope/benchmarks/hle/hle_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-04-24T06:57:36,280 copying build/lib/evalscope/benchmarks/hle/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-04-24T06:57:36,283 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmmu 2026-04-24T06:57:36,284 copying build/lib/evalscope/benchmarks/cmmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-04-24T06:57:36,286 copying build/lib/evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-04-24T06:57:36,288 copying build/lib/evalscope/benchmarks/cmmmu/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-04-24T06:57:36,291 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hmmt25 2026-04-24T06:57:36,292 copying build/lib/evalscope/benchmarks/hmmt25/hmmt25_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hmmt25 2026-04-24T06:57:36,296 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/vstar_bench 2026-04-24T06:57:36,297 copying build/lib/evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-04-24T06:57:36,299 copying build/lib/evalscope/benchmarks/vstar_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-04-24T06:57:36,301 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh 2026-04-24T06:57:36,304 copying build/lib/evalscope/benchmarks/bbh/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-04-24T06:57:36,306 copying build/lib/evalscope/benchmarks/bbh/bbh_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-04-24T06:57:36,309 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,311 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,313 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,315 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,317 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,319 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,321 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,323 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,325 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,327 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,329 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,331 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,334 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,336 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,338 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,340 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,342 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,345 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,347 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,349 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,351 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,353 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,355 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,357 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,359 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,361 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,363 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,366 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-24T06:57:36,368 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zebralogicbench 2026-04-24T06:57:36,370 copying build/lib/evalscope/benchmarks/zebralogicbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-04-24T06:57:36,371 copying build/lib/evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-04-24T06:57:36,374 copying build/lib/evalscope/benchmarks/zebralogicbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-04-24T06:57:36,377 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/needle_haystack 2026-04-24T06:57:36,378 copying build/lib/evalscope/benchmarks/needle_haystack/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-04-24T06:57:36,380 copying build/lib/evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-04-24T06:57:36,383 copying build/lib/evalscope/benchmarks/needle_haystack/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-04-24T06:57:36,385 copying build/lib/evalscope/benchmarks/needle_haystack/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-04-24T06:57:36,387 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/coin_flip 2026-04-24T06:57:36,389 copying build/lib/evalscope/benchmarks/coin_flip/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-04-24T06:57:36,391 copying build/lib/evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-04-24T06:57:36,394 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k 2026-04-24T06:57:36,395 copying build/lib/evalscope/benchmarks/gsm8k/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-04-24T06:57:36,397 copying build/lib/evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-04-24T06:57:36,400 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cl_bench 2026-04-24T06:57:36,401 copying build/lib/evalscope/benchmarks/cl_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-04-24T06:57:36,402 copying build/lib/evalscope/benchmarks/cl_bench/cl_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-04-24T06:57:36,405 copying build/lib/evalscope/benchmarks/cl_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-04-24T06:57:36,408 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math 2026-04-24T06:57:36,409 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math/utils 2026-04-24T06:57:36,411 copying build/lib/evalscope/benchmarks/poly_math/utils/instruction.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math/utils 2026-04-24T06:57:36,413 copying build/lib/evalscope/benchmarks/poly_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-04-24T06:57:36,415 copying build/lib/evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-04-24T06:57:36,418 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit 2026-04-24T06:57:36,419 copying build/lib/evalscope/benchmarks/image_edit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit 2026-04-24T06:57:36,422 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:36,423 copying build/lib/evalscope/benchmarks/image_edit/gedit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:36,425 copying build/lib/evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:36,427 copying build/lib/evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:36,430 copying build/lib/evalscope/benchmarks/image_edit/gedit/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-04-24T06:57:36,434 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/real_world_qa 2026-04-24T06:57:36,435 copying build/lib/evalscope/benchmarks/real_world_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-04-24T06:57:36,437 copying build/lib/evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-04-24T06:57:36,439 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbppplus 2026-04-24T06:57:36,440 copying build/lib/evalscope/benchmarks/mbppplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-04-24T06:57:36,442 copying build/lib/evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-04-24T06:57:36,445 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/refcoco 2026-04-24T06:57:36,447 copying build/lib/evalscope/benchmarks/refcoco/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-24T06:57:36,448 copying build/lib/evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-24T06:57:36,451 copying build/lib/evalscope/benchmarks/refcoco/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-24T06:57:36,453 copying build/lib/evalscope/benchmarks/refcoco/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-24T06:57:36,455 copying build/lib/evalscope/benchmarks/refcoco/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-24T06:57:36,458 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vqa 2026-04-24T06:57:36,459 copying build/lib/evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-04-24T06:57:36,461 copying build/lib/evalscope/benchmarks/general_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-04-24T06:57:36,464 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multipl_e 2026-04-24T06:57:36,465 copying build/lib/evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-04-24T06:57:36,467 copying build/lib/evalscope/benchmarks/multipl_e/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-04-24T06:57:36,469 copying build/lib/evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-04-24T06:57:36,471 copying build/lib/evalscope/benchmarks/multipl_e/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-04-24T06:57:36,474 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ai2d 2026-04-24T06:57:36,475 copying build/lib/evalscope/benchmarks/ai2d/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-04-24T06:57:36,477 copying build/lib/evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-04-24T06:57:36,480 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:36,481 copying build/lib/evalscope/benchmarks/openai_mrcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:36,483 copying build/lib/evalscope/benchmarks/openai_mrcr/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:36,485 copying build/lib/evalscope/benchmarks/openai_mrcr/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:36,487 copying build/lib/evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-04-24T06:57:36,490 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_qa 2026-04-24T06:57:36,491 copying build/lib/evalscope/benchmarks/general_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-04-24T06:57:36,493 copying build/lib/evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-04-24T06:57:36,495 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifbench 2026-04-24T06:57:36,496 copying build/lib/evalscope/benchmarks/ifbench/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-24T06:57:36,501 copying build/lib/evalscope/benchmarks/ifbench/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-24T06:57:36,504 copying build/lib/evalscope/benchmarks/ifbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-24T06:57:36,506 copying build/lib/evalscope/benchmarks/ifbench/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-24T06:57:36,508 copying build/lib/evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-24T06:57:36,510 copying build/lib/evalscope/benchmarks/ifbench/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-24T06:57:36,513 copying build/lib/evalscope/benchmarks/ifbench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-24T06:57:36,515 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,516 copying build/lib/evalscope/benchmarks/live_code_bench/extract_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,519 copying build/lib/evalscope/benchmarks/live_code_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,520 copying build/lib/evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,522 copying build/lib/evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,524 copying build/lib/evalscope/benchmarks/live_code_bench/load_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,527 copying build/lib/evalscope/benchmarks/live_code_bench/testing_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,529 copying build/lib/evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,532 copying build/lib/evalscope/benchmarks/live_code_bench/prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,534 copying build/lib/evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-24T06:57:36,537 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/trivia_qa 2026-04-24T06:57:36,538 copying build/lib/evalscope/benchmarks/trivia_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-04-24T06:57:36,541 copying build/lib/evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-04-24T06:57:36,542 copying build/lib/evalscope/benchmarks/trivia_qa/samples.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-04-24T06:57:36,545 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/halu_eval 2026-04-24T06:57:36,546 copying build/lib/evalscope/benchmarks/halu_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-04-24T06:57:36,548 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-04-24T06:57:36,550 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-04-24T06:57:36,553 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/winogrande 2026-04-24T06:57:36,554 copying build/lib/evalscope/benchmarks/winogrande/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-04-24T06:57:36,556 copying build/lib/evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-04-24T06:57:36,559 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/micro_vqa 2026-04-24T06:57:36,560 copying build/lib/evalscope/benchmarks/micro_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-04-24T06:57:36,562 copying build/lib/evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-04-24T06:57:36,565 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/terminal_bench 2026-04-24T06:57:36,566 copying build/lib/evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-04-24T06:57:36,568 copying build/lib/evalscope/benchmarks/terminal_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-04-24T06:57:36,570 copying build/lib/evalscope/benchmarks/terminal_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-04-24T06:57:36,572 copying build/lib/evalscope/benchmarks/terminal_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-04-24T06:57:36,574 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/iquiz 2026-04-24T06:57:36,575 copying build/lib/evalscope/benchmarks/iquiz/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-04-24T06:57:36,577 copying build/lib/evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-04-24T06:57:36,580 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/torgo 2026-04-24T06:57:36,581 copying build/lib/evalscope/benchmarks/torgo/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-04-24T06:57:36,583 copying build/lib/evalscope/benchmarks/torgo/torgo_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-04-24T06:57:36,585 copying build/lib/evalscope/benchmarks/torgo/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-04-24T06:57:36,587 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_redux 2026-04-24T06:57:36,589 copying build/lib/evalscope/benchmarks/mmlu_redux/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-04-24T06:57:36,590 copying build/lib/evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-04-24T06:57:36,594 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_500 2026-04-24T06:57:36,595 copying build/lib/evalscope/benchmarks/math_500/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-04-24T06:57:36,597 copying build/lib/evalscope/benchmarks/math_500/math_500_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-04-24T06:57:36,599 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus 2026-04-24T06:57:36,600 copying build/lib/evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-04-24T06:57:36,603 copying build/lib/evalscope/benchmarks/humanevalplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-04-24T06:57:36,605 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus/docker 2026-04-24T06:57:36,606 copying build/lib/evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus/docker 2026-04-24T06:57:36,608 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aa_lcr 2026-04-24T06:57:36,609 copying build/lib/evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-04-24T06:57:36,612 copying build/lib/evalscope/benchmarks/aa_lcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-04-24T06:57:36,615 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/wmt 2026-04-24T06:57:36,616 copying build/lib/evalscope/benchmarks/wmt/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-04-24T06:57:36,618 copying build/lib/evalscope/benchmarks/wmt/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-04-24T06:57:36,620 copying build/lib/evalscope/benchmarks/wmt/wmt24_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-04-24T06:57:36,623 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/race 2026-04-24T06:57:36,624 copying build/lib/evalscope/benchmarks/race/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-04-24T06:57:36,626 copying build/lib/evalscope/benchmarks/race/race_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-04-24T06:57:36,629 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/data_collection 2026-04-24T06:57:36,630 copying build/lib/evalscope/benchmarks/data_collection/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-04-24T06:57:36,632 copying build/lib/evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-04-24T06:57:36,635 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:36,636 copying build/lib/evalscope/benchmarks/olympiad_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:36,638 copying build/lib/evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:36,641 copying build/lib/evalscope/benchmarks/olympiad_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:36,643 copying build/lib/evalscope/benchmarks/olympiad_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-04-24T06:57:36,646 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmu 2026-04-24T06:57:36,647 copying build/lib/evalscope/benchmarks/cmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-04-24T06:57:36,649 copying build/lib/evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-04-24T06:57:36,651 copying build/lib/evalscope/benchmarks/cmmu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-04-24T06:57:36,654 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gpqa 2026-04-24T06:57:36,655 copying build/lib/evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-04-24T06:57:36,657 copying build/lib/evalscope/benchmarks/gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-04-24T06:57:36,659 copying build/lib/evalscope/benchmarks/gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-04-24T06:57:36,664 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/_meta 2026-04-24T06:57:36,665 copying build/lib/evalscope/benchmarks/_meta/commonsense_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,668 copying build/lib/evalscope/benchmarks/_meta/ai2d.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,671 copying build/lib/evalscope/benchmarks/_meta/multiple_humaneval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,673 copying build/lib/evalscope/benchmarks/_meta/med_mcqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,676 copying build/lib/evalscope/benchmarks/_meta/mmlu_redux.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,679 copying build/lib/evalscope/benchmarks/_meta/ncbi.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,682 copying build/lib/evalscope/benchmarks/_meta/bc5cdr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,684 copying build/lib/evalscope/benchmarks/_meta/wmt24pp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,687 copying build/lib/evalscope/benchmarks/_meta/general_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,690 copying build/lib/evalscope/benchmarks/_meta/chartqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,692 copying build/lib/evalscope/benchmarks/_meta/simple_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,695 copying build/lib/evalscope/benchmarks/_meta/cmmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,698 copying build/lib/evalscope/benchmarks/_meta/hpdv2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,701 copying build/lib/evalscope/benchmarks/_meta/arena_hard.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,703 copying build/lib/evalscope/benchmarks/_meta/micro_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,706 copying build/lib/evalscope/benchmarks/_meta/cl_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,709 copying build/lib/evalscope/benchmarks/_meta/scicode.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,714 copying build/lib/evalscope/benchmarks/_meta/biomix_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,717 copying build/lib/evalscope/benchmarks/_meta/swe_bench_lite.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,719 copying build/lib/evalscope/benchmarks/_meta/math_vision.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,722 copying build/lib/evalscope/benchmarks/_meta/bc4chemd.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,725 copying build/lib/evalscope/benchmarks/_meta/pubmedqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,727 copying build/lib/evalscope/benchmarks/_meta/docvqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,730 copying build/lib/evalscope/benchmarks/_meta/jnlpba_rare.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,733 copying build/lib/evalscope/benchmarks/_meta/cmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,736 copying build/lib/evalscope/benchmarks/_meta/hmmt25.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,738 copying build/lib/evalscope/benchmarks/_meta/mbpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,741 copying build/lib/evalscope/benchmarks/_meta/gsm8k_v.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,744 copying build/lib/evalscope/benchmarks/_meta/mmmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,747 copying build/lib/evalscope/benchmarks/_meta/tau2_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,749 copying build/lib/evalscope/benchmarks/_meta/health_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,752 copying build/lib/evalscope/benchmarks/_meta/eq_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,755 copying build/lib/evalscope/benchmarks/_meta/general_mcq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,757 copying build/lib/evalscope/benchmarks/_meta/genia_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,760 copying build/lib/evalscope/benchmarks/_meta/terminal_bench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,762 copying build/lib/evalscope/benchmarks/_meta/real_world_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,765 copying build/lib/evalscope/benchmarks/_meta/tau_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,768 copying build/lib/evalscope/benchmarks/_meta/seed_bench_2_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,770 copying build/lib/evalscope/benchmarks/_meta/winogrande.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,773 copying build/lib/evalscope/benchmarks/_meta/logi_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,776 copying build/lib/evalscope/benchmarks/_meta/general_fc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,779 copying build/lib/evalscope/benchmarks/_meta/tweet_ner_7.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,782 copying build/lib/evalscope/benchmarks/_meta/alpaca_eval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,784 copying build/lib/evalscope/benchmarks/_meta/needle_haystack.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,787 copying build/lib/evalscope/benchmarks/_meta/bc2gm.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,790 copying build/lib/evalscope/benchmarks/_meta/math_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,793 copying build/lib/evalscope/benchmarks/_meta/anat_em.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,795 copying build/lib/evalscope/benchmarks/_meta/mia_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,798 copying build/lib/evalscope/benchmarks/_meta/competition_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,801 copying build/lib/evalscope/benchmarks/_meta/jnlpba.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,804 copying build/lib/evalscope/benchmarks/_meta/minerva_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,806 copying build/lib/evalscope/benchmarks/_meta/musr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,809 copying build/lib/evalscope/benchmarks/_meta/infovqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,812 copying build/lib/evalscope/benchmarks/_meta/fin_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,815 copying build/lib/evalscope/benchmarks/_meta/frames.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,817 copying build/lib/evalscope/benchmarks/_meta/truthful_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,820 copying build/lib/evalscope/benchmarks/_meta/blink.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,823 copying build/lib/evalscope/benchmarks/_meta/mm_star.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,826 copying build/lib/evalscope/benchmarks/_meta/tool_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,829 copying build/lib/evalscope/benchmarks/_meta/drivel_selection.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,832 copying build/lib/evalscope/benchmarks/_meta/live_code_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,835 copying build/lib/evalscope/benchmarks/_meta/olympiad_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,837 copying build/lib/evalscope/benchmarks/_meta/iquiz.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,840 copying build/lib/evalscope/benchmarks/_meta/chinese_simpleqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,843 copying build/lib/evalscope/benchmarks/_meta/ifbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,846 copying build/lib/evalscope/benchmarks/_meta/gedit.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,849 copying build/lib/evalscope/benchmarks/_meta/general_vmcq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,851 copying build/lib/evalscope/benchmarks/_meta/aime24.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,854 copying build/lib/evalscope/benchmarks/_meta/mmlu_pro.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,857 copying build/lib/evalscope/benchmarks/_meta/humaneval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,859 copying build/lib/evalscope/benchmarks/_meta/torgo.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,862 copying build/lib/evalscope/benchmarks/_meta/vstar_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,865 copying build/lib/evalscope/benchmarks/_meta/super_gpqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,868 copying build/lib/evalscope/benchmarks/_meta/math_verse.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,871 copying build/lib/evalscope/benchmarks/_meta/arc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,874 copying build/lib/evalscope/benchmarks/_meta/maritime_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,877 copying build/lib/evalscope/benchmarks/_meta/bfcl_v4.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,880 copying build/lib/evalscope/benchmarks/_meta/qasc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,882 copying build/lib/evalscope/benchmarks/_meta/tweebank_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,885 copying build/lib/evalscope/benchmarks/_meta/cross_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,888 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified_mini.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,890 copying build/lib/evalscope/benchmarks/_meta/refcoco.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,893 copying build/lib/evalscope/benchmarks/_meta/docmath.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,896 copying build/lib/evalscope/benchmarks/_meta/mmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,899 copying build/lib/evalscope/benchmarks/_meta/general_arena.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,902 copying build/lib/evalscope/benchmarks/_meta/drivel_multilabel.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,905 copying build/lib/evalscope/benchmarks/_meta/gpqa_diamond.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,907 copying build/lib/evalscope/benchmarks/_meta/humaneval_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,910 copying build/lib/evalscope/benchmarks/_meta/a_okvqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,913 copying build/lib/evalscope/benchmarks/_meta/hellaswag.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,915 copying build/lib/evalscope/benchmarks/_meta/zerobench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,918 copying build/lib/evalscope/benchmarks/_meta/multi_nerd.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,921 copying build/lib/evalscope/benchmarks/_meta/cc_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,924 copying build/lib/evalscope/benchmarks/_meta/data_collection.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,926 copying build/lib/evalscope/benchmarks/_meta/fleurs.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,929 copying build/lib/evalscope/benchmarks/_meta/general_t2i.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,931 copying build/lib/evalscope/benchmarks/_meta/ocr_bench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,934 copying build/lib/evalscope/benchmarks/_meta/mgsm.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,937 copying build/lib/evalscope/benchmarks/_meta/ifeval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,940 copying build/lib/evalscope/benchmarks/_meta/mm_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,943 copying build/lib/evalscope/benchmarks/_meta/drivel_binary.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,945 copying build/lib/evalscope/benchmarks/_meta/multiple_mbpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,948 copying build/lib/evalscope/benchmarks/_meta/wnut2017.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,951 copying build/lib/evalscope/benchmarks/_meta/pope.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,953 copying build/lib/evalscope/benchmarks/_meta/drop.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,956 copying build/lib/evalscope/benchmarks/_meta/mbpp_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,959 copying build/lib/evalscope/benchmarks/_meta/copious.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,963 copying build/lib/evalscope/benchmarks/_meta/race.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,965 copying build/lib/evalscope/benchmarks/_meta/tifa160.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,967 copying build/lib/evalscope/benchmarks/_meta/hallusion_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,970 copying build/lib/evalscope/benchmarks/_meta/evalmuse.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,972 copying build/lib/evalscope/benchmarks/_meta/mmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,976 copying build/lib/evalscope/benchmarks/_meta/halueval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,978 copying build/lib/evalscope/benchmarks/_meta/harvey_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,981 copying build/lib/evalscope/benchmarks/_meta/siqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,983 copying build/lib/evalscope/benchmarks/_meta/simple_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,986 copying build/lib/evalscope/benchmarks/_meta/conll2003.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,989 copying build/lib/evalscope/benchmarks/_meta/music_trivia.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,991 copying build/lib/evalscope/benchmarks/_meta/math_vista.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,994 copying build/lib/evalscope/benchmarks/_meta/mmmu_pro.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:36,997 copying build/lib/evalscope/benchmarks/_meta/bfcl_v3.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,000 copying build/lib/evalscope/benchmarks/_meta/poly_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,003 copying build/lib/evalscope/benchmarks/_meta/math_500.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,006 copying build/lib/evalscope/benchmarks/_meta/genai_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,008 copying build/lib/evalscope/benchmarks/_meta/tir_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,011 copying build/lib/evalscope/benchmarks/_meta/hle.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,014 copying build/lib/evalscope/benchmarks/_meta/process_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,016 copying build/lib/evalscope/benchmarks/_meta/ceval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,019 copying build/lib/evalscope/benchmarks/_meta/broad_twitter_corpus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,023 copying build/lib/evalscope/benchmarks/_meta/omni_doc_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,027 copying build/lib/evalscope/benchmarks/_meta/mri_mcqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,029 copying build/lib/evalscope/benchmarks/_meta/drivel_writing.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,032 copying build/lib/evalscope/benchmarks/_meta/longbench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,034 copying build/lib/evalscope/benchmarks/_meta/mit_restaurant.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,037 copying build/lib/evalscope/benchmarks/_meta/omni_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,040 copying build/lib/evalscope/benchmarks/_meta/multi_if.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,043 copying build/lib/evalscope/benchmarks/_meta/bbh.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,046 copying build/lib/evalscope/benchmarks/_meta/general_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,048 copying build/lib/evalscope/benchmarks/_meta/aime26.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,050 copying build/lib/evalscope/benchmarks/_meta/librispeech.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,053 copying build/lib/evalscope/benchmarks/_meta/visulogic.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,055 copying build/lib/evalscope/benchmarks/_meta/aa_lcr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,058 copying build/lib/evalscope/benchmarks/_meta/aime25.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,060 copying build/lib/evalscope/benchmarks/_meta/science_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,063 copying build/lib/evalscope/benchmarks/_meta/piqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,065 copying build/lib/evalscope/benchmarks/_meta/zebralogicbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,068 copying build/lib/evalscope/benchmarks/_meta/conllpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,070 copying build/lib/evalscope/benchmarks/_meta/sciq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,073 copying build/lib/evalscope/benchmarks/_meta/amc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,075 copying build/lib/evalscope/benchmarks/_meta/trivia_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,078 copying build/lib/evalscope/benchmarks/_meta/coin_flip.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,080 copying build/lib/evalscope/benchmarks/_meta/ocr_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,083 copying build/lib/evalscope/benchmarks/_meta/openai_mrcr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,085 copying build/lib/evalscope/benchmarks/_meta/cmmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,088 copying build/lib/evalscope/benchmarks/_meta/ontonotes5.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,091 copying build/lib/evalscope/benchmarks/_meta/gsm8k.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,094 copying build/lib/evalscope/benchmarks/_meta/mit_movie_trivia.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,096 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-24T06:57:37,099 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl 2026-04-24T06:57:37,100 copying build/lib/evalscope/benchmarks/bfcl/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl 2026-04-24T06:57:37,103 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v4 2026-04-24T06:57:37,104 copying build/lib/evalscope/benchmarks/bfcl/v4/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-04-24T06:57:37,106 copying build/lib/evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-04-24T06:57:37,108 copying build/lib/evalscope/benchmarks/bfcl/v4/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-04-24T06:57:37,112 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:37,113 copying build/lib/evalscope/benchmarks/bfcl/v3/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:37,115 copying build/lib/evalscope/benchmarks/bfcl/v3/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:37,117 copying build/lib/evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:37,120 copying build/lib/evalscope/benchmarks/bfcl/v3/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-04-24T06:57:37,122 copying build/lib/evalscope/benchmarks/bfcl/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl 2026-04-24T06:57:37,124 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/alpaca_eval 2026-04-24T06:57:37,125 copying build/lib/evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-04-24T06:57:37,127 copying build/lib/evalscope/benchmarks/alpaca_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-04-24T06:57:37,129 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mri_mcqa 2026-04-24T06:57:37,130 copying build/lib/evalscope/benchmarks/mri_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-04-24T06:57:37,132 copying build/lib/evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-04-24T06:57:37,135 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ceval 2026-04-24T06:57:37,136 copying build/lib/evalscope/benchmarks/ceval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-04-24T06:57:37,138 copying build/lib/evalscope/benchmarks/ceval/ceval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-04-24T06:57:37,141 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/frames 2026-04-24T06:57:37,142 copying build/lib/evalscope/benchmarks/frames/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-04-24T06:57:37,144 copying build/lib/evalscope/benchmarks/frames/frames_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-04-24T06:57:37,146 copying build/lib/evalscope/benchmarks/frames/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-04-24T06:57:37,148 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_pro 2026-04-24T06:57:37,150 copying build/lib/evalscope/benchmarks/mmlu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-04-24T06:57:37,151 copying build/lib/evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-04-24T06:57:37,154 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/healthbench 2026-04-24T06:57:37,156 copying build/lib/evalscope/benchmarks/healthbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-04-24T06:57:37,157 copying build/lib/evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-04-24T06:57:37,161 copying build/lib/evalscope/benchmarks/healthbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-04-24T06:57:37,163 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/seed_bench_2_plus 2026-04-24T06:57:37,164 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-04-24T06:57:37,166 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-04-24T06:57:37,169 creating build/bdist.linux-armv7l/wheel/evalscope/app 2026-04-24T06:57:37,170 copying build/lib/evalscope/app/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-04-24T06:57:37,173 creating build/bdist.linux-armv7l/wheel/evalscope/app/utils 2026-04-24T06:57:37,174 copying build/lib/evalscope/app/utils/data_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-24T06:57:37,176 copying build/lib/evalscope/app/utils/text_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-24T06:57:37,178 copying build/lib/evalscope/app/utils/localization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-24T06:57:37,180 copying build/lib/evalscope/app/utils/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-24T06:57:37,182 copying build/lib/evalscope/app/utils/env_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-24T06:57:37,184 copying build/lib/evalscope/app/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-04-24T06:57:37,187 creating build/bdist.linux-armv7l/wheel/evalscope/app/ui 2026-04-24T06:57:37,188 copying build/lib/evalscope/app/ui/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-24T06:57:37,190 copying build/lib/evalscope/app/ui/multi_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-24T06:57:37,192 copying build/lib/evalscope/app/ui/app_ui.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-24T06:57:37,195 copying build/lib/evalscope/app/ui/sidebar.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-24T06:57:37,197 copying build/lib/evalscope/app/ui/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-24T06:57:37,199 copying build/lib/evalscope/app/ui/single_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-24T06:57:37,201 copying build/lib/evalscope/app/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-04-24T06:57:37,203 copying build/lib/evalscope/app/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-04-24T06:57:37,205 creating build/bdist.linux-armv7l/wheel/evalscope/third_party 2026-04-24T06:57:37,206 copying build/lib/evalscope/third_party/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party 2026-04-24T06:57:37,209 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench 2026-04-24T06:57:37,210 copying build/lib/evalscope/third_party/thinkbench/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-04-24T06:57:37,213 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/resources 2026-04-24T06:57:37,215 copying build/lib/evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-04-24T06:57:37,217 copying build/lib/evalscope/third_party/thinkbench/resources/critique_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-04-24T06:57:37,218 copying build/lib/evalscope/third_party/thinkbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-04-24T06:57:37,221 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/tools 2026-04-24T06:57:37,222 copying build/lib/evalscope/third_party/thinkbench/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-04-24T06:57:37,224 copying build/lib/evalscope/third_party/thinkbench/tools/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-04-24T06:57:37,226 copying build/lib/evalscope/third_party/thinkbench/tools/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-04-24T06:57:37,228 copying build/lib/evalscope/third_party/thinkbench/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-04-24T06:57:37,230 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write 2026-04-24T06:57:37,231 copying build/lib/evalscope/third_party/longbench_write/default_task.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-24T06:57:37,233 copying build/lib/evalscope/third_party/longbench_write/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-24T06:57:37,236 copying build/lib/evalscope/third_party/longbench_write/default_task.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-24T06:57:37,238 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/resources 2026-04-24T06:57:37,239 copying build/lib/evalscope/third_party/longbench_write/resources/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-24T06:57:37,242 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-24T06:57:37,244 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-24T06:57:37,247 copying build/lib/evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-24T06:57:37,249 copying build/lib/evalscope/third_party/longbench_write/resources/judge.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-24T06:57:37,251 copying build/lib/evalscope/third_party/longbench_write/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-24T06:57:37,254 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/tools 2026-04-24T06:57:37,255 copying build/lib/evalscope/third_party/longbench_write/tools/data_etl.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-04-24T06:57:37,257 copying build/lib/evalscope/third_party/longbench_write/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-04-24T06:57:37,259 copying build/lib/evalscope/third_party/longbench_write/tools/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-04-24T06:57:37,262 copying build/lib/evalscope/third_party/longbench_write/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-24T06:57:37,264 copying build/lib/evalscope/third_party/longbench_write/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-24T06:57:37,266 copying build/lib/evalscope/third_party/longbench_write/longbench_write.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-24T06:57:37,268 copying build/lib/evalscope/third_party/longbench_write/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-24T06:57:37,271 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static 2026-04-24T06:57:37,272 copying build/lib/evalscope/third_party/toolbench_static/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-24T06:57:37,274 copying build/lib/evalscope/third_party/toolbench_static/config_default.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-24T06:57:37,276 copying build/lib/evalscope/third_party/toolbench_static/config_default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-24T06:57:37,278 copying build/lib/evalscope/third_party/toolbench_static/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-24T06:57:37,281 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static/llm 2026-04-24T06:57:37,282 copying build/lib/evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-04-24T06:57:37,284 copying build/lib/evalscope/third_party/toolbench_static/llm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-04-24T06:57:37,286 copying build/lib/evalscope/third_party/toolbench_static/toolbench_static.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-24T06:57:37,288 copying build/lib/evalscope/third_party/toolbench_static/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-24T06:57:37,290 copying build/lib/evalscope/third_party/toolbench_static/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-24T06:57:37,292 copying build/lib/evalscope/third_party/toolbench_static/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-24T06:57:37,295 creating build/bdist.linux-armv7l/wheel/evalscope/report 2026-04-24T06:57:37,296 copying build/lib/evalscope/report/generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-24T06:57:37,299 copying build/lib/evalscope/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-24T06:57:37,301 copying build/lib/evalscope/report/combinator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-24T06:57:37,303 copying build/lib/evalscope/report/renderer.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-24T06:57:37,307 creating build/bdist.linux-armv7l/wheel/evalscope/report/template 2026-04-24T06:57:37,308 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/css 2026-04-24T06:57:37,310 copying build/lib/evalscope/report/template/css/base.css -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/css 2026-04-24T06:57:37,313 copying build/lib/evalscope/report/template/css/perf_extra.css -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/css 2026-04-24T06:57:37,315 copying build/lib/evalscope/report/template/perf_report.html.j2 -> build/bdist.linux-armv7l/wheel/./evalscope/report/template 2026-04-24T06:57:37,318 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/js 2026-04-24T06:57:37,319 copying build/lib/evalscope/report/template/js/shared.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-24T06:57:37,321 copying build/lib/evalscope/report/template/js/eval_extra.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-24T06:57:37,323 copying build/lib/evalscope/report/template/js/i18n_eval.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-24T06:57:37,326 copying build/lib/evalscope/report/template/js/i18n_perf.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-24T06:57:37,328 copying build/lib/evalscope/report/template/js/perf_extra.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-24T06:57:37,331 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/partials 2026-04-24T06:57:37,332 copying build/lib/evalscope/report/template/partials/header_perf.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-24T06:57:37,334 copying build/lib/evalscope/report/template/partials/footer.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-24T06:57:37,336 copying build/lib/evalscope/report/template/partials/toc_eval.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-24T06:57:37,338 copying build/lib/evalscope/report/template/partials/header_eval.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-24T06:57:37,340 copying build/lib/evalscope/report/template/partials/toc_perf.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-24T06:57:37,342 copying build/lib/evalscope/report/template/partials/brand_logo.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-24T06:57:37,344 copying build/lib/evalscope/report/template/report.html.j2 -> build/bdist.linux-armv7l/wheel/./evalscope/report/template 2026-04-24T06:57:37,347 copying build/lib/evalscope/report/report.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-24T06:57:37,350 creating build/bdist.linux-armv7l/wheel/evalscope/service 2026-04-24T06:57:37,351 creating build/bdist.linux-armv7l/wheel/evalscope/service/blueprints 2026-04-24T06:57:37,353 copying build/lib/evalscope/service/blueprints/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-04-24T06:57:37,355 copying build/lib/evalscope/service/blueprints/perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-04-24T06:57:37,358 copying build/lib/evalscope/service/blueprints/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-04-24T06:57:37,360 copying build/lib/evalscope/service/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-04-24T06:57:37,362 creating build/bdist.linux-armv7l/wheel/evalscope/service/utils 2026-04-24T06:57:37,363 copying build/lib/evalscope/service/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-04-24T06:57:37,365 copying build/lib/evalscope/service/utils/benchmarks.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-04-24T06:57:37,368 copying build/lib/evalscope/service/utils/process.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-04-24T06:57:37,370 copying build/lib/evalscope/service/utils/log.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-04-24T06:57:37,372 copying build/lib/evalscope/service/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-04-24T06:57:37,375 creating build/bdist.linux-armv7l/wheel/evalscope/service/frontend 2026-04-24T06:57:37,376 copying build/lib/evalscope/service/frontend/async_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-04-24T06:57:37,378 copying build/lib/evalscope/service/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-04-24T06:57:37,380 copying build/lib/evalscope/service/frontend/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-04-24T06:57:37,382 copying build/lib/evalscope/service/frontend/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-04-24T06:57:37,385 copying build/lib/evalscope/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-24T06:57:37,387 copying build/lib/evalscope/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-24T06:57:37,390 creating build/bdist.linux-armv7l/wheel/evalscope/evaluator 2026-04-24T06:57:37,391 copying build/lib/evalscope/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-04-24T06:57:37,393 copying build/lib/evalscope/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-04-24T06:57:37,396 copying build/lib/evalscope/evaluator/perf_collector.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-04-24T06:57:37,399 copying build/lib/evalscope/evaluator/batch_reviewer.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-04-24T06:57:37,403 creating build/bdist.linux-armv7l/wheel/evalscope/models 2026-04-24T06:57:37,404 copying build/lib/evalscope/models/model_apis.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-24T06:57:37,407 creating build/bdist.linux-armv7l/wheel/evalscope/models/utils 2026-04-24T06:57:37,408 copying build/lib/evalscope/models/utils/anthropic.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-04-24T06:57:37,411 copying build/lib/evalscope/models/utils/openai.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-04-24T06:57:37,414 copying build/lib/evalscope/models/modelscope.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-24T06:57:37,416 copying build/lib/evalscope/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-24T06:57:37,418 copying build/lib/evalscope/models/mockllm.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-24T06:57:37,420 copying build/lib/evalscope/models/text2image_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-24T06:57:37,422 copying build/lib/evalscope/models/anthropic_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-24T06:57:37,424 copying build/lib/evalscope/models/openai_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-24T06:57:37,427 copying build/lib/evalscope/models/image_edit_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-24T06:57:37,429 running install_egg_info 2026-04-24T06:57:37,434 Copying evalscope.egg-info to build/bdist.linux-armv7l/wheel/./evalscope-1.6.1-py3.11.egg-info 2026-04-24T06:57:37,449 running install_scripts 2026-04-24T06:57:37,463 creating build/bdist.linux-armv7l/wheel/evalscope-1.6.1.dist-info/WHEEL 2026-04-24T06:57:37,466 creating '/tmp/pip-wheel-06f9xic9/.tmp-vmxr20rm/evalscope-1.6.1-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-04-24T06:57:37,468 adding 'evalscope/__init__.py' 2026-04-24T06:57:37,470 adding 'evalscope/arguments.py' 2026-04-24T06:57:37,472 adding 'evalscope/config.py' 2026-04-24T06:57:37,474 adding 'evalscope/constants.py' 2026-04-24T06:57:37,476 adding 'evalscope/run.py' 2026-04-24T06:57:37,478 adding 'evalscope/version.py' 2026-04-24T06:57:37,480 adding 'evalscope/api/__init__.py' 2026-04-24T06:57:37,482 adding 'evalscope/api/registry.py' 2026-04-24T06:57:37,484 adding 'evalscope/api/benchmark/__init__.py' 2026-04-24T06:57:37,486 adding 'evalscope/api/benchmark/benchmark.py' 2026-04-24T06:57:37,487 adding 'evalscope/api/benchmark/meta.py' 2026-04-24T06:57:37,491 adding 'evalscope/api/benchmark/statistics.py' 2026-04-24T06:57:37,493 adding 'evalscope/api/benchmark/adapters/__init__.py' 2026-04-24T06:57:37,494 adding 'evalscope/api/benchmark/adapters/agent_adapter.py' 2026-04-24T06:57:37,500 adding 'evalscope/api/benchmark/adapters/default_data_adapter.py' 2026-04-24T06:57:37,502 adding 'evalscope/api/benchmark/adapters/image_edit_adapter.py' 2026-04-24T06:57:37,504 adding 'evalscope/api/benchmark/adapters/multi_choice_adapter.py' 2026-04-24T06:57:37,507 adding 'evalscope/api/benchmark/adapters/ner_adapter.py' 2026-04-24T06:57:37,510 adding 'evalscope/api/benchmark/adapters/text2image_adapter.py' 2026-04-24T06:57:37,511 adding 'evalscope/api/benchmark/adapters/vision_language_adapter.py' 2026-04-24T06:57:37,514 adding 'evalscope/api/dataset/__init__.py' 2026-04-24T06:57:37,517 adding 'evalscope/api/dataset/dataset.py' 2026-04-24T06:57:37,520 adding 'evalscope/api/dataset/loader.py' 2026-04-24T06:57:37,522 adding 'evalscope/api/dataset/utils.py' 2026-04-24T06:57:37,524 adding 'evalscope/api/evaluator/__init__.py' 2026-04-24T06:57:37,527 adding 'evalscope/api/evaluator/cache.py' 2026-04-24T06:57:37,529 adding 'evalscope/api/evaluator/evaluator.py' 2026-04-24T06:57:37,532 adding 'evalscope/api/evaluator/state.py' 2026-04-24T06:57:37,534 adding 'evalscope/api/filter/__init__.py' 2026-04-24T06:57:37,536 adding 'evalscope/api/filter/filter.py' 2026-04-24T06:57:37,538 adding 'evalscope/api/messages/__init__.py' 2026-04-24T06:57:37,541 adding 'evalscope/api/messages/chat_message.py' 2026-04-24T06:57:37,543 adding 'evalscope/api/messages/content.py' 2026-04-24T06:57:37,545 adding 'evalscope/api/messages/utils.py' 2026-04-24T06:57:37,547 adding 'evalscope/api/metric/__init__.py' 2026-04-24T06:57:37,549 adding 'evalscope/api/metric/metric.py' 2026-04-24T06:57:37,551 adding 'evalscope/api/metric/scorer.py' 2026-04-24T06:57:37,553 adding 'evalscope/api/mixin/__init__.py' 2026-04-24T06:57:37,555 adding 'evalscope/api/mixin/llm_judge_mixin.py' 2026-04-24T06:57:37,558 adding 'evalscope/api/mixin/sandbox_mixin.py' 2026-04-24T06:57:37,561 adding 'evalscope/api/model/__init__.py' 2026-04-24T06:57:37,563 adding 'evalscope/api/model/generate_config.py' 2026-04-24T06:57:37,566 adding 'evalscope/api/model/lazy_model.py' 2026-04-24T06:57:37,569 adding 'evalscope/api/model/model.py' 2026-04-24T06:57:37,571 adding 'evalscope/api/model/model_output.py' 2026-04-24T06:57:37,574 adding 'evalscope/api/model/perf_metrics.py' 2026-04-24T06:57:37,576 adding 'evalscope/api/tool/__init__.py' 2026-04-24T06:57:37,578 adding 'evalscope/api/tool/tool_call.py' 2026-04-24T06:57:37,580 adding 'evalscope/api/tool/tool_info.py' 2026-04-24T06:57:37,582 adding 'evalscope/api/tool/utils.py' 2026-04-24T06:57:37,584 adding 'evalscope/app/__init__.py' 2026-04-24T06:57:37,586 adding 'evalscope/app/app.py' 2026-04-24T06:57:37,587 adding 'evalscope/app/arguments.py' 2026-04-24T06:57:37,589 adding 'evalscope/app/constants.py' 2026-04-24T06:57:37,591 adding 'evalscope/app/ui/__init__.py' 2026-04-24T06:57:37,593 adding 'evalscope/app/ui/app_ui.py' 2026-04-24T06:57:37,595 adding 'evalscope/app/ui/multi_model.py' 2026-04-24T06:57:37,596 adding 'evalscope/app/ui/sidebar.py' 2026-04-24T06:57:37,598 adding 'evalscope/app/ui/single_model.py' 2026-04-24T06:57:37,599 adding 'evalscope/app/ui/visualization.py' 2026-04-24T06:57:37,601 adding 'evalscope/app/utils/data_utils.py' 2026-04-24T06:57:37,603 adding 'evalscope/app/utils/env_utils.py' 2026-04-24T06:57:37,604 adding 'evalscope/app/utils/localization.py' 2026-04-24T06:57:37,606 adding 'evalscope/app/utils/text_utils.py' 2026-04-24T06:57:37,607 adding 'evalscope/app/utils/visualization.py' 2026-04-24T06:57:37,609 adding 'evalscope/backend/__init__.py' 2026-04-24T06:57:37,610 adding 'evalscope/backend/base.py' 2026-04-24T06:57:37,612 adding 'evalscope/backend/opencompass/__init__.py' 2026-04-24T06:57:37,613 adding 'evalscope/backend/opencompass/api_meta_template.py' 2026-04-24T06:57:37,615 adding 'evalscope/backend/opencompass/backend_manager.py' 2026-04-24T06:57:37,617 adding 'evalscope/backend/opencompass/tasks/__init__.py' 2026-04-24T06:57:37,618 adding 'evalscope/backend/opencompass/tasks/eval_api.py' 2026-04-24T06:57:37,619 adding 'evalscope/backend/opencompass/tasks/eval_datasets.py' 2026-04-24T06:57:37,621 adding 'evalscope/backend/rag_eval/__init__.py' 2026-04-24T06:57:37,623 adding 'evalscope/backend/rag_eval/backend_manager.py' 2026-04-24T06:57:37,625 adding 'evalscope/backend/rag_eval/clip_benchmark/__init__.py' 2026-04-24T06:57:37,626 adding 'evalscope/backend/rag_eval/clip_benchmark/arguments.py' 2026-04-24T06:57:37,628 adding 'evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py' 2026-04-24T06:57:37,629 adding 'evalscope/backend/rag_eval/clip_benchmark/task_template.py' 2026-04-24T06:57:37,631 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py' 2026-04-24T06:57:37,633 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py' 2026-04-24T06:57:37,634 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py' 2026-04-24T06:57:37,636 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py' 2026-04-24T06:57:37,638 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py' 2026-04-24T06:57:37,640 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt' 2026-04-24T06:57:37,641 adding 'evalscope/backend/rag_eval/cmteb/__init__.py' 2026-04-24T06:57:37,643 adding 'evalscope/backend/rag_eval/cmteb/arguments.py' 2026-04-24T06:57:37,644 adding 'evalscope/backend/rag_eval/cmteb/base.py' 2026-04-24T06:57:37,646 adding 'evalscope/backend/rag_eval/cmteb/task_template.py' 2026-04-24T06:57:37,648 adding 'evalscope/backend/rag_eval/cmteb/tasks/Classification.py' 2026-04-24T06:57:37,649 adding 'evalscope/backend/rag_eval/cmteb/tasks/Clustering.py' 2026-04-24T06:57:37,651 adding 'evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py' 2026-04-24T06:57:37,652 adding 'evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py' 2026-04-24T06:57:37,653 adding 'evalscope/backend/rag_eval/cmteb/tasks/Reranking.py' 2026-04-24T06:57:37,655 adding 'evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py' 2026-04-24T06:57:37,657 adding 'evalscope/backend/rag_eval/cmteb/tasks/STS.py' 2026-04-24T06:57:37,658 adding 'evalscope/backend/rag_eval/cmteb/tasks/__init__.py' 2026-04-24T06:57:37,659 adding 'evalscope/backend/rag_eval/ragas/__init__.py' 2026-04-24T06:57:37,661 adding 'evalscope/backend/rag_eval/ragas/arguments.py' 2026-04-24T06:57:37,662 adding 'evalscope/backend/rag_eval/ragas/task_template.py' 2026-04-24T06:57:37,664 adding 'evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py' 2026-04-24T06:57:37,665 adding 'evalscope/backend/rag_eval/ragas/tasks/__init__.py' 2026-04-24T06:57:37,666 adding 'evalscope/backend/rag_eval/ragas/tasks/build_distribution.py' 2026-04-24T06:57:37,668 adding 'evalscope/backend/rag_eval/ragas/tasks/build_transform.py' 2026-04-24T06:57:37,669 adding 'evalscope/backend/rag_eval/ragas/tasks/testset_generation.py' 2026-04-24T06:57:37,671 adding 'evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py' 2026-04-24T06:57:37,672 adding 'evalscope/backend/rag_eval/utils/__init__.py' 2026-04-24T06:57:37,674 adding 'evalscope/backend/rag_eval/utils/clip.py' 2026-04-24T06:57:37,675 adding 'evalscope/backend/rag_eval/utils/embedding.py' 2026-04-24T06:57:37,677 adding 'evalscope/backend/rag_eval/utils/llm.py' 2026-04-24T06:57:37,678 adding 'evalscope/backend/rag_eval/utils/tools.py' 2026-04-24T06:57:37,680 adding 'evalscope/backend/vlm_eval_kit/__init__.py' 2026-04-24T06:57:37,681 adding 'evalscope/backend/vlm_eval_kit/backend_manager.py' 2026-04-24T06:57:37,685 adding 'evalscope/benchmarks/__init__.py' 2026-04-24T06:57:37,690 adding 'evalscope/benchmarks/_meta/a_okvqa.json' 2026-04-24T06:57:37,692 adding 'evalscope/benchmarks/_meta/aa_lcr.json' 2026-04-24T06:57:37,694 adding 'evalscope/benchmarks/_meta/ai2d.json' 2026-04-24T06:57:37,696 adding 'evalscope/benchmarks/_meta/aime24.json' 2026-04-24T06:57:37,698 adding 'evalscope/benchmarks/_meta/aime25.json' 2026-04-24T06:57:37,700 adding 'evalscope/benchmarks/_meta/aime26.json' 2026-04-24T06:57:37,701 adding 'evalscope/benchmarks/_meta/alpaca_eval.json' 2026-04-24T06:57:37,703 adding 'evalscope/benchmarks/_meta/amc.json' 2026-04-24T06:57:37,705 adding 'evalscope/benchmarks/_meta/anat_em.json' 2026-04-24T06:57:37,707 adding 'evalscope/benchmarks/_meta/arc.json' 2026-04-24T06:57:37,709 adding 'evalscope/benchmarks/_meta/arena_hard.json' 2026-04-24T06:57:37,711 adding 'evalscope/benchmarks/_meta/bbh.json' 2026-04-24T06:57:37,714 adding 'evalscope/benchmarks/_meta/bc2gm.json' 2026-04-24T06:57:37,716 adding 'evalscope/benchmarks/_meta/bc4chemd.json' 2026-04-24T06:57:37,718 adding 'evalscope/benchmarks/_meta/bc5cdr.json' 2026-04-24T06:57:37,721 adding 'evalscope/benchmarks/_meta/bfcl_v3.json' 2026-04-24T06:57:37,724 adding 'evalscope/benchmarks/_meta/bfcl_v4.json' 2026-04-24T06:57:37,725 adding 'evalscope/benchmarks/_meta/biomix_qa.json' 2026-04-24T06:57:37,728 adding 'evalscope/benchmarks/_meta/blink.json' 2026-04-24T06:57:37,730 adding 'evalscope/benchmarks/_meta/broad_twitter_corpus.json' 2026-04-24T06:57:37,732 adding 'evalscope/benchmarks/_meta/cc_bench.json' 2026-04-24T06:57:37,735 adding 'evalscope/benchmarks/_meta/ceval.json' 2026-04-24T06:57:37,737 adding 'evalscope/benchmarks/_meta/chartqa.json' 2026-04-24T06:57:37,739 adding 'evalscope/benchmarks/_meta/chinese_simpleqa.json' 2026-04-24T06:57:37,742 adding 'evalscope/benchmarks/_meta/cl_bench.json' 2026-04-24T06:57:37,745 adding 'evalscope/benchmarks/_meta/cmmlu.json' 2026-04-24T06:57:37,749 adding 'evalscope/benchmarks/_meta/cmmmu.json' 2026-04-24T06:57:37,752 adding 'evalscope/benchmarks/_meta/cmmu.json' 2026-04-24T06:57:37,754 adding 'evalscope/benchmarks/_meta/coin_flip.json' 2026-04-24T06:57:37,755 adding 'evalscope/benchmarks/_meta/commonsense_qa.json' 2026-04-24T06:57:37,757 adding 'evalscope/benchmarks/_meta/competition_math.json' 2026-04-24T06:57:37,759 adding 'evalscope/benchmarks/_meta/conll2003.json' 2026-04-24T06:57:37,761 adding 'evalscope/benchmarks/_meta/conllpp.json' 2026-04-24T06:57:37,765 adding 'evalscope/benchmarks/_meta/copious.json' 2026-04-24T06:57:37,767 adding 'evalscope/benchmarks/_meta/cross_ner.json' 2026-04-24T06:57:37,769 adding 'evalscope/benchmarks/_meta/data_collection.json' 2026-04-24T06:57:37,771 adding 'evalscope/benchmarks/_meta/docmath.json' 2026-04-24T06:57:37,773 adding 'evalscope/benchmarks/_meta/docvqa.json' 2026-04-24T06:57:37,774 adding 'evalscope/benchmarks/_meta/drivel_binary.json' 2026-04-24T06:57:37,776 adding 'evalscope/benchmarks/_meta/drivel_multilabel.json' 2026-04-24T06:57:37,778 adding 'evalscope/benchmarks/_meta/drivel_selection.json' 2026-04-24T06:57:37,780 adding 'evalscope/benchmarks/_meta/drivel_writing.json' 2026-04-24T06:57:37,782 adding 'evalscope/benchmarks/_meta/drop.json' 2026-04-24T06:57:37,784 adding 'evalscope/benchmarks/_meta/eq_bench.json' 2026-04-24T06:57:37,786 adding 'evalscope/benchmarks/_meta/evalmuse.json' 2026-04-24T06:57:37,788 adding 'evalscope/benchmarks/_meta/fin_ner.json' 2026-04-24T06:57:37,790 adding 'evalscope/benchmarks/_meta/fleurs.json' 2026-04-24T06:57:37,792 adding 'evalscope/benchmarks/_meta/frames.json' 2026-04-24T06:57:37,794 adding 'evalscope/benchmarks/_meta/gedit.json' 2026-04-24T06:57:37,796 adding 'evalscope/benchmarks/_meta/genai_bench.json' 2026-04-24T06:57:37,798 adding 'evalscope/benchmarks/_meta/general_arena.json' 2026-04-24T06:57:37,801 adding 'evalscope/benchmarks/_meta/general_fc.json' 2026-04-24T06:57:37,803 adding 'evalscope/benchmarks/_meta/general_mcq.json' 2026-04-24T06:57:37,805 adding 'evalscope/benchmarks/_meta/general_qa.json' 2026-04-24T06:57:37,806 adding 'evalscope/benchmarks/_meta/general_t2i.json' 2026-04-24T06:57:37,808 adding 'evalscope/benchmarks/_meta/general_vmcq.json' 2026-04-24T06:57:37,809 adding 'evalscope/benchmarks/_meta/general_vqa.json' 2026-04-24T06:57:37,811 adding 'evalscope/benchmarks/_meta/genia_ner.json' 2026-04-24T06:57:37,813 adding 'evalscope/benchmarks/_meta/gpqa_diamond.json' 2026-04-24T06:57:37,815 adding 'evalscope/benchmarks/_meta/gsm8k.json' 2026-04-24T06:57:37,817 adding 'evalscope/benchmarks/_meta/gsm8k_v.json' 2026-04-24T06:57:37,819 adding 'evalscope/benchmarks/_meta/hallusion_bench.json' 2026-04-24T06:57:37,821 adding 'evalscope/benchmarks/_meta/halueval.json' 2026-04-24T06:57:37,823 adding 'evalscope/benchmarks/_meta/harvey_ner.json' 2026-04-24T06:57:37,826 adding 'evalscope/benchmarks/_meta/health_bench.json' 2026-04-24T06:57:37,827 adding 'evalscope/benchmarks/_meta/hellaswag.json' 2026-04-24T06:57:37,830 adding 'evalscope/benchmarks/_meta/hle.json' 2026-04-24T06:57:37,832 adding 'evalscope/benchmarks/_meta/hmmt25.json' 2026-04-24T06:57:37,834 adding 'evalscope/benchmarks/_meta/hpdv2.json' 2026-04-24T06:57:37,836 adding 'evalscope/benchmarks/_meta/humaneval.json' 2026-04-24T06:57:37,838 adding 'evalscope/benchmarks/_meta/humaneval_plus.json' 2026-04-24T06:57:37,840 adding 'evalscope/benchmarks/_meta/ifbench.json' 2026-04-24T06:57:37,842 adding 'evalscope/benchmarks/_meta/ifeval.json' 2026-04-24T06:57:37,844 adding 'evalscope/benchmarks/_meta/infovqa.json' 2026-04-24T06:57:37,846 adding 'evalscope/benchmarks/_meta/iquiz.json' 2026-04-24T06:57:37,848 adding 'evalscope/benchmarks/_meta/jnlpba.json' 2026-04-24T06:57:37,850 adding 'evalscope/benchmarks/_meta/jnlpba_rare.json' 2026-04-24T06:57:37,852 adding 'evalscope/benchmarks/_meta/librispeech.json' 2026-04-24T06:57:37,854 adding 'evalscope/benchmarks/_meta/live_code_bench.json' 2026-04-24T06:57:37,856 adding 'evalscope/benchmarks/_meta/logi_qa.json' 2026-04-24T06:57:37,858 adding 'evalscope/benchmarks/_meta/longbench_v2.json' 2026-04-24T06:57:37,860 adding 'evalscope/benchmarks/_meta/maritime_bench.json' 2026-04-24T06:57:37,862 adding 'evalscope/benchmarks/_meta/math_500.json' 2026-04-24T06:57:37,863 adding 'evalscope/benchmarks/_meta/math_qa.json' 2026-04-24T06:57:37,866 adding 'evalscope/benchmarks/_meta/math_verse.json' 2026-04-24T06:57:37,868 adding 'evalscope/benchmarks/_meta/math_vision.json' 2026-04-24T06:57:37,870 adding 'evalscope/benchmarks/_meta/math_vista.json' 2026-04-24T06:57:37,872 adding 'evalscope/benchmarks/_meta/mbpp.json' 2026-04-24T06:57:37,874 adding 'evalscope/benchmarks/_meta/mbpp_plus.json' 2026-04-24T06:57:37,876 adding 'evalscope/benchmarks/_meta/med_mcqa.json' 2026-04-24T06:57:37,878 adding 'evalscope/benchmarks/_meta/mgsm.json' 2026-04-24T06:57:37,880 adding 'evalscope/benchmarks/_meta/mia_bench.json' 2026-04-24T06:57:37,882 adding 'evalscope/benchmarks/_meta/micro_vqa.json' 2026-04-24T06:57:37,884 adding 'evalscope/benchmarks/_meta/minerva_math.json' 2026-04-24T06:57:37,886 adding 'evalscope/benchmarks/_meta/mit_movie_trivia.json' 2026-04-24T06:57:37,888 adding 'evalscope/benchmarks/_meta/mit_restaurant.json' 2026-04-24T06:57:37,890 adding 'evalscope/benchmarks/_meta/mm_bench.json' 2026-04-24T06:57:37,892 adding 'evalscope/benchmarks/_meta/mm_star.json' 2026-04-24T06:57:37,896 adding 'evalscope/benchmarks/_meta/mmlu.json' 2026-04-24T06:57:37,898 adding 'evalscope/benchmarks/_meta/mmlu_pro.json' 2026-04-24T06:57:37,901 adding 'evalscope/benchmarks/_meta/mmlu_redux.json' 2026-04-24T06:57:37,903 adding 'evalscope/benchmarks/_meta/mmmlu.json' 2026-04-24T06:57:37,907 adding 'evalscope/benchmarks/_meta/mmmu.json' 2026-04-24T06:57:37,911 adding 'evalscope/benchmarks/_meta/mmmu_pro.json' 2026-04-24T06:57:37,913 adding 'evalscope/benchmarks/_meta/mri_mcqa.json' 2026-04-24T06:57:37,915 adding 'evalscope/benchmarks/_meta/multi_if.json' 2026-04-24T06:57:37,917 adding 'evalscope/benchmarks/_meta/multi_nerd.json' 2026-04-24T06:57:37,920 adding 'evalscope/benchmarks/_meta/multiple_humaneval.json' 2026-04-24T06:57:37,922 adding 'evalscope/benchmarks/_meta/multiple_mbpp.json' 2026-04-24T06:57:37,924 adding 'evalscope/benchmarks/_meta/music_trivia.json' 2026-04-24T06:57:37,926 adding 'evalscope/benchmarks/_meta/musr.json' 2026-04-24T06:57:37,928 adding 'evalscope/benchmarks/_meta/ncbi.json' 2026-04-24T06:57:37,930 adding 'evalscope/benchmarks/_meta/needle_haystack.json' 2026-04-24T06:57:37,932 adding 'evalscope/benchmarks/_meta/ocr_bench.json' 2026-04-24T06:57:37,936 adding 'evalscope/benchmarks/_meta/ocr_bench_v2.json' 2026-04-24T06:57:37,939 adding 'evalscope/benchmarks/_meta/olympiad_bench.json' 2026-04-24T06:57:37,941 adding 'evalscope/benchmarks/_meta/omni_bench.json' 2026-04-24T06:57:37,945 adding 'evalscope/benchmarks/_meta/omni_doc_bench.json' 2026-04-24T06:57:37,948 adding 'evalscope/benchmarks/_meta/ontonotes5.json' 2026-04-24T06:57:37,950 adding 'evalscope/benchmarks/_meta/openai_mrcr.json' 2026-04-24T06:57:37,952 adding 'evalscope/benchmarks/_meta/piqa.json' 2026-04-24T06:57:37,955 adding 'evalscope/benchmarks/_meta/poly_math.json' 2026-04-24T06:57:37,957 adding 'evalscope/benchmarks/_meta/pope.json' 2026-04-24T06:57:37,959 adding 'evalscope/benchmarks/_meta/process_bench.json' 2026-04-24T06:57:37,961 adding 'evalscope/benchmarks/_meta/pubmedqa.json' 2026-04-24T06:57:37,963 adding 'evalscope/benchmarks/_meta/qasc.json' 2026-04-24T06:57:37,965 adding 'evalscope/benchmarks/_meta/race.json' 2026-04-24T06:57:37,967 adding 'evalscope/benchmarks/_meta/real_world_qa.json' 2026-04-24T06:57:37,969 adding 'evalscope/benchmarks/_meta/refcoco.json' 2026-04-24T06:57:37,975 adding 'evalscope/benchmarks/_meta/scicode.json' 2026-04-24T06:57:37,978 adding 'evalscope/benchmarks/_meta/science_qa.json' 2026-04-24T06:57:37,980 adding 'evalscope/benchmarks/_meta/sciq.json' 2026-04-24T06:57:37,982 adding 'evalscope/benchmarks/_meta/seed_bench_2_plus.json' 2026-04-24T06:57:37,984 adding 'evalscope/benchmarks/_meta/simple_qa.json' 2026-04-24T06:57:37,986 adding 'evalscope/benchmarks/_meta/simple_vqa.json' 2026-04-24T06:57:37,987 adding 'evalscope/benchmarks/_meta/siqa.json' 2026-04-24T06:57:37,991 adding 'evalscope/benchmarks/_meta/super_gpqa.json' 2026-04-24T06:57:37,993 adding 'evalscope/benchmarks/_meta/swe_bench_lite.json' 2026-04-24T06:57:37,994 adding 'evalscope/benchmarks/_meta/swe_bench_verified.json' 2026-04-24T06:57:37,996 adding 'evalscope/benchmarks/_meta/swe_bench_verified_mini.json' 2026-04-24T06:57:37,998 adding 'evalscope/benchmarks/_meta/tau2_bench.json' 2026-04-24T06:57:38,000 adding 'evalscope/benchmarks/_meta/tau_bench.json' 2026-04-24T06:57:38,001 adding 'evalscope/benchmarks/_meta/terminal_bench_v2.json' 2026-04-24T06:57:38,003 adding 'evalscope/benchmarks/_meta/tifa160.json' 2026-04-24T06:57:38,006 adding 'evalscope/benchmarks/_meta/tir_bench.json' 2026-04-24T06:57:38,008 adding 'evalscope/benchmarks/_meta/tool_bench.json' 2026-04-24T06:57:38,010 adding 'evalscope/benchmarks/_meta/torgo.json' 2026-04-24T06:57:38,012 adding 'evalscope/benchmarks/_meta/trivia_qa.json' 2026-04-24T06:57:38,014 adding 'evalscope/benchmarks/_meta/truthful_qa.json' 2026-04-24T06:57:38,016 adding 'evalscope/benchmarks/_meta/tweebank_ner.json' 2026-04-24T06:57:38,018 adding 'evalscope/benchmarks/_meta/tweet_ner_7.json' 2026-04-24T06:57:38,020 adding 'evalscope/benchmarks/_meta/visulogic.json' 2026-04-24T06:57:38,022 adding 'evalscope/benchmarks/_meta/vstar_bench.json' 2026-04-24T06:57:38,024 adding 'evalscope/benchmarks/_meta/winogrande.json' 2026-04-24T06:57:38,027 adding 'evalscope/benchmarks/_meta/wmt24pp.json' 2026-04-24T06:57:38,029 adding 'evalscope/benchmarks/_meta/wnut2017.json' 2026-04-24T06:57:38,031 adding 'evalscope/benchmarks/_meta/zebralogicbench.json' 2026-04-24T06:57:38,033 adding 'evalscope/benchmarks/_meta/zerobench.json' 2026-04-24T06:57:38,035 adding 'evalscope/benchmarks/a_okvqa/__init__.py' 2026-04-24T06:57:38,036 adding 'evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py' 2026-04-24T06:57:38,038 adding 'evalscope/benchmarks/aa_lcr/__init__.py' 2026-04-24T06:57:38,040 adding 'evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py' 2026-04-24T06:57:38,041 adding 'evalscope/benchmarks/ai2d/__init__.py' 2026-04-24T06:57:38,043 adding 'evalscope/benchmarks/ai2d/ai2d_adapter.py' 2026-04-24T06:57:38,044 adding 'evalscope/benchmarks/aime/__init__.py' 2026-04-24T06:57:38,046 adding 'evalscope/benchmarks/aime/aime_adapter.py' 2026-04-24T06:57:38,048 adding 'evalscope/benchmarks/aime/grader.py' 2026-04-24T06:57:38,050 adding 'evalscope/benchmarks/aime/math_normalize.py' 2026-04-24T06:57:38,051 adding 'evalscope/benchmarks/alpaca_eval/__init__.py' 2026-04-24T06:57:38,053 adding 'evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py' 2026-04-24T06:57:38,055 adding 'evalscope/benchmarks/amc/__init__.py' 2026-04-24T06:57:38,056 adding 'evalscope/benchmarks/amc/amc_adapter.py' 2026-04-24T06:57:38,058 adding 'evalscope/benchmarks/arc/__init__.py' 2026-04-24T06:57:38,059 adding 'evalscope/benchmarks/arc/arc_adapter.py' 2026-04-24T06:57:38,061 adding 'evalscope/benchmarks/arena_hard/__init__.py' 2026-04-24T06:57:38,063 adding 'evalscope/benchmarks/arena_hard/arena_hard_adapter.py' 2026-04-24T06:57:38,064 adding 'evalscope/benchmarks/arena_hard/requirements.txt' 2026-04-24T06:57:38,065 adding 'evalscope/benchmarks/arena_hard/utils.py' 2026-04-24T06:57:38,067 adding 'evalscope/benchmarks/bbh/__init__.py' 2026-04-24T06:57:38,069 adding 'evalscope/benchmarks/bbh/bbh_adapter.py' 2026-04-24T06:57:38,071 adding 'evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt' 2026-04-24T06:57:38,073 adding 'evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt' 2026-04-24T06:57:38,074 adding 'evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt' 2026-04-24T06:57:38,075 adding 'evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt' 2026-04-24T06:57:38,077 adding 'evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt' 2026-04-24T06:57:38,078 adding 'evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt' 2026-04-24T06:57:38,080 adding 'evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt' 2026-04-24T06:57:38,081 adding 'evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt' 2026-04-24T06:57:38,082 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt' 2026-04-24T06:57:38,083 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt' 2026-04-24T06:57:38,084 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt' 2026-04-24T06:57:38,086 adding 'evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt' 2026-04-24T06:57:38,087 adding 'evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt' 2026-04-24T06:57:38,088 adding 'evalscope/benchmarks/bbh/cot_prompts/navigate.txt' 2026-04-24T06:57:38,089 adding 'evalscope/benchmarks/bbh/cot_prompts/object_counting.txt' 2026-04-24T06:57:38,091 adding 'evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt' 2026-04-24T06:57:38,092 adding 'evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt' 2026-04-24T06:57:38,093 adding 'evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt' 2026-04-24T06:57:38,095 adding 'evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt' 2026-04-24T06:57:38,096 adding 'evalscope/benchmarks/bbh/cot_prompts/snarks.txt' 2026-04-24T06:57:38,098 adding 'evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt' 2026-04-24T06:57:38,099 adding 'evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt' 2026-04-24T06:57:38,100 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt' 2026-04-24T06:57:38,102 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt' 2026-04-24T06:57:38,103 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt' 2026-04-24T06:57:38,104 adding 'evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt' 2026-04-24T06:57:38,105 adding 'evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt' 2026-04-24T06:57:38,107 adding 'evalscope/benchmarks/bfcl/__init__.py' 2026-04-24T06:57:38,109 adding 'evalscope/benchmarks/bfcl/requirements.txt' 2026-04-24T06:57:38,111 adding 'evalscope/benchmarks/bfcl/v3/__init__.py' 2026-04-24T06:57:38,113 adding 'evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py' 2026-04-24T06:57:38,115 adding 'evalscope/benchmarks/bfcl/v3/generation.py' 2026-04-24T06:57:38,116 adding 'evalscope/benchmarks/bfcl/v3/utils.py' 2026-04-24T06:57:38,118 adding 'evalscope/benchmarks/bfcl/v4/__init__.py' 2026-04-24T06:57:38,120 adding 'evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py' 2026-04-24T06:57:38,122 adding 'evalscope/benchmarks/bfcl/v4/utils.py' 2026-04-24T06:57:38,124 adding 'evalscope/benchmarks/biomix_qa/__init__.py' 2026-04-24T06:57:38,125 adding 'evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py' 2026-04-24T06:57:38,127 adding 'evalscope/benchmarks/blink/__init__.py' 2026-04-24T06:57:38,128 adding 'evalscope/benchmarks/blink/blink_adapter.py' 2026-04-24T06:57:38,130 adding 'evalscope/benchmarks/ceval/__init__.py' 2026-04-24T06:57:38,132 adding 'evalscope/benchmarks/ceval/ceval_adapter.py' 2026-04-24T06:57:38,134 adding 'evalscope/benchmarks/chartqa/__init__.py' 2026-04-24T06:57:38,135 adding 'evalscope/benchmarks/chartqa/chartqa_adapter.py' 2026-04-24T06:57:38,136 adding 'evalscope/benchmarks/chartqa/utils.py' 2026-04-24T06:57:38,138 adding 'evalscope/benchmarks/chinese_simple_qa/__init__.py' 2026-04-24T06:57:38,140 adding 'evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py' 2026-04-24T06:57:38,142 adding 'evalscope/benchmarks/cl_bench/__init__.py' 2026-04-24T06:57:38,144 adding 'evalscope/benchmarks/cl_bench/cl_bench_adapter.py' 2026-04-24T06:57:38,145 adding 'evalscope/benchmarks/cl_bench/utils.py' 2026-04-24T06:57:38,147 adding 'evalscope/benchmarks/cmmlu/__init__.py' 2026-04-24T06:57:38,149 adding 'evalscope/benchmarks/cmmlu/cmmlu_adapter.py' 2026-04-24T06:57:38,150 adding 'evalscope/benchmarks/cmmmu/__init__.py' 2026-04-24T06:57:38,152 adding 'evalscope/benchmarks/cmmmu/cmmmu_adapter.py' 2026-04-24T06:57:38,154 adding 'evalscope/benchmarks/cmmmu/utils.py' 2026-04-24T06:57:38,156 adding 'evalscope/benchmarks/cmmu/__init__.py' 2026-04-24T06:57:38,157 adding 'evalscope/benchmarks/cmmu/cmmu_adapter.py' 2026-04-24T06:57:38,159 adding 'evalscope/benchmarks/cmmu/prompt.py' 2026-04-24T06:57:38,160 adding 'evalscope/benchmarks/coin_flip/__init__.py' 2026-04-24T06:57:38,162 adding 'evalscope/benchmarks/coin_flip/coin_flip_adapter.py' 2026-04-24T06:57:38,164 adding 'evalscope/benchmarks/commonsense_qa/__init__.py' 2026-04-24T06:57:38,165 adding 'evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py' 2026-04-24T06:57:38,167 adding 'evalscope/benchmarks/competition_math/__init__.py' 2026-04-24T06:57:38,168 adding 'evalscope/benchmarks/competition_math/competition_math_adapter.py' 2026-04-24T06:57:38,170 adding 'evalscope/benchmarks/data_collection/__init__.py' 2026-04-24T06:57:38,172 adding 'evalscope/benchmarks/data_collection/data_collection_adapter.py' 2026-04-24T06:57:38,173 adding 'evalscope/benchmarks/docmath/__init__.py' 2026-04-24T06:57:38,175 adding 'evalscope/benchmarks/docmath/docmath_adapter.py' 2026-04-24T06:57:38,176 adding 'evalscope/benchmarks/docmath/utils.py' 2026-04-24T06:57:38,178 adding 'evalscope/benchmarks/docvqa/__init__.py' 2026-04-24T06:57:38,179 adding 'evalscope/benchmarks/docvqa/docvqa_adapter.py' 2026-04-24T06:57:38,181 adding 'evalscope/benchmarks/drivelology/__init__.py' 2026-04-24T06:57:38,183 adding 'evalscope/benchmarks/drivelology/drivelology_binary_adapter.py' 2026-04-24T06:57:38,185 adding 'evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py' 2026-04-24T06:57:38,186 adding 'evalscope/benchmarks/drivelology/drivelology_selection_adapter.py' 2026-04-24T06:57:38,188 adding 'evalscope/benchmarks/drivelology/drivelology_writing_adapter.py' 2026-04-24T06:57:38,190 adding 'evalscope/benchmarks/drop/__init__.py' 2026-04-24T06:57:38,192 adding 'evalscope/benchmarks/drop/drop_adapter.py' 2026-04-24T06:57:38,194 adding 'evalscope/benchmarks/drop/utils.py' 2026-04-24T06:57:38,195 adding 'evalscope/benchmarks/eq_bench/__init__.py' 2026-04-24T06:57:38,197 adding 'evalscope/benchmarks/eq_bench/answer_validation.py' 2026-04-24T06:57:38,199 adding 'evalscope/benchmarks/eq_bench/eq_bench_adapter.py' 2026-04-24T06:57:38,201 adding 'evalscope/benchmarks/fleurs/__init__.py' 2026-04-24T06:57:38,203 adding 'evalscope/benchmarks/fleurs/fleurs_adapter.py' 2026-04-24T06:57:38,204 adding 'evalscope/benchmarks/frames/__init__.py' 2026-04-24T06:57:38,206 adding 'evalscope/benchmarks/frames/frames_adapter.py' 2026-04-24T06:57:38,207 adding 'evalscope/benchmarks/frames/utils.py' 2026-04-24T06:57:38,209 adding 'evalscope/benchmarks/general_arena/__init__.py' 2026-04-24T06:57:38,212 adding 'evalscope/benchmarks/general_arena/general_arena_adapter.py' 2026-04-24T06:57:38,213 adding 'evalscope/benchmarks/general_arena/requirements.txt' 2026-04-24T06:57:38,215 adding 'evalscope/benchmarks/general_arena/utils.py' 2026-04-24T06:57:38,217 adding 'evalscope/benchmarks/general_fc/__init__.py' 2026-04-24T06:57:38,219 adding 'evalscope/benchmarks/general_fc/general_fc_adapter.py' 2026-04-24T06:57:38,221 adding 'evalscope/benchmarks/general_mcq/__init__.py' 2026-04-24T06:57:38,222 adding 'evalscope/benchmarks/general_mcq/general_mcq_adapter.py' 2026-04-24T06:57:38,224 adding 'evalscope/benchmarks/general_qa/__init__.py' 2026-04-24T06:57:38,226 adding 'evalscope/benchmarks/general_qa/general_qa_adapter.py' 2026-04-24T06:57:38,228 adding 'evalscope/benchmarks/general_vmcq/__init__.py' 2026-04-24T06:57:38,229 adding 'evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py' 2026-04-24T06:57:38,231 adding 'evalscope/benchmarks/general_vqa/__init__.py' 2026-04-24T06:57:38,233 adding 'evalscope/benchmarks/general_vqa/general_vqa_adapter.py' 2026-04-24T06:57:38,235 adding 'evalscope/benchmarks/gpqa/__init__.py' 2026-04-24T06:57:38,236 adding 'evalscope/benchmarks/gpqa/gpqa_adapter.py' 2026-04-24T06:57:38,238 adding 'evalscope/benchmarks/gpqa/prompt.py' 2026-04-24T06:57:38,239 adding 'evalscope/benchmarks/gsm8k/__init__.py' 2026-04-24T06:57:38,241 adding 'evalscope/benchmarks/gsm8k/gsm8k_adapter.py' 2026-04-24T06:57:38,242 adding 'evalscope/benchmarks/gsm8k_v/__init__.py' 2026-04-24T06:57:38,244 adding 'evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py' 2026-04-24T06:57:38,246 adding 'evalscope/benchmarks/hallusion_bench/__init__.py' 2026-04-24T06:57:38,247 adding 'evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py' 2026-04-24T06:57:38,249 adding 'evalscope/benchmarks/halu_eval/__init__.py' 2026-04-24T06:57:38,250 adding 'evalscope/benchmarks/halu_eval/halu_eval_adapter.py' 2026-04-24T06:57:38,252 adding 'evalscope/benchmarks/halu_eval/halu_eval_instructions.py' 2026-04-24T06:57:38,254 adding 'evalscope/benchmarks/healthbench/__init__.py' 2026-04-24T06:57:38,256 adding 'evalscope/benchmarks/healthbench/healthbench_adapter.py' 2026-04-24T06:57:38,258 adding 'evalscope/benchmarks/healthbench/utils.py' 2026-04-24T06:57:38,259 adding 'evalscope/benchmarks/hellaswag/__init__.py' 2026-04-24T06:57:38,261 adding 'evalscope/benchmarks/hellaswag/hellaswag_adapter.py' 2026-04-24T06:57:38,263 adding 'evalscope/benchmarks/hle/__init__.py' 2026-04-24T06:57:38,264 adding 'evalscope/benchmarks/hle/hle_adapter.py' 2026-04-24T06:57:38,267 adding 'evalscope/benchmarks/hmmt25/hmmt25_adapter.py' 2026-04-24T06:57:38,269 adding 'evalscope/benchmarks/humaneval/__init__.py' 2026-04-24T06:57:38,270 adding 'evalscope/benchmarks/humaneval/humaneval_adapter.py' 2026-04-24T06:57:38,272 adding 'evalscope/benchmarks/humaneval/utils.py' 2026-04-24T06:57:38,274 adding 'evalscope/benchmarks/humanevalplus/__init__.py' 2026-04-24T06:57:38,275 adding 'evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py' 2026-04-24T06:57:38,277 adding 'evalscope/benchmarks/humanevalplus/docker/Dockerfile' 2026-04-24T06:57:38,279 adding 'evalscope/benchmarks/ifbench/__init__.py' 2026-04-24T06:57:38,281 adding 'evalscope/benchmarks/ifbench/evaluation_lib.py' 2026-04-24T06:57:38,282 adding 'evalscope/benchmarks/ifbench/ifbench_adapter.py' 2026-04-24T06:57:38,290 adding 'evalscope/benchmarks/ifbench/instructions.py' 2026-04-24T06:57:38,292 adding 'evalscope/benchmarks/ifbench/instructions_registry.py' 2026-04-24T06:57:38,295 adding 'evalscope/benchmarks/ifbench/instructions_util.py' 2026-04-24T06:57:38,296 adding 'evalscope/benchmarks/ifbench/requirements.txt' 2026-04-24T06:57:38,298 adding 'evalscope/benchmarks/ifeval/__init__.py' 2026-04-24T06:57:38,299 adding 'evalscope/benchmarks/ifeval/ifeval_adapter.py' 2026-04-24T06:57:38,304 adding 'evalscope/benchmarks/ifeval/instructions.py' 2026-04-24T06:57:38,306 adding 'evalscope/benchmarks/ifeval/instructions_registry.py' 2026-04-24T06:57:38,310 adding 'evalscope/benchmarks/ifeval/instructions_util.py' 2026-04-24T06:57:38,311 adding 'evalscope/benchmarks/ifeval/requirements.txt' 2026-04-24T06:57:38,312 adding 'evalscope/benchmarks/ifeval/utils.py' 2026-04-24T06:57:38,314 adding 'evalscope/benchmarks/image_edit/__init__.py' 2026-04-24T06:57:38,316 adding 'evalscope/benchmarks/image_edit/gedit/__init__.py' 2026-04-24T06:57:38,317 adding 'evalscope/benchmarks/image_edit/gedit/gedit_adapter.py' 2026-04-24T06:57:38,319 adding 'evalscope/benchmarks/image_edit/gedit/utils.py' 2026-04-24T06:57:38,321 adding 'evalscope/benchmarks/image_edit/gedit/vie_prompts.py' 2026-04-24T06:57:38,324 adding 'evalscope/benchmarks/infovqa/__init__.py' 2026-04-24T06:57:38,325 adding 'evalscope/benchmarks/infovqa/infovqa_adapter.py' 2026-04-24T06:57:38,327 adding 'evalscope/benchmarks/iquiz/__init__.py' 2026-04-24T06:57:38,328 adding 'evalscope/benchmarks/iquiz/iquiz_adapter.py' 2026-04-24T06:57:38,330 adding 'evalscope/benchmarks/librispeech/__init__.py' 2026-04-24T06:57:38,332 adding 'evalscope/benchmarks/librispeech/librispeech_adapter.py' 2026-04-24T06:57:38,334 adding 'evalscope/benchmarks/live_code_bench/__init__.py' 2026-04-24T06:57:38,335 adding 'evalscope/benchmarks/live_code_bench/evaluate_utils.py' 2026-04-24T06:57:38,337 adding 'evalscope/benchmarks/live_code_bench/extract_utils.py' 2026-04-24T06:57:38,338 adding 'evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py' 2026-04-24T06:57:38,340 adding 'evalscope/benchmarks/live_code_bench/load_utils.py' 2026-04-24T06:57:38,341 adding 'evalscope/benchmarks/live_code_bench/pass_k_utils.py' 2026-04-24T06:57:38,343 adding 'evalscope/benchmarks/live_code_bench/prompts.py' 2026-04-24T06:57:38,345 adding 'evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py' 2026-04-24T06:57:38,347 adding 'evalscope/benchmarks/live_code_bench/testing_util.py' 2026-04-24T06:57:38,349 adding 'evalscope/benchmarks/logi_qa/__int__.py' 2026-04-24T06:57:38,350 adding 'evalscope/benchmarks/logi_qa/logi_qa_adapter.py' 2026-04-24T06:57:38,352 adding 'evalscope/benchmarks/longbench_v2/__init__.py' 2026-04-24T06:57:38,353 adding 'evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py' 2026-04-24T06:57:38,355 adding 'evalscope/benchmarks/maritime_bench/__init__.py' 2026-04-24T06:57:38,356 adding 'evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py' 2026-04-24T06:57:38,358 adding 'evalscope/benchmarks/math_500/__init__.py' 2026-04-24T06:57:38,359 adding 'evalscope/benchmarks/math_500/math_500_adapter.py' 2026-04-24T06:57:38,361 adding 'evalscope/benchmarks/math_qa/__init__.py' 2026-04-24T06:57:38,362 adding 'evalscope/benchmarks/math_qa/math_qa_adapter.py' 2026-04-24T06:57:38,364 adding 'evalscope/benchmarks/math_verse/__init__.py' 2026-04-24T06:57:38,366 adding 'evalscope/benchmarks/math_verse/math_verse_adapter.py' 2026-04-24T06:57:38,367 adding 'evalscope/benchmarks/math_vision/__init__.py' 2026-04-24T06:57:38,369 adding 'evalscope/benchmarks/math_vision/math_vision_adapter.py' 2026-04-24T06:57:38,370 adding 'evalscope/benchmarks/math_vista/__init__.py' 2026-04-24T06:57:38,372 adding 'evalscope/benchmarks/math_vista/math_vista_adapter.py' 2026-04-24T06:57:38,374 adding 'evalscope/benchmarks/mbpp/__init__.py' 2026-04-24T06:57:38,375 adding 'evalscope/benchmarks/mbpp/mbpp_adapter.py' 2026-04-24T06:57:38,377 adding 'evalscope/benchmarks/mbppplus/__init__.py' 2026-04-24T06:57:38,379 adding 'evalscope/benchmarks/mbppplus/mbppplus_adapter.py' 2026-04-24T06:57:38,380 adding 'evalscope/benchmarks/med_mcqa/__init__.py' 2026-04-24T06:57:38,382 adding 'evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py' 2026-04-24T06:57:38,383 adding 'evalscope/benchmarks/mgsm/__init__.py' 2026-04-24T06:57:38,385 adding 'evalscope/benchmarks/mgsm/mgsm_adapter.py' 2026-04-24T06:57:38,386 adding 'evalscope/benchmarks/mia_bench/__init__.py' 2026-04-24T06:57:38,388 adding 'evalscope/benchmarks/mia_bench/mia_bench_adapter.py' 2026-04-24T06:57:38,390 adding 'evalscope/benchmarks/mia_bench/utils.py' 2026-04-24T06:57:38,392 adding 'evalscope/benchmarks/micro_vqa/__init__.py' 2026-04-24T06:57:38,393 adding 'evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py' 2026-04-24T06:57:38,395 adding 'evalscope/benchmarks/minerva_math/__init__.py' 2026-04-24T06:57:38,396 adding 'evalscope/benchmarks/minerva_math/minerva_math_adapter.py' 2026-04-24T06:57:38,398 adding 'evalscope/benchmarks/mm_bench/__init__.py' 2026-04-24T06:57:38,400 adding 'evalscope/benchmarks/mm_bench/mm_bench_adapter.py' 2026-04-24T06:57:38,402 adding 'evalscope/benchmarks/mm_star/__init__.py' 2026-04-24T06:57:38,403 adding 'evalscope/benchmarks/mm_star/mm_star_adapter.py' 2026-04-24T06:57:38,405 adding 'evalscope/benchmarks/mmlu/__init__.py' 2026-04-24T06:57:38,407 adding 'evalscope/benchmarks/mmlu/mmlu_adapter.py' 2026-04-24T06:57:38,408 adding 'evalscope/benchmarks/mmlu_pro/__init__.py' 2026-04-24T06:57:38,410 adding 'evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py' 2026-04-24T06:57:38,412 adding 'evalscope/benchmarks/mmlu_redux/__init__.py' 2026-04-24T06:57:38,413 adding 'evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py' 2026-04-24T06:57:38,415 adding 'evalscope/benchmarks/mmmlu/__init__.py' 2026-04-24T06:57:38,416 adding 'evalscope/benchmarks/mmmlu/mmmlu_adapter.py' 2026-04-24T06:57:38,418 adding 'evalscope/benchmarks/mmmlu/prompt.py' 2026-04-24T06:57:38,420 adding 'evalscope/benchmarks/mmmu/__init__.py' 2026-04-24T06:57:38,421 adding 'evalscope/benchmarks/mmmu/mmmu_adapter.py' 2026-04-24T06:57:38,423 adding 'evalscope/benchmarks/mmmu_pro/__init__.py' 2026-04-24T06:57:38,424 adding 'evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py' 2026-04-24T06:57:38,426 adding 'evalscope/benchmarks/mri_mcqa/__init__.py' 2026-04-24T06:57:38,427 adding 'evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py' 2026-04-24T06:57:38,429 adding 'evalscope/benchmarks/multi_if/__init__.py' 2026-04-24T06:57:38,437 adding 'evalscope/benchmarks/multi_if/ifeval.py' 2026-04-24T06:57:38,439 adding 'evalscope/benchmarks/multi_if/metrics.py' 2026-04-24T06:57:38,441 adding 'evalscope/benchmarks/multi_if/multi_if_adapter.py' 2026-04-24T06:57:38,442 adding 'evalscope/benchmarks/multi_if/requirements.txt' 2026-04-24T06:57:38,444 adding 'evalscope/benchmarks/multipl_e/__init__.py' 2026-04-24T06:57:38,445 adding 'evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py' 2026-04-24T06:57:38,447 adding 'evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py' 2026-04-24T06:57:38,448 adding 'evalscope/benchmarks/multipl_e/utils.py' 2026-04-24T06:57:38,450 adding 'evalscope/benchmarks/music_trivia/__init__.py' 2026-04-24T06:57:38,451 adding 'evalscope/benchmarks/music_trivia/music_trivia_adapter.py' 2026-04-24T06:57:38,453 adding 'evalscope/benchmarks/musr/__init__.py' 2026-04-24T06:57:38,454 adding 'evalscope/benchmarks/musr/musr_adapter.py' 2026-04-24T06:57:38,456 adding 'evalscope/benchmarks/needle_haystack/__init__.py' 2026-04-24T06:57:38,458 adding 'evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py' 2026-04-24T06:57:38,460 adding 'evalscope/benchmarks/needle_haystack/requirements.txt' 2026-04-24T06:57:38,461 adding 'evalscope/benchmarks/needle_haystack/utils.py' 2026-04-24T06:57:38,463 adding 'evalscope/benchmarks/ner/__init__.py' 2026-04-24T06:57:38,465 adding 'evalscope/benchmarks/ner/anat_em_adapter.py' 2026-04-24T06:57:38,466 adding 'evalscope/benchmarks/ner/bc2gm_adapter.py' 2026-04-24T06:57:38,467 adding 'evalscope/benchmarks/ner/bc4chemd_adapter.py' 2026-04-24T06:57:38,469 adding 'evalscope/benchmarks/ner/bc5cdr_adapter.py' 2026-04-24T06:57:38,470 adding 'evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py' 2026-04-24T06:57:38,471 adding 'evalscope/benchmarks/ner/conll2003_adapter.py' 2026-04-24T06:57:38,473 adding 'evalscope/benchmarks/ner/conllpp_adapter.py' 2026-04-24T06:57:38,474 adding 'evalscope/benchmarks/ner/copious_adapter.py' 2026-04-24T06:57:38,476 adding 'evalscope/benchmarks/ner/cross_ner_adapter.py' 2026-04-24T06:57:38,477 adding 'evalscope/benchmarks/ner/fin_ner_adapter.py' 2026-04-24T06:57:38,479 adding 'evalscope/benchmarks/ner/genia_ner_adapter.py' 2026-04-24T06:57:38,480 adding 'evalscope/benchmarks/ner/harvey_ner_adapter.py' 2026-04-24T06:57:38,482 adding 'evalscope/benchmarks/ner/jnlpba_adapter.py' 2026-04-24T06:57:38,483 adding 'evalscope/benchmarks/ner/jnlpba_rare_adapter.py' 2026-04-24T06:57:38,484 adding 'evalscope/benchmarks/ner/mit_movie_trivia_adapter.py' 2026-04-24T06:57:38,486 adding 'evalscope/benchmarks/ner/mit_restaurant_adapter.py' 2026-04-24T06:57:38,488 adding 'evalscope/benchmarks/ner/multi_nerd_adapter.py' 2026-04-24T06:57:38,489 adding 'evalscope/benchmarks/ner/ncbi_adapter.py' 2026-04-24T06:57:38,491 adding 'evalscope/benchmarks/ner/ontonotes5_adapter.py' 2026-04-24T06:57:38,492 adding 'evalscope/benchmarks/ner/tweebank_ner_adapter.py' 2026-04-24T06:57:38,494 adding 'evalscope/benchmarks/ner/tweet_ner_7_adapter.py' 2026-04-24T06:57:38,496 adding 'evalscope/benchmarks/ner/wnut2017_adapter.py' 2026-04-24T06:57:38,497 adding 'evalscope/benchmarks/ner/cross_ner_entities/__init__.py' 2026-04-24T06:57:38,499 adding 'evalscope/benchmarks/ner/cross_ner_entities/ai.py' 2026-04-24T06:57:38,500 adding 'evalscope/benchmarks/ner/cross_ner_entities/literature.py' 2026-04-24T06:57:38,501 adding 'evalscope/benchmarks/ner/cross_ner_entities/music.py' 2026-04-24T06:57:38,502 adding 'evalscope/benchmarks/ner/cross_ner_entities/politics.py' 2026-04-24T06:57:38,504 adding 'evalscope/benchmarks/ner/cross_ner_entities/science.py' 2026-04-24T06:57:38,505 adding 'evalscope/benchmarks/ocr_bench/__init__.py' 2026-04-24T06:57:38,506 adding 'evalscope/benchmarks/ocr_bench/requirements.txt' 2026-04-24T06:57:38,508 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py' 2026-04-24T06:57:38,510 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py' 2026-04-24T06:57:38,512 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py' 2026-04-24T06:57:38,516 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py' 2026-04-24T06:57:38,517 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py' 2026-04-24T06:57:38,519 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py' 2026-04-24T06:57:38,520 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py' 2026-04-24T06:57:38,522 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py' 2026-04-24T06:57:38,523 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py' 2026-04-24T06:57:38,525 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py' 2026-04-24T06:57:38,527 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py' 2026-04-24T06:57:38,529 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py' 2026-04-24T06:57:38,530 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt' 2026-04-24T06:57:38,533 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py' 2026-04-24T06:57:38,535 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py' 2026-04-24T06:57:38,537 adding 'evalscope/benchmarks/olympiad_bench/__init__.py' 2026-04-24T06:57:38,539 adding 'evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py' 2026-04-24T06:57:38,540 adding 'evalscope/benchmarks/olympiad_bench/requirements.txt' 2026-04-24T06:57:38,543 adding 'evalscope/benchmarks/olympiad_bench/utils.py' 2026-04-24T06:57:38,545 adding 'evalscope/benchmarks/omni_bench/__init__.py' 2026-04-24T06:57:38,546 adding 'evalscope/benchmarks/omni_bench/omni_bench_adapter.py' 2026-04-24T06:57:38,548 adding 'evalscope/benchmarks/omnidoc_bench/__init__.py' 2026-04-24T06:57:38,550 adding 'evalscope/benchmarks/omnidoc_bench/end2end_eval.py' 2026-04-24T06:57:38,553 adding 'evalscope/benchmarks/omnidoc_bench/metrics.py' 2026-04-24T06:57:38,555 adding 'evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py' 2026-04-24T06:57:38,556 adding 'evalscope/benchmarks/omnidoc_bench/requirements.txt' 2026-04-24T06:57:38,564 adding 'evalscope/benchmarks/omnidoc_bench/utils.py' 2026-04-24T06:57:38,566 adding 'evalscope/benchmarks/openai_mrcr/__init__.py' 2026-04-24T06:57:38,568 adding 'evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py' 2026-04-24T06:57:38,570 adding 'evalscope/benchmarks/openai_mrcr/requirements.txt' 2026-04-24T06:57:38,571 adding 'evalscope/benchmarks/openai_mrcr/utils.py' 2026-04-24T06:57:38,573 adding 'evalscope/benchmarks/piqa/__init__.py' 2026-04-24T06:57:38,574 adding 'evalscope/benchmarks/piqa/piqa_adapter.py' 2026-04-24T06:57:38,576 adding 'evalscope/benchmarks/poly_math/__init__.py' 2026-04-24T06:57:38,577 adding 'evalscope/benchmarks/poly_math/poly_math_adapter.py' 2026-04-24T06:57:38,580 adding 'evalscope/benchmarks/poly_math/utils/instruction.py' 2026-04-24T06:57:38,581 adding 'evalscope/benchmarks/pope/__init__.py' 2026-04-24T06:57:38,583 adding 'evalscope/benchmarks/pope/pope_adapter.py' 2026-04-24T06:57:38,585 adding 'evalscope/benchmarks/process_bench/__init__.py' 2026-04-24T06:57:38,586 adding 'evalscope/benchmarks/process_bench/process_bench_adapter.py' 2026-04-24T06:57:38,588 adding 'evalscope/benchmarks/pumed_qa/__init__.py' 2026-04-24T06:57:38,590 adding 'evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py' 2026-04-24T06:57:38,591 adding 'evalscope/benchmarks/qasc/__init__.py' 2026-04-24T06:57:38,593 adding 'evalscope/benchmarks/qasc/qasc_adapter.py' 2026-04-24T06:57:38,594 adding 'evalscope/benchmarks/race/__init__.py' 2026-04-24T06:57:38,596 adding 'evalscope/benchmarks/race/race_adapter.py' 2026-04-24T06:57:38,598 adding 'evalscope/benchmarks/real_world_qa/__init__.py' 2026-04-24T06:57:38,599 adding 'evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py' 2026-04-24T06:57:38,601 adding 'evalscope/benchmarks/refcoco/__init__.py' 2026-04-24T06:57:38,603 adding 'evalscope/benchmarks/refcoco/evaluation_lib.py' 2026-04-24T06:57:38,605 adding 'evalscope/benchmarks/refcoco/refcoco_adapter.py' 2026-04-24T06:57:38,606 adding 'evalscope/benchmarks/refcoco/requirements.txt' 2026-04-24T06:57:38,607 adding 'evalscope/benchmarks/refcoco/utils.py' 2026-04-24T06:57:38,609 adding 'evalscope/benchmarks/scicode/__init__.py' 2026-04-24T06:57:38,610 adding 'evalscope/benchmarks/scicode/prompt_templates.py' 2026-04-24T06:57:38,612 adding 'evalscope/benchmarks/scicode/scicode_adapter.py' 2026-04-24T06:57:38,613 adding 'evalscope/benchmarks/scicode/util.py' 2026-04-24T06:57:38,615 adding 'evalscope/benchmarks/scicode/docker/Dockerfile' 2026-04-24T06:57:38,616 adding 'evalscope/benchmarks/scicode/docker/docker_requirements.txt' 2026-04-24T06:57:38,618 adding 'evalscope/benchmarks/scicode/docker/process_data.py' 2026-04-24T06:57:38,619 adding 'evalscope/benchmarks/scicode/docker/test_util.py' 2026-04-24T06:57:38,621 adding 'evalscope/benchmarks/science_qa/__init__.py' 2026-04-24T06:57:38,622 adding 'evalscope/benchmarks/science_qa/science_qa_adapter.py' 2026-04-24T06:57:38,624 adding 'evalscope/benchmarks/sciq/__init__.py' 2026-04-24T06:57:38,625 adding 'evalscope/benchmarks/sciq/sciq_adapter.py' 2026-04-24T06:57:38,627 adding 'evalscope/benchmarks/seed_bench_2_plus/__init__.py' 2026-04-24T06:57:38,628 adding 'evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py' 2026-04-24T06:57:38,630 adding 'evalscope/benchmarks/simple_qa/__init__.py' 2026-04-24T06:57:38,632 adding 'evalscope/benchmarks/simple_qa/simple_qa_adapter.py' 2026-04-24T06:57:38,634 adding 'evalscope/benchmarks/simple_vqa/__init__.py' 2026-04-24T06:57:38,636 adding 'evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py' 2026-04-24T06:57:38,638 adding 'evalscope/benchmarks/siqa/__init__.py' 2026-04-24T06:57:38,639 adding 'evalscope/benchmarks/siqa/siqa_adapter.py' 2026-04-24T06:57:38,641 adding 'evalscope/benchmarks/super_gpqa/__init__.py' 2026-04-24T06:57:38,643 adding 'evalscope/benchmarks/super_gpqa/prompt.py' 2026-04-24T06:57:38,645 adding 'evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py' 2026-04-24T06:57:38,646 adding 'evalscope/benchmarks/super_gpqa/utils.py' 2026-04-24T06:57:38,648 adding 'evalscope/benchmarks/swe_bench/__init__.py' 2026-04-24T06:57:38,650 adding 'evalscope/benchmarks/swe_bench/build_images.py' 2026-04-24T06:57:38,651 adding 'evalscope/benchmarks/swe_bench/requirements.txt' 2026-04-24T06:57:38,653 adding 'evalscope/benchmarks/swe_bench/swe_bench_adapter.py' 2026-04-24T06:57:38,655 adding 'evalscope/benchmarks/swe_bench/utils.py' 2026-04-24T06:57:38,657 adding 'evalscope/benchmarks/tau_bench/__init__.py' 2026-04-24T06:57:38,658 adding 'evalscope/benchmarks/tau_bench/tau2_bench/__init__.py' 2026-04-24T06:57:38,660 adding 'evalscope/benchmarks/tau_bench/tau2_bench/generation.py' 2026-04-24T06:57:38,661 adding 'evalscope/benchmarks/tau_bench/tau2_bench/requirements.txt' 2026-04-24T06:57:38,663 adding 'evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py' 2026-04-24T06:57:38,665 adding 'evalscope/benchmarks/tau_bench/tau_bench/__init__.py' 2026-04-24T06:57:38,667 adding 'evalscope/benchmarks/tau_bench/tau_bench/generation.py' 2026-04-24T06:57:38,668 adding 'evalscope/benchmarks/tau_bench/tau_bench/requirements.txt' 2026-04-24T06:57:38,669 adding 'evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py' 2026-04-24T06:57:38,671 adding 'evalscope/benchmarks/terminal_bench/__init__.py' 2026-04-24T06:57:38,672 adding 'evalscope/benchmarks/terminal_bench/requirements.txt' 2026-04-24T06:57:38,674 adding 'evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py' 2026-04-24T06:57:38,675 adding 'evalscope/benchmarks/terminal_bench/utils.py' 2026-04-24T06:57:38,677 adding 'evalscope/benchmarks/text2image/__init__.py' 2026-04-24T06:57:38,678 adding 'evalscope/benchmarks/text2image/evalmuse_adapter.py' 2026-04-24T06:57:38,680 adding 'evalscope/benchmarks/text2image/genai_bench_adapter.py' 2026-04-24T06:57:38,681 adding 'evalscope/benchmarks/text2image/general_t2i_adapter.py' 2026-04-24T06:57:38,683 adding 'evalscope/benchmarks/text2image/hpdv2_adapter.py' 2026-04-24T06:57:38,685 adding 'evalscope/benchmarks/text2image/tifa_adapter.py' 2026-04-24T06:57:38,686 adding 'evalscope/benchmarks/tir_bench/__init__.py' 2026-04-24T06:57:38,689 adding 'evalscope/benchmarks/tir_bench/tir_bench_adapter.py' 2026-04-24T06:57:38,691 adding 'evalscope/benchmarks/tir_bench/utils.py' 2026-04-24T06:57:38,693 adding 'evalscope/benchmarks/tool_bench/__init__.py' 2026-04-24T06:57:38,694 adding 'evalscope/benchmarks/tool_bench/tool_bench_adapter.py' 2026-04-24T06:57:38,696 adding 'evalscope/benchmarks/tool_bench/utils.py' 2026-04-24T06:57:38,698 adding 'evalscope/benchmarks/torgo/__init__.py' 2026-04-24T06:57:38,699 adding 'evalscope/benchmarks/torgo/requirements.txt' 2026-04-24T06:57:38,700 adding 'evalscope/benchmarks/torgo/torgo_adapter.py' 2026-04-24T06:57:38,702 adding 'evalscope/benchmarks/trivia_qa/__init__.py' 2026-04-24T06:57:38,704 adding 'evalscope/benchmarks/trivia_qa/samples.jsonl' 2026-04-24T06:57:38,705 adding 'evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py' 2026-04-24T06:57:38,707 adding 'evalscope/benchmarks/truthful_qa/__init__.py' 2026-04-24T06:57:38,708 adding 'evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py' 2026-04-24T06:57:38,710 adding 'evalscope/benchmarks/visu_logic/__init__.py' 2026-04-24T06:57:38,711 adding 'evalscope/benchmarks/visu_logic/visu_logic_adapter.py' 2026-04-24T06:57:38,713 adding 'evalscope/benchmarks/vstar_bench/__init__.py' 2026-04-24T06:57:38,715 adding 'evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py' 2026-04-24T06:57:38,716 adding 'evalscope/benchmarks/winogrande/__init__.py' 2026-04-24T06:57:38,718 adding 'evalscope/benchmarks/winogrande/winogrande_adapter.py' 2026-04-24T06:57:38,719 adding 'evalscope/benchmarks/wmt/__init__.py' 2026-04-24T06:57:38,720 adding 'evalscope/benchmarks/wmt/requirements.txt' 2026-04-24T06:57:38,722 adding 'evalscope/benchmarks/wmt/wmt24_adapter.py' 2026-04-24T06:57:38,724 adding 'evalscope/benchmarks/zebralogicbench/__init__.py' 2026-04-24T06:57:38,726 adding 'evalscope/benchmarks/zebralogicbench/utils.py' 2026-04-24T06:57:38,728 adding 'evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py' 2026-04-24T06:57:38,729 adding 'evalscope/benchmarks/zerobench/__init__.py' 2026-04-24T06:57:38,731 adding 'evalscope/benchmarks/zerobench/zerobench_adapter.py' 2026-04-24T06:57:38,733 adding 'evalscope/cli/__init__.py' 2026-04-24T06:57:38,734 adding 'evalscope/cli/base.py' 2026-04-24T06:57:38,736 adding 'evalscope/cli/benchmark_info.py' 2026-04-24T06:57:38,737 adding 'evalscope/cli/cli.py' 2026-04-24T06:57:38,739 adding 'evalscope/cli/start_app.py' 2026-04-24T06:57:38,740 adding 'evalscope/cli/start_eval.py' 2026-04-24T06:57:38,741 adding 'evalscope/cli/start_perf.py' 2026-04-24T06:57:38,743 adding 'evalscope/cli/start_service.py' 2026-04-24T06:57:38,745 adding 'evalscope/collections/__init__.py' 2026-04-24T06:57:38,746 adding 'evalscope/collections/sampler.py' 2026-04-24T06:57:38,748 adding 'evalscope/collections/schema.py' 2026-04-24T06:57:38,750 adding 'evalscope/evaluator/__init__.py' 2026-04-24T06:57:38,751 adding 'evalscope/evaluator/batch_reviewer.py' 2026-04-24T06:57:38,754 adding 'evalscope/evaluator/evaluator.py' 2026-04-24T06:57:38,756 adding 'evalscope/evaluator/perf_collector.py' 2026-04-24T06:57:38,757 adding 'evalscope/filters/__init__.py' 2026-04-24T06:57:38,759 adding 'evalscope/filters/extraction.py' 2026-04-24T06:57:38,760 adding 'evalscope/filters/selection.py' 2026-04-24T06:57:38,762 adding 'evalscope/metrics/__init__.py' 2026-04-24T06:57:38,764 adding 'evalscope/metrics/llm_judge.py' 2026-04-24T06:57:38,766 adding 'evalscope/metrics/math_parser.py' 2026-04-24T06:57:38,769 adding 'evalscope/metrics/metric.py' 2026-04-24T06:57:38,771 adding 'evalscope/metrics/metrics.py' 2026-04-24T06:57:38,773 adding 'evalscope/metrics/rouge_metric.py' 2026-04-24T06:57:38,775 adding 'evalscope/metrics/bert_score/__init__.py' 2026-04-24T06:57:38,777 adding 'evalscope/metrics/bert_score/scorer.py' 2026-04-24T06:57:38,780 adding 'evalscope/metrics/bert_score/utils.py' 2026-04-24T06:57:38,782 adding 'evalscope/metrics/bundled_rouge_score/__init__.py' 2026-04-24T06:57:38,784 adding 'evalscope/metrics/bundled_rouge_score/rouge_scorer.py' 2026-04-24T06:57:38,786 adding 'evalscope/metrics/sem_score/__init__.py' 2026-04-24T06:57:38,788 adding 'evalscope/metrics/sem_score/scorer.py' 2026-04-24T06:57:38,789 adding 'evalscope/metrics/t2v_metrics/__init__.py' 2026-04-24T06:57:38,791 adding 'evalscope/metrics/t2v_metrics/clipscore.py' 2026-04-24T06:57:38,792 adding 'evalscope/metrics/t2v_metrics/constants.py' 2026-04-24T06:57:38,793 adding 'evalscope/metrics/t2v_metrics/itmscore.py' 2026-04-24T06:57:38,794 adding 'evalscope/metrics/t2v_metrics/score.py' 2026-04-24T06:57:38,795 adding 'evalscope/metrics/t2v_metrics/vqascore.py' 2026-04-24T06:57:38,797 adding 'evalscope/metrics/t2v_metrics/models/__init__.py' 2026-04-24T06:57:38,798 adding 'evalscope/metrics/t2v_metrics/models/model.py' 2026-04-24T06:57:38,800 adding 'evalscope/metrics/t2v_metrics/models/utils.py' 2026-04-24T06:57:38,801 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py' 2026-04-24T06:57:38,803 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py' 2026-04-24T06:57:38,804 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py' 2026-04-24T06:57:38,806 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py' 2026-04-24T06:57:38,807 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py' 2026-04-24T06:57:38,809 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py' 2026-04-24T06:57:38,810 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py' 2026-04-24T06:57:38,812 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py' 2026-04-24T06:57:38,814 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py' 2026-04-24T06:57:38,815 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py' 2026-04-24T06:57:38,817 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py' 2026-04-24T06:57:38,818 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py' 2026-04-24T06:57:38,820 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py' 2026-04-24T06:57:38,822 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py' 2026-04-24T06:57:38,823 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py' 2026-04-24T06:57:38,825 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py' 2026-04-24T06:57:38,827 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py' 2026-04-24T06:57:38,828 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py' 2026-04-24T06:57:38,830 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py' 2026-04-24T06:57:38,831 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py' 2026-04-24T06:57:38,832 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py' 2026-04-24T06:57:38,834 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py' 2026-04-24T06:57:38,836 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py' 2026-04-24T06:57:38,838 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py' 2026-04-24T06:57:38,840 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py' 2026-04-24T06:57:38,841 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py' 2026-04-24T06:57:38,843 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py' 2026-04-24T06:57:38,845 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py' 2026-04-24T06:57:38,848 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py' 2026-04-24T06:57:38,849 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py' 2026-04-24T06:57:38,850 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py' 2026-04-24T06:57:38,852 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py' 2026-04-24T06:57:38,853 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py' 2026-04-24T06:57:38,855 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py' 2026-04-24T06:57:38,857 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py' 2026-04-24T06:57:38,859 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py' 2026-04-24T06:57:38,861 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py' 2026-04-24T06:57:38,863 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py' 2026-04-24T06:57:38,865 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml' 2026-04-24T06:57:38,867 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json' 2026-04-24T06:57:38,868 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json' 2026-04-24T06:57:38,869 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json' 2026-04-24T06:57:38,871 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml' 2026-04-24T06:57:38,873 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml' 2026-04-24T06:57:38,874 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml' 2026-04-24T06:57:38,875 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml' 2026-04-24T06:57:38,877 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml' 2026-04-24T06:57:38,878 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml' 2026-04-24T06:57:38,879 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml' 2026-04-24T06:57:38,880 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml' 2026-04-24T06:57:38,882 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml' 2026-04-24T06:57:38,883 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml' 2026-04-24T06:57:38,884 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml' 2026-04-24T06:57:38,886 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml' 2026-04-24T06:57:38,887 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml' 2026-04-24T06:57:38,888 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml' 2026-04-24T06:57:38,889 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml' 2026-04-24T06:57:38,890 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml' 2026-04-24T06:57:38,892 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml' 2026-04-24T06:57:38,893 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml' 2026-04-24T06:57:38,894 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml' 2026-04-24T06:57:38,897 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py' 2026-04-24T06:57:38,898 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py' 2026-04-24T06:57:38,900 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py' 2026-04-24T06:57:38,903 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py' 2026-04-24T06:57:38,908 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py' 2026-04-24T06:57:38,911 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py' 2026-04-24T06:57:38,916 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py' 2026-04-24T06:57:38,918 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py' 2026-04-24T06:57:38,919 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py' 2026-04-24T06:57:38,921 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py' 2026-04-24T06:57:38,924 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py' 2026-04-24T06:57:38,926 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py' 2026-04-24T06:57:38,929 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py' 2026-04-24T06:57:38,931 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py' 2026-04-24T06:57:38,935 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py' 2026-04-24T06:57:38,943 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py' 2026-04-24T06:57:38,946 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py' 2026-04-24T06:57:38,947 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py' 2026-04-24T06:57:38,949 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py' 2026-04-24T06:57:38,950 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py' 2026-04-24T06:57:38,952 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py' 2026-04-24T06:57:38,954 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py' 2026-04-24T06:57:38,955 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py' 2026-04-24T06:57:38,957 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py' 2026-04-24T06:57:38,959 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py' 2026-04-24T06:57:38,961 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py' 2026-04-24T06:57:38,965 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py' 2026-04-24T06:57:38,967 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py' 2026-04-24T06:57:38,968 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py' 2026-04-24T06:57:38,970 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py' 2026-04-24T06:57:38,972 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py' 2026-04-24T06:57:38,973 adding 'evalscope/metrics/text_normalizer/__init__.py' 2026-04-24T06:57:38,975 adding 'evalscope/metrics/text_normalizer/basic.py' 2026-04-24T06:57:38,981 adding 'evalscope/metrics/text_normalizer/chinese.py' 2026-04-24T06:57:38,990 adding 'evalscope/metrics/text_normalizer/english.json' 2026-04-24T06:57:38,992 adding 'evalscope/metrics/text_normalizer/english.py' 2026-04-24T06:57:38,994 adding 'evalscope/metrics/text_normalizer/wer.py' 2026-04-24T06:57:38,996 adding 'evalscope/models/__init__.py' 2026-04-24T06:57:38,997 adding 'evalscope/models/anthropic_compatible.py' 2026-04-24T06:57:38,999 adding 'evalscope/models/image_edit_model.py' 2026-04-24T06:57:39,000 adding 'evalscope/models/mockllm.py' 2026-04-24T06:57:39,002 adding 'evalscope/models/model_apis.py' 2026-04-24T06:57:39,004 adding 'evalscope/models/modelscope.py' 2026-04-24T06:57:39,006 adding 'evalscope/models/openai_compatible.py' 2026-04-24T06:57:39,007 adding 'evalscope/models/text2image_model.py' 2026-04-24T06:57:39,011 adding 'evalscope/models/utils/anthropic.py' 2026-04-24T06:57:39,015 adding 'evalscope/models/utils/openai.py' 2026-04-24T06:57:39,017 adding 'evalscope/perf/__init__.py' 2026-04-24T06:57:39,019 adding 'evalscope/perf/arguments.py' 2026-04-24T06:57:39,021 adding 'evalscope/perf/benchmark.py' 2026-04-24T06:57:39,023 adding 'evalscope/perf/http_client.py' 2026-04-24T06:57:39,024 adding 'evalscope/perf/main.py' 2026-04-24T06:57:39,026 adding 'evalscope/perf/multi_turn_benchmark.py' 2026-04-24T06:57:39,028 adding 'evalscope/perf/plugin/__init__.py' 2026-04-24T06:57:39,029 adding 'evalscope/perf/plugin/registry.py' 2026-04-24T06:57:39,031 adding 'evalscope/perf/plugin/api/__init__.py' 2026-04-24T06:57:39,032 adding 'evalscope/perf/plugin/api/base.py' 2026-04-24T06:57:39,034 adding 'evalscope/perf/plugin/api/custom_api.py' 2026-04-24T06:57:39,036 adding 'evalscope/perf/plugin/api/dashscope_api.py' 2026-04-24T06:57:39,038 adding 'evalscope/perf/plugin/api/default_api.py' 2026-04-24T06:57:39,040 adding 'evalscope/perf/plugin/api/openai_api.py' 2026-04-24T06:57:39,042 adding 'evalscope/perf/plugin/api/openai_embedding_api.py' 2026-04-24T06:57:39,044 adding 'evalscope/perf/plugin/api/openai_rerank_api.py' 2026-04-24T06:57:39,046 adding 'evalscope/perf/plugin/datasets/__init__.py' 2026-04-24T06:57:39,047 adding 'evalscope/perf/plugin/datasets/base.py' 2026-04-24T06:57:39,049 adding 'evalscope/perf/plugin/datasets/custom.py' 2026-04-24T06:57:39,050 adding 'evalscope/perf/plugin/datasets/embedding_dataset.py' 2026-04-24T06:57:39,052 adding 'evalscope/perf/plugin/datasets/flickr8k.py' 2026-04-24T06:57:39,053 adding 'evalscope/perf/plugin/datasets/kontext_bench.py' 2026-04-24T06:57:39,055 adding 'evalscope/perf/plugin/datasets/line_by_line.py' 2026-04-24T06:57:39,056 adding 'evalscope/perf/plugin/datasets/longalpaca.py' 2026-04-24T06:57:39,058 adding 'evalscope/perf/plugin/datasets/multi_turn.py' 2026-04-24T06:57:39,060 adding 'evalscope/perf/plugin/datasets/openqa.py' 2026-04-24T06:57:39,062 adding 'evalscope/perf/plugin/datasets/random_dataset.py' 2026-04-24T06:57:39,063 adding 'evalscope/perf/plugin/datasets/random_vl_dataset.py' 2026-04-24T06:57:39,064 adding 'evalscope/perf/plugin/datasets/rerank_dataset.py' 2026-04-24T06:57:39,066 adding 'evalscope/perf/plugin/datasets/share_gpt.py' 2026-04-24T06:57:39,067 adding 'evalscope/perf/plugin/datasets/speed_benchmark.py' 2026-04-24T06:57:39,069 adding 'evalscope/perf/plugin/datasets/utils.py' 2026-04-24T06:57:39,070 adding 'evalscope/perf/sla/__init__.py' 2026-04-24T06:57:39,071 adding 'evalscope/perf/sla/sla_criterion.py' 2026-04-24T06:57:39,074 adding 'evalscope/perf/sla/sla_run.py' 2026-04-24T06:57:39,076 adding 'evalscope/perf/utils/__init__.py' 2026-04-24T06:57:39,077 adding 'evalscope/perf/utils/analysis_result.py' 2026-04-24T06:57:39,079 adding 'evalscope/perf/utils/benchmark_util.py' 2026-04-24T06:57:39,082 adding 'evalscope/perf/utils/db_util.py' 2026-04-24T06:57:39,083 adding 'evalscope/perf/utils/handler.py' 2026-04-24T06:57:39,084 adding 'evalscope/perf/utils/local_server.py' 2026-04-24T06:57:39,086 adding 'evalscope/perf/utils/log_utils.py' 2026-04-24T06:57:39,089 adding 'evalscope/perf/utils/rich_display.py' 2026-04-24T06:57:39,090 adding 'evalscope/perf/utils/report/__init__.py' 2026-04-24T06:57:39,092 adding 'evalscope/perf/utils/report/generate_report.py' 2026-04-24T06:57:39,094 adding 'evalscope/perf/utils/report/perf_charts.py' 2026-04-24T06:57:39,096 adding 'evalscope/perf/utils/report/perf_data.py' 2026-04-24T06:57:39,098 adding 'evalscope/report/__init__.py' 2026-04-24T06:57:39,100 adding 'evalscope/report/combinator.py' 2026-04-24T06:57:39,101 adding 'evalscope/report/generator.py' 2026-04-24T06:57:39,103 adding 'evalscope/report/renderer.py' 2026-04-24T06:57:39,105 adding 'evalscope/report/report.py' 2026-04-24T06:57:39,108 adding 'evalscope/report/template/perf_report.html.j2' 2026-04-24T06:57:39,109 adding 'evalscope/report/template/report.html.j2' 2026-04-24T06:57:39,112 adding 'evalscope/report/template/css/base.css' 2026-04-24T06:57:39,114 adding 'evalscope/report/template/css/perf_extra.css' 2026-04-24T06:57:39,115 adding 'evalscope/report/template/js/eval_extra.js' 2026-04-24T06:57:39,117 adding 'evalscope/report/template/js/i18n_eval.js' 2026-04-24T06:57:39,118 adding 'evalscope/report/template/js/i18n_perf.js' 2026-04-24T06:57:39,120 adding 'evalscope/report/template/js/perf_extra.js' 2026-04-24T06:57:39,121 adding 'evalscope/report/template/js/shared.js' 2026-04-24T06:57:39,123 adding 'evalscope/report/template/partials/brand_logo.html' 2026-04-24T06:57:39,124 adding 'evalscope/report/template/partials/footer.html' 2026-04-24T06:57:39,126 adding 'evalscope/report/template/partials/header_eval.html' 2026-04-24T06:57:39,127 adding 'evalscope/report/template/partials/header_perf.html' 2026-04-24T06:57:39,128 adding 'evalscope/report/template/partials/toc_eval.html' 2026-04-24T06:57:39,130 adding 'evalscope/report/template/partials/toc_perf.html' 2026-04-24T06:57:39,132 adding 'evalscope/sandbox/__init__.py' 2026-04-24T06:57:39,134 adding 'evalscope/sandbox/volcengine.py' 2026-04-24T06:57:39,136 adding 'evalscope/service/__init__.py' 2026-04-24T06:57:39,137 adding 'evalscope/service/app.py' 2026-04-24T06:57:39,139 adding 'evalscope/service/blueprints/__init__.py' 2026-04-24T06:57:39,141 adding 'evalscope/service/blueprints/eval.py' 2026-04-24T06:57:39,143 adding 'evalscope/service/blueprints/perf.py' 2026-04-24T06:57:39,145 adding 'evalscope/service/frontend/__init__.py' 2026-04-24T06:57:39,147 adding 'evalscope/service/frontend/async_client.py' 2026-04-24T06:57:39,149 adding 'evalscope/service/frontend/main.py' 2026-04-24T06:57:39,150 adding 'evalscope/service/frontend/utils.py' 2026-04-24T06:57:39,152 adding 'evalscope/service/utils/__init__.py' 2026-04-24T06:57:39,154 adding 'evalscope/service/utils/benchmarks.py' 2026-04-24T06:57:39,155 adding 'evalscope/service/utils/log.py' 2026-04-24T06:57:39,156 adding 'evalscope/service/utils/process.py' 2026-04-24T06:57:39,158 adding 'evalscope/summarizer/__init__.py' 2026-04-24T06:57:39,159 adding 'evalscope/summarizer/summarizer.py' 2026-04-24T06:57:39,161 adding 'evalscope/third_party/__init__.py' 2026-04-24T06:57:39,163 adding 'evalscope/third_party/longbench_write/README.md' 2026-04-24T06:57:39,164 adding 'evalscope/third_party/longbench_write/__init__.py' 2026-04-24T06:57:39,165 adding 'evalscope/third_party/longbench_write/default_task.json' 2026-04-24T06:57:39,166 adding 'evalscope/third_party/longbench_write/default_task.yaml' 2026-04-24T06:57:39,168 adding 'evalscope/third_party/longbench_write/eval.py' 2026-04-24T06:57:39,170 adding 'evalscope/third_party/longbench_write/infer.py' 2026-04-24T06:57:39,171 adding 'evalscope/third_party/longbench_write/longbench_write.py' 2026-04-24T06:57:39,172 adding 'evalscope/third_party/longbench_write/utils.py' 2026-04-24T06:57:39,174 adding 'evalscope/third_party/longbench_write/resources/__init__.py' 2026-04-24T06:57:39,175 adding 'evalscope/third_party/longbench_write/resources/judge.txt' 2026-04-24T06:57:39,182 adding 'evalscope/third_party/longbench_write/resources/longbench_write.jsonl' 2026-04-24T06:57:39,186 adding 'evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl' 2026-04-24T06:57:39,188 adding 'evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl' 2026-04-24T06:57:39,189 adding 'evalscope/third_party/longbench_write/tools/__init__.py' 2026-04-24T06:57:39,191 adding 'evalscope/third_party/longbench_write/tools/data_etl.py' 2026-04-24T06:57:39,193 adding 'evalscope/third_party/longbench_write/tools/openai_api.py' 2026-04-24T06:57:39,194 adding 'evalscope/third_party/thinkbench/__init__.py' 2026-04-24T06:57:39,197 adding 'evalscope/third_party/thinkbench/eval.py' 2026-04-24T06:57:39,198 adding 'evalscope/third_party/thinkbench/infer.py' 2026-04-24T06:57:39,200 adding 'evalscope/third_party/thinkbench/resources/critique_template.txt' 2026-04-24T06:57:39,201 adding 'evalscope/third_party/thinkbench/resources/reformat_template.txt' 2026-04-24T06:57:39,203 adding 'evalscope/third_party/thinkbench/tools/__init__.py' 2026-04-24T06:57:39,204 adding 'evalscope/third_party/thinkbench/tools/llm.py' 2026-04-24T06:57:39,205 adding 'evalscope/third_party/thinkbench/tools/utils.py' 2026-04-24T06:57:39,208 adding 'evalscope/third_party/toolbench_static/README.md' 2026-04-24T06:57:39,209 adding 'evalscope/third_party/toolbench_static/__init__.py' 2026-04-24T06:57:39,210 adding 'evalscope/third_party/toolbench_static/config_default.json' 2026-04-24T06:57:39,211 adding 'evalscope/third_party/toolbench_static/config_default.yaml' 2026-04-24T06:57:39,213 adding 'evalscope/third_party/toolbench_static/eval.py' 2026-04-24T06:57:39,215 adding 'evalscope/third_party/toolbench_static/infer.py' 2026-04-24T06:57:39,216 adding 'evalscope/third_party/toolbench_static/requirements.txt' 2026-04-24T06:57:39,218 adding 'evalscope/third_party/toolbench_static/toolbench_static.py' 2026-04-24T06:57:39,219 adding 'evalscope/third_party/toolbench_static/llm/__init__.py' 2026-04-24T06:57:39,220 adding 'evalscope/third_party/toolbench_static/llm/swift_infer.py' 2026-04-24T06:57:39,222 adding 'evalscope/utils/__init__.py' 2026-04-24T06:57:39,224 adding 'evalscope/utils/argument_utils.py' 2026-04-24T06:57:39,225 adding 'evalscope/utils/chat_service.py' 2026-04-24T06:57:39,228 adding 'evalscope/utils/code_utils.py' 2026-04-24T06:57:39,230 adding 'evalscope/utils/deprecation_utils.py' 2026-04-24T06:57:39,232 adding 'evalscope/utils/function_utils.py' 2026-04-24T06:57:39,234 adding 'evalscope/utils/import_utils.py' 2026-04-24T06:57:39,237 adding 'evalscope/utils/io_utils.py' 2026-04-24T06:57:39,239 adding 'evalscope/utils/json_schema.py' 2026-04-24T06:57:39,241 adding 'evalscope/utils/logger.py' 2026-04-24T06:57:39,242 adding 'evalscope/utils/model_utils.py' 2026-04-24T06:57:39,244 adding 'evalscope/utils/multi_choices.py' 2026-04-24T06:57:39,246 adding 'evalscope/utils/ner.py' 2026-04-24T06:57:39,248 adding 'evalscope/utils/resource_utils.py' 2026-04-24T06:57:39,249 adding 'evalscope/utils/url_utils.py' 2026-04-24T06:57:39,251 adding 'evalscope/utils/doc_utils/__init__.py' 2026-04-24T06:57:39,254 adding 'evalscope/utils/doc_utils/benchmark_stats.py' 2026-04-24T06:57:39,257 adding 'evalscope/utils/doc_utils/generate_dataset_md.py' 2026-04-24T06:57:39,259 adding 'evalscope/utils/doc_utils/readme_generator.py' 2026-04-24T06:57:39,261 adding 'evalscope/utils/doc_utils/translate_description.py' 2026-04-24T06:57:39,262 adding 'evalscope/utils/tqdm_utils/__init__.py' 2026-04-24T06:57:39,264 adding 'evalscope/utils/tqdm_utils/progress_tracker.py' 2026-04-24T06:57:39,266 adding 'evalscope/utils/tqdm_utils/tqdm_logging.py' 2026-04-24T06:57:39,269 adding 'evalscope-1.6.1.dist-info/licenses/LICENSE' 2026-04-24T06:57:39,274 adding 'evalscope-1.6.1.dist-info/METADATA' 2026-04-24T06:57:39,275 adding 'evalscope-1.6.1.dist-info/WHEEL' 2026-04-24T06:57:39,276 adding 'evalscope-1.6.1.dist-info/entry_points.txt' 2026-04-24T06:57:39,277 adding 'evalscope-1.6.1.dist-info/top_level.txt' 2026-04-24T06:57:39,293 adding 'evalscope-1.6.1.dist-info/RECORD' 2026-04-24T06:57:39,328 removing build/bdist.linux-armv7l/wheel 2026-04-24T06:57:39,720 Building wheel for evalscope (pyproject.toml): finished with status 'done' 2026-04-24T06:57:39,792 Created wheel for evalscope: filename=evalscope-1.6.1-py3-none-any.whl size=2151974 sha256=603f6982401ac313daa073da9590d3168948b1638a8a4d45faaa22d41ae6b92f 2026-04-24T06:57:39,794 Stored in directory: /tmp/pip-ephem-wheel-cache-qtchmhm0/wheels/24/d1/c7/d9c0786504fe4d5223da7294826dcfda008743782ea7d3cfeb 2026-04-24T06:57:39,852 Successfully built evalscope 2026-04-24T06:57:39,906 Removed build tracker: '/tmp/pip-build-tracker-hudghgmg'