2026-04-13T12:59:29,150 Created temporary directory: /tmp/pip-ephem-wheel-cache-nvrsya03 2026-04-13T12:59:29,152 Created temporary directory: /tmp/pip-build-tracker-29rga0pi 2026-04-13T12:59:29,152 Initialized build tracking at /tmp/pip-build-tracker-29rga0pi 2026-04-13T12:59:29,153 Created build tracker: /tmp/pip-build-tracker-29rga0pi 2026-04-13T12:59:29,153 Entered build tracker: /tmp/pip-build-tracker-29rga0pi 2026-04-13T12:59:29,154 Created temporary directory: /tmp/pip-wheel-coynym30 2026-04-13T12:59:29,157 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-13T12:59:29,159 Created temporary directory: /tmp/pip-ephem-wheel-cache-wi6h2a8t 2026-04-13T12:59:29,181 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-13T12:59:29,184 2 location(s) to search for versions of evalscope: 2026-04-13T12:59:29,184 * https://pypi.org/simple/evalscope/ 2026-04-13T12:59:29,184 * https://www.piwheels.org/simple/evalscope/ 2026-04-13T12:59:29,185 Fetching project page and analyzing links: https://pypi.org/simple/evalscope/ 2026-04-13T12:59:29,186 Getting page https://pypi.org/simple/evalscope/ 2026-04-13T12:59:29,187 Found index url https://pypi.org/simple 2026-04-13T12:59:29,419 Fetched page https://pypi.org/simple/evalscope/ as application/vnd.pypi.simple.v1+json 2026-04-13T12:59:29,436 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/94/d2/dc5e929802776bf4e662a46d794b765876bb93e2300189cafd113cac74d6/evalscope-0.5.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,438 Found link https://files.pythonhosted.org/packages/e5/70/45a5dad24b1fa535bff194b99a4668e7f5f328be972b51b3b91eafb4cdbb/evalscope-0.5.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0rc0 2026-04-13T12:59:29,438 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ac/eb/341fe367df2bc9a0ae7ef5eb2037a5d549d9bb8c0d7ad84844c9926e0947/evalscope-0.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,439 Found link https://files.pythonhosted.org/packages/3b/3f/585f7f1cf2ce90b234c1cfd654bb26977be4d889c0e5eed0122cb3024c45/evalscope-0.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0 2026-04-13T12:59:29,440 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ad/58/c0ce004159cfac6df9b5736c011576d52bdceb778943de8a022a419d86eb/evalscope-0.5.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,441 Found link https://files.pythonhosted.org/packages/94/33/7ad2f2285f5b68953ad4466d23dc5de1a2e57e7cc63d5924ab0e84d156ba/evalscope-0.5.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.2 2026-04-13T12:59:29,442 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/59/a9e1c4cf88018ece1fdd8d8b7fa976e28f9b4b181ef7ceb74a5e2db533ab/evalscope-0.5.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,443 Found link https://files.pythonhosted.org/packages/04/57/9ca7b1fd68f2acc32802b22236c83b597a5690a483d5938d38183b549d22/evalscope-0.5.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.3 2026-04-13T12:59:29,444 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/fa/c8/fcbaf01b7486c3b29b7790167c2cda560f00a04d100cec808ee9a3349ca0/evalscope-0.5.4-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,445 Found link https://files.pythonhosted.org/packages/8c/cc/abd412bad714c0266be1f0159b49a817d45db099c3bd031134d223589e93/evalscope-0.5.4.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.4 2026-04-13T12:59:29,446 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4d/da/d705d457683223f289e8c5d6cadbaad15d15e098692c53ea7e6196a94373/evalscope-0.5.5rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,446 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7e/4c/414dd545a1833245a53797d70a35a78ae1b9cfbcc81b35a4e1763e678437/evalscope-0.5.5rc1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,447 Found link https://files.pythonhosted.org/packages/56/5f/aa7fcf62102694dd66b69e88cb2523094bf04b53f785854e17ee6b7234a7/evalscope-0.5.5rc1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5rc1 2026-04-13T12:59:29,448 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/3e/d0/91a7a1f95f3fa19dd8d4e434dc711768abe5c006f32a514a8602b429e049/evalscope-0.5.5-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,449 Found link https://files.pythonhosted.org/packages/04/7e/a7f065d6ebac15fe172d3b0906ff5b26a71df5a9975c0f14978044211cf1/evalscope-0.5.5.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5 2026-04-13T12:59:29,449 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/89/b812b01a5ed91fc079dea052e1341860cd65d25d463c75c90e5a30ab6ae8/evalscope-0.6.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,450 Found link https://files.pythonhosted.org/packages/d5/ad/57a2d5f33c5b7d5066f8a5dcb1d34f14bf246112f7900228b9f2fb41b21b/evalscope-0.6.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0rc0 2026-04-13T12:59:29,451 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/25/3f03d9d924f1b65610724c9f10727b48ef952afdfee8687a461949c88c78/evalscope-0.6.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,452 Found link https://files.pythonhosted.org/packages/9d/ac/1f432bcc46ccb8348b869b80d2aaabde5e583b370418ba48714083e31068/evalscope-0.6.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0 2026-04-13T12:59:29,453 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7b/e1/42c9e58b4690f23ef48bce841fc95cbf0744c1579cdc80fa6f33b0453344/evalscope-0.6.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,453 Found link https://files.pythonhosted.org/packages/cd/d6/1d9d2db9acda6e61d4210074f51e7e3dee4d0212fabdd94999105db23eed/evalscope-0.6.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.1 2026-04-13T12:59:29,454 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/20/92/6fff1dfd12a4f73489c451dab56351b6a3c1095a92bb55025c2934fc625d/evalscope-0.7.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,455 Found link https://files.pythonhosted.org/packages/fe/3c/500d655a27ca80e1aba3fb2b1e8886951942732b869ad1516422d9e6ac97/evalscope-0.7.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.0 2026-04-13T12:59:29,456 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a5/3a/ae22d4d9a44ad37ac887da21b948bf6e784307001a09802d36c5bf04018d/evalscope-0.7.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,457 Found link https://files.pythonhosted.org/packages/88/47/69d067f0d3d784a7975cc4ea067fdc55c8f785ece6fbe86e5e21edc8b36f/evalscope-0.7.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.1 2026-04-13T12:59:29,458 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/23/acbcbc2ed6f00d3bb81220054651e6a2b1b02714d42b1aeb018a2f5574c4/evalscope-0.7.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,459 Found link https://files.pythonhosted.org/packages/a4/38/9126b9329cd2ad6ccfd4a73f04402bf71b65921564c3c12cb0d62b3b421d/evalscope-0.7.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.2 2026-04-13T12:59:29,460 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/1f/9598183da3026696adc19a6727a19d57f9abbf4fd7aeb20cdf12faee7693/evalscope-0.8.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,461 Found link https://files.pythonhosted.org/packages/63/7e/fc44d30e3a83dbc3070396d78279c9e3e8716cbc4cc05811a70f1b463bfb/evalscope-0.8.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.0 2026-04-13T12:59:29,461 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/37/a9/a9c6cda95a6b837c9303e9a1598999f2c4e605abb507365c6ff70b372a5a/evalscope-0.8.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,462 Found link https://files.pythonhosted.org/packages/63/5a/bbc230bb06a7bc40dd3985dde4615a8a71f111bf95761e40f6f0f8a7e1a6/evalscope-0.8.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.1 2026-04-13T12:59:29,463 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/07/39/a1eb2efed77d21e8253daa37f082669a004c4d288813a2ee9e15398f2e80/evalscope-0.8.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,464 Found link https://files.pythonhosted.org/packages/57/bc/5ff6d538e459d8b3c567577c991eec72ab6adbc19497dc164d96cd634d2f/evalscope-0.8.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.2 2026-04-13T12:59:29,464 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f6/29/81a188c03b272bf7def0c9bb556b9e9465adbc68ecb18907b636f1e8cbd7/evalscope-0.9.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,465 Found link https://files.pythonhosted.org/packages/c0/c2/19da3be1fbd6b548ecdc877d47269e92503518de53acbbfe96120c5c9753/evalscope-0.9.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.9.0 2026-04-13T12:59:29,466 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/6d/45d7407f31d6878494c3b493d7e49a8318b1839161839293c1a2e66aadcf/evalscope-0.10.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,467 Found link https://files.pythonhosted.org/packages/16/b1/b6cef37a0dd0acfa5873ca4763ac6b4ac4b19a0b15ca6bdc8f30d4443682/evalscope-0.10.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.0 2026-04-13T12:59:29,468 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a7/8f/907045290d359b4e07e7acc96ec60173380748b49e9f3c91b7ddd8e8342d/evalscope-0.10.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,469 Found link https://files.pythonhosted.org/packages/e3/d3/dda2ac0513904bff8aa0c2efe77bc851d3acc6d514707db36648e4a903d2/evalscope-0.10.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.1 2026-04-13T12:59:29,470 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d6/e5/852326943d86c85b5ca6b548a5f3753c217b771d93968c61ec2ca46ee0b1/evalscope-0.11.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,471 Found link https://files.pythonhosted.org/packages/5f/3f/e2816b99487b4ead257453a242b5282a85881f9d26fbb5efb21cc5cf88fa/evalscope-0.11.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.11.0 2026-04-13T12:59:29,472 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7d/8e/f9eceecf8bc7d740603f915eef7fab3e9d657a01f5de2c523e531445299c/evalscope-0.12.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,473 Found link https://files.pythonhosted.org/packages/33/82/7765517ff80a73eac7465369767aa45a5be3d5e0fb7c4f4a3ff743811f8c/evalscope-0.12.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.0 2026-04-13T12:59:29,473 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ee/4c/84a5e18985e149eb4283fef2b58a81ff2cec2d099c017684938ec3a3935f/evalscope-0.12.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,474 Found link https://files.pythonhosted.org/packages/b2/24/83b5530319bdb02142289e04640c6008dda1b988043c42feb7f0a5eab3b7/evalscope-0.12.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.1 2026-04-13T12:59:29,475 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/a5/65faf0660cd8ae2660354b002b2b3a586b9419bc894120fea97efd506cb6/evalscope-0.13.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,476 Found link https://files.pythonhosted.org/packages/d0/39/d5eb469a94191760c61d1bfcb235e28be1d2a080d88b44792f53d76c45d1/evalscope-0.13.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.0 2026-04-13T12:59:29,476 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f7/fc/91b7b4379131d2e15ca1f575f533daea589357293e492ef2c93e0aac6b55/evalscope-0.13.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,477 Found link https://files.pythonhosted.org/packages/9e/a3/33b4ce270d5500fe7c8f32fa2160749b607f248141328d4785b6032c8f2a/evalscope-0.13.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.1 2026-04-13T12:59:29,478 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e6/66/83752c305879cf3dea54170398839ab08a046485bc18c41a34f41aca11ab/evalscope-0.13.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,479 Found link https://files.pythonhosted.org/packages/b7/8d/711ae30b80329e2dd7da760c001d9a5b45e4d8e5292f317f1ea10c744c29/evalscope-0.13.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.2 2026-04-13T12:59:29,480 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a8/b4/b22c7e52e6a7381333bdfa0bf92fae0258e7812064b1e208cbab56a62d08/evalscope-0.14.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,481 Found link https://files.pythonhosted.org/packages/40/44/7db2cb90e6ca0c9db92124f10c0273d7c6ef4b81523e1c98c34a88e67faa/evalscope-0.14.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.14.0 2026-04-13T12:59:29,481 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b2/47/976633e0f29b58b8c9f3faacf7373a2da734771a7915ac45d721d96e0ad7/evalscope-0.15.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,482 Found link https://files.pythonhosted.org/packages/b8/e3/bd534d69328afa98bdd497b5eaaf4b7416da9e8f56109d045c332b17d016/evalscope-0.15.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.0 2026-04-13T12:59:29,483 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5a/54/237aee5f0317fa04450c9ad67c3bf28b730460b2e6dc1e65b74b4bf2cd67/evalscope-0.15.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,484 Found link https://files.pythonhosted.org/packages/7a/d4/8b87e83a3a08f87ce5b4325f0cd5ab9bc54d296dc3f3492a1d3216a97a6d/evalscope-0.15.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.1 2026-04-13T12:59:29,485 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/1e/b5/6fec1cbb02a41ab79430eb3fb51eea7709525df0f9753ae2c54fbc4633f1/evalscope-0.16.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,486 Found link https://files.pythonhosted.org/packages/99/69/63997bedfa6fd33af671b539f77c375b111017eff23313f76693e24b8872/evalscope-0.16.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.0 2026-04-13T12:59:29,486 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ea/76/639641578bdc92d25211eb8dae24d1fae19e40cd1649e17946f6ad8a5dc3/evalscope-0.16.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,487 Found link https://files.pythonhosted.org/packages/94/13/616bf9c33b0769db44a2bae32b54d33cf7129874392682aa76326a51e085/evalscope-0.16.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.1 2026-04-13T12:59:29,488 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f1/d3/9e83cc1b5a132342a05ef6ee79018bd7561f90a6406dc9db5c85fb0a281f/evalscope-0.16.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,489 Found link https://files.pythonhosted.org/packages/94/47/edd3faaddd321e464ae72db6e7bf82246dd4ca2f0f67127ca8c427cac664/evalscope-0.16.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.2 2026-04-13T12:59:29,490 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5d/4e/34c56086cdfe7d1ad912241af819b3e67f20373f382016e33ac89dd43dde/evalscope-0.16.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,491 Found link https://files.pythonhosted.org/packages/22/07/14603c038a8019472881f574f1c47bd4193481f256b6dc702c65d8b8f984/evalscope-0.16.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.3 2026-04-13T12:59:29,491 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f3/e2/104156f74779cf2849f53566bba585015492d7320c36a7cb76c7196b0ef5/evalscope-0.17.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,492 Found link https://files.pythonhosted.org/packages/c5/d0/b66d1b97ec67d65b6df54f18682e30cfeb6401604b93c9b1bdd1e97b8d79/evalscope-0.17.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.17.0 2026-04-13T12:59:29,493 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a9/b5/630d2c5dc5c32e9fbad5034e04d8aba6f6461dc08f255df77dd8d463857f/evalscope-0.17.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,494 Found link https://files.pythonhosted.org/packages/38/88/326e48929bb9577a6a36e07afa65bbf6bd870c1b644f82e5713874ae3238/evalscope-0.17.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 0.17.1 2026-04-13T12:59:29,495 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/21/18/9a6c208d2bc119ac67b5537b60851c6bccc99f25229eaa96cbe6e38721bd/evalscope-1.0.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,496 Found link https://files.pythonhosted.org/packages/d8/44/3d727dd28fcc50317c95d12e5bb850ed2863105f812373ac120877875434/evalscope-1.0.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.0 2026-04-13T12:59:29,497 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e1/8a/50456fa7dd77be4c3a0ea0b3d96cb7ae5b2557454fdd35cbf0009a9d792f/evalscope-1.0.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,498 Found link https://files.pythonhosted.org/packages/8d/52/93569134b3d8dea2a0d0bc2134c03056f0ffee1840f7299eb83d475457df/evalscope-1.0.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.1 2026-04-13T12:59:29,499 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/22/f6/32fb0fef08a6c881ac840117455a5697a0d63226db8a24cce5208b720829/evalscope-1.0.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,499 Found link https://files.pythonhosted.org/packages/3d/f5/025baefe432d9af1ed845ab5738b638b7b97f2dd3767e9478b8eee10966f/evalscope-1.0.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.2 2026-04-13T12:59:29,500 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/6e/12/f31dbb18daa7e3c6cfecf856ddc323a303e115271166c100e06af58ea6b6/evalscope-1.1.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,501 Found link https://files.pythonhosted.org/packages/b4/98/7449040e89beaa4556bf35ba1e171b2d4955ff15b2b4c43f2ff55b048aeb/evalscope-1.1.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.1.0 2026-04-13T12:59:29,502 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/79/d1/afc8b23345ad8f11a5e1f8c6c3112a8679604d833bdbc02aa06787952fd1/evalscope-1.1.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,503 Found link https://files.pythonhosted.org/packages/e8/3f/d67b73ce19789e914d6a78740fa7bfd0c07f161bc239b92cd3c26541f2fb/evalscope-1.1.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.1.1 2026-04-13T12:59:29,503 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/82/0a/d984751e4b5751f064209da3745b6aff6cc0f1d9d93f13cab0a1017a8639/evalscope-1.2.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,504 Found link https://files.pythonhosted.org/packages/7f/f9/0a2a069ee4500666ec5c3d10b302fc71d176c17bbe70447336e610953e1a/evalscope-1.2.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.2.0 2026-04-13T12:59:29,505 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/58/823d009dfa49cfdc750ac257e744eb97456d69528a09ac108ee8cab15316/evalscope-1.3.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,506 Found link https://files.pythonhosted.org/packages/42/5a/a309f7ce1fbe2b39e4b0a1f26cfcd7864eaa90e4792a5290e8cdd2ce3b4f/evalscope-1.3.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.3.0 2026-04-13T12:59:29,507 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c6/57/005d3ef07ecdd5163e1bbed3413537b653b637d9c8b62a2bcdd97546607b/evalscope-1.4.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,508 Found link https://files.pythonhosted.org/packages/11/d5/268f610ac7db9c5c2109936f65ac4df8b4bc52106ed4369509d3b3c4f127/evalscope-1.4.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.0 2026-04-13T12:59:29,508 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b8/96/a2b4fabf6fa6cf09ac6669aa01dc483ac53576a8fd7c2c4be21ea281840f/evalscope-1.4.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,509 Found link https://files.pythonhosted.org/packages/1c/8d/4fa3d9a2a5443ba2df06147ebc057961b0550bef806c716b2552fe7c7f66/evalscope-1.4.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.1 2026-04-13T12:59:29,510 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/bf/0f/97e68e89f7925160df49ea1dbbcef7f3f8e808a51756c199aaaadc75f5a5/evalscope-1.4.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,511 Found link https://files.pythonhosted.org/packages/97/dd/6e1cda8f161363ef806929b4989a0e9a7df733b7b82361ef1897315c12d0/evalscope-1.4.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.2 2026-04-13T12:59:29,512 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c0/fb/e6b1a396bad204e38591a6d6de1172dac2ce3e0d15b87e812d57e22d0e4f/evalscope-1.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,512 Found link https://files.pythonhosted.org/packages/a7/32/518a920ac8a73c4c6e39f7e443df6da6ea9a3be6567c4a425def866b8f5e/evalscope-1.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.0 2026-04-13T12:59:29,513 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/eb/68/0c870a84e38d5a8d3e7c9df918739a4ba6a45c3ddb624d2792a41a8d3293/evalscope-1.5.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,514 Found link https://files.pythonhosted.org/packages/2f/74/c4275a5a1746667352246ce10b9076137ab661d0f132796d2db32eca97fe/evalscope-1.5.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.1 2026-04-13T12:59:29,514 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/55/1f/b1b087b1d646f635e9e225c8b610f80b1e6e2590802228c15d1d58ae026e/evalscope-1.5.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,515 Found link https://files.pythonhosted.org/packages/ff/13/56b351e22964e93e6d74dbfdb71a4d5e2f96b4ae716f76d5b5ed4d88bae7/evalscope-1.5.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.2 2026-04-13T12:59:29,516 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/53/5b/3d5e1067f98e08cc2aac5c45f4fa67e6a183d62471439e61528469bc5e61/evalscope-1.5.2.post1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,517 Found link https://files.pythonhosted.org/packages/64/d7/ab3fad322268613661de3c4451df7f236475ef4aef7645619e6998b3199d/evalscope-1.5.2.post1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.2.post1 2026-04-13T12:59:29,518 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/02/f3/d38eb7c4488f85d92caf2a6c4b9af852e72ac4d23f5132715d6d5062a82a/evalscope-1.6.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,518 Found link https://files.pythonhosted.org/packages/ba/13/b96ac9df484c1ff1f3e202bd103bc715c87c338c465ac9dff585109bee98/evalscope-1.6.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.6.0 2026-04-13T12:59:29,519 Fetching project page and analyzing links: https://www.piwheels.org/simple/evalscope/ 2026-04-13T12:59:29,520 Getting page https://www.piwheels.org/simple/evalscope/ 2026-04-13T12:59:29,521 Found index url https://www.piwheels.org/simple 2026-04-13T12:59:29,692 Fetched page https://www.piwheels.org/simple/evalscope/ as text/html 2026-04-13T12:59:29,703 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.2.post1-py3-none-any.whl#sha256=76e17bb53fe492f1648148d0cebe8d500169cef934d52fee6e0e3edbf5351b90 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,704 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.2-py3-none-any.whl#sha256=db9427c4cca0bcaa951a6faac7db6c94854d1384ab0914a0ba0f9d377c947f66 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,705 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.1-py3-none-any.whl#sha256=a95eabf175191595bfeebe4e6face613a6c137a65067e8fd7dca613567bba440 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,705 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.5.0-py3-none-any.whl#sha256=933f1aa9915ed658bc3ae6901e0b96efbdbf80db96eca40c2d42453b26530d9b (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,706 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.2-py3-none-any.whl#sha256=222e938fe502394b9935f3c00677cf1372892caa530b6fb48476ae909a91399a (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,706 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.1-py3-none-any.whl#sha256=d74ddb7150b19de1eb995026c697b598cdfc0b75fee3a2d110219256c4241688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,707 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.0-py3-none-any.whl#sha256=307c9ed70f562ba776fdf9b0136b35fe4e361b1b974bb0f8ca39e425f4738e6e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,707 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.3.0-py3-none-any.whl#sha256=d9949bacf6c08b5ab341f872c6ee4b31995d724fe963f64ddf3c129e0e39145e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,708 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.2.0-py3-none-any.whl#sha256=5969cf4a3132f6a29f9ef39aa35a8a3be24f114bbcd7a77a19c145bbec432be9 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,708 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.1-py3-none-any.whl#sha256=5bfb8c55f45e1bcd5df5cb0cd4ecceae2ccb93cd09c9477b0b0e6c097ecb1d1f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-04-13T12:59:29,709 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.0-py3-none-any.whl#sha256=ca7c951fa316bb7ec6fb0e38ad6503632067f07f469e37828fe9e39e51591994 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,710 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.2-py3-none-any.whl#sha256=e14016040022bcd666c05ffa806f3713775e9de19a98290bd2a0a36e5c435409 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,710 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.1-py3-none-any.whl#sha256=61b9c14f3409804d84ddf31b749b587c7c7441f9d1aa0d453991b1bd0bbda74c (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,711 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.0-py3-none-any.whl#sha256=ea1044755177db8d9e94cc4ccafbd56e35b173a8425db7d1653dd9f66e1463ad (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,711 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.1-py3-none-any.whl#sha256=aa0054b8aed77684e56d0836d4568080fa4827799a16d62fef6ec13802cd4050 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-04-13T12:59:29,712 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.0-py3-none-any.whl#sha256=e7a2549f9f5ac5b0061d01f8ca99900e06f6340b91d3e546163423b896287862 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,712 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.3-py3-none-any.whl#sha256=75dcbd7cc0a0336f68d407a3925fe065ecaaea0fa9030ceebfaa3f22f0f3b417 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,713 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.2-py3-none-any.whl#sha256=14e00f16b506b723a359799e3cb271370a3da768da3667563612e458982d6847 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,713 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.1-py3-none-any.whl#sha256=296b90c06a33f69c9e7049a768f16929acd1279af0830f47f18fc598560d0e13 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,714 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.0-py3-none-any.whl#sha256=0a6f11a7a4d564d4a1dca5fe424cda19608cf947b9ab739079f5b60842651c7e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,714 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.1-py3-none-any.whl#sha256=504717cfc96a8fdbb0d4bc080d0292e80423891b658d61283385737794c4e95f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,715 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.0-py3-none-any.whl#sha256=263020a7b7f7788e17515c738604ea64904ca0da34d09664d2b7ee16d3522a00 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,715 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.14.0-py3-none-any.whl#sha256=cdf26beb4b188e1dd5e09feeb1832ccf909c9d2a535eb76e702fb3c66fc65688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,716 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.2-py3-none-any.whl#sha256=7d360083cf9dd960996847cde2140085d0830ddc8b12aa8007b4f72d395c5211 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,717 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.1-py3-none-any.whl#sha256=6b26f11daca05d6b56da3cb1b78ea0a8f28de94901b7520171c66e2a16b1c638 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,717 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.0-py3-none-any.whl#sha256=8ca93c4011f04e35df239b5486248e03100a9c9e265664e01330ef4b1cc691f7 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,718 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.1-py3-none-any.whl#sha256=72e5815923789c6cab6c32425477393bf645f670edcc493f8e4dec6e93f3da23 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,718 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.0-py3-none-any.whl#sha256=f26c7317a5bacac8527806723d45bac74563a2608ed761d822ace03be3a5e45f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,719 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.11.0-py3-none-any.whl#sha256=6e7d3242c5bf97b54d24644a1300575fdf41c1d90eef7c1344c20bf0d1518671 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,720 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.1-py3-none-any.whl#sha256=a7e645410333c1aaec5a024be4c02166dfb0d6b4635b020c181ce672d31102d2 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,720 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.0-py3-none-any.whl#sha256=494f1742178def5f86e552c004562c877cd8b8b6f5d4267d29f961f20e8bf569 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,721 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.9.0-py3-none-any.whl#sha256=26fd59fd387850f3e84f995fd38357994460e71d0124bc09954ca8837027ec52 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,721 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.2-py3-none-any.whl#sha256=3c4ed25b5c39d7a706927607e552e6ced6d50532bc442bba74979f70720f4894 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,722 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.1-py3-none-any.whl#sha256=64e9306453082a95b0c0507d6fd1dbf50ed0ab210ebcfebdcb52c534769d1856 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,722 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.0-py3-none-any.whl#sha256=7758550ef3406d9c6096de05909cc1c97ed0c2f7be6edaf3fe5344244e34f233 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,723 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.2-py3-none-any.whl#sha256=21a1f4448ffca926853b7516b6375f801ef6a8067501dde546e7d794fb759f20 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,723 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.1-py3-none-any.whl#sha256=b959f2d9850544f2d96ac765b06ed112cf579a11899618fcc5245407f4e33843 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,724 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.0-py3-none-any.whl#sha256=2deddeb89bfb6fa02844b72a2be24627526136cbd45315e8e4dadb672d53c9fc (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-04-13T12:59:29,725 Skipping link: not a file: https://www.piwheels.org/simple/evalscope/ 2026-04-13T12:59:29,725 Skipping link: not a file: https://pypi.org/simple/evalscope/ 2026-04-13T12:59:29,751 Given no hashes to check 1 links for project 'evalscope': discarding no candidates 2026-04-13T12:59:29,771 Collecting evalscope==1.6.0 2026-04-13T12:59:29,773 Created temporary directory: /tmp/pip-unpack-nmqdcmol 2026-04-13T12:59:30,010 Downloading evalscope-1.6.0.tar.gz (1.5 MB) 2026-04-13T12:59:32,284 Added evalscope==1.6.0 from https://files.pythonhosted.org/packages/ba/13/b96ac9df484c1ff1f3e202bd103bc715c87c338c465ac9dff585109bee98/evalscope-1.6.0.tar.gz to build tracker '/tmp/pip-build-tracker-29rga0pi' 2026-04-13T12:59:32,291 Created temporary directory: /tmp/pip-build-env-97152t6f 2026-04-13T12:59:32,296 Installing build dependencies: started 2026-04-13T12:59:32,297 Running command pip subprocess to install build dependencies 2026-04-13T12:59:33,428 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-04-13T12:59:33,922 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-13T12:59:33,945 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-13T12:59:35,685 Collecting setuptools>=69 2026-04-13T12:59:35,762 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-04-13T12:59:36,041 Collecting wheel 2026-04-13T12:59:36,061 Using cached https://www.piwheels.org/simple/wheel/wheel-0.46.3-py3-none-any.whl (30 kB) 2026-04-13T12:59:36,246 Collecting packaging>=24.0 2026-04-13T12:59:36,262 Using cached https://www.piwheels.org/simple/packaging/packaging-26.0-py3-none-any.whl (74 kB) 2026-04-13T12:59:39,191 Installing collected packages: setuptools, packaging, wheel 2026-04-13T12:59:42,629 Creating /tmp/pip-build-env-97152t6f/overlay/local/bin 2026-04-13T12:59:42,631 changing mode of /tmp/pip-build-env-97152t6f/overlay/local/bin/wheel to 755 2026-04-13T12:59:42,652 Successfully installed packaging-26.0 setuptools-82.0.1 wheel-0.46.3 2026-04-13T12:59:42,931 Installing build dependencies: finished with status 'done' 2026-04-13T12:59:42,938 Getting requirements to build wheel: started 2026-04-13T12:59:42,940 Running command Getting requirements to build wheel 2026-04-13T12:59:43,710 /tmp/pip-build-env-97152t6f/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-04-13T12:59:43,711 !! 2026-04-13T12:59:43,712 ******************************************************************************** 2026-04-13T12:59:43,713 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-04-13T12:59:43,714 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-13T12:59:43,714 or your builds will no longer be supported. 2026-04-13T12:59:43,715 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-13T12:59:43,716 ******************************************************************************** 2026-04-13T12:59:43,717 !! 2026-04-13T12:59:43,717 corresp(dist, value, root_dir) 2026-04-13T12:59:43,797 /tmp/pip-build-env-97152t6f/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-13T12:59:43,797 !! 2026-04-13T12:59:43,798 ******************************************************************************** 2026-04-13T12:59:43,799 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-13T12:59:43,800 License :: OSI Approved :: Apache Software License 2026-04-13T12:59:43,801 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-13T12:59:43,802 ******************************************************************************** 2026-04-13T12:59:43,803 !! 2026-04-13T12:59:43,804 dist._finalize_license_expression() 2026-04-13T12:59:43,805 /tmp/pip-build-env-97152t6f/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-13T12:59:43,805 !! 2026-04-13T12:59:43,807 ******************************************************************************** 2026-04-13T12:59:43,807 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-13T12:59:43,808 License :: OSI Approved :: Apache Software License 2026-04-13T12:59:43,809 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-13T12:59:43,809 ******************************************************************************** 2026-04-13T12:59:43,810 !! 2026-04-13T12:59:43,810 self._finalize_license_expression() 2026-04-13T12:59:43,811 running egg_info 2026-04-13T12:59:43,817 writing evalscope.egg-info/PKG-INFO 2026-04-13T12:59:43,844 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-04-13T12:59:43,846 writing entry points to evalscope.egg-info/entry_points.txt 2026-04-13T12:59:43,861 writing requirements to evalscope.egg-info/requires.txt 2026-04-13T12:59:43,862 writing top-level names to evalscope.egg-info/top_level.txt 2026-04-13T12:59:44,116 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-04-13T12:59:44,169 reading manifest template 'MANIFEST.in' 2026-04-13T12:59:44,587 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-04-13T12:59:44,592 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-04-13T12:59:44,599 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-04-13T12:59:44,605 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-04-13T12:59:44,606 adding license file 'LICENSE' 2026-04-13T12:59:44,663 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-04-13T12:59:44,764 Getting requirements to build wheel: finished with status 'done' 2026-04-13T12:59:44,768 Created temporary directory: /tmp/pip-modern-metadata-migec575 2026-04-13T12:59:44,770 Preparing metadata (pyproject.toml): started 2026-04-13T12:59:44,771 Running command Preparing metadata (pyproject.toml) 2026-04-13T12:59:45,448 /tmp/pip-build-env-97152t6f/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-04-13T12:59:45,448 !! 2026-04-13T12:59:45,450 ******************************************************************************** 2026-04-13T12:59:45,450 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-04-13T12:59:45,452 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-13T12:59:45,452 or your builds will no longer be supported. 2026-04-13T12:59:45,453 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-13T12:59:45,454 ******************************************************************************** 2026-04-13T12:59:45,455 !! 2026-04-13T12:59:45,456 corresp(dist, value, root_dir) 2026-04-13T12:59:45,535 /tmp/pip-build-env-97152t6f/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-13T12:59:45,536 !! 2026-04-13T12:59:45,537 ******************************************************************************** 2026-04-13T12:59:45,538 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-13T12:59:45,539 License :: OSI Approved :: Apache Software License 2026-04-13T12:59:45,540 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-13T12:59:45,541 ******************************************************************************** 2026-04-13T12:59:45,542 !! 2026-04-13T12:59:45,543 dist._finalize_license_expression() 2026-04-13T12:59:45,545 /tmp/pip-build-env-97152t6f/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-13T12:59:45,546 !! 2026-04-13T12:59:45,547 ******************************************************************************** 2026-04-13T12:59:45,547 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-13T12:59:45,548 License :: OSI Approved :: Apache Software License 2026-04-13T12:59:45,549 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-13T12:59:45,550 ******************************************************************************** 2026-04-13T12:59:45,551 !! 2026-04-13T12:59:45,551 self._finalize_license_expression() 2026-04-13T12:59:45,553 running dist_info 2026-04-13T12:59:45,563 creating /tmp/pip-modern-metadata-migec575/evalscope.egg-info 2026-04-13T12:59:45,564 writing /tmp/pip-modern-metadata-migec575/evalscope.egg-info/PKG-INFO 2026-04-13T12:59:45,591 writing dependency_links to /tmp/pip-modern-metadata-migec575/evalscope.egg-info/dependency_links.txt 2026-04-13T12:59:45,593 writing entry points to /tmp/pip-modern-metadata-migec575/evalscope.egg-info/entry_points.txt 2026-04-13T12:59:45,607 writing requirements to /tmp/pip-modern-metadata-migec575/evalscope.egg-info/requires.txt 2026-04-13T12:59:45,609 writing top-level names to /tmp/pip-modern-metadata-migec575/evalscope.egg-info/top_level.txt 2026-04-13T12:59:45,610 writing manifest file '/tmp/pip-modern-metadata-migec575/evalscope.egg-info/SOURCES.txt' 2026-04-13T12:59:45,824 reading manifest file '/tmp/pip-modern-metadata-migec575/evalscope.egg-info/SOURCES.txt' 2026-04-13T12:59:45,826 reading manifest template 'MANIFEST.in' 2026-04-13T12:59:46,257 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-04-13T12:59:46,260 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-04-13T12:59:46,264 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-04-13T12:59:46,268 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-04-13T12:59:46,269 adding license file 'LICENSE' 2026-04-13T12:59:46,311 writing manifest file '/tmp/pip-modern-metadata-migec575/evalscope.egg-info/SOURCES.txt' 2026-04-13T12:59:46,314 creating '/tmp/pip-modern-metadata-migec575/evalscope-1.6.0.dist-info' 2026-04-13T12:59:46,445 Preparing metadata (pyproject.toml): finished with status 'done' 2026-04-13T12:59:46,454 Source in /tmp/pip-wheel-coynym30/evalscope_94ca307076fe4af5abb956ce3c85a8a4 has version 1.6.0, which satisfies requirement evalscope==1.6.0 from https://files.pythonhosted.org/packages/ba/13/b96ac9df484c1ff1f3e202bd103bc715c87c338c465ac9dff585109bee98/evalscope-1.6.0.tar.gz 2026-04-13T12:59:46,455 Removed evalscope==1.6.0 from https://files.pythonhosted.org/packages/ba/13/b96ac9df484c1ff1f3e202bd103bc715c87c338c465ac9dff585109bee98/evalscope-1.6.0.tar.gz from build tracker '/tmp/pip-build-tracker-29rga0pi' 2026-04-13T12:59:46,468 Created temporary directory: /tmp/pip-unpack-y2shpmlc 2026-04-13T12:59:46,468 Building wheels for collected packages: evalscope 2026-04-13T12:59:46,473 Created temporary directory: /tmp/pip-wheel-85ia55ga 2026-04-13T12:59:46,474 Destination directory: /tmp/pip-wheel-85ia55ga 2026-04-13T12:59:46,476 Building wheel for evalscope (pyproject.toml): started 2026-04-13T12:59:46,478 Running command Building wheel for evalscope (pyproject.toml) 2026-04-13T12:59:47,155 /tmp/pip-build-env-97152t6f/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-04-13T12:59:47,156 !! 2026-04-13T12:59:47,157 ******************************************************************************** 2026-04-13T12:59:47,158 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-04-13T12:59:47,159 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-13T12:59:47,160 or your builds will no longer be supported. 2026-04-13T12:59:47,161 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-13T12:59:47,161 ******************************************************************************** 2026-04-13T12:59:47,162 !! 2026-04-13T12:59:47,162 corresp(dist, value, root_dir) 2026-04-13T12:59:47,242 /tmp/pip-build-env-97152t6f/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-13T12:59:47,243 !! 2026-04-13T12:59:47,244 ******************************************************************************** 2026-04-13T12:59:47,244 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-13T12:59:47,245 License :: OSI Approved :: Apache Software License 2026-04-13T12:59:47,246 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-13T12:59:47,246 ******************************************************************************** 2026-04-13T12:59:47,247 !! 2026-04-13T12:59:47,248 dist._finalize_license_expression() 2026-04-13T12:59:47,254 /tmp/pip-build-env-97152t6f/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-04-13T12:59:47,254 !! 2026-04-13T12:59:47,255 ******************************************************************************** 2026-04-13T12:59:47,256 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-04-13T12:59:47,257 License :: OSI Approved :: Apache Software License 2026-04-13T12:59:47,258 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-04-13T12:59:47,259 ******************************************************************************** 2026-04-13T12:59:47,260 !! 2026-04-13T12:59:47,261 self._finalize_license_expression() 2026-04-13T12:59:47,261 running bdist_wheel 2026-04-13T12:59:47,277 running build 2026-04-13T12:59:47,278 running build_py 2026-04-13T12:59:47,284 creating build/lib/evalscope 2026-04-13T12:59:47,287 copying evalscope/version.py -> build/lib/evalscope 2026-04-13T12:59:47,289 copying evalscope/run.py -> build/lib/evalscope 2026-04-13T12:59:47,293 copying evalscope/arguments.py -> build/lib/evalscope 2026-04-13T12:59:47,295 copying evalscope/constants.py -> build/lib/evalscope 2026-04-13T12:59:47,298 copying evalscope/config.py -> build/lib/evalscope 2026-04-13T12:59:47,301 copying evalscope/__init__.py -> build/lib/evalscope 2026-04-13T12:59:47,304 creating build/lib/evalscope/collections 2026-04-13T12:59:47,305 copying evalscope/collections/sampler.py -> build/lib/evalscope/collections 2026-04-13T12:59:47,308 copying evalscope/collections/schema.py -> build/lib/evalscope/collections 2026-04-13T12:59:47,310 copying evalscope/collections/__init__.py -> build/lib/evalscope/collections 2026-04-13T12:59:47,313 creating build/lib/evalscope/benchmarks 2026-04-13T12:59:47,315 copying evalscope/benchmarks/__init__.py -> build/lib/evalscope/benchmarks 2026-04-13T12:59:47,317 creating build/lib/evalscope/service 2026-04-13T12:59:47,319 copying evalscope/service/app.py -> build/lib/evalscope/service 2026-04-13T12:59:47,322 copying evalscope/service/__init__.py -> build/lib/evalscope/service 2026-04-13T12:59:47,324 creating build/lib/evalscope/cli 2026-04-13T12:59:47,326 copying evalscope/cli/start_app.py -> build/lib/evalscope/cli 2026-04-13T12:59:47,328 copying evalscope/cli/cli.py -> build/lib/evalscope/cli 2026-04-13T12:59:47,330 copying evalscope/cli/start_perf.py -> build/lib/evalscope/cli 2026-04-13T12:59:47,332 copying evalscope/cli/base.py -> build/lib/evalscope/cli 2026-04-13T12:59:47,334 copying evalscope/cli/start_eval.py -> build/lib/evalscope/cli 2026-04-13T12:59:47,336 copying evalscope/cli/benchmark_info.py -> build/lib/evalscope/cli 2026-04-13T12:59:47,339 copying evalscope/cli/start_service.py -> build/lib/evalscope/cli 2026-04-13T12:59:47,341 copying evalscope/cli/__init__.py -> build/lib/evalscope/cli 2026-04-13T12:59:47,344 creating build/lib/evalscope/filters 2026-04-13T12:59:47,345 copying evalscope/filters/extraction.py -> build/lib/evalscope/filters 2026-04-13T12:59:47,347 copying evalscope/filters/__init__.py -> build/lib/evalscope/filters 2026-04-13T12:59:47,350 copying evalscope/filters/selection.py -> build/lib/evalscope/filters 2026-04-13T12:59:47,352 creating build/lib/evalscope/app 2026-04-13T12:59:47,354 copying evalscope/app/app.py -> build/lib/evalscope/app 2026-04-13T12:59:47,356 copying evalscope/app/arguments.py -> build/lib/evalscope/app 2026-04-13T12:59:47,358 copying evalscope/app/constants.py -> build/lib/evalscope/app 2026-04-13T12:59:47,360 copying evalscope/app/__init__.py -> build/lib/evalscope/app 2026-04-13T12:59:47,363 creating build/lib/evalscope/perf 2026-04-13T12:59:47,364 copying evalscope/perf/main.py -> build/lib/evalscope/perf 2026-04-13T12:59:47,367 copying evalscope/perf/arguments.py -> build/lib/evalscope/perf 2026-04-13T12:59:47,370 copying evalscope/perf/http_client.py -> build/lib/evalscope/perf 2026-04-13T12:59:47,373 copying evalscope/perf/benchmark.py -> build/lib/evalscope/perf 2026-04-13T12:59:47,375 copying evalscope/perf/__init__.py -> build/lib/evalscope/perf 2026-04-13T12:59:47,378 creating build/lib/evalscope/summarizer 2026-04-13T12:59:47,379 copying evalscope/summarizer/summarizer.py -> build/lib/evalscope/summarizer 2026-04-13T12:59:47,382 copying evalscope/summarizer/__init__.py -> build/lib/evalscope/summarizer 2026-04-13T12:59:47,385 creating build/lib/evalscope/sandbox 2026-04-13T12:59:47,386 copying evalscope/sandbox/volcengine.py -> build/lib/evalscope/sandbox 2026-04-13T12:59:47,389 copying evalscope/sandbox/__init__.py -> build/lib/evalscope/sandbox 2026-04-13T12:59:47,392 creating build/lib/evalscope/third_party 2026-04-13T12:59:47,393 copying evalscope/third_party/__init__.py -> build/lib/evalscope/third_party 2026-04-13T12:59:47,396 creating build/lib/evalscope/evaluator 2026-04-13T12:59:47,397 copying evalscope/evaluator/evaluator.py -> build/lib/evalscope/evaluator 2026-04-13T12:59:47,401 copying evalscope/evaluator/batch_reviewer.py -> build/lib/evalscope/evaluator 2026-04-13T12:59:47,403 copying evalscope/evaluator/__init__.py -> build/lib/evalscope/evaluator 2026-04-13T12:59:47,406 creating build/lib/evalscope/utils 2026-04-13T12:59:47,407 copying evalscope/utils/chat_service.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,410 copying evalscope/utils/url_utils.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,413 copying evalscope/utils/multi_choices.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,416 copying evalscope/utils/json_schema.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,418 copying evalscope/utils/logger.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,421 copying evalscope/utils/deprecation_utils.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,423 copying evalscope/utils/import_utils.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,426 copying evalscope/utils/code_utils.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,430 copying evalscope/utils/model_utils.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,432 copying evalscope/utils/io_utils.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,435 copying evalscope/utils/ner.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,438 copying evalscope/utils/function_utils.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,441 copying evalscope/utils/argument_utils.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,443 copying evalscope/utils/resource_utils.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,446 copying evalscope/utils/__init__.py -> build/lib/evalscope/utils 2026-04-13T12:59:47,449 creating build/lib/evalscope/report 2026-04-13T12:59:47,450 copying evalscope/report/report.py -> build/lib/evalscope/report 2026-04-13T12:59:47,453 copying evalscope/report/combinator.py -> build/lib/evalscope/report 2026-04-13T12:59:47,456 copying evalscope/report/generator.py -> build/lib/evalscope/report 2026-04-13T12:59:47,458 copying evalscope/report/renderer.py -> build/lib/evalscope/report 2026-04-13T12:59:47,461 copying evalscope/report/__init__.py -> build/lib/evalscope/report 2026-04-13T12:59:47,464 creating build/lib/evalscope/models 2026-04-13T12:59:47,465 copying evalscope/models/anthropic_compatible.py -> build/lib/evalscope/models 2026-04-13T12:59:47,468 copying evalscope/models/text2image_model.py -> build/lib/evalscope/models 2026-04-13T12:59:47,470 copying evalscope/models/model_apis.py -> build/lib/evalscope/models 2026-04-13T12:59:47,472 copying evalscope/models/image_edit_model.py -> build/lib/evalscope/models 2026-04-13T12:59:47,475 copying evalscope/models/modelscope.py -> build/lib/evalscope/models 2026-04-13T12:59:47,478 copying evalscope/models/mockllm.py -> build/lib/evalscope/models 2026-04-13T12:59:47,480 copying evalscope/models/__init__.py -> build/lib/evalscope/models 2026-04-13T12:59:47,482 copying evalscope/models/openai_compatible.py -> build/lib/evalscope/models 2026-04-13T12:59:47,485 creating build/lib/evalscope/api 2026-04-13T12:59:47,487 copying evalscope/api/registry.py -> build/lib/evalscope/api 2026-04-13T12:59:47,490 copying evalscope/api/__init__.py -> build/lib/evalscope/api 2026-04-13T12:59:47,492 creating build/lib/evalscope/metrics 2026-04-13T12:59:47,494 copying evalscope/metrics/math_parser.py -> build/lib/evalscope/metrics 2026-04-13T12:59:47,497 copying evalscope/metrics/llm_judge.py -> build/lib/evalscope/metrics 2026-04-13T12:59:47,499 copying evalscope/metrics/rouge_metric.py -> build/lib/evalscope/metrics 2026-04-13T12:59:47,502 copying evalscope/metrics/metrics.py -> build/lib/evalscope/metrics 2026-04-13T12:59:47,505 copying evalscope/metrics/metric.py -> build/lib/evalscope/metrics 2026-04-13T12:59:47,508 copying evalscope/metrics/__init__.py -> build/lib/evalscope/metrics 2026-04-13T12:59:47,511 creating build/lib/evalscope/backend 2026-04-13T12:59:47,513 copying evalscope/backend/base.py -> build/lib/evalscope/backend 2026-04-13T12:59:47,515 copying evalscope/backend/__init__.py -> build/lib/evalscope/backend 2026-04-13T12:59:47,517 creating build/lib/evalscope/benchmarks/mmlu_pro 2026-04-13T12:59:47,519 copying evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-04-13T12:59:47,522 copying evalscope/benchmarks/mmlu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-04-13T12:59:47,524 creating build/lib/evalscope/benchmarks/musr 2026-04-13T12:59:47,525 copying evalscope/benchmarks/musr/__init__.py -> build/lib/evalscope/benchmarks/musr 2026-04-13T12:59:47,527 copying evalscope/benchmarks/musr/musr_adapter.py -> build/lib/evalscope/benchmarks/musr 2026-04-13T12:59:47,530 creating build/lib/evalscope/benchmarks/wmt 2026-04-13T12:59:47,531 copying evalscope/benchmarks/wmt/wmt24_adapter.py -> build/lib/evalscope/benchmarks/wmt 2026-04-13T12:59:47,534 copying evalscope/benchmarks/wmt/__init__.py -> build/lib/evalscope/benchmarks/wmt 2026-04-13T12:59:47,537 creating build/lib/evalscope/benchmarks/terminal_bench 2026-04-13T12:59:47,538 copying evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-04-13T12:59:47,541 copying evalscope/benchmarks/terminal_bench/utils.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-04-13T12:59:47,544 copying evalscope/benchmarks/terminal_bench/__init__.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-04-13T12:59:47,546 creating build/lib/evalscope/benchmarks/librispeech 2026-04-13T12:59:47,547 copying evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/lib/evalscope/benchmarks/librispeech 2026-04-13T12:59:47,550 copying evalscope/benchmarks/librispeech/__init__.py -> build/lib/evalscope/benchmarks/librispeech 2026-04-13T12:59:47,552 creating build/lib/evalscope/benchmarks/super_gpqa 2026-04-13T12:59:47,553 copying evalscope/benchmarks/super_gpqa/prompt.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-04-13T12:59:47,556 copying evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-04-13T12:59:47,559 copying evalscope/benchmarks/super_gpqa/utils.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-04-13T12:59:47,561 copying evalscope/benchmarks/super_gpqa/__init__.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-04-13T12:59:47,564 creating build/lib/evalscope/benchmarks/bfcl 2026-04-13T12:59:47,565 copying evalscope/benchmarks/bfcl/__init__.py -> build/lib/evalscope/benchmarks/bfcl 2026-04-13T12:59:47,568 creating build/lib/evalscope/benchmarks/hmmt25 2026-04-13T12:59:47,569 copying evalscope/benchmarks/hmmt25/hmmt25_adapter.py -> build/lib/evalscope/benchmarks/hmmt25 2026-04-13T12:59:47,572 creating build/lib/evalscope/benchmarks/halu_eval 2026-04-13T12:59:47,573 copying evalscope/benchmarks/halu_eval/__init__.py -> build/lib/evalscope/benchmarks/halu_eval 2026-04-13T12:59:47,575 copying evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/lib/evalscope/benchmarks/halu_eval 2026-04-13T12:59:47,577 copying evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/lib/evalscope/benchmarks/halu_eval 2026-04-13T12:59:47,580 creating build/lib/evalscope/benchmarks/science_qa 2026-04-13T12:59:47,581 copying evalscope/benchmarks/science_qa/__init__.py -> build/lib/evalscope/benchmarks/science_qa 2026-04-13T12:59:47,583 copying evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/lib/evalscope/benchmarks/science_qa 2026-04-13T12:59:47,585 creating build/lib/evalscope/benchmarks/text2image 2026-04-13T12:59:47,587 copying evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-13T12:59:47,589 copying evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-13T12:59:47,591 copying evalscope/benchmarks/text2image/tifa_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-13T12:59:47,593 copying evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-13T12:59:47,595 copying evalscope/benchmarks/text2image/__init__.py -> build/lib/evalscope/benchmarks/text2image 2026-04-13T12:59:47,596 copying evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-04-13T12:59:47,599 creating build/lib/evalscope/benchmarks/healthbench 2026-04-13T12:59:47,600 copying evalscope/benchmarks/healthbench/utils.py -> build/lib/evalscope/benchmarks/healthbench 2026-04-13T12:59:47,602 copying evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/lib/evalscope/benchmarks/healthbench 2026-04-13T12:59:47,605 copying evalscope/benchmarks/healthbench/__init__.py -> build/lib/evalscope/benchmarks/healthbench 2026-04-13T12:59:47,607 creating build/lib/evalscope/benchmarks/competition_math 2026-04-13T12:59:47,608 copying evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/lib/evalscope/benchmarks/competition_math 2026-04-13T12:59:47,610 copying evalscope/benchmarks/competition_math/__init__.py -> build/lib/evalscope/benchmarks/competition_math 2026-04-13T12:59:47,613 creating build/lib/evalscope/benchmarks/real_world_qa 2026-04-13T12:59:47,614 copying evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-04-13T12:59:47,616 copying evalscope/benchmarks/real_world_qa/__init__.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-04-13T12:59:47,618 creating build/lib/evalscope/benchmarks/logi_qa 2026-04-13T12:59:47,619 copying evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/lib/evalscope/benchmarks/logi_qa 2026-04-13T12:59:47,621 copying evalscope/benchmarks/logi_qa/__int__.py -> build/lib/evalscope/benchmarks/logi_qa 2026-04-13T12:59:47,623 creating build/lib/evalscope/benchmarks/a_okvqa 2026-04-13T12:59:47,624 copying evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-04-13T12:59:47,626 copying evalscope/benchmarks/a_okvqa/__init__.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-04-13T12:59:47,628 creating build/lib/evalscope/benchmarks/siqa 2026-04-13T12:59:47,629 copying evalscope/benchmarks/siqa/siqa_adapter.py -> build/lib/evalscope/benchmarks/siqa 2026-04-13T12:59:47,631 copying evalscope/benchmarks/siqa/__init__.py -> build/lib/evalscope/benchmarks/siqa 2026-04-13T12:59:47,633 creating build/lib/evalscope/benchmarks/arena_hard 2026-04-13T12:59:47,634 copying evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/lib/evalscope/benchmarks/arena_hard 2026-04-13T12:59:47,637 copying evalscope/benchmarks/arena_hard/utils.py -> build/lib/evalscope/benchmarks/arena_hard 2026-04-13T12:59:47,639 copying evalscope/benchmarks/arena_hard/__init__.py -> build/lib/evalscope/benchmarks/arena_hard 2026-04-13T12:59:47,641 creating build/lib/evalscope/benchmarks/mbpp 2026-04-13T12:59:47,642 copying evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/lib/evalscope/benchmarks/mbpp 2026-04-13T12:59:47,644 copying evalscope/benchmarks/mbpp/__init__.py -> build/lib/evalscope/benchmarks/mbpp 2026-04-13T12:59:47,646 creating build/lib/evalscope/benchmarks/ifeval 2026-04-13T12:59:47,647 copying evalscope/benchmarks/ifeval/instructions_util.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-13T12:59:47,650 copying evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-13T12:59:47,653 copying evalscope/benchmarks/ifeval/instructions.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-13T12:59:47,655 copying evalscope/benchmarks/ifeval/utils.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-13T12:59:47,658 copying evalscope/benchmarks/ifeval/instructions_registry.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-13T12:59:47,660 copying evalscope/benchmarks/ifeval/__init__.py -> build/lib/evalscope/benchmarks/ifeval 2026-04-13T12:59:47,662 creating build/lib/evalscope/benchmarks/race 2026-04-13T12:59:47,663 copying evalscope/benchmarks/race/race_adapter.py -> build/lib/evalscope/benchmarks/race 2026-04-13T12:59:47,666 copying evalscope/benchmarks/race/__init__.py -> build/lib/evalscope/benchmarks/race 2026-04-13T12:59:47,668 creating build/lib/evalscope/benchmarks/ceval 2026-04-13T12:59:47,669 copying evalscope/benchmarks/ceval/ceval_adapter.py -> build/lib/evalscope/benchmarks/ceval 2026-04-13T12:59:47,672 copying evalscope/benchmarks/ceval/__init__.py -> build/lib/evalscope/benchmarks/ceval 2026-04-13T12:59:47,674 creating build/lib/evalscope/benchmarks/qasc 2026-04-13T12:59:47,675 copying evalscope/benchmarks/qasc/qasc_adapter.py -> build/lib/evalscope/benchmarks/qasc 2026-04-13T12:59:47,677 copying evalscope/benchmarks/qasc/__init__.py -> build/lib/evalscope/benchmarks/qasc 2026-04-13T12:59:47,679 creating build/lib/evalscope/benchmarks/math_vista 2026-04-13T12:59:47,681 copying evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/lib/evalscope/benchmarks/math_vista 2026-04-13T12:59:47,683 copying evalscope/benchmarks/math_vista/__init__.py -> build/lib/evalscope/benchmarks/math_vista 2026-04-13T12:59:47,685 creating build/lib/evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:47,686 copying evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:47,689 copying evalscope/benchmarks/openai_mrcr/utils.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:47,691 copying evalscope/benchmarks/openai_mrcr/__init__.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:47,693 creating build/lib/evalscope/benchmarks/eq_bench 2026-04-13T12:59:47,694 copying evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/lib/evalscope/benchmarks/eq_bench 2026-04-13T12:59:47,697 copying evalscope/benchmarks/eq_bench/answer_validation.py -> build/lib/evalscope/benchmarks/eq_bench 2026-04-13T12:59:47,699 copying evalscope/benchmarks/eq_bench/__init__.py -> build/lib/evalscope/benchmarks/eq_bench 2026-04-13T12:59:47,702 creating build/lib/evalscope/benchmarks/arc 2026-04-13T12:59:47,702 copying evalscope/benchmarks/arc/arc_adapter.py -> build/lib/evalscope/benchmarks/arc 2026-04-13T12:59:47,705 copying evalscope/benchmarks/arc/__init__.py -> build/lib/evalscope/benchmarks/arc 2026-04-13T12:59:47,707 creating build/lib/evalscope/benchmarks/process_bench 2026-04-13T12:59:47,708 copying evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/lib/evalscope/benchmarks/process_bench 2026-04-13T12:59:47,711 copying evalscope/benchmarks/process_bench/__init__.py -> build/lib/evalscope/benchmarks/process_bench 2026-04-13T12:59:47,713 creating build/lib/evalscope/benchmarks/math_vision 2026-04-13T12:59:47,714 copying evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/lib/evalscope/benchmarks/math_vision 2026-04-13T12:59:47,716 copying evalscope/benchmarks/math_vision/__init__.py -> build/lib/evalscope/benchmarks/math_vision 2026-04-13T12:59:47,718 creating build/lib/evalscope/benchmarks/ifbench 2026-04-13T12:59:47,719 copying evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-13T12:59:47,722 copying evalscope/benchmarks/ifbench/instructions_util.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-13T12:59:47,724 copying evalscope/benchmarks/ifbench/instructions.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-13T12:59:47,728 copying evalscope/benchmarks/ifbench/evaluation_lib.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-13T12:59:47,730 copying evalscope/benchmarks/ifbench/instructions_registry.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-13T12:59:47,733 copying evalscope/benchmarks/ifbench/__init__.py -> build/lib/evalscope/benchmarks/ifbench 2026-04-13T12:59:47,735 creating build/lib/evalscope/benchmarks/micro_vqa 2026-04-13T12:59:47,736 copying evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-04-13T12:59:47,738 copying evalscope/benchmarks/micro_vqa/__init__.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-04-13T12:59:47,740 creating build/lib/evalscope/benchmarks/mmlu 2026-04-13T12:59:47,742 copying evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/lib/evalscope/benchmarks/mmlu 2026-04-13T12:59:47,744 copying evalscope/benchmarks/mmlu/__init__.py -> build/lib/evalscope/benchmarks/mmlu 2026-04-13T12:59:47,747 creating build/lib/evalscope/benchmarks/maritime_bench 2026-04-13T12:59:47,748 copying evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-04-13T12:59:47,750 copying evalscope/benchmarks/maritime_bench/__init__.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-04-13T12:59:47,752 creating build/lib/evalscope/benchmarks/math_500 2026-04-13T12:59:47,753 copying evalscope/benchmarks/math_500/math_500_adapter.py -> build/lib/evalscope/benchmarks/math_500 2026-04-13T12:59:47,755 copying evalscope/benchmarks/math_500/__init__.py -> build/lib/evalscope/benchmarks/math_500 2026-04-13T12:59:47,758 creating build/lib/evalscope/benchmarks/simple_vqa 2026-04-13T12:59:47,759 copying evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-04-13T12:59:47,761 copying evalscope/benchmarks/simple_vqa/__init__.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-04-13T12:59:47,763 creating build/lib/evalscope/benchmarks/general_fc 2026-04-13T12:59:47,764 copying evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/lib/evalscope/benchmarks/general_fc 2026-04-13T12:59:47,767 copying evalscope/benchmarks/general_fc/__init__.py -> build/lib/evalscope/benchmarks/general_fc 2026-04-13T12:59:47,769 creating build/lib/evalscope/benchmarks/mbppplus 2026-04-13T12:59:47,770 copying evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/lib/evalscope/benchmarks/mbppplus 2026-04-13T12:59:47,772 copying evalscope/benchmarks/mbppplus/__init__.py -> build/lib/evalscope/benchmarks/mbppplus 2026-04-13T12:59:47,774 creating build/lib/evalscope/benchmarks/omni_bench 2026-04-13T12:59:47,775 copying evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/lib/evalscope/benchmarks/omni_bench 2026-04-13T12:59:47,778 copying evalscope/benchmarks/omni_bench/__init__.py -> build/lib/evalscope/benchmarks/omni_bench 2026-04-13T12:59:47,780 creating build/lib/evalscope/benchmarks/piqa 2026-04-13T12:59:47,780 copying evalscope/benchmarks/piqa/piqa_adapter.py -> build/lib/evalscope/benchmarks/piqa 2026-04-13T12:59:47,782 copying evalscope/benchmarks/piqa/__init__.py -> build/lib/evalscope/benchmarks/piqa 2026-04-13T12:59:47,784 creating build/lib/evalscope/benchmarks/hellaswag 2026-04-13T12:59:47,785 copying evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/lib/evalscope/benchmarks/hellaswag 2026-04-13T12:59:47,787 copying evalscope/benchmarks/hellaswag/__init__.py -> build/lib/evalscope/benchmarks/hellaswag 2026-04-13T12:59:47,790 creating build/lib/evalscope/benchmarks/swe_bench 2026-04-13T12:59:47,791 copying evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/lib/evalscope/benchmarks/swe_bench 2026-04-13T12:59:47,793 copying evalscope/benchmarks/swe_bench/build_images.py -> build/lib/evalscope/benchmarks/swe_bench 2026-04-13T12:59:47,796 copying evalscope/benchmarks/swe_bench/utils.py -> build/lib/evalscope/benchmarks/swe_bench 2026-04-13T12:59:47,798 copying evalscope/benchmarks/swe_bench/__init__.py -> build/lib/evalscope/benchmarks/swe_bench 2026-04-13T12:59:47,801 creating build/lib/evalscope/benchmarks/drivelology 2026-04-13T12:59:47,802 copying evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-13T12:59:47,804 copying evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-13T12:59:47,807 copying evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-13T12:59:47,810 copying evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-13T12:59:47,812 copying evalscope/benchmarks/drivelology/__init__.py -> build/lib/evalscope/benchmarks/drivelology 2026-04-13T12:59:47,814 creating build/lib/evalscope/benchmarks/alpaca_eval 2026-04-13T12:59:47,815 copying evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-04-13T12:59:47,818 copying evalscope/benchmarks/alpaca_eval/__init__.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-04-13T12:59:47,820 creating build/lib/evalscope/benchmarks/mmlu_redux 2026-04-13T12:59:47,821 copying evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-04-13T12:59:47,824 copying evalscope/benchmarks/mmlu_redux/__init__.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-04-13T12:59:47,826 creating build/lib/evalscope/benchmarks/docvqa 2026-04-13T12:59:47,827 copying evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/lib/evalscope/benchmarks/docvqa 2026-04-13T12:59:47,829 copying evalscope/benchmarks/docvqa/__init__.py -> build/lib/evalscope/benchmarks/docvqa 2026-04-13T12:59:47,831 creating build/lib/evalscope/benchmarks/mmmlu 2026-04-13T12:59:47,832 copying evalscope/benchmarks/mmmlu/prompt.py -> build/lib/evalscope/benchmarks/mmmlu 2026-04-13T12:59:47,835 copying evalscope/benchmarks/mmmlu/mmmlu_adapter.py -> build/lib/evalscope/benchmarks/mmmlu 2026-04-13T12:59:47,837 copying evalscope/benchmarks/mmmlu/__init__.py -> build/lib/evalscope/benchmarks/mmmlu 2026-04-13T12:59:47,839 creating build/lib/evalscope/benchmarks/trivia_qa 2026-04-13T12:59:47,840 copying evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-04-13T12:59:47,842 copying evalscope/benchmarks/trivia_qa/__init__.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-04-13T12:59:47,845 creating build/lib/evalscope/benchmarks/aime 2026-04-13T12:59:47,846 copying evalscope/benchmarks/aime/grader.py -> build/lib/evalscope/benchmarks/aime 2026-04-13T12:59:47,849 copying evalscope/benchmarks/aime/aime_adapter.py -> build/lib/evalscope/benchmarks/aime 2026-04-13T12:59:47,852 copying evalscope/benchmarks/aime/__init__.py -> build/lib/evalscope/benchmarks/aime 2026-04-13T12:59:47,854 copying evalscope/benchmarks/aime/math_normalize.py -> build/lib/evalscope/benchmarks/aime 2026-04-13T12:59:47,856 creating build/lib/evalscope/benchmarks/truthful_qa 2026-04-13T12:59:47,857 copying evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-04-13T12:59:47,860 copying evalscope/benchmarks/truthful_qa/__init__.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-04-13T12:59:47,862 creating build/lib/evalscope/benchmarks/general_mcq 2026-04-13T12:59:47,863 copying evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/lib/evalscope/benchmarks/general_mcq 2026-04-13T12:59:47,865 copying evalscope/benchmarks/general_mcq/__init__.py -> build/lib/evalscope/benchmarks/general_mcq 2026-04-13T12:59:47,868 creating build/lib/evalscope/benchmarks/chartqa 2026-04-13T12:59:47,869 copying evalscope/benchmarks/chartqa/utils.py -> build/lib/evalscope/benchmarks/chartqa 2026-04-13T12:59:47,871 copying evalscope/benchmarks/chartqa/__init__.py -> build/lib/evalscope/benchmarks/chartqa 2026-04-13T12:59:47,872 copying evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/lib/evalscope/benchmarks/chartqa 2026-04-13T12:59:47,875 creating build/lib/evalscope/benchmarks/general_qa 2026-04-13T12:59:47,876 copying evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/lib/evalscope/benchmarks/general_qa 2026-04-13T12:59:47,878 copying evalscope/benchmarks/general_qa/__init__.py -> build/lib/evalscope/benchmarks/general_qa 2026-04-13T12:59:47,881 creating build/lib/evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:47,882 copying evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:47,885 copying evalscope/benchmarks/olympiad_bench/utils.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:47,888 copying evalscope/benchmarks/olympiad_bench/__init__.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:47,890 creating build/lib/evalscope/benchmarks/math_qa 2026-04-13T12:59:47,891 copying evalscope/benchmarks/math_qa/__init__.py -> build/lib/evalscope/benchmarks/math_qa 2026-04-13T12:59:47,893 copying evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/lib/evalscope/benchmarks/math_qa 2026-04-13T12:59:47,897 creating build/lib/evalscope/benchmarks/chinese_simple_qa 2026-04-13T12:59:47,898 copying evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-04-13T12:59:47,900 copying evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-04-13T12:59:47,902 creating build/lib/evalscope/benchmarks/med_mcqa 2026-04-13T12:59:47,904 copying evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-04-13T12:59:47,906 copying evalscope/benchmarks/med_mcqa/__init__.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-04-13T12:59:47,908 creating build/lib/evalscope/benchmarks/mmmu_pro 2026-04-13T12:59:47,909 copying evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-04-13T12:59:47,912 copying evalscope/benchmarks/mmmu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-04-13T12:59:47,914 creating build/lib/evalscope/benchmarks/hallusion_bench 2026-04-13T12:59:47,915 copying evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-04-13T12:59:47,917 copying evalscope/benchmarks/hallusion_bench/__init__.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-04-13T12:59:47,919 creating build/lib/evalscope/benchmarks/gsm8k 2026-04-13T12:59:47,920 copying evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/lib/evalscope/benchmarks/gsm8k 2026-04-13T12:59:47,922 copying evalscope/benchmarks/gsm8k/__init__.py -> build/lib/evalscope/benchmarks/gsm8k 2026-04-13T12:59:47,924 creating build/lib/evalscope/benchmarks/iquiz 2026-04-13T12:59:47,925 copying evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/lib/evalscope/benchmarks/iquiz 2026-04-13T12:59:47,927 copying evalscope/benchmarks/iquiz/__init__.py -> build/lib/evalscope/benchmarks/iquiz 2026-04-13T12:59:47,930 creating build/lib/evalscope/benchmarks/data_collection 2026-04-13T12:59:47,931 copying evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/lib/evalscope/benchmarks/data_collection 2026-04-13T12:59:47,934 copying evalscope/benchmarks/data_collection/__init__.py -> build/lib/evalscope/benchmarks/data_collection 2026-04-13T12:59:47,936 creating build/lib/evalscope/benchmarks/commonsense_qa 2026-04-13T12:59:47,937 copying evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-04-13T12:59:47,939 copying evalscope/benchmarks/commonsense_qa/__init__.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-04-13T12:59:47,941 creating build/lib/evalscope/benchmarks/gsm8k_v 2026-04-13T12:59:47,942 copying evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-04-13T12:59:47,944 copying evalscope/benchmarks/gsm8k_v/__init__.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-04-13T12:59:47,946 creating build/lib/evalscope/benchmarks/visu_logic 2026-04-13T12:59:47,947 copying evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/lib/evalscope/benchmarks/visu_logic 2026-04-13T12:59:47,950 copying evalscope/benchmarks/visu_logic/__init__.py -> build/lib/evalscope/benchmarks/visu_logic 2026-04-13T12:59:47,952 creating build/lib/evalscope/benchmarks/cmmlu 2026-04-13T12:59:47,953 copying evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/lib/evalscope/benchmarks/cmmlu 2026-04-13T12:59:47,955 copying evalscope/benchmarks/cmmlu/__init__.py -> build/lib/evalscope/benchmarks/cmmlu 2026-04-13T12:59:47,958 creating build/lib/evalscope/benchmarks/mm_bench 2026-04-13T12:59:47,959 copying evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/lib/evalscope/benchmarks/mm_bench 2026-04-13T12:59:47,961 copying evalscope/benchmarks/mm_bench/__init__.py -> build/lib/evalscope/benchmarks/mm_bench 2026-04-13T12:59:47,963 creating build/lib/evalscope/benchmarks/general_vqa 2026-04-13T12:59:47,965 copying evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/lib/evalscope/benchmarks/general_vqa 2026-04-13T12:59:47,967 copying evalscope/benchmarks/general_vqa/__init__.py -> build/lib/evalscope/benchmarks/general_vqa 2026-04-13T12:59:47,969 creating build/lib/evalscope/benchmarks/refcoco 2026-04-13T12:59:47,970 copying evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/lib/evalscope/benchmarks/refcoco 2026-04-13T12:59:47,973 copying evalscope/benchmarks/refcoco/evaluation_lib.py -> build/lib/evalscope/benchmarks/refcoco 2026-04-13T12:59:47,975 copying evalscope/benchmarks/refcoco/utils.py -> build/lib/evalscope/benchmarks/refcoco 2026-04-13T12:59:47,977 copying evalscope/benchmarks/refcoco/__init__.py -> build/lib/evalscope/benchmarks/refcoco 2026-04-13T12:59:47,979 creating build/lib/evalscope/benchmarks/needle_haystack 2026-04-13T12:59:47,980 copying evalscope/benchmarks/needle_haystack/utils.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-04-13T12:59:47,982 copying evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-04-13T12:59:47,985 copying evalscope/benchmarks/needle_haystack/__init__.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-04-13T12:59:47,987 creating build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:47,988 copying evalscope/benchmarks/live_code_bench/prompts.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:47,991 copying evalscope/benchmarks/live_code_bench/load_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:47,993 copying evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:47,996 copying evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:47,998 copying evalscope/benchmarks/live_code_bench/testing_util.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:48,001 copying evalscope/benchmarks/live_code_bench/extract_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:48,003 copying evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:48,005 copying evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:48,007 copying evalscope/benchmarks/live_code_bench/__init__.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:48,009 creating build/lib/evalscope/benchmarks/mia_bench 2026-04-13T12:59:48,011 copying evalscope/benchmarks/mia_bench/mia_bench_adapter.py -> build/lib/evalscope/benchmarks/mia_bench 2026-04-13T12:59:48,013 copying evalscope/benchmarks/mia_bench/utils.py -> build/lib/evalscope/benchmarks/mia_bench 2026-04-13T12:59:48,015 copying evalscope/benchmarks/mia_bench/__init__.py -> build/lib/evalscope/benchmarks/mia_bench 2026-04-13T12:59:48,017 creating build/lib/evalscope/benchmarks/pope 2026-04-13T12:59:48,018 copying evalscope/benchmarks/pope/pope_adapter.py -> build/lib/evalscope/benchmarks/pope 2026-04-13T12:59:48,021 copying evalscope/benchmarks/pope/__init__.py -> build/lib/evalscope/benchmarks/pope 2026-04-13T12:59:48,023 creating build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-04-13T12:59:48,024 copying evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-04-13T12:59:48,026 copying evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-04-13T12:59:48,028 creating build/lib/evalscope/benchmarks/gpqa 2026-04-13T12:59:48,029 copying evalscope/benchmarks/gpqa/prompt.py -> build/lib/evalscope/benchmarks/gpqa 2026-04-13T12:59:48,032 copying evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/lib/evalscope/benchmarks/gpqa 2026-04-13T12:59:48,034 copying evalscope/benchmarks/gpqa/__init__.py -> build/lib/evalscope/benchmarks/gpqa 2026-04-13T12:59:48,036 creating build/lib/evalscope/benchmarks/humaneval 2026-04-13T12:59:48,038 copying evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/lib/evalscope/benchmarks/humaneval 2026-04-13T12:59:48,040 copying evalscope/benchmarks/humaneval/utils.py -> build/lib/evalscope/benchmarks/humaneval 2026-04-13T12:59:48,043 copying evalscope/benchmarks/humaneval/__init__.py -> build/lib/evalscope/benchmarks/humaneval 2026-04-13T12:59:48,045 creating build/lib/evalscope/benchmarks/ocr_bench 2026-04-13T12:59:48,046 copying evalscope/benchmarks/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench 2026-04-13T12:59:48,048 creating build/lib/evalscope/benchmarks/tool_bench 2026-04-13T12:59:48,050 copying evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/lib/evalscope/benchmarks/tool_bench 2026-04-13T12:59:48,052 copying evalscope/benchmarks/tool_bench/utils.py -> build/lib/evalscope/benchmarks/tool_bench 2026-04-13T12:59:48,055 copying evalscope/benchmarks/tool_bench/__init__.py -> build/lib/evalscope/benchmarks/tool_bench 2026-04-13T12:59:48,057 creating build/lib/evalscope/benchmarks/poly_math 2026-04-13T12:59:48,058 copying evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/lib/evalscope/benchmarks/poly_math 2026-04-13T12:59:48,060 copying evalscope/benchmarks/poly_math/__init__.py -> build/lib/evalscope/benchmarks/poly_math 2026-04-13T12:59:48,062 creating build/lib/evalscope/benchmarks/infovqa 2026-04-13T12:59:48,063 copying evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/lib/evalscope/benchmarks/infovqa 2026-04-13T12:59:48,065 copying evalscope/benchmarks/infovqa/__init__.py -> build/lib/evalscope/benchmarks/infovqa 2026-04-13T12:59:48,067 creating build/lib/evalscope/benchmarks/humanevalplus 2026-04-13T12:59:48,068 copying evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-04-13T12:59:48,071 copying evalscope/benchmarks/humanevalplus/__init__.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-04-13T12:59:48,073 creating build/lib/evalscope/benchmarks/image_edit 2026-04-13T12:59:48,074 copying evalscope/benchmarks/image_edit/__init__.py -> build/lib/evalscope/benchmarks/image_edit 2026-04-13T12:59:48,077 creating build/lib/evalscope/benchmarks/vstar_bench 2026-04-13T12:59:48,078 copying evalscope/benchmarks/vstar_bench/__init__.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-04-13T12:59:48,080 copying evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-04-13T12:59:48,082 creating build/lib/evalscope/benchmarks/torgo 2026-04-13T12:59:48,083 copying evalscope/benchmarks/torgo/torgo_adapter.py -> build/lib/evalscope/benchmarks/torgo 2026-04-13T12:59:48,086 copying evalscope/benchmarks/torgo/__init__.py -> build/lib/evalscope/benchmarks/torgo 2026-04-13T12:59:48,088 creating build/lib/evalscope/benchmarks/aa_lcr 2026-04-13T12:59:48,089 copying evalscope/benchmarks/aa_lcr/__init__.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-04-13T12:59:48,091 copying evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-04-13T12:59:48,094 creating build/lib/evalscope/benchmarks/amc 2026-04-13T12:59:48,095 copying evalscope/benchmarks/amc/amc_adapter.py -> build/lib/evalscope/benchmarks/amc 2026-04-13T12:59:48,097 copying evalscope/benchmarks/amc/__init__.py -> build/lib/evalscope/benchmarks/amc 2026-04-13T12:59:48,099 creating build/lib/evalscope/benchmarks/simple_qa 2026-04-13T12:59:48,100 copying evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/lib/evalscope/benchmarks/simple_qa 2026-04-13T12:59:48,104 copying evalscope/benchmarks/simple_qa/__init__.py -> build/lib/evalscope/benchmarks/simple_qa 2026-04-13T12:59:48,106 creating build/lib/evalscope/benchmarks/fleurs 2026-04-13T12:59:48,107 copying evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/lib/evalscope/benchmarks/fleurs 2026-04-13T12:59:48,110 copying evalscope/benchmarks/fleurs/__init__.py -> build/lib/evalscope/benchmarks/fleurs 2026-04-13T12:59:48,112 creating build/lib/evalscope/benchmarks/tau_bench 2026-04-13T12:59:48,113 copying evalscope/benchmarks/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench 2026-04-13T12:59:48,115 creating build/lib/evalscope/benchmarks/mgsm 2026-04-13T12:59:48,116 copying evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/lib/evalscope/benchmarks/mgsm 2026-04-13T12:59:48,119 copying evalscope/benchmarks/mgsm/__init__.py -> build/lib/evalscope/benchmarks/mgsm 2026-04-13T12:59:48,121 creating build/lib/evalscope/benchmarks/docmath 2026-04-13T12:59:48,122 copying evalscope/benchmarks/docmath/utils.py -> build/lib/evalscope/benchmarks/docmath 2026-04-13T12:59:48,125 copying evalscope/benchmarks/docmath/__init__.py -> build/lib/evalscope/benchmarks/docmath 2026-04-13T12:59:48,126 copying evalscope/benchmarks/docmath/docmath_adapter.py -> build/lib/evalscope/benchmarks/docmath 2026-04-13T12:59:48,129 creating build/lib/evalscope/benchmarks/mmmu 2026-04-13T12:59:48,130 copying evalscope/benchmarks/mmmu/__init__.py -> build/lib/evalscope/benchmarks/mmmu 2026-04-13T12:59:48,132 copying evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/lib/evalscope/benchmarks/mmmu 2026-04-13T12:59:48,135 creating build/lib/evalscope/benchmarks/drop 2026-04-13T12:59:48,137 copying evalscope/benchmarks/drop/drop_adapter.py -> build/lib/evalscope/benchmarks/drop 2026-04-13T12:59:48,139 copying evalscope/benchmarks/drop/utils.py -> build/lib/evalscope/benchmarks/drop 2026-04-13T12:59:48,142 copying evalscope/benchmarks/drop/__init__.py -> build/lib/evalscope/benchmarks/drop 2026-04-13T12:59:48,144 creating build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:48,145 copying evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:48,148 copying evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:48,150 copying evalscope/benchmarks/omnidoc_bench/metrics.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:48,153 copying evalscope/benchmarks/omnidoc_bench/utils.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:48,158 copying evalscope/benchmarks/omnidoc_bench/__init__.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:48,160 creating build/lib/evalscope/benchmarks/hle 2026-04-13T12:59:48,161 copying evalscope/benchmarks/hle/hle_adapter.py -> build/lib/evalscope/benchmarks/hle 2026-04-13T12:59:48,164 copying evalscope/benchmarks/hle/__init__.py -> build/lib/evalscope/benchmarks/hle 2026-04-13T12:59:48,166 creating build/lib/evalscope/benchmarks/cl_bench 2026-04-13T12:59:48,167 copying evalscope/benchmarks/cl_bench/utils.py -> build/lib/evalscope/benchmarks/cl_bench 2026-04-13T12:59:48,169 copying evalscope/benchmarks/cl_bench/cl_bench_adapter.py -> build/lib/evalscope/benchmarks/cl_bench 2026-04-13T12:59:48,172 copying evalscope/benchmarks/cl_bench/__init__.py -> build/lib/evalscope/benchmarks/cl_bench 2026-04-13T12:59:48,174 creating build/lib/evalscope/benchmarks/general_arena 2026-04-13T12:59:48,175 copying evalscope/benchmarks/general_arena/utils.py -> build/lib/evalscope/benchmarks/general_arena 2026-04-13T12:59:48,178 copying evalscope/benchmarks/general_arena/__init__.py -> build/lib/evalscope/benchmarks/general_arena 2026-04-13T12:59:48,180 copying evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/lib/evalscope/benchmarks/general_arena 2026-04-13T12:59:48,183 creating build/lib/evalscope/benchmarks/zerobench 2026-04-13T12:59:48,184 copying evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/lib/evalscope/benchmarks/zerobench 2026-04-13T12:59:48,186 copying evalscope/benchmarks/zerobench/__init__.py -> build/lib/evalscope/benchmarks/zerobench 2026-04-13T12:59:48,188 creating build/lib/evalscope/benchmarks/general_vmcq 2026-04-13T12:59:48,189 copying evalscope/benchmarks/general_vmcq/__init__.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-04-13T12:59:48,191 copying evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-04-13T12:59:48,194 creating build/lib/evalscope/benchmarks/winogrande 2026-04-13T12:59:48,195 copying evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/lib/evalscope/benchmarks/winogrande 2026-04-13T12:59:48,197 copying evalscope/benchmarks/winogrande/__init__.py -> build/lib/evalscope/benchmarks/winogrande 2026-04-13T12:59:48,199 creating build/lib/evalscope/benchmarks/scicode 2026-04-13T12:59:48,201 copying evalscope/benchmarks/scicode/util.py -> build/lib/evalscope/benchmarks/scicode 2026-04-13T12:59:48,203 copying evalscope/benchmarks/scicode/scicode_adapter.py -> build/lib/evalscope/benchmarks/scicode 2026-04-13T12:59:48,205 copying evalscope/benchmarks/scicode/prompt_templates.py -> build/lib/evalscope/benchmarks/scicode 2026-04-13T12:59:48,207 copying evalscope/benchmarks/scicode/__init__.py -> build/lib/evalscope/benchmarks/scicode 2026-04-13T12:59:48,209 creating build/lib/evalscope/benchmarks/bbh 2026-04-13T12:59:48,210 copying evalscope/benchmarks/bbh/bbh_adapter.py -> build/lib/evalscope/benchmarks/bbh 2026-04-13T12:59:48,213 copying evalscope/benchmarks/bbh/__init__.py -> build/lib/evalscope/benchmarks/bbh 2026-04-13T12:59:48,215 creating build/lib/evalscope/benchmarks/sciq 2026-04-13T12:59:48,216 copying evalscope/benchmarks/sciq/sciq_adapter.py -> build/lib/evalscope/benchmarks/sciq 2026-04-13T12:59:48,218 copying evalscope/benchmarks/sciq/__init__.py -> build/lib/evalscope/benchmarks/sciq 2026-04-13T12:59:48,220 creating build/lib/evalscope/benchmarks/longbench_v2 2026-04-13T12:59:48,222 copying evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py -> build/lib/evalscope/benchmarks/longbench_v2 2026-04-13T12:59:48,224 copying evalscope/benchmarks/longbench_v2/__init__.py -> build/lib/evalscope/benchmarks/longbench_v2 2026-04-13T12:59:48,226 creating build/lib/evalscope/benchmarks/mm_star 2026-04-13T12:59:48,227 copying evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/lib/evalscope/benchmarks/mm_star 2026-04-13T12:59:48,229 copying evalscope/benchmarks/mm_star/__init__.py -> build/lib/evalscope/benchmarks/mm_star 2026-04-13T12:59:48,232 creating build/lib/evalscope/benchmarks/multi_if 2026-04-13T12:59:48,233 copying evalscope/benchmarks/multi_if/ifeval.py -> build/lib/evalscope/benchmarks/multi_if 2026-04-13T12:59:48,238 copying evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/lib/evalscope/benchmarks/multi_if 2026-04-13T12:59:48,240 copying evalscope/benchmarks/multi_if/metrics.py -> build/lib/evalscope/benchmarks/multi_if 2026-04-13T12:59:48,243 copying evalscope/benchmarks/multi_if/__init__.py -> build/lib/evalscope/benchmarks/multi_if 2026-04-13T12:59:48,245 creating build/lib/evalscope/benchmarks/math_verse 2026-04-13T12:59:48,246 copying evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/lib/evalscope/benchmarks/math_verse 2026-04-13T12:59:48,248 copying evalscope/benchmarks/math_verse/__init__.py -> build/lib/evalscope/benchmarks/math_verse 2026-04-13T12:59:48,251 creating build/lib/evalscope/benchmarks/multipl_e 2026-04-13T12:59:48,252 copying evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-04-13T12:59:48,254 copying evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-04-13T12:59:48,257 copying evalscope/benchmarks/multipl_e/utils.py -> build/lib/evalscope/benchmarks/multipl_e 2026-04-13T12:59:48,259 copying evalscope/benchmarks/multipl_e/__init__.py -> build/lib/evalscope/benchmarks/multipl_e 2026-04-13T12:59:48,261 creating build/lib/evalscope/benchmarks/blink 2026-04-13T12:59:48,263 copying evalscope/benchmarks/blink/blink_adapter.py -> build/lib/evalscope/benchmarks/blink 2026-04-13T12:59:48,265 copying evalscope/benchmarks/blink/__init__.py -> build/lib/evalscope/benchmarks/blink 2026-04-13T12:59:48,267 creating build/lib/evalscope/benchmarks/cmmmu 2026-04-13T12:59:48,268 copying evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmmu 2026-04-13T12:59:48,271 copying evalscope/benchmarks/cmmmu/utils.py -> build/lib/evalscope/benchmarks/cmmmu 2026-04-13T12:59:48,273 copying evalscope/benchmarks/cmmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmmu 2026-04-13T12:59:48,275 creating build/lib/evalscope/benchmarks/frames 2026-04-13T12:59:48,276 copying evalscope/benchmarks/frames/utils.py -> build/lib/evalscope/benchmarks/frames 2026-04-13T12:59:48,279 copying evalscope/benchmarks/frames/__init__.py -> build/lib/evalscope/benchmarks/frames 2026-04-13T12:59:48,280 copying evalscope/benchmarks/frames/frames_adapter.py -> build/lib/evalscope/benchmarks/frames 2026-04-13T12:59:48,283 creating build/lib/evalscope/benchmarks/mri_mcqa 2026-04-13T12:59:48,284 copying evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-04-13T12:59:48,286 copying evalscope/benchmarks/mri_mcqa/__init__.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-04-13T12:59:48,289 creating build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,290 copying evalscope/benchmarks/ner/fin_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,292 copying evalscope/benchmarks/ner/genia_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,294 copying evalscope/benchmarks/ner/copious_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,296 copying evalscope/benchmarks/ner/bc2gm_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,298 copying evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,300 copying evalscope/benchmarks/ner/anat_em_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,302 copying evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,304 copying evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,306 copying evalscope/benchmarks/ner/jnlpba_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,308 copying evalscope/benchmarks/ner/cross_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,310 copying evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,312 copying evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,314 copying evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,317 copying evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,319 copying evalscope/benchmarks/ner/conll2003_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,321 copying evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,323 copying evalscope/benchmarks/ner/wnut2017_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,325 copying evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,327 copying evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,328 copying evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,330 copying evalscope/benchmarks/ner/ncbi_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,332 copying evalscope/benchmarks/ner/__init__.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,334 copying evalscope/benchmarks/ner/conllpp_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-04-13T12:59:48,336 creating build/lib/evalscope/benchmarks/coin_flip 2026-04-13T12:59:48,338 copying evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/lib/evalscope/benchmarks/coin_flip 2026-04-13T12:59:48,340 copying evalscope/benchmarks/coin_flip/__init__.py -> build/lib/evalscope/benchmarks/coin_flip 2026-04-13T12:59:48,342 creating build/lib/evalscope/benchmarks/zebralogicbench 2026-04-13T12:59:48,343 copying evalscope/benchmarks/zebralogicbench/utils.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-04-13T12:59:48,346 copying evalscope/benchmarks/zebralogicbench/__init__.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-04-13T12:59:48,347 copying evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-04-13T12:59:48,350 creating build/lib/evalscope/benchmarks/pumed_qa 2026-04-13T12:59:48,351 copying evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-04-13T12:59:48,354 copying evalscope/benchmarks/pumed_qa/__init__.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-04-13T12:59:48,355 creating build/lib/evalscope/benchmarks/biomix_qa 2026-04-13T12:59:48,356 copying evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-04-13T12:59:48,358 copying evalscope/benchmarks/biomix_qa/__init__.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-04-13T12:59:48,360 creating build/lib/evalscope/benchmarks/cmmu 2026-04-13T12:59:48,361 copying evalscope/benchmarks/cmmu/prompt.py -> build/lib/evalscope/benchmarks/cmmu 2026-04-13T12:59:48,363 copying evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmu 2026-04-13T12:59:48,366 copying evalscope/benchmarks/cmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmu 2026-04-13T12:59:48,367 creating build/lib/evalscope/benchmarks/ai2d 2026-04-13T12:59:48,368 copying evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/lib/evalscope/benchmarks/ai2d 2026-04-13T12:59:48,370 copying evalscope/benchmarks/ai2d/__init__.py -> build/lib/evalscope/benchmarks/ai2d 2026-04-13T12:59:48,372 creating build/lib/evalscope/benchmarks/minerva_math 2026-04-13T12:59:48,373 copying evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/lib/evalscope/benchmarks/minerva_math 2026-04-13T12:59:48,375 copying evalscope/benchmarks/minerva_math/__init__.py -> build/lib/evalscope/benchmarks/minerva_math 2026-04-13T12:59:48,377 creating build/lib/evalscope/benchmarks/music_trivia 2026-04-13T12:59:48,378 copying evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/lib/evalscope/benchmarks/music_trivia 2026-04-13T12:59:48,380 copying evalscope/benchmarks/music_trivia/__init__.py -> build/lib/evalscope/benchmarks/music_trivia 2026-04-13T12:59:48,382 creating build/lib/evalscope/benchmarks/bfcl/v4 2026-04-13T12:59:48,383 copying evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-04-13T12:59:48,386 copying evalscope/benchmarks/bfcl/v4/utils.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-04-13T12:59:48,388 copying evalscope/benchmarks/bfcl/v4/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-04-13T12:59:48,390 creating build/lib/evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:48,391 copying evalscope/benchmarks/bfcl/v3/generation.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:48,393 copying evalscope/benchmarks/bfcl/v3/utils.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:48,395 copying evalscope/benchmarks/bfcl/v3/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:48,397 copying evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:48,400 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-13T12:59:48,401 copying evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-13T12:59:48,403 copying evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-13T12:59:48,405 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,406 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,409 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,411 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,413 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,414 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,417 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,419 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,421 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,422 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:48,425 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:48,426 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:48,429 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:48,430 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:48,433 creating build/lib/evalscope/benchmarks/poly_math/utils 2026-04-13T12:59:48,434 copying evalscope/benchmarks/poly_math/utils/instruction.py -> build/lib/evalscope/benchmarks/poly_math/utils 2026-04-13T12:59:48,437 creating build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:48,437 copying evalscope/benchmarks/image_edit/gedit/utils.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:48,440 copying evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:48,442 copying evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:48,445 copying evalscope/benchmarks/image_edit/gedit/__init__.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:48,447 creating build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:48,447 copying evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:48,450 copying evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:48,451 copying evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:48,454 creating build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:48,455 copying evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:48,457 copying evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:48,459 copying evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:48,461 creating build/lib/evalscope/benchmarks/scicode/docker 2026-04-13T12:59:48,462 copying evalscope/benchmarks/scicode/docker/test_util.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-04-13T12:59:48,464 copying evalscope/benchmarks/scicode/docker/process_data.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-04-13T12:59:48,467 creating build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:48,468 copying evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:48,470 copying evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:48,472 copying evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:48,474 copying evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:48,476 copying evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:48,477 copying evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:48,479 creating build/lib/evalscope/service/frontend 2026-04-13T12:59:48,480 copying evalscope/service/frontend/main.py -> build/lib/evalscope/service/frontend 2026-04-13T12:59:48,482 copying evalscope/service/frontend/async_client.py -> build/lib/evalscope/service/frontend 2026-04-13T12:59:48,485 copying evalscope/service/frontend/utils.py -> build/lib/evalscope/service/frontend 2026-04-13T12:59:48,487 copying evalscope/service/frontend/__init__.py -> build/lib/evalscope/service/frontend 2026-04-13T12:59:48,488 creating build/lib/evalscope/service/blueprints 2026-04-13T12:59:48,489 copying evalscope/service/blueprints/perf.py -> build/lib/evalscope/service/blueprints 2026-04-13T12:59:48,491 copying evalscope/service/blueprints/eval.py -> build/lib/evalscope/service/blueprints 2026-04-13T12:59:48,494 copying evalscope/service/blueprints/__init__.py -> build/lib/evalscope/service/blueprints 2026-04-13T12:59:48,496 creating build/lib/evalscope/service/utils 2026-04-13T12:59:48,496 copying evalscope/service/utils/benchmarks.py -> build/lib/evalscope/service/utils 2026-04-13T12:59:48,499 copying evalscope/service/utils/process.py -> build/lib/evalscope/service/utils 2026-04-13T12:59:48,501 copying evalscope/service/utils/log.py -> build/lib/evalscope/service/utils 2026-04-13T12:59:48,503 copying evalscope/service/utils/__init__.py -> build/lib/evalscope/service/utils 2026-04-13T12:59:48,505 creating build/lib/evalscope/app/utils 2026-04-13T12:59:48,506 copying evalscope/app/utils/text_utils.py -> build/lib/evalscope/app/utils 2026-04-13T12:59:48,508 copying evalscope/app/utils/visualization.py -> build/lib/evalscope/app/utils 2026-04-13T12:59:48,510 copying evalscope/app/utils/localization.py -> build/lib/evalscope/app/utils 2026-04-13T12:59:48,512 copying evalscope/app/utils/data_utils.py -> build/lib/evalscope/app/utils 2026-04-13T12:59:48,515 copying evalscope/app/utils/env_utils.py -> build/lib/evalscope/app/utils 2026-04-13T12:59:48,517 creating build/lib/evalscope/app/ui 2026-04-13T12:59:48,518 copying evalscope/app/ui/visualization.py -> build/lib/evalscope/app/ui 2026-04-13T12:59:48,520 copying evalscope/app/ui/multi_model.py -> build/lib/evalscope/app/ui 2026-04-13T12:59:48,522 copying evalscope/app/ui/sidebar.py -> build/lib/evalscope/app/ui 2026-04-13T12:59:48,524 copying evalscope/app/ui/single_model.py -> build/lib/evalscope/app/ui 2026-04-13T12:59:48,526 copying evalscope/app/ui/app_ui.py -> build/lib/evalscope/app/ui 2026-04-13T12:59:48,528 copying evalscope/app/ui/__init__.py -> build/lib/evalscope/app/ui 2026-04-13T12:59:48,531 creating build/lib/evalscope/perf/plugin 2026-04-13T12:59:48,531 copying evalscope/perf/plugin/registry.py -> build/lib/evalscope/perf/plugin 2026-04-13T12:59:48,533 copying evalscope/perf/plugin/__init__.py -> build/lib/evalscope/perf/plugin 2026-04-13T12:59:48,536 creating build/lib/evalscope/perf/sla 2026-04-13T12:59:48,536 copying evalscope/perf/sla/sla_criterion.py -> build/lib/evalscope/perf/sla 2026-04-13T12:59:48,538 copying evalscope/perf/sla/sla_run.py -> build/lib/evalscope/perf/sla 2026-04-13T12:59:48,541 copying evalscope/perf/sla/__init__.py -> build/lib/evalscope/perf/sla 2026-04-13T12:59:48,543 creating build/lib/evalscope/perf/utils 2026-04-13T12:59:48,544 copying evalscope/perf/utils/handler.py -> build/lib/evalscope/perf/utils 2026-04-13T12:59:48,546 copying evalscope/perf/utils/db_util.py -> build/lib/evalscope/perf/utils 2026-04-13T12:59:48,548 copying evalscope/perf/utils/local_server.py -> build/lib/evalscope/perf/utils 2026-04-13T12:59:48,550 copying evalscope/perf/utils/benchmark_util.py -> build/lib/evalscope/perf/utils 2026-04-13T12:59:48,553 copying evalscope/perf/utils/analysis_result.py -> build/lib/evalscope/perf/utils 2026-04-13T12:59:48,554 copying evalscope/perf/utils/log_utils.py -> build/lib/evalscope/perf/utils 2026-04-13T12:59:48,556 copying evalscope/perf/utils/rich_display.py -> build/lib/evalscope/perf/utils 2026-04-13T12:59:48,559 copying evalscope/perf/utils/__init__.py -> build/lib/evalscope/perf/utils 2026-04-13T12:59:48,561 creating build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,562 copying evalscope/perf/plugin/datasets/longalpaca.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,563 copying evalscope/perf/plugin/datasets/kontext_bench.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,565 copying evalscope/perf/plugin/datasets/random_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,567 copying evalscope/perf/plugin/datasets/openqa.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,569 copying evalscope/perf/plugin/datasets/line_by_line.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,570 copying evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,572 copying evalscope/perf/plugin/datasets/rerank_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,574 copying evalscope/perf/plugin/datasets/utils.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,576 copying evalscope/perf/plugin/datasets/flickr8k.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,578 copying evalscope/perf/plugin/datasets/base.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,580 copying evalscope/perf/plugin/datasets/embedding_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,582 copying evalscope/perf/plugin/datasets/speed_benchmark.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,584 copying evalscope/perf/plugin/datasets/custom.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,586 copying evalscope/perf/plugin/datasets/__init__.py -> build/lib/evalscope/perf/plugin/datasets 2026-04-13T12:59:48,588 creating build/lib/evalscope/perf/plugin/api 2026-04-13T12:59:48,589 copying evalscope/perf/plugin/api/default_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-13T12:59:48,591 copying evalscope/perf/plugin/api/dashscope_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-13T12:59:48,593 copying evalscope/perf/plugin/api/openai_rerank_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-13T12:59:48,595 copying evalscope/perf/plugin/api/openai_embedding_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-13T12:59:48,598 copying evalscope/perf/plugin/api/custom_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-13T12:59:48,600 copying evalscope/perf/plugin/api/openai_api.py -> build/lib/evalscope/perf/plugin/api 2026-04-13T12:59:48,603 copying evalscope/perf/plugin/api/base.py -> build/lib/evalscope/perf/plugin/api 2026-04-13T12:59:48,604 copying evalscope/perf/plugin/api/__init__.py -> build/lib/evalscope/perf/plugin/api 2026-04-13T12:59:48,607 creating build/lib/evalscope/perf/utils/report 2026-04-13T12:59:48,608 copying evalscope/perf/utils/report/generate_report.py -> build/lib/evalscope/perf/utils/report 2026-04-13T12:59:48,610 copying evalscope/perf/utils/report/perf_data.py -> build/lib/evalscope/perf/utils/report 2026-04-13T12:59:48,612 copying evalscope/perf/utils/report/__init__.py -> build/lib/evalscope/perf/utils/report 2026-04-13T12:59:48,614 copying evalscope/perf/utils/report/perf_charts.py -> build/lib/evalscope/perf/utils/report 2026-04-13T12:59:48,617 creating build/lib/evalscope/third_party/toolbench_static 2026-04-13T12:59:48,618 copying evalscope/third_party/toolbench_static/infer.py -> build/lib/evalscope/third_party/toolbench_static 2026-04-13T12:59:48,620 copying evalscope/third_party/toolbench_static/toolbench_static.py -> build/lib/evalscope/third_party/toolbench_static 2026-04-13T12:59:48,622 copying evalscope/third_party/toolbench_static/eval.py -> build/lib/evalscope/third_party/toolbench_static 2026-04-13T12:59:48,624 copying evalscope/third_party/toolbench_static/__init__.py -> build/lib/evalscope/third_party/toolbench_static 2026-04-13T12:59:48,627 creating build/lib/evalscope/third_party/longbench_write 2026-04-13T12:59:48,628 copying evalscope/third_party/longbench_write/infer.py -> build/lib/evalscope/third_party/longbench_write 2026-04-13T12:59:48,630 copying evalscope/third_party/longbench_write/utils.py -> build/lib/evalscope/third_party/longbench_write 2026-04-13T12:59:48,632 copying evalscope/third_party/longbench_write/eval.py -> build/lib/evalscope/third_party/longbench_write 2026-04-13T12:59:48,634 copying evalscope/third_party/longbench_write/longbench_write.py -> build/lib/evalscope/third_party/longbench_write 2026-04-13T12:59:48,636 copying evalscope/third_party/longbench_write/__init__.py -> build/lib/evalscope/third_party/longbench_write 2026-04-13T12:59:48,638 creating build/lib/evalscope/third_party/thinkbench 2026-04-13T12:59:48,639 copying evalscope/third_party/thinkbench/infer.py -> build/lib/evalscope/third_party/thinkbench 2026-04-13T12:59:48,641 copying evalscope/third_party/thinkbench/eval.py -> build/lib/evalscope/third_party/thinkbench 2026-04-13T12:59:48,644 copying evalscope/third_party/thinkbench/__init__.py -> build/lib/evalscope/third_party/thinkbench 2026-04-13T12:59:48,646 creating build/lib/evalscope/third_party/toolbench_static/llm 2026-04-13T12:59:48,647 copying evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-04-13T12:59:48,649 copying evalscope/third_party/toolbench_static/llm/__init__.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-04-13T12:59:48,651 creating build/lib/evalscope/third_party/longbench_write/resources 2026-04-13T12:59:48,652 copying evalscope/third_party/longbench_write/resources/__init__.py -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-13T12:59:48,655 creating build/lib/evalscope/third_party/longbench_write/tools 2026-04-13T12:59:48,656 copying evalscope/third_party/longbench_write/tools/data_etl.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-04-13T12:59:48,658 copying evalscope/third_party/longbench_write/tools/openai_api.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-04-13T12:59:48,660 copying evalscope/third_party/longbench_write/tools/__init__.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-04-13T12:59:48,663 creating build/lib/evalscope/third_party/thinkbench/tools 2026-04-13T12:59:48,663 copying evalscope/third_party/thinkbench/tools/utils.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-04-13T12:59:48,665 copying evalscope/third_party/thinkbench/tools/__init__.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-04-13T12:59:48,667 copying evalscope/third_party/thinkbench/tools/llm.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-04-13T12:59:48,669 creating build/lib/evalscope/utils/tqdm_utils 2026-04-13T12:59:48,670 copying evalscope/utils/tqdm_utils/tqdm_logging.py -> build/lib/evalscope/utils/tqdm_utils 2026-04-13T12:59:48,673 copying evalscope/utils/tqdm_utils/progress_tracker.py -> build/lib/evalscope/utils/tqdm_utils 2026-04-13T12:59:48,675 copying evalscope/utils/tqdm_utils/__init__.py -> build/lib/evalscope/utils/tqdm_utils 2026-04-13T12:59:48,677 creating build/lib/evalscope/utils/doc_utils 2026-04-13T12:59:48,678 copying evalscope/utils/doc_utils/generate_dataset_md.py -> build/lib/evalscope/utils/doc_utils 2026-04-13T12:59:48,681 copying evalscope/utils/doc_utils/readme_generator.py -> build/lib/evalscope/utils/doc_utils 2026-04-13T12:59:48,684 copying evalscope/utils/doc_utils/__init__.py -> build/lib/evalscope/utils/doc_utils 2026-04-13T12:59:48,686 copying evalscope/utils/doc_utils/benchmark_stats.py -> build/lib/evalscope/utils/doc_utils 2026-04-13T12:59:48,689 copying evalscope/utils/doc_utils/translate_description.py -> build/lib/evalscope/utils/doc_utils 2026-04-13T12:59:48,693 creating build/lib/evalscope/models/utils 2026-04-13T12:59:48,694 copying evalscope/models/utils/anthropic.py -> build/lib/evalscope/models/utils 2026-04-13T12:59:48,696 copying evalscope/models/utils/openai.py -> build/lib/evalscope/models/utils 2026-04-13T12:59:48,699 creating build/lib/evalscope/api/mixin 2026-04-13T12:59:48,700 copying evalscope/api/mixin/llm_judge_mixin.py -> build/lib/evalscope/api/mixin 2026-04-13T12:59:48,703 copying evalscope/api/mixin/sandbox_mixin.py -> build/lib/evalscope/api/mixin 2026-04-13T12:59:48,705 copying evalscope/api/mixin/__init__.py -> build/lib/evalscope/api/mixin 2026-04-13T12:59:48,707 creating build/lib/evalscope/api/benchmark 2026-04-13T12:59:48,708 copying evalscope/api/benchmark/meta.py -> build/lib/evalscope/api/benchmark 2026-04-13T12:59:48,711 copying evalscope/api/benchmark/benchmark.py -> build/lib/evalscope/api/benchmark 2026-04-13T12:59:48,713 copying evalscope/api/benchmark/statistics.py -> build/lib/evalscope/api/benchmark 2026-04-13T12:59:48,716 copying evalscope/api/benchmark/__init__.py -> build/lib/evalscope/api/benchmark 2026-04-13T12:59:48,718 creating build/lib/evalscope/api/model 2026-04-13T12:59:48,720 copying evalscope/api/model/model.py -> build/lib/evalscope/api/model 2026-04-13T12:59:48,722 copying evalscope/api/model/generate_config.py -> build/lib/evalscope/api/model 2026-04-13T12:59:48,724 copying evalscope/api/model/__init__.py -> build/lib/evalscope/api/model 2026-04-13T12:59:48,726 copying evalscope/api/model/model_output.py -> build/lib/evalscope/api/model 2026-04-13T12:59:48,729 copying evalscope/api/model/lazy_model.py -> build/lib/evalscope/api/model 2026-04-13T12:59:48,731 creating build/lib/evalscope/api/filter 2026-04-13T12:59:48,732 copying evalscope/api/filter/filter.py -> build/lib/evalscope/api/filter 2026-04-13T12:59:48,734 copying evalscope/api/filter/__init__.py -> build/lib/evalscope/api/filter 2026-04-13T12:59:48,737 creating build/lib/evalscope/api/messages 2026-04-13T12:59:48,738 copying evalscope/api/messages/utils.py -> build/lib/evalscope/api/messages 2026-04-13T12:59:48,740 copying evalscope/api/messages/content.py -> build/lib/evalscope/api/messages 2026-04-13T12:59:48,742 copying evalscope/api/messages/chat_message.py -> build/lib/evalscope/api/messages 2026-04-13T12:59:48,744 copying evalscope/api/messages/__init__.py -> build/lib/evalscope/api/messages 2026-04-13T12:59:48,746 creating build/lib/evalscope/api/evaluator 2026-04-13T12:59:48,747 copying evalscope/api/evaluator/evaluator.py -> build/lib/evalscope/api/evaluator 2026-04-13T12:59:48,749 copying evalscope/api/evaluator/cache.py -> build/lib/evalscope/api/evaluator 2026-04-13T12:59:48,752 copying evalscope/api/evaluator/__init__.py -> build/lib/evalscope/api/evaluator 2026-04-13T12:59:48,754 copying evalscope/api/evaluator/state.py -> build/lib/evalscope/api/evaluator 2026-04-13T12:59:48,756 creating build/lib/evalscope/api/dataset 2026-04-13T12:59:48,757 copying evalscope/api/dataset/loader.py -> build/lib/evalscope/api/dataset 2026-04-13T12:59:48,760 copying evalscope/api/dataset/utils.py -> build/lib/evalscope/api/dataset 2026-04-13T12:59:48,762 copying evalscope/api/dataset/dataset.py -> build/lib/evalscope/api/dataset 2026-04-13T12:59:48,765 copying evalscope/api/dataset/__init__.py -> build/lib/evalscope/api/dataset 2026-04-13T12:59:48,767 creating build/lib/evalscope/api/tool 2026-04-13T12:59:48,768 copying evalscope/api/tool/tool_info.py -> build/lib/evalscope/api/tool 2026-04-13T12:59:48,770 copying evalscope/api/tool/utils.py -> build/lib/evalscope/api/tool 2026-04-13T12:59:48,772 copying evalscope/api/tool/tool_call.py -> build/lib/evalscope/api/tool 2026-04-13T12:59:48,774 copying evalscope/api/tool/__init__.py -> build/lib/evalscope/api/tool 2026-04-13T12:59:48,776 creating build/lib/evalscope/api/metric 2026-04-13T12:59:48,777 copying evalscope/api/metric/scorer.py -> build/lib/evalscope/api/metric 2026-04-13T12:59:48,779 copying evalscope/api/metric/metric.py -> build/lib/evalscope/api/metric 2026-04-13T12:59:48,781 copying evalscope/api/metric/__init__.py -> build/lib/evalscope/api/metric 2026-04-13T12:59:48,783 creating build/lib/evalscope/api/benchmark/adapters 2026-04-13T12:59:48,784 copying evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-13T12:59:48,786 copying evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-13T12:59:48,788 copying evalscope/api/benchmark/adapters/ner_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-13T12:59:48,791 copying evalscope/api/benchmark/adapters/default_data_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-13T12:59:48,793 copying evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-13T12:59:48,795 copying evalscope/api/benchmark/adapters/text2image_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-13T12:59:48,797 copying evalscope/api/benchmark/adapters/agent_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-13T12:59:48,799 copying evalscope/api/benchmark/adapters/__init__.py -> build/lib/evalscope/api/benchmark/adapters 2026-04-13T12:59:48,801 creating build/lib/evalscope/metrics/sem_score 2026-04-13T12:59:48,802 copying evalscope/metrics/sem_score/scorer.py -> build/lib/evalscope/metrics/sem_score 2026-04-13T12:59:48,804 copying evalscope/metrics/sem_score/__init__.py -> build/lib/evalscope/metrics/sem_score 2026-04-13T12:59:48,806 creating build/lib/evalscope/metrics/bundled_rouge_score 2026-04-13T12:59:48,807 copying evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-04-13T12:59:48,810 copying evalscope/metrics/bundled_rouge_score/__init__.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-04-13T12:59:48,812 creating build/lib/evalscope/metrics/t2v_metrics 2026-04-13T12:59:48,813 copying evalscope/metrics/t2v_metrics/itmscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-13T12:59:48,815 copying evalscope/metrics/t2v_metrics/vqascore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-13T12:59:48,817 copying evalscope/metrics/t2v_metrics/constants.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-13T12:59:48,818 copying evalscope/metrics/t2v_metrics/score.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-13T12:59:48,820 copying evalscope/metrics/t2v_metrics/clipscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-13T12:59:48,822 copying evalscope/metrics/t2v_metrics/__init__.py -> build/lib/evalscope/metrics/t2v_metrics 2026-04-13T12:59:48,824 creating build/lib/evalscope/metrics/bert_score 2026-04-13T12:59:48,825 copying evalscope/metrics/bert_score/scorer.py -> build/lib/evalscope/metrics/bert_score 2026-04-13T12:59:48,827 copying evalscope/metrics/bert_score/utils.py -> build/lib/evalscope/metrics/bert_score 2026-04-13T12:59:48,830 copying evalscope/metrics/bert_score/__init__.py -> build/lib/evalscope/metrics/bert_score 2026-04-13T12:59:48,832 creating build/lib/evalscope/metrics/text_normalizer 2026-04-13T12:59:48,833 copying evalscope/metrics/text_normalizer/english.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-13T12:59:48,836 copying evalscope/metrics/text_normalizer/wer.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-13T12:59:48,838 copying evalscope/metrics/text_normalizer/basic.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-13T12:59:48,840 copying evalscope/metrics/text_normalizer/chinese.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-13T12:59:48,843 copying evalscope/metrics/text_normalizer/__init__.py -> build/lib/evalscope/metrics/text_normalizer 2026-04-13T12:59:48,845 creating build/lib/evalscope/metrics/t2v_metrics/models 2026-04-13T12:59:48,846 copying evalscope/metrics/t2v_metrics/models/model.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-04-13T12:59:48,848 copying evalscope/metrics/t2v_metrics/models/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-04-13T12:59:48,851 copying evalscope/metrics/t2v_metrics/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-04-13T12:59:48,853 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:48,854 copying evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:48,856 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:48,858 copying evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:48,860 copying evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:48,863 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:48,864 copying evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:48,867 copying evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:48,869 copying evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:48,872 copying evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:48,874 copying evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:48,877 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:48,878 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:48,881 copying evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:48,884 copying evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:48,886 copying evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:48,889 copying evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:48,891 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-13T12:59:48,893 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-13T12:59:48,895 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-13T12:59:48,898 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-13T12:59:48,900 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:48,901 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:48,904 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:48,907 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:48,910 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:48,912 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-04-13T12:59:48,913 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-04-13T12:59:48,916 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-04-13T12:59:48,917 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-04-13T12:59:48,920 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-04-13T12:59:48,921 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-04-13T12:59:48,924 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-13T12:59:48,925 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-13T12:59:48,928 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-13T12:59:48,930 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-04-13T12:59:48,932 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-04-13T12:59:48,935 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-04-13T12:59:48,936 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-04-13T12:59:48,940 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:48,941 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:48,944 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:48,947 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:48,950 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:48,953 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:48,956 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:48,958 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:48,960 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:48,962 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:48,964 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:48,966 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:48,968 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:48,970 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:48,972 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:48,974 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:48,975 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:48,977 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:48,980 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:48,982 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:48,985 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:48,986 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:48,989 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:48,991 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:48,994 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:48,997 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:49,001 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:49,003 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:49,005 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:49,008 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:49,010 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:49,013 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,013 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,016 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,019 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,021 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,024 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,026 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,028 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,030 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,032 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,034 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,037 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:49,039 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-13T12:59:49,040 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-13T12:59:49,043 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-13T12:59:49,045 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-13T12:59:49,047 creating build/lib/evalscope/backend/rag_eval 2026-04-13T12:59:49,049 copying evalscope/backend/rag_eval/__init__.py -> build/lib/evalscope/backend/rag_eval 2026-04-13T12:59:49,051 copying evalscope/backend/rag_eval/backend_manager.py -> build/lib/evalscope/backend/rag_eval 2026-04-13T12:59:49,053 creating build/lib/evalscope/backend/opencompass 2026-04-13T12:59:49,054 copying evalscope/backend/opencompass/api_meta_template.py -> build/lib/evalscope/backend/opencompass 2026-04-13T12:59:49,056 copying evalscope/backend/opencompass/__init__.py -> build/lib/evalscope/backend/opencompass 2026-04-13T12:59:49,057 copying evalscope/backend/opencompass/backend_manager.py -> build/lib/evalscope/backend/opencompass 2026-04-13T12:59:49,060 creating build/lib/evalscope/backend/vlm_eval_kit 2026-04-13T12:59:49,061 copying evalscope/backend/vlm_eval_kit/__init__.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-04-13T12:59:49,063 copying evalscope/backend/vlm_eval_kit/backend_manager.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-04-13T12:59:49,065 creating build/lib/evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:49,066 copying evalscope/backend/rag_eval/cmteb/arguments.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:49,068 copying evalscope/backend/rag_eval/cmteb/task_template.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:49,070 copying evalscope/backend/rag_eval/cmteb/base.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:49,072 copying evalscope/backend/rag_eval/cmteb/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:49,074 creating build/lib/evalscope/backend/rag_eval/ragas 2026-04-13T12:59:49,075 copying evalscope/backend/rag_eval/ragas/arguments.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-04-13T12:59:49,077 copying evalscope/backend/rag_eval/ragas/task_template.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-04-13T12:59:49,079 copying evalscope/backend/rag_eval/ragas/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-04-13T12:59:49,081 creating build/lib/evalscope/backend/rag_eval/utils 2026-04-13T12:59:49,082 copying evalscope/backend/rag_eval/utils/clip.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-13T12:59:49,085 copying evalscope/backend/rag_eval/utils/tools.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-13T12:59:49,087 copying evalscope/backend/rag_eval/utils/embedding.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-13T12:59:49,090 copying evalscope/backend/rag_eval/utils/__init__.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-13T12:59:49,091 copying evalscope/backend/rag_eval/utils/llm.py -> build/lib/evalscope/backend/rag_eval/utils 2026-04-13T12:59:49,094 creating build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:49,095 copying evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:49,097 copying evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:49,100 copying evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:49,102 copying evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:49,104 creating build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:49,105 copying evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:49,108 copying evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:49,111 copying evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:49,113 copying evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:49,115 copying evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:49,117 copying evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:49,119 copying evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:49,121 copying evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:49,124 creating build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:49,125 copying evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:49,128 copying evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:49,129 copying evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:49,131 copying evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:49,133 copying evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:49,135 creating build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-04-13T12:59:49,136 copying evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-04-13T12:59:49,139 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:49,140 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:49,142 copying evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:49,144 copying evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:49,145 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:49,148 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-13T12:59:49,149 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-13T12:59:49,152 creating build/lib/evalscope/backend/opencompass/tasks 2026-04-13T12:59:49,153 copying evalscope/backend/opencompass/tasks/eval_datasets.py -> build/lib/evalscope/backend/opencompass/tasks 2026-04-13T12:59:49,156 copying evalscope/backend/opencompass/tasks/__init__.py -> build/lib/evalscope/backend/opencompass/tasks 2026-04-13T12:59:49,157 copying evalscope/backend/opencompass/tasks/eval_api.py -> build/lib/evalscope/backend/opencompass/tasks 2026-04-13T12:59:49,161 running egg_info 2026-04-13T12:59:49,171 writing evalscope.egg-info/PKG-INFO 2026-04-13T12:59:49,198 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-04-13T12:59:49,200 writing entry points to evalscope.egg-info/entry_points.txt 2026-04-13T12:59:49,214 writing requirements to evalscope.egg-info/requires.txt 2026-04-13T12:59:49,215 writing top-level names to evalscope.egg-info/top_level.txt 2026-04-13T12:59:49,403 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-04-13T12:59:49,453 reading manifest template 'MANIFEST.in' 2026-04-13T12:59:49,879 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-04-13T12:59:49,884 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-04-13T12:59:49,890 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-04-13T12:59:49,895 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-04-13T12:59:49,896 adding license file 'LICENSE' 2026-04-13T12:59:49,951 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-04-13T12:59:50,067 copying evalscope/benchmarks/wmt/requirements.txt -> build/lib/evalscope/benchmarks/wmt 2026-04-13T12:59:50,069 copying evalscope/benchmarks/terminal_bench/requirements.txt -> build/lib/evalscope/benchmarks/terminal_bench 2026-04-13T12:59:50,071 copying evalscope/benchmarks/bfcl/requirements.txt -> build/lib/evalscope/benchmarks/bfcl 2026-04-13T12:59:50,073 copying evalscope/benchmarks/arena_hard/requirements.txt -> build/lib/evalscope/benchmarks/arena_hard 2026-04-13T12:59:50,075 copying evalscope/benchmarks/ifeval/requirements.txt -> build/lib/evalscope/benchmarks/ifeval 2026-04-13T12:59:50,077 copying evalscope/benchmarks/openai_mrcr/requirements.txt -> build/lib/evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:50,079 copying evalscope/benchmarks/ifbench/requirements.txt -> build/lib/evalscope/benchmarks/ifbench 2026-04-13T12:59:50,081 copying evalscope/benchmarks/swe_bench/requirements.txt -> build/lib/evalscope/benchmarks/swe_bench 2026-04-13T12:59:50,083 copying evalscope/benchmarks/trivia_qa/samples.jsonl -> build/lib/evalscope/benchmarks/trivia_qa 2026-04-13T12:59:50,085 copying evalscope/benchmarks/olympiad_bench/requirements.txt -> build/lib/evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:50,087 creating build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,088 copying evalscope/benchmarks/_meta/a_okvqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,091 copying evalscope/benchmarks/_meta/aa_lcr.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,094 copying evalscope/benchmarks/_meta/ai2d.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,097 copying evalscope/benchmarks/_meta/aime24.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,099 copying evalscope/benchmarks/_meta/aime25.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,102 copying evalscope/benchmarks/_meta/aime26.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,105 copying evalscope/benchmarks/_meta/alpaca_eval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,108 copying evalscope/benchmarks/_meta/amc.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,111 copying evalscope/benchmarks/_meta/anat_em.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,114 copying evalscope/benchmarks/_meta/arc.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,116 copying evalscope/benchmarks/_meta/arena_hard.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,119 copying evalscope/benchmarks/_meta/bbh.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,122 copying evalscope/benchmarks/_meta/bc2gm.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,125 copying evalscope/benchmarks/_meta/bc4chemd.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,128 copying evalscope/benchmarks/_meta/bc5cdr.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,131 copying evalscope/benchmarks/_meta/bfcl_v3.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,134 copying evalscope/benchmarks/_meta/bfcl_v4.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,137 copying evalscope/benchmarks/_meta/biomix_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,140 copying evalscope/benchmarks/_meta/blink.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,143 copying evalscope/benchmarks/_meta/broad_twitter_corpus.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,146 copying evalscope/benchmarks/_meta/cc_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,148 copying evalscope/benchmarks/_meta/ceval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,151 copying evalscope/benchmarks/_meta/chartqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,154 copying evalscope/benchmarks/_meta/chinese_simpleqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,157 copying evalscope/benchmarks/_meta/cl_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,160 copying evalscope/benchmarks/_meta/cmmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,163 copying evalscope/benchmarks/_meta/cmmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,166 copying evalscope/benchmarks/_meta/cmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,169 copying evalscope/benchmarks/_meta/coin_flip.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,172 copying evalscope/benchmarks/_meta/commonsense_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,174 copying evalscope/benchmarks/_meta/competition_math.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,177 copying evalscope/benchmarks/_meta/conll2003.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,180 copying evalscope/benchmarks/_meta/conllpp.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,183 copying evalscope/benchmarks/_meta/copious.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,186 copying evalscope/benchmarks/_meta/cross_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,189 copying evalscope/benchmarks/_meta/data_collection.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,191 copying evalscope/benchmarks/_meta/docmath.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,194 copying evalscope/benchmarks/_meta/docvqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,197 copying evalscope/benchmarks/_meta/drivel_binary.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,200 copying evalscope/benchmarks/_meta/drivel_multilabel.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,202 copying evalscope/benchmarks/_meta/drivel_selection.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,205 copying evalscope/benchmarks/_meta/drivel_writing.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,208 copying evalscope/benchmarks/_meta/drop.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,210 copying evalscope/benchmarks/_meta/eq_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,213 copying evalscope/benchmarks/_meta/evalmuse.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,216 copying evalscope/benchmarks/_meta/fin_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,219 copying evalscope/benchmarks/_meta/fleurs.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,221 copying evalscope/benchmarks/_meta/frames.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,224 copying evalscope/benchmarks/_meta/gedit.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,227 copying evalscope/benchmarks/_meta/genai_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,230 copying evalscope/benchmarks/_meta/general_arena.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,233 copying evalscope/benchmarks/_meta/general_fc.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,236 copying evalscope/benchmarks/_meta/general_mcq.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,239 copying evalscope/benchmarks/_meta/general_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,242 copying evalscope/benchmarks/_meta/general_t2i.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,245 copying evalscope/benchmarks/_meta/general_vmcq.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,248 copying evalscope/benchmarks/_meta/general_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,250 copying evalscope/benchmarks/_meta/genia_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,254 copying evalscope/benchmarks/_meta/gpqa_diamond.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,257 copying evalscope/benchmarks/_meta/gsm8k.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,259 copying evalscope/benchmarks/_meta/gsm8k_v.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,262 copying evalscope/benchmarks/_meta/hallusion_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,265 copying evalscope/benchmarks/_meta/halueval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,268 copying evalscope/benchmarks/_meta/harvey_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,271 copying evalscope/benchmarks/_meta/health_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,274 copying evalscope/benchmarks/_meta/hellaswag.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,276 copying evalscope/benchmarks/_meta/hle.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,279 copying evalscope/benchmarks/_meta/hmmt25.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,282 copying evalscope/benchmarks/_meta/hpdv2.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,284 copying evalscope/benchmarks/_meta/humaneval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,287 copying evalscope/benchmarks/_meta/humaneval_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,290 copying evalscope/benchmarks/_meta/ifbench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,293 copying evalscope/benchmarks/_meta/ifeval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,296 copying evalscope/benchmarks/_meta/infovqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,299 copying evalscope/benchmarks/_meta/iquiz.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,301 copying evalscope/benchmarks/_meta/jnlpba.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,304 copying evalscope/benchmarks/_meta/jnlpba_rare.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,306 copying evalscope/benchmarks/_meta/librispeech.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,309 copying evalscope/benchmarks/_meta/live_code_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,312 copying evalscope/benchmarks/_meta/logi_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,314 copying evalscope/benchmarks/_meta/longbench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,317 copying evalscope/benchmarks/_meta/maritime_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,320 copying evalscope/benchmarks/_meta/math_500.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,322 copying evalscope/benchmarks/_meta/math_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,325 copying evalscope/benchmarks/_meta/math_verse.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,328 copying evalscope/benchmarks/_meta/math_vision.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,331 copying evalscope/benchmarks/_meta/math_vista.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,333 copying evalscope/benchmarks/_meta/mbpp.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,336 copying evalscope/benchmarks/_meta/mbpp_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,338 copying evalscope/benchmarks/_meta/med_mcqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,341 copying evalscope/benchmarks/_meta/mgsm.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,343 copying evalscope/benchmarks/_meta/mia_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,346 copying evalscope/benchmarks/_meta/micro_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,349 copying evalscope/benchmarks/_meta/minerva_math.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,352 copying evalscope/benchmarks/_meta/mit_movie_trivia.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,354 copying evalscope/benchmarks/_meta/mit_restaurant.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,357 copying evalscope/benchmarks/_meta/mm_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,360 copying evalscope/benchmarks/_meta/mm_star.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,363 copying evalscope/benchmarks/_meta/mmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,366 copying evalscope/benchmarks/_meta/mmlu_pro.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,368 copying evalscope/benchmarks/_meta/mmlu_redux.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,372 copying evalscope/benchmarks/_meta/mmmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,374 copying evalscope/benchmarks/_meta/mmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,377 copying evalscope/benchmarks/_meta/mmmu_pro.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,380 copying evalscope/benchmarks/_meta/mri_mcqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,383 copying evalscope/benchmarks/_meta/multi_if.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,386 copying evalscope/benchmarks/_meta/multi_nerd.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,389 copying evalscope/benchmarks/_meta/multiple_humaneval.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,392 copying evalscope/benchmarks/_meta/multiple_mbpp.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,394 copying evalscope/benchmarks/_meta/music_trivia.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,397 copying evalscope/benchmarks/_meta/musr.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,399 copying evalscope/benchmarks/_meta/ncbi.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,402 copying evalscope/benchmarks/_meta/needle_haystack.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,405 copying evalscope/benchmarks/_meta/ocr_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,407 copying evalscope/benchmarks/_meta/ocr_bench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,411 copying evalscope/benchmarks/_meta/olympiad_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,413 copying evalscope/benchmarks/_meta/omni_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,416 copying evalscope/benchmarks/_meta/omni_doc_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,420 copying evalscope/benchmarks/_meta/ontonotes5.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,423 copying evalscope/benchmarks/_meta/openai_mrcr.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,426 copying evalscope/benchmarks/_meta/piqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,429 copying evalscope/benchmarks/_meta/poly_math.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,432 copying evalscope/benchmarks/_meta/pope.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,435 copying evalscope/benchmarks/_meta/process_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,437 copying evalscope/benchmarks/_meta/pubmedqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,440 copying evalscope/benchmarks/_meta/qasc.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,442 copying evalscope/benchmarks/_meta/race.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,445 copying evalscope/benchmarks/_meta/real_world_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,447 copying evalscope/benchmarks/_meta/refcoco.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,450 copying evalscope/benchmarks/_meta/scicode.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,454 copying evalscope/benchmarks/_meta/science_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,457 copying evalscope/benchmarks/_meta/sciq.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,460 copying evalscope/benchmarks/_meta/seed_bench_2_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,462 copying evalscope/benchmarks/_meta/simple_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,465 copying evalscope/benchmarks/_meta/simple_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,468 copying evalscope/benchmarks/_meta/siqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,470 copying evalscope/benchmarks/_meta/super_gpqa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,473 copying evalscope/benchmarks/_meta/swe_bench_lite.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,476 copying evalscope/benchmarks/_meta/swe_bench_verified.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,478 copying evalscope/benchmarks/_meta/swe_bench_verified_mini.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,481 copying evalscope/benchmarks/_meta/tau2_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,484 copying evalscope/benchmarks/_meta/tau_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,486 copying evalscope/benchmarks/_meta/terminal_bench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,489 copying evalscope/benchmarks/_meta/tifa160.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,492 copying evalscope/benchmarks/_meta/tool_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,495 copying evalscope/benchmarks/_meta/torgo.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,498 copying evalscope/benchmarks/_meta/trivia_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,501 copying evalscope/benchmarks/_meta/truthful_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,503 copying evalscope/benchmarks/_meta/tweebank_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,506 copying evalscope/benchmarks/_meta/tweet_ner_7.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,509 copying evalscope/benchmarks/_meta/visulogic.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,512 copying evalscope/benchmarks/_meta/vstar_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,515 copying evalscope/benchmarks/_meta/winogrande.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,517 copying evalscope/benchmarks/_meta/wmt24pp.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,520 copying evalscope/benchmarks/_meta/wnut2017.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,523 copying evalscope/benchmarks/_meta/zebralogicbench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,526 copying evalscope/benchmarks/_meta/zerobench.json -> build/lib/evalscope/benchmarks/_meta 2026-04-13T12:59:50,529 copying evalscope/benchmarks/refcoco/requirements.txt -> build/lib/evalscope/benchmarks/refcoco 2026-04-13T12:59:50,531 copying evalscope/benchmarks/needle_haystack/requirements.txt -> build/lib/evalscope/benchmarks/needle_haystack 2026-04-13T12:59:50,533 copying evalscope/benchmarks/ocr_bench/requirements.txt -> build/lib/evalscope/benchmarks/ocr_bench 2026-04-13T12:59:50,535 copying evalscope/benchmarks/torgo/requirements.txt -> build/lib/evalscope/benchmarks/torgo 2026-04-13T12:59:50,537 copying evalscope/benchmarks/omnidoc_bench/requirements.txt -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:50,540 copying evalscope/benchmarks/general_arena/requirements.txt -> build/lib/evalscope/benchmarks/general_arena 2026-04-13T12:59:50,542 copying evalscope/benchmarks/multi_if/requirements.txt -> build/lib/evalscope/benchmarks/multi_if 2026-04-13T12:59:50,544 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:50,546 creating build/lib/evalscope/benchmarks/humanevalplus/docker 2026-04-13T12:59:50,547 copying evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/lib/evalscope/benchmarks/humanevalplus/docker 2026-04-13T12:59:50,549 copying evalscope/benchmarks/tau_bench/tau2_bench/requirements.txt -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:50,552 copying evalscope/benchmarks/tau_bench/tau_bench/requirements.txt -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:50,554 copying evalscope/benchmarks/scicode/docker/Dockerfile -> build/lib/evalscope/benchmarks/scicode/docker 2026-04-13T12:59:50,556 copying evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/lib/evalscope/benchmarks/scicode/docker 2026-04-13T12:59:50,558 creating build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,559 copying evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,561 copying evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,563 copying evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,566 copying evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,568 copying evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,570 copying evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,573 copying evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,575 copying evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,578 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,580 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,582 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,584 copying evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,587 copying evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,589 copying evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,591 copying evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,593 copying evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,596 copying evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,598 copying evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,600 copying evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,603 copying evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,605 copying evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,607 copying evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,610 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,612 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,614 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,616 copying evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,619 copying evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:50,621 copying evalscope/third_party/toolbench_static/README.md -> build/lib/evalscope/third_party/toolbench_static 2026-04-13T12:59:50,623 copying evalscope/third_party/toolbench_static/config_default.json -> build/lib/evalscope/third_party/toolbench_static 2026-04-13T12:59:50,625 copying evalscope/third_party/toolbench_static/config_default.yaml -> build/lib/evalscope/third_party/toolbench_static 2026-04-13T12:59:50,627 copying evalscope/third_party/toolbench_static/requirements.txt -> build/lib/evalscope/third_party/toolbench_static 2026-04-13T12:59:50,629 copying evalscope/third_party/longbench_write/README.md -> build/lib/evalscope/third_party/longbench_write 2026-04-13T12:59:50,632 copying evalscope/third_party/longbench_write/default_task.json -> build/lib/evalscope/third_party/longbench_write 2026-04-13T12:59:50,634 copying evalscope/third_party/longbench_write/default_task.yaml -> build/lib/evalscope/third_party/longbench_write 2026-04-13T12:59:50,636 copying evalscope/third_party/longbench_write/resources/judge.txt -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-13T12:59:50,638 copying evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-13T12:59:50,642 copying evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-13T12:59:50,645 copying evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-04-13T12:59:50,648 creating build/lib/evalscope/third_party/thinkbench/resources 2026-04-13T12:59:50,649 copying evalscope/third_party/thinkbench/resources/critique_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-04-13T12:59:50,651 copying evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-04-13T12:59:50,653 creating build/lib/evalscope/report/template 2026-04-13T12:59:50,654 copying evalscope/report/template/perf_report.html.j2 -> build/lib/evalscope/report/template 2026-04-13T12:59:50,657 copying evalscope/report/template/report.html.j2 -> build/lib/evalscope/report/template 2026-04-13T12:59:50,660 creating build/lib/evalscope/report/template/css 2026-04-13T12:59:50,661 copying evalscope/report/template/css/base.css -> build/lib/evalscope/report/template/css 2026-04-13T12:59:50,664 copying evalscope/report/template/css/perf_extra.css -> build/lib/evalscope/report/template/css 2026-04-13T12:59:50,666 creating build/lib/evalscope/report/template/partials 2026-04-13T12:59:50,667 copying evalscope/report/template/partials/brand_logo.html -> build/lib/evalscope/report/template/partials 2026-04-13T12:59:50,670 copying evalscope/report/template/partials/footer.html -> build/lib/evalscope/report/template/partials 2026-04-13T12:59:50,672 copying evalscope/report/template/partials/header_eval.html -> build/lib/evalscope/report/template/partials 2026-04-13T12:59:50,674 copying evalscope/report/template/partials/header_perf.html -> build/lib/evalscope/report/template/partials 2026-04-13T12:59:50,677 copying evalscope/report/template/partials/toc_eval.html -> build/lib/evalscope/report/template/partials 2026-04-13T12:59:50,679 copying evalscope/report/template/partials/toc_perf.html -> build/lib/evalscope/report/template/partials 2026-04-13T12:59:50,681 creating build/lib/evalscope/report/template/js 2026-04-13T12:59:50,682 copying evalscope/report/template/js/eval_extra.js -> build/lib/evalscope/report/template/js 2026-04-13T12:59:50,684 copying evalscope/report/template/js/i18n_eval.js -> build/lib/evalscope/report/template/js 2026-04-13T12:59:50,687 copying evalscope/report/template/js/i18n_perf.js -> build/lib/evalscope/report/template/js 2026-04-13T12:59:50,690 copying evalscope/report/template/js/perf_extra.js -> build/lib/evalscope/report/template/js 2026-04-13T12:59:50,692 copying evalscope/report/template/js/shared.js -> build/lib/evalscope/report/template/js 2026-04-13T12:59:50,694 copying evalscope/metrics/text_normalizer/english.json -> build/lib/evalscope/metrics/text_normalizer 2026-04-13T12:59:50,697 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-04-13T12:59:50,698 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-04-13T12:59:50,701 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-13T12:59:50,702 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-13T12:59:50,704 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-13T12:59:50,706 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-13T12:59:50,709 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,710 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,712 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,714 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,717 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,719 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,721 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,723 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,726 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,728 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,730 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,732 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,735 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,737 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,740 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,742 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,744 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,746 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,749 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,751 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:50,753 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-13T12:59:50,852 installing to build/bdist.linux-armv7l/wheel 2026-04-13T12:59:50,853 running install 2026-04-13T12:59:50,877 running install_lib 2026-04-13T12:59:50,882 creating build/bdist.linux-armv7l/wheel 2026-04-13T12:59:50,885 creating build/bdist.linux-armv7l/wheel/evalscope 2026-04-13T12:59:50,886 creating build/bdist.linux-armv7l/wheel/evalscope/collections 2026-04-13T12:59:50,888 copying build/lib/evalscope/collections/sampler.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-04-13T12:59:50,890 copying build/lib/evalscope/collections/schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-04-13T12:59:50,893 copying build/lib/evalscope/collections/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-04-13T12:59:50,898 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks 2026-04-13T12:59:50,899 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_pro 2026-04-13T12:59:50,901 copying build/lib/evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-04-13T12:59:50,903 copying build/lib/evalscope/benchmarks/mmlu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-04-13T12:59:50,905 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/musr 2026-04-13T12:59:50,906 copying build/lib/evalscope/benchmarks/musr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-04-13T12:59:50,908 copying build/lib/evalscope/benchmarks/musr/musr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-04-13T12:59:50,911 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/wmt 2026-04-13T12:59:50,912 copying build/lib/evalscope/benchmarks/wmt/wmt24_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-04-13T12:59:50,915 copying build/lib/evalscope/benchmarks/wmt/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-04-13T12:59:50,916 copying build/lib/evalscope/benchmarks/wmt/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-04-13T12:59:50,919 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/terminal_bench 2026-04-13T12:59:50,920 copying build/lib/evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-04-13T12:59:50,922 copying build/lib/evalscope/benchmarks/terminal_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-04-13T12:59:50,924 copying build/lib/evalscope/benchmarks/terminal_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-04-13T12:59:50,926 copying build/lib/evalscope/benchmarks/terminal_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-04-13T12:59:50,928 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/librispeech 2026-04-13T12:59:50,929 copying build/lib/evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-04-13T12:59:50,931 copying build/lib/evalscope/benchmarks/librispeech/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-04-13T12:59:50,934 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/super_gpqa 2026-04-13T12:59:50,935 copying build/lib/evalscope/benchmarks/super_gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-04-13T12:59:50,937 copying build/lib/evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-04-13T12:59:50,940 copying build/lib/evalscope/benchmarks/super_gpqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-04-13T12:59:50,942 copying build/lib/evalscope/benchmarks/super_gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-04-13T12:59:50,944 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl 2026-04-13T12:59:50,946 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v4 2026-04-13T12:59:50,947 copying build/lib/evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-04-13T12:59:50,950 copying build/lib/evalscope/benchmarks/bfcl/v4/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-04-13T12:59:50,953 copying build/lib/evalscope/benchmarks/bfcl/v4/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-04-13T12:59:50,955 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:50,956 copying build/lib/evalscope/benchmarks/bfcl/v3/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:50,959 copying build/lib/evalscope/benchmarks/bfcl/v3/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:50,961 copying build/lib/evalscope/benchmarks/bfcl/v3/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:50,962 copying build/lib/evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-04-13T12:59:50,965 copying build/lib/evalscope/benchmarks/bfcl/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl 2026-04-13T12:59:50,967 copying build/lib/evalscope/benchmarks/bfcl/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl 2026-04-13T12:59:50,970 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hmmt25 2026-04-13T12:59:50,971 copying build/lib/evalscope/benchmarks/hmmt25/hmmt25_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hmmt25 2026-04-13T12:59:50,974 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/halu_eval 2026-04-13T12:59:50,975 copying build/lib/evalscope/benchmarks/halu_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-04-13T12:59:50,976 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-04-13T12:59:50,979 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-04-13T12:59:50,982 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/science_qa 2026-04-13T12:59:50,983 copying build/lib/evalscope/benchmarks/science_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-04-13T12:59:50,985 copying build/lib/evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-04-13T12:59:50,987 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/text2image 2026-04-13T12:59:50,988 copying build/lib/evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-13T12:59:50,991 copying build/lib/evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-13T12:59:50,993 copying build/lib/evalscope/benchmarks/text2image/tifa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-13T12:59:50,995 copying build/lib/evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-13T12:59:50,997 copying build/lib/evalscope/benchmarks/text2image/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-13T12:59:50,998 copying build/lib/evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-04-13T12:59:51,001 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/healthbench 2026-04-13T12:59:51,002 copying build/lib/evalscope/benchmarks/healthbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-04-13T12:59:51,004 copying build/lib/evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-04-13T12:59:51,007 copying build/lib/evalscope/benchmarks/healthbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-04-13T12:59:51,009 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/competition_math 2026-04-13T12:59:51,011 copying build/lib/evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-04-13T12:59:51,013 copying build/lib/evalscope/benchmarks/competition_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-04-13T12:59:51,015 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/real_world_qa 2026-04-13T12:59:51,017 copying build/lib/evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-04-13T12:59:51,019 copying build/lib/evalscope/benchmarks/real_world_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-04-13T12:59:51,021 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/logi_qa 2026-04-13T12:59:51,022 copying build/lib/evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-04-13T12:59:51,024 copying build/lib/evalscope/benchmarks/logi_qa/__int__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-04-13T12:59:51,027 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/a_okvqa 2026-04-13T12:59:51,028 copying build/lib/evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-04-13T12:59:51,030 copying build/lib/evalscope/benchmarks/a_okvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-04-13T12:59:51,032 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/siqa 2026-04-13T12:59:51,034 copying build/lib/evalscope/benchmarks/siqa/siqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-04-13T12:59:51,036 copying build/lib/evalscope/benchmarks/siqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-04-13T12:59:51,038 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arena_hard 2026-04-13T12:59:51,039 copying build/lib/evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-04-13T12:59:51,042 copying build/lib/evalscope/benchmarks/arena_hard/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-04-13T12:59:51,045 copying build/lib/evalscope/benchmarks/arena_hard/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-04-13T12:59:51,046 copying build/lib/evalscope/benchmarks/arena_hard/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-04-13T12:59:51,049 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbpp 2026-04-13T12:59:51,051 copying build/lib/evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-04-13T12:59:51,054 copying build/lib/evalscope/benchmarks/mbpp/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-04-13T12:59:51,056 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifeval 2026-04-13T12:59:51,057 copying build/lib/evalscope/benchmarks/ifeval/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-13T12:59:51,061 copying build/lib/evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-13T12:59:51,063 copying build/lib/evalscope/benchmarks/ifeval/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-13T12:59:51,066 copying build/lib/evalscope/benchmarks/ifeval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-13T12:59:51,069 copying build/lib/evalscope/benchmarks/ifeval/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-13T12:59:51,071 copying build/lib/evalscope/benchmarks/ifeval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-13T12:59:51,073 copying build/lib/evalscope/benchmarks/ifeval/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-04-13T12:59:51,075 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/race 2026-04-13T12:59:51,076 copying build/lib/evalscope/benchmarks/race/race_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-04-13T12:59:51,079 copying build/lib/evalscope/benchmarks/race/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-04-13T12:59:51,081 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ceval 2026-04-13T12:59:51,082 copying build/lib/evalscope/benchmarks/ceval/ceval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-04-13T12:59:51,085 copying build/lib/evalscope/benchmarks/ceval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-04-13T12:59:51,088 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/qasc 2026-04-13T12:59:51,089 copying build/lib/evalscope/benchmarks/qasc/qasc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-04-13T12:59:51,091 copying build/lib/evalscope/benchmarks/qasc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-04-13T12:59:51,093 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vista 2026-04-13T12:59:51,094 copying build/lib/evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-04-13T12:59:51,096 copying build/lib/evalscope/benchmarks/math_vista/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-04-13T12:59:51,098 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:51,099 copying build/lib/evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:51,102 copying build/lib/evalscope/benchmarks/openai_mrcr/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:51,104 copying build/lib/evalscope/benchmarks/openai_mrcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:51,105 copying build/lib/evalscope/benchmarks/openai_mrcr/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-04-13T12:59:51,108 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/eq_bench 2026-04-13T12:59:51,109 copying build/lib/evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-04-13T12:59:51,111 copying build/lib/evalscope/benchmarks/eq_bench/answer_validation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-04-13T12:59:51,114 copying build/lib/evalscope/benchmarks/eq_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-04-13T12:59:51,116 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arc 2026-04-13T12:59:51,117 copying build/lib/evalscope/benchmarks/arc/arc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-04-13T12:59:51,120 copying build/lib/evalscope/benchmarks/arc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-04-13T12:59:51,122 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/process_bench 2026-04-13T12:59:51,123 copying build/lib/evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-04-13T12:59:51,125 copying build/lib/evalscope/benchmarks/process_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-04-13T12:59:51,128 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vision 2026-04-13T12:59:51,129 copying build/lib/evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-04-13T12:59:51,131 copying build/lib/evalscope/benchmarks/math_vision/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-04-13T12:59:51,134 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifbench 2026-04-13T12:59:51,135 copying build/lib/evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-13T12:59:51,137 copying build/lib/evalscope/benchmarks/ifbench/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-13T12:59:51,140 copying build/lib/evalscope/benchmarks/ifbench/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-13T12:59:51,143 copying build/lib/evalscope/benchmarks/ifbench/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-13T12:59:51,146 copying build/lib/evalscope/benchmarks/ifbench/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-13T12:59:51,148 copying build/lib/evalscope/benchmarks/ifbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-13T12:59:51,149 copying build/lib/evalscope/benchmarks/ifbench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-04-13T12:59:51,151 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/micro_vqa 2026-04-13T12:59:51,152 copying build/lib/evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-04-13T12:59:51,154 copying build/lib/evalscope/benchmarks/micro_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-04-13T12:59:51,157 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu 2026-04-13T12:59:51,158 copying build/lib/evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-04-13T12:59:51,160 copying build/lib/evalscope/benchmarks/mmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-04-13T12:59:51,163 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/maritime_bench 2026-04-13T12:59:51,165 copying build/lib/evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-04-13T12:59:51,167 copying build/lib/evalscope/benchmarks/maritime_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-04-13T12:59:51,169 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_500 2026-04-13T12:59:51,170 copying build/lib/evalscope/benchmarks/math_500/math_500_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-04-13T12:59:51,173 copying build/lib/evalscope/benchmarks/math_500/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-04-13T12:59:51,175 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_vqa 2026-04-13T12:59:51,176 copying build/lib/evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-04-13T12:59:51,179 copying build/lib/evalscope/benchmarks/simple_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-04-13T12:59:51,181 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_fc 2026-04-13T12:59:51,182 copying build/lib/evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-04-13T12:59:51,185 copying build/lib/evalscope/benchmarks/general_fc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-04-13T12:59:51,187 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbppplus 2026-04-13T12:59:51,188 copying build/lib/evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-04-13T12:59:51,191 copying build/lib/evalscope/benchmarks/mbppplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-04-13T12:59:51,193 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omni_bench 2026-04-13T12:59:51,194 copying build/lib/evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-04-13T12:59:51,196 copying build/lib/evalscope/benchmarks/omni_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-04-13T12:59:51,198 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/piqa 2026-04-13T12:59:51,200 copying build/lib/evalscope/benchmarks/piqa/piqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-04-13T12:59:51,202 copying build/lib/evalscope/benchmarks/piqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-04-13T12:59:51,204 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hellaswag 2026-04-13T12:59:51,205 copying build/lib/evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-04-13T12:59:51,207 copying build/lib/evalscope/benchmarks/hellaswag/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-04-13T12:59:51,210 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/swe_bench 2026-04-13T12:59:51,211 copying build/lib/evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-13T12:59:51,213 copying build/lib/evalscope/benchmarks/swe_bench/build_images.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-13T12:59:51,215 copying build/lib/evalscope/benchmarks/swe_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-13T12:59:51,218 copying build/lib/evalscope/benchmarks/swe_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-13T12:59:51,219 copying build/lib/evalscope/benchmarks/swe_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-04-13T12:59:51,221 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drivelology 2026-04-13T12:59:51,222 copying build/lib/evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-13T12:59:51,225 copying build/lib/evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-13T12:59:51,227 copying build/lib/evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-13T12:59:51,229 copying build/lib/evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-13T12:59:51,232 copying build/lib/evalscope/benchmarks/drivelology/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-04-13T12:59:51,234 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/alpaca_eval 2026-04-13T12:59:51,235 copying build/lib/evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-04-13T12:59:51,237 copying build/lib/evalscope/benchmarks/alpaca_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-04-13T12:59:51,239 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_redux 2026-04-13T12:59:51,240 copying build/lib/evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-04-13T12:59:51,243 copying build/lib/evalscope/benchmarks/mmlu_redux/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-04-13T12:59:51,245 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docvqa 2026-04-13T12:59:51,246 copying build/lib/evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-04-13T12:59:51,249 copying build/lib/evalscope/benchmarks/docvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-04-13T12:59:51,251 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmlu 2026-04-13T12:59:51,252 copying build/lib/evalscope/benchmarks/mmmlu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-04-13T12:59:51,255 copying build/lib/evalscope/benchmarks/mmmlu/mmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-04-13T12:59:51,257 copying build/lib/evalscope/benchmarks/mmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-04-13T12:59:51,259 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/trivia_qa 2026-04-13T12:59:51,260 copying build/lib/evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-04-13T12:59:51,262 copying build/lib/evalscope/benchmarks/trivia_qa/samples.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-04-13T12:59:51,264 copying build/lib/evalscope/benchmarks/trivia_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-04-13T12:59:51,267 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aime 2026-04-13T12:59:51,268 copying build/lib/evalscope/benchmarks/aime/grader.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-04-13T12:59:51,270 copying build/lib/evalscope/benchmarks/aime/aime_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-04-13T12:59:51,273 copying build/lib/evalscope/benchmarks/aime/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-04-13T12:59:51,274 copying build/lib/evalscope/benchmarks/aime/math_normalize.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-04-13T12:59:51,277 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/truthful_qa 2026-04-13T12:59:51,278 copying build/lib/evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-04-13T12:59:51,280 copying build/lib/evalscope/benchmarks/truthful_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-04-13T12:59:51,283 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_mcq 2026-04-13T12:59:51,284 copying build/lib/evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-04-13T12:59:51,286 copying build/lib/evalscope/benchmarks/general_mcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-04-13T12:59:51,288 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chartqa 2026-04-13T12:59:51,289 copying build/lib/evalscope/benchmarks/chartqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-04-13T12:59:51,292 copying build/lib/evalscope/benchmarks/chartqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-04-13T12:59:51,293 copying build/lib/evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-04-13T12:59:51,296 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_qa 2026-04-13T12:59:51,297 copying build/lib/evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-04-13T12:59:51,299 copying build/lib/evalscope/benchmarks/general_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-04-13T12:59:51,302 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:51,303 copying build/lib/evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:51,305 copying build/lib/evalscope/benchmarks/olympiad_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:51,308 copying build/lib/evalscope/benchmarks/olympiad_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:51,310 copying build/lib/evalscope/benchmarks/olympiad_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-04-13T12:59:51,312 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_qa 2026-04-13T12:59:51,313 copying build/lib/evalscope/benchmarks/math_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-04-13T12:59:51,315 copying build/lib/evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-04-13T12:59:51,317 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chinese_simple_qa 2026-04-13T12:59:51,318 copying build/lib/evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-04-13T12:59:51,320 copying build/lib/evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-04-13T12:59:51,322 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/med_mcqa 2026-04-13T12:59:51,323 copying build/lib/evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-04-13T12:59:51,326 copying build/lib/evalscope/benchmarks/med_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-04-13T12:59:51,328 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu_pro 2026-04-13T12:59:51,329 copying build/lib/evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-04-13T12:59:51,331 copying build/lib/evalscope/benchmarks/mmmu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-04-13T12:59:51,334 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hallusion_bench 2026-04-13T12:59:51,335 copying build/lib/evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-04-13T12:59:51,337 copying build/lib/evalscope/benchmarks/hallusion_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-04-13T12:59:51,340 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k 2026-04-13T12:59:51,341 copying build/lib/evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-04-13T12:59:51,343 copying build/lib/evalscope/benchmarks/gsm8k/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-04-13T12:59:51,345 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/iquiz 2026-04-13T12:59:51,346 copying build/lib/evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-04-13T12:59:51,348 copying build/lib/evalscope/benchmarks/iquiz/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-04-13T12:59:51,353 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/_meta 2026-04-13T12:59:51,354 copying build/lib/evalscope/benchmarks/_meta/aime24.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,357 copying build/lib/evalscope/benchmarks/_meta/process_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,360 copying build/lib/evalscope/benchmarks/_meta/docvqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,362 copying build/lib/evalscope/benchmarks/_meta/wmt24pp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,365 copying build/lib/evalscope/benchmarks/_meta/general_fc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,369 copying build/lib/evalscope/benchmarks/_meta/visulogic.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,372 copying build/lib/evalscope/benchmarks/_meta/chinese_simpleqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,374 copying build/lib/evalscope/benchmarks/_meta/torgo.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,377 copying build/lib/evalscope/benchmarks/_meta/swe_bench_lite.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,379 copying build/lib/evalscope/benchmarks/_meta/bc2gm.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,382 copying build/lib/evalscope/benchmarks/_meta/arc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,385 copying build/lib/evalscope/benchmarks/_meta/cl_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,388 copying build/lib/evalscope/benchmarks/_meta/musr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,390 copying build/lib/evalscope/benchmarks/_meta/qasc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,393 copying build/lib/evalscope/benchmarks/_meta/pope.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,395 copying build/lib/evalscope/benchmarks/_meta/ontonotes5.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,398 copying build/lib/evalscope/benchmarks/_meta/mbpp_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,400 copying build/lib/evalscope/benchmarks/_meta/seed_bench_2_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,403 copying build/lib/evalscope/benchmarks/_meta/aime25.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,405 copying build/lib/evalscope/benchmarks/_meta/vstar_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,408 copying build/lib/evalscope/benchmarks/_meta/siqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,410 copying build/lib/evalscope/benchmarks/_meta/openai_mrcr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,413 copying build/lib/evalscope/benchmarks/_meta/maritime_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,415 copying build/lib/evalscope/benchmarks/_meta/mm_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,418 copying build/lib/evalscope/benchmarks/_meta/mri_mcqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,420 copying build/lib/evalscope/benchmarks/_meta/cross_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,423 copying build/lib/evalscope/benchmarks/_meta/simple_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,426 copying build/lib/evalscope/benchmarks/_meta/ifbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,428 copying build/lib/evalscope/benchmarks/_meta/mit_movie_trivia.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,431 copying build/lib/evalscope/benchmarks/_meta/ceval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,434 copying build/lib/evalscope/benchmarks/_meta/logi_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,436 copying build/lib/evalscope/benchmarks/_meta/humaneval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,439 copying build/lib/evalscope/benchmarks/_meta/zebralogicbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,442 copying build/lib/evalscope/benchmarks/_meta/health_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,445 copying build/lib/evalscope/benchmarks/_meta/bc4chemd.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,447 copying build/lib/evalscope/benchmarks/_meta/trivia_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,450 copying build/lib/evalscope/benchmarks/_meta/hallusion_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,452 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,455 copying build/lib/evalscope/benchmarks/_meta/conllpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,457 copying build/lib/evalscope/benchmarks/_meta/live_code_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,460 copying build/lib/evalscope/benchmarks/_meta/mit_restaurant.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,462 copying build/lib/evalscope/benchmarks/_meta/general_t2i.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,464 copying build/lib/evalscope/benchmarks/_meta/data_collection.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,467 copying build/lib/evalscope/benchmarks/_meta/general_arena.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,469 copying build/lib/evalscope/benchmarks/_meta/mmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,472 copying build/lib/evalscope/benchmarks/_meta/aime26.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,475 copying build/lib/evalscope/benchmarks/_meta/multiple_humaneval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,477 copying build/lib/evalscope/benchmarks/_meta/hpdv2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,480 copying build/lib/evalscope/benchmarks/_meta/alpaca_eval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,483 copying build/lib/evalscope/benchmarks/_meta/general_mcq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,485 copying build/lib/evalscope/benchmarks/_meta/math_verse.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,488 copying build/lib/evalscope/benchmarks/_meta/med_mcqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,490 copying build/lib/evalscope/benchmarks/_meta/tau2_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,493 copying build/lib/evalscope/benchmarks/_meta/longbench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,495 copying build/lib/evalscope/benchmarks/_meta/science_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,498 copying build/lib/evalscope/benchmarks/_meta/commonsense_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,500 copying build/lib/evalscope/benchmarks/_meta/mmmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,503 copying build/lib/evalscope/benchmarks/_meta/a_okvqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,506 copying build/lib/evalscope/benchmarks/_meta/general_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,508 copying build/lib/evalscope/benchmarks/_meta/gpqa_diamond.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,510 copying build/lib/evalscope/benchmarks/_meta/copious.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,514 copying build/lib/evalscope/benchmarks/_meta/mmlu_redux.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,517 copying build/lib/evalscope/benchmarks/_meta/mmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,520 copying build/lib/evalscope/benchmarks/_meta/cc_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,523 copying build/lib/evalscope/benchmarks/_meta/drivel_selection.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,525 copying build/lib/evalscope/benchmarks/_meta/math_vision.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,527 copying build/lib/evalscope/benchmarks/_meta/genai_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,530 copying build/lib/evalscope/benchmarks/_meta/mm_star.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,532 copying build/lib/evalscope/benchmarks/_meta/genia_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,535 copying build/lib/evalscope/benchmarks/_meta/bfcl_v4.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,537 copying build/lib/evalscope/benchmarks/_meta/music_trivia.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,540 copying build/lib/evalscope/benchmarks/_meta/librispeech.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,543 copying build/lib/evalscope/benchmarks/_meta/race.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,545 copying build/lib/evalscope/benchmarks/_meta/ncbi.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,548 copying build/lib/evalscope/benchmarks/_meta/chartqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,550 copying build/lib/evalscope/benchmarks/_meta/docmath.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,552 copying build/lib/evalscope/benchmarks/_meta/tweebank_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,555 copying build/lib/evalscope/benchmarks/_meta/drivel_writing.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,557 copying build/lib/evalscope/benchmarks/_meta/fin_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,560 copying build/lib/evalscope/benchmarks/_meta/mbpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,563 copying build/lib/evalscope/benchmarks/_meta/gsm8k_v.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,565 copying build/lib/evalscope/benchmarks/_meta/refcoco.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,568 copying build/lib/evalscope/benchmarks/_meta/jnlpba_rare.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,570 copying build/lib/evalscope/benchmarks/_meta/amc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,573 copying build/lib/evalscope/benchmarks/_meta/math_vista.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,576 copying build/lib/evalscope/benchmarks/_meta/gedit.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,578 copying build/lib/evalscope/benchmarks/_meta/drivel_multilabel.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,581 copying build/lib/evalscope/benchmarks/_meta/aa_lcr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,584 copying build/lib/evalscope/benchmarks/_meta/wnut2017.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,587 copying build/lib/evalscope/benchmarks/_meta/blink.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,590 copying build/lib/evalscope/benchmarks/_meta/omni_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,592 copying build/lib/evalscope/benchmarks/_meta/ai2d.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,595 copying build/lib/evalscope/benchmarks/_meta/hmmt25.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,597 copying build/lib/evalscope/benchmarks/_meta/hle.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,600 copying build/lib/evalscope/benchmarks/_meta/arena_hard.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,602 copying build/lib/evalscope/benchmarks/_meta/general_vmcq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,605 copying build/lib/evalscope/benchmarks/_meta/multi_nerd.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,607 copying build/lib/evalscope/benchmarks/_meta/ocr_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,610 copying build/lib/evalscope/benchmarks/_meta/humaneval_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,613 copying build/lib/evalscope/benchmarks/_meta/competition_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,615 copying build/lib/evalscope/benchmarks/_meta/scicode.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,619 copying build/lib/evalscope/benchmarks/_meta/needle_haystack.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,622 copying build/lib/evalscope/benchmarks/_meta/omni_doc_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,626 copying build/lib/evalscope/benchmarks/_meta/eq_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,628 copying build/lib/evalscope/benchmarks/_meta/real_world_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,631 copying build/lib/evalscope/benchmarks/_meta/tool_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,634 copying build/lib/evalscope/benchmarks/_meta/fleurs.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,636 copying build/lib/evalscope/benchmarks/_meta/cmmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,640 copying build/lib/evalscope/benchmarks/_meta/pubmedqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,642 copying build/lib/evalscope/benchmarks/_meta/micro_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,645 copying build/lib/evalscope/benchmarks/_meta/poly_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,648 copying build/lib/evalscope/benchmarks/_meta/tweet_ner_7.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,651 copying build/lib/evalscope/benchmarks/_meta/terminal_bench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,654 copying build/lib/evalscope/benchmarks/_meta/math_500.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,657 copying build/lib/evalscope/benchmarks/_meta/mia_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,659 copying build/lib/evalscope/benchmarks/_meta/bfcl_v3.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,662 copying build/lib/evalscope/benchmarks/_meta/mmmu_pro.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,666 copying build/lib/evalscope/benchmarks/_meta/olympiad_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,669 copying build/lib/evalscope/benchmarks/_meta/iquiz.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,672 copying build/lib/evalscope/benchmarks/_meta/truthful_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,675 copying build/lib/evalscope/benchmarks/_meta/simple_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,677 copying build/lib/evalscope/benchmarks/_meta/harvey_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,680 copying build/lib/evalscope/benchmarks/_meta/frames.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,683 copying build/lib/evalscope/benchmarks/_meta/cmmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,686 copying build/lib/evalscope/benchmarks/_meta/mmlu_pro.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,689 copying build/lib/evalscope/benchmarks/_meta/conll2003.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,692 copying build/lib/evalscope/benchmarks/_meta/general_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,695 copying build/lib/evalscope/benchmarks/_meta/ifeval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,699 copying build/lib/evalscope/benchmarks/_meta/zerobench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,702 copying build/lib/evalscope/benchmarks/_meta/coin_flip.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,704 copying build/lib/evalscope/benchmarks/_meta/tau_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,707 copying build/lib/evalscope/benchmarks/_meta/super_gpqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,710 copying build/lib/evalscope/benchmarks/_meta/gsm8k.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,713 copying build/lib/evalscope/benchmarks/_meta/anat_em.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,716 copying build/lib/evalscope/benchmarks/_meta/winogrande.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,719 copying build/lib/evalscope/benchmarks/_meta/ocr_bench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,723 copying build/lib/evalscope/benchmarks/_meta/tifa160.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,725 copying build/lib/evalscope/benchmarks/_meta/drop.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,728 copying build/lib/evalscope/benchmarks/_meta/drivel_binary.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,731 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified_mini.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,734 copying build/lib/evalscope/benchmarks/_meta/math_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,737 copying build/lib/evalscope/benchmarks/_meta/bc5cdr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,740 copying build/lib/evalscope/benchmarks/_meta/jnlpba.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,743 copying build/lib/evalscope/benchmarks/_meta/multiple_mbpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,746 copying build/lib/evalscope/benchmarks/_meta/mgsm.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,749 copying build/lib/evalscope/benchmarks/_meta/hellaswag.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,752 copying build/lib/evalscope/benchmarks/_meta/halueval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,754 copying build/lib/evalscope/benchmarks/_meta/evalmuse.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,757 copying build/lib/evalscope/benchmarks/_meta/sciq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,760 copying build/lib/evalscope/benchmarks/_meta/cmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,763 copying build/lib/evalscope/benchmarks/_meta/multi_if.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,766 copying build/lib/evalscope/benchmarks/_meta/infovqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,768 copying build/lib/evalscope/benchmarks/_meta/bbh.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,772 copying build/lib/evalscope/benchmarks/_meta/biomix_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,776 copying build/lib/evalscope/benchmarks/_meta/broad_twitter_corpus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,779 copying build/lib/evalscope/benchmarks/_meta/piqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,783 copying build/lib/evalscope/benchmarks/_meta/minerva_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-04-13T12:59:51,787 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/data_collection 2026-04-13T12:59:51,788 copying build/lib/evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-04-13T12:59:51,792 copying build/lib/evalscope/benchmarks/data_collection/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-04-13T12:59:51,795 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/commonsense_qa 2026-04-13T12:59:51,796 copying build/lib/evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-04-13T12:59:51,799 copying build/lib/evalscope/benchmarks/commonsense_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-04-13T12:59:51,802 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k_v 2026-04-13T12:59:51,804 copying build/lib/evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-04-13T12:59:51,807 copying build/lib/evalscope/benchmarks/gsm8k_v/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-04-13T12:59:51,810 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/visu_logic 2026-04-13T12:59:51,811 copying build/lib/evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-04-13T12:59:51,814 copying build/lib/evalscope/benchmarks/visu_logic/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-04-13T12:59:51,817 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmlu 2026-04-13T12:59:51,819 copying build/lib/evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-04-13T12:59:51,822 copying build/lib/evalscope/benchmarks/cmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-04-13T12:59:51,825 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_bench 2026-04-13T12:59:51,827 copying build/lib/evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-04-13T12:59:51,831 copying build/lib/evalscope/benchmarks/mm_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-04-13T12:59:51,834 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vqa 2026-04-13T12:59:51,835 copying build/lib/evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-04-13T12:59:51,839 copying build/lib/evalscope/benchmarks/general_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-04-13T12:59:51,842 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/refcoco 2026-04-13T12:59:51,843 copying build/lib/evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-13T12:59:51,846 copying build/lib/evalscope/benchmarks/refcoco/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-13T12:59:51,849 copying build/lib/evalscope/benchmarks/refcoco/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-13T12:59:51,851 copying build/lib/evalscope/benchmarks/refcoco/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-13T12:59:51,853 copying build/lib/evalscope/benchmarks/refcoco/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-04-13T12:59:51,857 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/needle_haystack 2026-04-13T12:59:51,858 copying build/lib/evalscope/benchmarks/needle_haystack/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-04-13T12:59:51,861 copying build/lib/evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-04-13T12:59:51,864 copying build/lib/evalscope/benchmarks/needle_haystack/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-04-13T12:59:51,866 copying build/lib/evalscope/benchmarks/needle_haystack/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-04-13T12:59:51,870 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,871 copying build/lib/evalscope/benchmarks/live_code_bench/prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,874 copying build/lib/evalscope/benchmarks/live_code_bench/load_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,877 copying build/lib/evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,879 copying build/lib/evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,882 copying build/lib/evalscope/benchmarks/live_code_bench/testing_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,884 copying build/lib/evalscope/benchmarks/live_code_bench/extract_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,886 copying build/lib/evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,888 copying build/lib/evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,890 copying build/lib/evalscope/benchmarks/live_code_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-04-13T12:59:51,892 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mia_bench 2026-04-13T12:59:51,893 copying build/lib/evalscope/benchmarks/mia_bench/mia_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mia_bench 2026-04-13T12:59:51,896 copying build/lib/evalscope/benchmarks/mia_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mia_bench 2026-04-13T12:59:51,899 copying build/lib/evalscope/benchmarks/mia_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mia_bench 2026-04-13T12:59:51,901 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pope 2026-04-13T12:59:51,902 copying build/lib/evalscope/benchmarks/pope/pope_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-04-13T12:59:51,904 copying build/lib/evalscope/benchmarks/pope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-04-13T12:59:51,906 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/seed_bench_2_plus 2026-04-13T12:59:51,907 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-04-13T12:59:51,909 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-04-13T12:59:51,911 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gpqa 2026-04-13T12:59:51,912 copying build/lib/evalscope/benchmarks/gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-04-13T12:59:51,914 copying build/lib/evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-04-13T12:59:51,917 copying build/lib/evalscope/benchmarks/gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-04-13T12:59:51,919 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humaneval 2026-04-13T12:59:51,920 copying build/lib/evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-04-13T12:59:51,922 copying build/lib/evalscope/benchmarks/humaneval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-04-13T12:59:51,924 copying build/lib/evalscope/benchmarks/humaneval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-04-13T12:59:51,927 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench 2026-04-13T12:59:51,928 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-13T12:59:51,929 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-13T12:59:51,932 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-04-13T12:59:51,934 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,935 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,938 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,940 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,942 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,944 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,947 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,949 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,951 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,953 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:51,954 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:51,957 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:51,959 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:51,960 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-04-13T12:59:51,963 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-04-13T12:59:51,965 copying build/lib/evalscope/benchmarks/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench 2026-04-13T12:59:51,967 copying build/lib/evalscope/benchmarks/ocr_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench 2026-04-13T12:59:51,969 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tool_bench 2026-04-13T12:59:51,970 copying build/lib/evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-04-13T12:59:51,972 copying build/lib/evalscope/benchmarks/tool_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-04-13T12:59:51,975 copying build/lib/evalscope/benchmarks/tool_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-04-13T12:59:51,977 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math 2026-04-13T12:59:51,978 copying build/lib/evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-04-13T12:59:51,980 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math/utils 2026-04-13T12:59:51,981 copying build/lib/evalscope/benchmarks/poly_math/utils/instruction.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math/utils 2026-04-13T12:59:51,984 copying build/lib/evalscope/benchmarks/poly_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-04-13T12:59:51,986 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/infovqa 2026-04-13T12:59:51,987 copying build/lib/evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-04-13T12:59:51,989 copying build/lib/evalscope/benchmarks/infovqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-04-13T12:59:51,991 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus 2026-04-13T12:59:51,992 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus/docker 2026-04-13T12:59:51,993 copying build/lib/evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus/docker 2026-04-13T12:59:51,995 copying build/lib/evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-04-13T12:59:51,998 copying build/lib/evalscope/benchmarks/humanevalplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-04-13T12:59:52,000 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit 2026-04-13T12:59:52,001 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:52,002 copying build/lib/evalscope/benchmarks/image_edit/gedit/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:52,005 copying build/lib/evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:52,007 copying build/lib/evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:52,009 copying build/lib/evalscope/benchmarks/image_edit/gedit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-04-13T12:59:52,011 copying build/lib/evalscope/benchmarks/image_edit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit 2026-04-13T12:59:52,013 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/vstar_bench 2026-04-13T12:59:52,014 copying build/lib/evalscope/benchmarks/vstar_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-04-13T12:59:52,016 copying build/lib/evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-04-13T12:59:52,018 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/torgo 2026-04-13T12:59:52,019 copying build/lib/evalscope/benchmarks/torgo/torgo_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-04-13T12:59:52,021 copying build/lib/evalscope/benchmarks/torgo/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-04-13T12:59:52,023 copying build/lib/evalscope/benchmarks/torgo/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-04-13T12:59:52,025 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aa_lcr 2026-04-13T12:59:52,026 copying build/lib/evalscope/benchmarks/aa_lcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-04-13T12:59:52,028 copying build/lib/evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-04-13T12:59:52,031 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/amc 2026-04-13T12:59:52,032 copying build/lib/evalscope/benchmarks/amc/amc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-04-13T12:59:52,034 copying build/lib/evalscope/benchmarks/amc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-04-13T12:59:52,036 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_qa 2026-04-13T12:59:52,037 copying build/lib/evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-04-13T12:59:52,040 copying build/lib/evalscope/benchmarks/simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-04-13T12:59:52,042 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/fleurs 2026-04-13T12:59:52,043 copying build/lib/evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-04-13T12:59:52,045 copying build/lib/evalscope/benchmarks/fleurs/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-04-13T12:59:52,048 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench 2026-04-13T12:59:52,049 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:52,051 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:52,053 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:52,055 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:52,057 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-04-13T12:59:52,060 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:52,061 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:52,064 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:52,066 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:52,067 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-04-13T12:59:52,069 copying build/lib/evalscope/benchmarks/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench 2026-04-13T12:59:52,071 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mgsm 2026-04-13T12:59:52,072 copying build/lib/evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-04-13T12:59:52,074 copying build/lib/evalscope/benchmarks/mgsm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-04-13T12:59:52,076 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docmath 2026-04-13T12:59:52,078 copying build/lib/evalscope/benchmarks/docmath/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-04-13T12:59:52,080 copying build/lib/evalscope/benchmarks/docmath/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-04-13T12:59:52,082 copying build/lib/evalscope/benchmarks/docmath/docmath_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-04-13T12:59:52,084 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu 2026-04-13T12:59:52,085 copying build/lib/evalscope/benchmarks/mmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-04-13T12:59:52,087 copying build/lib/evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-04-13T12:59:52,090 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drop 2026-04-13T12:59:52,091 copying build/lib/evalscope/benchmarks/drop/drop_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-04-13T12:59:52,094 copying build/lib/evalscope/benchmarks/drop/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-04-13T12:59:52,096 copying build/lib/evalscope/benchmarks/drop/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-04-13T12:59:52,098 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:52,099 copying build/lib/evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:52,102 copying build/lib/evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:52,104 copying build/lib/evalscope/benchmarks/omnidoc_bench/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:52,107 copying build/lib/evalscope/benchmarks/omnidoc_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:52,111 copying build/lib/evalscope/benchmarks/omnidoc_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:52,112 copying build/lib/evalscope/benchmarks/omnidoc_bench/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-04-13T12:59:52,114 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hle 2026-04-13T12:59:52,115 copying build/lib/evalscope/benchmarks/hle/hle_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-04-13T12:59:52,118 copying build/lib/evalscope/benchmarks/hle/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-04-13T12:59:52,120 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cl_bench 2026-04-13T12:59:52,121 copying build/lib/evalscope/benchmarks/cl_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-04-13T12:59:52,123 copying build/lib/evalscope/benchmarks/cl_bench/cl_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-04-13T12:59:52,125 copying build/lib/evalscope/benchmarks/cl_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-04-13T12:59:52,127 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_arena 2026-04-13T12:59:52,128 copying build/lib/evalscope/benchmarks/general_arena/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-04-13T12:59:52,130 copying build/lib/evalscope/benchmarks/general_arena/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-04-13T12:59:52,132 copying build/lib/evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-04-13T12:59:52,135 copying build/lib/evalscope/benchmarks/general_arena/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-04-13T12:59:52,137 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zerobench 2026-04-13T12:59:52,138 copying build/lib/evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-04-13T12:59:52,140 copying build/lib/evalscope/benchmarks/zerobench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-04-13T12:59:52,142 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vmcq 2026-04-13T12:59:52,144 copying build/lib/evalscope/benchmarks/general_vmcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-04-13T12:59:52,145 copying build/lib/evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-04-13T12:59:52,148 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/winogrande 2026-04-13T12:59:52,149 copying build/lib/evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-04-13T12:59:52,151 copying build/lib/evalscope/benchmarks/winogrande/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-04-13T12:59:52,154 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode 2026-04-13T12:59:52,155 copying build/lib/evalscope/benchmarks/scicode/util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-04-13T12:59:52,157 copying build/lib/evalscope/benchmarks/scicode/scicode_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-04-13T12:59:52,160 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode/docker 2026-04-13T12:59:52,161 copying build/lib/evalscope/benchmarks/scicode/docker/test_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-04-13T12:59:52,163 copying build/lib/evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-04-13T12:59:52,165 copying build/lib/evalscope/benchmarks/scicode/docker/process_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-04-13T12:59:52,167 copying build/lib/evalscope/benchmarks/scicode/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-04-13T12:59:52,169 copying build/lib/evalscope/benchmarks/scicode/prompt_templates.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-04-13T12:59:52,172 copying build/lib/evalscope/benchmarks/scicode/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-04-13T12:59:52,174 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh 2026-04-13T12:59:52,175 copying build/lib/evalscope/benchmarks/bbh/bbh_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-04-13T12:59:52,178 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,180 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,182 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,183 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,185 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,187 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,189 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,191 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,193 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,195 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,197 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,199 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,201 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,203 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,205 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,207 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,209 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,211 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,213 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,215 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,217 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,219 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,221 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,223 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,224 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,227 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,228 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,230 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-04-13T12:59:52,232 copying build/lib/evalscope/benchmarks/bbh/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-04-13T12:59:52,234 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/sciq 2026-04-13T12:59:52,235 copying build/lib/evalscope/benchmarks/sciq/sciq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-04-13T12:59:52,238 copying build/lib/evalscope/benchmarks/sciq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-04-13T12:59:52,240 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/longbench_v2 2026-04-13T12:59:52,241 copying build/lib/evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/longbench_v2 2026-04-13T12:59:52,243 copying build/lib/evalscope/benchmarks/longbench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/longbench_v2 2026-04-13T12:59:52,245 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_star 2026-04-13T12:59:52,246 copying build/lib/evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-04-13T12:59:52,249 copying build/lib/evalscope/benchmarks/mm_star/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-04-13T12:59:52,251 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multi_if 2026-04-13T12:59:52,252 copying build/lib/evalscope/benchmarks/multi_if/ifeval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-13T12:59:52,257 copying build/lib/evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-13T12:59:52,259 copying build/lib/evalscope/benchmarks/multi_if/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-13T12:59:52,261 copying build/lib/evalscope/benchmarks/multi_if/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-13T12:59:52,263 copying build/lib/evalscope/benchmarks/multi_if/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-04-13T12:59:52,265 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_verse 2026-04-13T12:59:52,266 copying build/lib/evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-04-13T12:59:52,269 copying build/lib/evalscope/benchmarks/math_verse/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-04-13T12:59:52,271 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multipl_e 2026-04-13T12:59:52,272 copying build/lib/evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-04-13T12:59:52,275 copying build/lib/evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-04-13T12:59:52,277 copying build/lib/evalscope/benchmarks/multipl_e/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-04-13T12:59:52,279 copying build/lib/evalscope/benchmarks/multipl_e/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-04-13T12:59:52,281 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/blink 2026-04-13T12:59:52,282 copying build/lib/evalscope/benchmarks/blink/blink_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-04-13T12:59:52,284 copying build/lib/evalscope/benchmarks/blink/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-04-13T12:59:52,286 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmmu 2026-04-13T12:59:52,287 copying build/lib/evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-04-13T12:59:52,290 copying build/lib/evalscope/benchmarks/cmmmu/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-04-13T12:59:52,292 copying build/lib/evalscope/benchmarks/cmmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-04-13T12:59:52,294 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/frames 2026-04-13T12:59:52,295 copying build/lib/evalscope/benchmarks/frames/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-04-13T12:59:52,297 copying build/lib/evalscope/benchmarks/frames/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-04-13T12:59:52,299 copying build/lib/evalscope/benchmarks/frames/frames_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-04-13T12:59:52,302 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mri_mcqa 2026-04-13T12:59:52,303 copying build/lib/evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-04-13T12:59:52,305 copying build/lib/evalscope/benchmarks/mri_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-04-13T12:59:52,307 copying build/lib/evalscope/benchmarks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks 2026-04-13T12:59:52,310 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner 2026-04-13T12:59:52,311 copying build/lib/evalscope/benchmarks/ner/fin_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,313 copying build/lib/evalscope/benchmarks/ner/genia_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,315 copying build/lib/evalscope/benchmarks/ner/copious_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,317 copying build/lib/evalscope/benchmarks/ner/bc2gm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,319 copying build/lib/evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,321 copying build/lib/evalscope/benchmarks/ner/anat_em_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,323 copying build/lib/evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,325 copying build/lib/evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,327 copying build/lib/evalscope/benchmarks/ner/jnlpba_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,329 copying build/lib/evalscope/benchmarks/ner/cross_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,331 copying build/lib/evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,333 copying build/lib/evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,335 copying build/lib/evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,337 copying build/lib/evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,340 copying build/lib/evalscope/benchmarks/ner/conll2003_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,342 copying build/lib/evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,343 copying build/lib/evalscope/benchmarks/ner/wnut2017_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,346 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:52,347 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:52,349 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:52,351 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:52,353 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:52,355 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:52,357 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-04-13T12:59:52,359 copying build/lib/evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,361 copying build/lib/evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,363 copying build/lib/evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,365 copying build/lib/evalscope/benchmarks/ner/ncbi_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,367 copying build/lib/evalscope/benchmarks/ner/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,368 copying build/lib/evalscope/benchmarks/ner/conllpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-04-13T12:59:52,371 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/coin_flip 2026-04-13T12:59:52,372 copying build/lib/evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-04-13T12:59:52,374 copying build/lib/evalscope/benchmarks/coin_flip/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-04-13T12:59:52,376 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zebralogicbench 2026-04-13T12:59:52,377 copying build/lib/evalscope/benchmarks/zebralogicbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-04-13T12:59:52,380 copying build/lib/evalscope/benchmarks/zebralogicbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-04-13T12:59:52,381 copying build/lib/evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-04-13T12:59:52,384 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pumed_qa 2026-04-13T12:59:52,385 copying build/lib/evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-04-13T12:59:52,387 copying build/lib/evalscope/benchmarks/pumed_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-04-13T12:59:52,389 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/biomix_qa 2026-04-13T12:59:52,390 copying build/lib/evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-04-13T12:59:52,392 copying build/lib/evalscope/benchmarks/biomix_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-04-13T12:59:52,394 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmu 2026-04-13T12:59:52,395 copying build/lib/evalscope/benchmarks/cmmu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-04-13T12:59:52,397 copying build/lib/evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-04-13T12:59:52,400 copying build/lib/evalscope/benchmarks/cmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-04-13T12:59:52,402 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ai2d 2026-04-13T12:59:52,403 copying build/lib/evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-04-13T12:59:52,405 copying build/lib/evalscope/benchmarks/ai2d/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-04-13T12:59:52,407 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/minerva_math 2026-04-13T12:59:52,408 copying build/lib/evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-04-13T12:59:52,410 copying build/lib/evalscope/benchmarks/minerva_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-04-13T12:59:52,412 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/music_trivia 2026-04-13T12:59:52,413 copying build/lib/evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-04-13T12:59:52,415 copying build/lib/evalscope/benchmarks/music_trivia/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-04-13T12:59:52,417 creating build/bdist.linux-armv7l/wheel/evalscope/service 2026-04-13T12:59:52,418 copying build/lib/evalscope/service/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-04-13T12:59:52,421 creating build/bdist.linux-armv7l/wheel/evalscope/service/frontend 2026-04-13T12:59:52,422 copying build/lib/evalscope/service/frontend/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-04-13T12:59:52,424 copying build/lib/evalscope/service/frontend/async_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-04-13T12:59:52,427 copying build/lib/evalscope/service/frontend/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-04-13T12:59:52,429 copying build/lib/evalscope/service/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-04-13T12:59:52,431 creating build/bdist.linux-armv7l/wheel/evalscope/service/blueprints 2026-04-13T12:59:52,432 copying build/lib/evalscope/service/blueprints/perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-04-13T12:59:52,435 copying build/lib/evalscope/service/blueprints/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-04-13T12:59:52,437 copying build/lib/evalscope/service/blueprints/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-04-13T12:59:52,439 creating build/bdist.linux-armv7l/wheel/evalscope/service/utils 2026-04-13T12:59:52,440 copying build/lib/evalscope/service/utils/benchmarks.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-04-13T12:59:52,442 copying build/lib/evalscope/service/utils/process.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-04-13T12:59:52,445 copying build/lib/evalscope/service/utils/log.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-04-13T12:59:52,446 copying build/lib/evalscope/service/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-04-13T12:59:52,448 copying build/lib/evalscope/service/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-04-13T12:59:52,450 creating build/bdist.linux-armv7l/wheel/evalscope/cli 2026-04-13T12:59:52,451 copying build/lib/evalscope/cli/start_app.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-13T12:59:52,453 copying build/lib/evalscope/cli/cli.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-13T12:59:52,455 copying build/lib/evalscope/cli/start_perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-13T12:59:52,457 copying build/lib/evalscope/cli/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-13T12:59:52,458 copying build/lib/evalscope/cli/start_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-13T12:59:52,460 copying build/lib/evalscope/cli/benchmark_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-13T12:59:52,463 copying build/lib/evalscope/cli/start_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-13T12:59:52,464 copying build/lib/evalscope/cli/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-04-13T12:59:52,466 creating build/bdist.linux-armv7l/wheel/evalscope/filters 2026-04-13T12:59:52,468 copying build/lib/evalscope/filters/extraction.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-04-13T12:59:52,470 copying build/lib/evalscope/filters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-04-13T12:59:52,472 copying build/lib/evalscope/filters/selection.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-04-13T12:59:52,473 copying build/lib/evalscope/version.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-13T12:59:52,476 creating build/bdist.linux-armv7l/wheel/evalscope/app 2026-04-13T12:59:52,477 copying build/lib/evalscope/app/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-04-13T12:59:52,478 copying build/lib/evalscope/app/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-04-13T12:59:52,480 copying build/lib/evalscope/app/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-04-13T12:59:52,482 creating build/bdist.linux-armv7l/wheel/evalscope/app/utils 2026-04-13T12:59:52,483 copying build/lib/evalscope/app/utils/text_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-13T12:59:52,486 copying build/lib/evalscope/app/utils/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-13T12:59:52,488 copying build/lib/evalscope/app/utils/localization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-13T12:59:52,490 copying build/lib/evalscope/app/utils/data_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-13T12:59:52,492 copying build/lib/evalscope/app/utils/env_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-04-13T12:59:52,495 creating build/bdist.linux-armv7l/wheel/evalscope/app/ui 2026-04-13T12:59:52,496 copying build/lib/evalscope/app/ui/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-13T12:59:52,497 copying build/lib/evalscope/app/ui/multi_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-13T12:59:52,500 copying build/lib/evalscope/app/ui/sidebar.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-13T12:59:52,501 copying build/lib/evalscope/app/ui/single_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-13T12:59:52,504 copying build/lib/evalscope/app/ui/app_ui.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-13T12:59:52,506 copying build/lib/evalscope/app/ui/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-04-13T12:59:52,507 copying build/lib/evalscope/app/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-04-13T12:59:52,509 copying build/lib/evalscope/run.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-13T12:59:52,512 creating build/bdist.linux-armv7l/wheel/evalscope/perf 2026-04-13T12:59:52,513 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin 2026-04-13T12:59:52,515 copying build/lib/evalscope/perf/plugin/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-04-13T12:59:52,517 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/datasets 2026-04-13T12:59:52,518 copying build/lib/evalscope/perf/plugin/datasets/longalpaca.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,520 copying build/lib/evalscope/perf/plugin/datasets/kontext_bench.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,522 copying build/lib/evalscope/perf/plugin/datasets/random_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,524 copying build/lib/evalscope/perf/plugin/datasets/openqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,526 copying build/lib/evalscope/perf/plugin/datasets/line_by_line.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,528 copying build/lib/evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,529 copying build/lib/evalscope/perf/plugin/datasets/rerank_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,532 copying build/lib/evalscope/perf/plugin/datasets/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,534 copying build/lib/evalscope/perf/plugin/datasets/flickr8k.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,535 copying build/lib/evalscope/perf/plugin/datasets/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,537 copying build/lib/evalscope/perf/plugin/datasets/embedding_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,540 copying build/lib/evalscope/perf/plugin/datasets/speed_benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,541 copying build/lib/evalscope/perf/plugin/datasets/custom.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,543 copying build/lib/evalscope/perf/plugin/datasets/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-04-13T12:59:52,545 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/api 2026-04-13T12:59:52,546 copying build/lib/evalscope/perf/plugin/api/default_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-13T12:59:52,549 copying build/lib/evalscope/perf/plugin/api/dashscope_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-13T12:59:52,551 copying build/lib/evalscope/perf/plugin/api/openai_rerank_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-13T12:59:52,553 copying build/lib/evalscope/perf/plugin/api/openai_embedding_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-13T12:59:52,555 copying build/lib/evalscope/perf/plugin/api/custom_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-13T12:59:52,557 copying build/lib/evalscope/perf/plugin/api/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-13T12:59:52,559 copying build/lib/evalscope/perf/plugin/api/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-13T12:59:52,561 copying build/lib/evalscope/perf/plugin/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-04-13T12:59:52,563 copying build/lib/evalscope/perf/plugin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-04-13T12:59:52,565 copying build/lib/evalscope/perf/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-13T12:59:52,568 creating build/bdist.linux-armv7l/wheel/evalscope/perf/sla 2026-04-13T12:59:52,569 copying build/lib/evalscope/perf/sla/sla_criterion.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-04-13T12:59:52,572 copying build/lib/evalscope/perf/sla/sla_run.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-04-13T12:59:52,574 copying build/lib/evalscope/perf/sla/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-04-13T12:59:52,576 copying build/lib/evalscope/perf/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-13T12:59:52,579 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils 2026-04-13T12:59:52,581 copying build/lib/evalscope/perf/utils/handler.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-13T12:59:52,583 copying build/lib/evalscope/perf/utils/db_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-13T12:59:52,585 copying build/lib/evalscope/perf/utils/local_server.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-13T12:59:52,588 copying build/lib/evalscope/perf/utils/benchmark_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-13T12:59:52,590 copying build/lib/evalscope/perf/utils/analysis_result.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-13T12:59:52,593 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils/report 2026-04-13T12:59:52,594 copying build/lib/evalscope/perf/utils/report/generate_report.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-04-13T12:59:52,597 copying build/lib/evalscope/perf/utils/report/perf_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-04-13T12:59:52,600 copying build/lib/evalscope/perf/utils/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-04-13T12:59:52,602 copying build/lib/evalscope/perf/utils/report/perf_charts.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-04-13T12:59:52,605 copying build/lib/evalscope/perf/utils/log_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-13T12:59:52,608 copying build/lib/evalscope/perf/utils/rich_display.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-13T12:59:52,611 copying build/lib/evalscope/perf/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-04-13T12:59:52,612 copying build/lib/evalscope/perf/http_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-13T12:59:52,615 copying build/lib/evalscope/perf/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-13T12:59:52,617 copying build/lib/evalscope/perf/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-04-13T12:59:52,620 creating build/bdist.linux-armv7l/wheel/evalscope/summarizer 2026-04-13T12:59:52,621 copying build/lib/evalscope/summarizer/summarizer.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-04-13T12:59:52,623 copying build/lib/evalscope/summarizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-04-13T12:59:52,626 creating build/bdist.linux-armv7l/wheel/evalscope/sandbox 2026-04-13T12:59:52,627 copying build/lib/evalscope/sandbox/volcengine.py -> build/bdist.linux-armv7l/wheel/./evalscope/sandbox 2026-04-13T12:59:52,629 copying build/lib/evalscope/sandbox/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/sandbox 2026-04-13T12:59:52,632 creating build/bdist.linux-armv7l/wheel/evalscope/third_party 2026-04-13T12:59:52,633 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static 2026-04-13T12:59:52,634 copying build/lib/evalscope/third_party/toolbench_static/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-13T12:59:52,636 copying build/lib/evalscope/third_party/toolbench_static/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-13T12:59:52,639 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static/llm 2026-04-13T12:59:52,641 copying build/lib/evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-04-13T12:59:52,643 copying build/lib/evalscope/third_party/toolbench_static/llm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-04-13T12:59:52,645 copying build/lib/evalscope/third_party/toolbench_static/config_default.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-13T12:59:52,647 copying build/lib/evalscope/third_party/toolbench_static/toolbench_static.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-13T12:59:52,649 copying build/lib/evalscope/third_party/toolbench_static/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-13T12:59:52,653 copying build/lib/evalscope/third_party/toolbench_static/config_default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-13T12:59:52,655 copying build/lib/evalscope/third_party/toolbench_static/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-13T12:59:52,657 copying build/lib/evalscope/third_party/toolbench_static/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-04-13T12:59:52,660 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write 2026-04-13T12:59:52,661 copying build/lib/evalscope/third_party/longbench_write/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-13T12:59:52,663 copying build/lib/evalscope/third_party/longbench_write/default_task.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-13T12:59:52,665 copying build/lib/evalscope/third_party/longbench_write/default_task.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-13T12:59:52,667 copying build/lib/evalscope/third_party/longbench_write/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-13T12:59:52,670 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/resources 2026-04-13T12:59:52,671 copying build/lib/evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-13T12:59:52,674 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-13T12:59:52,676 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-13T12:59:52,680 copying build/lib/evalscope/third_party/longbench_write/resources/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-13T12:59:52,682 copying build/lib/evalscope/third_party/longbench_write/resources/judge.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-04-13T12:59:52,684 copying build/lib/evalscope/third_party/longbench_write/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-13T12:59:52,686 copying build/lib/evalscope/third_party/longbench_write/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-13T12:59:52,689 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/tools 2026-04-13T12:59:52,690 copying build/lib/evalscope/third_party/longbench_write/tools/data_etl.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-04-13T12:59:52,693 copying build/lib/evalscope/third_party/longbench_write/tools/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-04-13T12:59:52,696 copying build/lib/evalscope/third_party/longbench_write/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-04-13T12:59:52,698 copying build/lib/evalscope/third_party/longbench_write/longbench_write.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-13T12:59:52,700 copying build/lib/evalscope/third_party/longbench_write/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-04-13T12:59:52,703 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench 2026-04-13T12:59:52,704 copying build/lib/evalscope/third_party/thinkbench/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-04-13T12:59:52,707 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/resources 2026-04-13T12:59:52,708 copying build/lib/evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-04-13T12:59:52,710 copying build/lib/evalscope/third_party/thinkbench/resources/critique_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-04-13T12:59:52,712 copying build/lib/evalscope/third_party/thinkbench/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-04-13T12:59:52,715 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/tools 2026-04-13T12:59:52,716 copying build/lib/evalscope/third_party/thinkbench/tools/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-04-13T12:59:52,718 copying build/lib/evalscope/third_party/thinkbench/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-04-13T12:59:52,720 copying build/lib/evalscope/third_party/thinkbench/tools/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-04-13T12:59:52,722 copying build/lib/evalscope/third_party/thinkbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-04-13T12:59:52,724 copying build/lib/evalscope/third_party/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party 2026-04-13T12:59:52,726 copying build/lib/evalscope/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-13T12:59:52,729 creating build/bdist.linux-armv7l/wheel/evalscope/evaluator 2026-04-13T12:59:52,730 copying build/lib/evalscope/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-04-13T12:59:52,733 copying build/lib/evalscope/evaluator/batch_reviewer.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-04-13T12:59:52,735 copying build/lib/evalscope/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-04-13T12:59:52,737 copying build/lib/evalscope/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-13T12:59:52,740 creating build/bdist.linux-armv7l/wheel/evalscope/utils 2026-04-13T12:59:52,742 creating build/bdist.linux-armv7l/wheel/evalscope/utils/tqdm_utils 2026-04-13T12:59:52,743 copying build/lib/evalscope/utils/tqdm_utils/tqdm_logging.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-04-13T12:59:52,745 copying build/lib/evalscope/utils/tqdm_utils/progress_tracker.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-04-13T12:59:52,747 copying build/lib/evalscope/utils/tqdm_utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-04-13T12:59:52,750 creating build/bdist.linux-armv7l/wheel/evalscope/utils/doc_utils 2026-04-13T12:59:52,751 copying build/lib/evalscope/utils/doc_utils/generate_dataset_md.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-13T12:59:52,753 copying build/lib/evalscope/utils/doc_utils/readme_generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-13T12:59:52,756 copying build/lib/evalscope/utils/doc_utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-13T12:59:52,758 copying build/lib/evalscope/utils/doc_utils/benchmark_stats.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-13T12:59:52,760 copying build/lib/evalscope/utils/doc_utils/translate_description.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-04-13T12:59:52,763 copying build/lib/evalscope/utils/chat_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,765 copying build/lib/evalscope/utils/url_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,767 copying build/lib/evalscope/utils/multi_choices.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,770 copying build/lib/evalscope/utils/json_schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,772 copying build/lib/evalscope/utils/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,774 copying build/lib/evalscope/utils/deprecation_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,776 copying build/lib/evalscope/utils/import_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,779 copying build/lib/evalscope/utils/code_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,781 copying build/lib/evalscope/utils/model_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,783 copying build/lib/evalscope/utils/io_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,786 copying build/lib/evalscope/utils/ner.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,789 copying build/lib/evalscope/utils/function_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,791 copying build/lib/evalscope/utils/argument_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,794 copying build/lib/evalscope/utils/resource_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,796 copying build/lib/evalscope/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-04-13T12:59:52,799 creating build/bdist.linux-armv7l/wheel/evalscope/report 2026-04-13T12:59:52,800 copying build/lib/evalscope/report/report.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-13T12:59:52,803 copying build/lib/evalscope/report/combinator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-13T12:59:52,805 copying build/lib/evalscope/report/generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-13T12:59:52,807 copying build/lib/evalscope/report/renderer.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-13T12:59:52,811 creating build/bdist.linux-armv7l/wheel/evalscope/report/template 2026-04-13T12:59:52,812 copying build/lib/evalscope/report/template/report.html.j2 -> build/bdist.linux-armv7l/wheel/./evalscope/report/template 2026-04-13T12:59:52,815 copying build/lib/evalscope/report/template/perf_report.html.j2 -> build/bdist.linux-armv7l/wheel/./evalscope/report/template 2026-04-13T12:59:52,818 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/css 2026-04-13T12:59:52,819 copying build/lib/evalscope/report/template/css/perf_extra.css -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/css 2026-04-13T12:59:52,821 copying build/lib/evalscope/report/template/css/base.css -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/css 2026-04-13T12:59:52,824 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/partials 2026-04-13T12:59:52,826 copying build/lib/evalscope/report/template/partials/brand_logo.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-13T12:59:52,828 copying build/lib/evalscope/report/template/partials/footer.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-13T12:59:52,830 copying build/lib/evalscope/report/template/partials/toc_perf.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-13T12:59:52,832 copying build/lib/evalscope/report/template/partials/toc_eval.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-13T12:59:52,834 copying build/lib/evalscope/report/template/partials/header_eval.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-13T12:59:52,836 copying build/lib/evalscope/report/template/partials/header_perf.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-04-13T12:59:52,838 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/js 2026-04-13T12:59:52,839 copying build/lib/evalscope/report/template/js/i18n_perf.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-13T12:59:52,842 copying build/lib/evalscope/report/template/js/shared.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-13T12:59:52,844 copying build/lib/evalscope/report/template/js/perf_extra.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-13T12:59:52,846 copying build/lib/evalscope/report/template/js/eval_extra.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-13T12:59:52,848 copying build/lib/evalscope/report/template/js/i18n_eval.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-04-13T12:59:52,851 copying build/lib/evalscope/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-04-13T12:59:52,853 copying build/lib/evalscope/config.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-13T12:59:52,856 creating build/bdist.linux-armv7l/wheel/evalscope/models 2026-04-13T12:59:52,857 copying build/lib/evalscope/models/anthropic_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-13T12:59:52,859 copying build/lib/evalscope/models/text2image_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-13T12:59:52,862 copying build/lib/evalscope/models/model_apis.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-13T12:59:52,864 creating build/bdist.linux-armv7l/wheel/evalscope/models/utils 2026-04-13T12:59:52,865 copying build/lib/evalscope/models/utils/anthropic.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-04-13T12:59:52,868 copying build/lib/evalscope/models/utils/openai.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-04-13T12:59:52,871 copying build/lib/evalscope/models/image_edit_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-13T12:59:52,873 copying build/lib/evalscope/models/modelscope.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-13T12:59:52,876 copying build/lib/evalscope/models/mockllm.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-13T12:59:52,878 copying build/lib/evalscope/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-13T12:59:52,880 copying build/lib/evalscope/models/openai_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-04-13T12:59:52,883 creating build/bdist.linux-armv7l/wheel/evalscope/api 2026-04-13T12:59:52,885 creating build/bdist.linux-armv7l/wheel/evalscope/api/mixin 2026-04-13T12:59:52,886 copying build/lib/evalscope/api/mixin/llm_judge_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-04-13T12:59:52,888 copying build/lib/evalscope/api/mixin/sandbox_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-04-13T12:59:52,891 copying build/lib/evalscope/api/mixin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-04-13T12:59:52,893 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark 2026-04-13T12:59:52,894 copying build/lib/evalscope/api/benchmark/meta.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-04-13T12:59:52,897 copying build/lib/evalscope/api/benchmark/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-04-13T12:59:52,899 copying build/lib/evalscope/api/benchmark/statistics.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-04-13T12:59:52,903 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark/adapters 2026-04-13T12:59:52,904 copying build/lib/evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-13T12:59:52,906 copying build/lib/evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-13T12:59:52,908 copying build/lib/evalscope/api/benchmark/adapters/ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-13T12:59:52,911 copying build/lib/evalscope/api/benchmark/adapters/default_data_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-13T12:59:52,914 copying build/lib/evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-13T12:59:52,916 copying build/lib/evalscope/api/benchmark/adapters/text2image_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-13T12:59:52,918 copying build/lib/evalscope/api/benchmark/adapters/agent_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-13T12:59:52,920 copying build/lib/evalscope/api/benchmark/adapters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-04-13T12:59:52,923 copying build/lib/evalscope/api/benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-04-13T12:59:52,926 creating build/bdist.linux-armv7l/wheel/evalscope/api/model 2026-04-13T12:59:52,927 copying build/lib/evalscope/api/model/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-13T12:59:52,929 copying build/lib/evalscope/api/model/generate_config.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-13T12:59:52,932 copying build/lib/evalscope/api/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-13T12:59:52,934 copying build/lib/evalscope/api/model/model_output.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-13T12:59:52,937 copying build/lib/evalscope/api/model/lazy_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-04-13T12:59:52,939 copying build/lib/evalscope/api/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-04-13T12:59:52,941 creating build/bdist.linux-armv7l/wheel/evalscope/api/filter 2026-04-13T12:59:52,943 copying build/lib/evalscope/api/filter/filter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-04-13T12:59:52,945 copying build/lib/evalscope/api/filter/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-04-13T12:59:52,947 creating build/bdist.linux-armv7l/wheel/evalscope/api/messages 2026-04-13T12:59:52,948 copying build/lib/evalscope/api/messages/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-04-13T12:59:52,950 copying build/lib/evalscope/api/messages/content.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-04-13T12:59:52,952 copying build/lib/evalscope/api/messages/chat_message.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-04-13T12:59:52,954 copying build/lib/evalscope/api/messages/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-04-13T12:59:52,957 creating build/bdist.linux-armv7l/wheel/evalscope/api/evaluator 2026-04-13T12:59:52,958 copying build/lib/evalscope/api/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-04-13T12:59:52,960 copying build/lib/evalscope/api/evaluator/cache.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-04-13T12:59:52,963 copying build/lib/evalscope/api/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-04-13T12:59:52,964 copying build/lib/evalscope/api/evaluator/state.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-04-13T12:59:52,967 creating build/bdist.linux-armv7l/wheel/evalscope/api/dataset 2026-04-13T12:59:52,968 copying build/lib/evalscope/api/dataset/loader.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-04-13T12:59:52,971 copying build/lib/evalscope/api/dataset/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-04-13T12:59:52,973 copying build/lib/evalscope/api/dataset/dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-04-13T12:59:52,976 copying build/lib/evalscope/api/dataset/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-04-13T12:59:52,978 creating build/bdist.linux-armv7l/wheel/evalscope/api/tool 2026-04-13T12:59:52,979 copying build/lib/evalscope/api/tool/tool_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-04-13T12:59:52,982 copying build/lib/evalscope/api/tool/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-04-13T12:59:52,984 copying build/lib/evalscope/api/tool/tool_call.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-04-13T12:59:52,986 copying build/lib/evalscope/api/tool/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-04-13T12:59:52,988 creating build/bdist.linux-armv7l/wheel/evalscope/api/metric 2026-04-13T12:59:52,989 copying build/lib/evalscope/api/metric/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-04-13T12:59:52,991 copying build/lib/evalscope/api/metric/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-04-13T12:59:52,994 copying build/lib/evalscope/api/metric/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-04-13T12:59:52,996 copying build/lib/evalscope/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-04-13T12:59:52,998 creating build/bdist.linux-armv7l/wheel/evalscope/metrics 2026-04-13T12:59:52,999 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/sem_score 2026-04-13T12:59:53,001 copying build/lib/evalscope/metrics/sem_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-04-13T12:59:53,003 copying build/lib/evalscope/metrics/sem_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-04-13T12:59:53,005 copying build/lib/evalscope/metrics/math_parser.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-13T12:59:53,008 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bundled_rouge_score 2026-04-13T12:59:53,009 copying build/lib/evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-04-13T12:59:53,012 copying build/lib/evalscope/metrics/bundled_rouge_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-04-13T12:59:53,014 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics 2026-04-13T12:59:53,015 copying build/lib/evalscope/metrics/t2v_metrics/itmscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-13T12:59:53,017 copying build/lib/evalscope/metrics/t2v_metrics/vqascore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-13T12:59:53,019 copying build/lib/evalscope/metrics/t2v_metrics/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-13T12:59:53,021 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models 2026-04-13T12:59:53,022 copying build/lib/evalscope/metrics/t2v_metrics/models/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-04-13T12:59:53,025 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:53,026 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:53,028 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:53,030 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-13T12:59:53,031 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-13T12:59:53,033 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-13T12:59:53,036 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-04-13T12:59:53,037 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:53,039 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-04-13T12:59:53,042 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:53,043 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:53,045 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:53,047 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:53,049 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:53,052 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:53,053 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:53,056 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:53,058 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:53,060 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-04-13T12:59:53,062 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-04-13T12:59:53,064 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:53,066 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:53,069 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:53,071 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:53,073 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:53,076 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-04-13T12:59:53,078 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-04-13T12:59:53,079 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-13T12:59:53,081 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-13T12:59:53,083 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-04-13T12:59:53,085 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-04-13T12:59:53,086 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-04-13T12:59:53,089 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-04-13T12:59:53,090 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-04-13T12:59:53,093 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-04-13T12:59:53,095 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-04-13T12:59:53,097 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-04-13T12:59:53,099 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-04-13T12:59:53,101 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-13T12:59:53,103 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,104 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,107 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,109 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,111 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,113 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,115 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,118 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,120 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,122 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,124 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,126 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,127 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,129 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,131 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,133 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,135 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,137 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,139 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,140 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-04-13T12:59:53,142 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-13T12:59:53,144 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-13T12:59:53,146 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-04-13T12:59:53,148 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-04-13T12:59:53,151 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:53,152 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:53,155 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:53,158 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,159 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,162 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,165 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,168 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,172 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,176 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,179 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,181 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,183 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,186 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-04-13T12:59:53,188 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:53,191 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,192 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,195 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,198 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,201 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,204 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,206 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,209 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,212 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,214 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,216 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,219 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-04-13T12:59:53,221 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:53,225 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:53,227 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-04-13T12:59:53,230 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:53,232 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-13T12:59:53,233 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-13T12:59:53,235 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-13T12:59:53,238 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-04-13T12:59:53,240 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:53,242 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:53,245 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:53,246 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:53,249 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:53,252 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:53,254 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-04-13T12:59:53,256 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:53,258 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:53,260 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:53,262 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:53,265 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-04-13T12:59:53,266 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-04-13T12:59:53,269 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-04-13T12:59:53,271 copying build/lib/evalscope/metrics/t2v_metrics/models/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-04-13T12:59:53,273 copying build/lib/evalscope/metrics/t2v_metrics/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-04-13T12:59:53,274 copying build/lib/evalscope/metrics/t2v_metrics/score.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-13T12:59:53,276 copying build/lib/evalscope/metrics/t2v_metrics/clipscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-13T12:59:53,278 copying build/lib/evalscope/metrics/t2v_metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-04-13T12:59:53,280 copying build/lib/evalscope/metrics/llm_judge.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-13T12:59:53,283 copying build/lib/evalscope/metrics/rouge_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-13T12:59:53,285 copying build/lib/evalscope/metrics/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-13T12:59:53,288 copying build/lib/evalscope/metrics/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-13T12:59:53,291 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bert_score 2026-04-13T12:59:53,292 copying build/lib/evalscope/metrics/bert_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-04-13T12:59:53,295 copying build/lib/evalscope/metrics/bert_score/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-04-13T12:59:53,298 copying build/lib/evalscope/metrics/bert_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-04-13T12:59:53,300 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/text_normalizer 2026-04-13T12:59:53,301 copying build/lib/evalscope/metrics/text_normalizer/english.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-13T12:59:53,305 copying build/lib/evalscope/metrics/text_normalizer/english.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-13T12:59:53,307 copying build/lib/evalscope/metrics/text_normalizer/wer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-13T12:59:53,310 copying build/lib/evalscope/metrics/text_normalizer/basic.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-13T12:59:53,312 copying build/lib/evalscope/metrics/text_normalizer/chinese.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-13T12:59:53,315 copying build/lib/evalscope/metrics/text_normalizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-04-13T12:59:53,317 copying build/lib/evalscope/metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-04-13T12:59:53,319 copying build/lib/evalscope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-04-13T12:59:53,321 creating build/bdist.linux-armv7l/wheel/evalscope/backend 2026-04-13T12:59:53,323 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval 2026-04-13T12:59:53,325 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:53,326 copying build/lib/evalscope/backend/rag_eval/cmteb/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:53,329 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:53,330 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:53,332 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:53,335 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:53,337 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:53,339 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:53,341 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:53,343 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:53,345 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-04-13T12:59:53,348 copying build/lib/evalscope/backend/rag_eval/cmteb/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:53,350 copying build/lib/evalscope/backend/rag_eval/cmteb/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:53,352 copying build/lib/evalscope/backend/rag_eval/cmteb/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-04-13T12:59:53,355 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas 2026-04-13T12:59:53,356 copying build/lib/evalscope/backend/rag_eval/ragas/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-04-13T12:59:53,358 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:53,360 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:53,362 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:53,364 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:53,366 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:53,368 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-04-13T12:59:53,370 copying build/lib/evalscope/backend/rag_eval/ragas/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-04-13T12:59:53,373 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/prompts 2026-04-13T12:59:53,374 copying build/lib/evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/prompts 2026-04-13T12:59:53,376 copying build/lib/evalscope/backend/rag_eval/ragas/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-04-13T12:59:53,378 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/utils 2026-04-13T12:59:53,379 copying build/lib/evalscope/backend/rag_eval/utils/clip.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-13T12:59:53,382 copying build/lib/evalscope/backend/rag_eval/utils/tools.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-13T12:59:53,384 copying build/lib/evalscope/backend/rag_eval/utils/embedding.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-13T12:59:53,386 copying build/lib/evalscope/backend/rag_eval/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-13T12:59:53,387 copying build/lib/evalscope/backend/rag_eval/utils/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-04-13T12:59:53,390 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:53,391 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:53,393 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:53,394 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:53,397 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:53,399 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:53,400 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-04-13T12:59:53,403 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-13T12:59:53,404 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-13T12:59:53,407 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-04-13T12:59:53,409 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:53,411 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:53,413 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-04-13T12:59:53,415 copying build/lib/evalscope/backend/rag_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-04-13T12:59:53,418 copying build/lib/evalscope/backend/rag_eval/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-04-13T12:59:53,420 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass 2026-04-13T12:59:53,422 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass/tasks 2026-04-13T12:59:53,424 copying build/lib/evalscope/backend/opencompass/tasks/eval_datasets.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-04-13T12:59:53,426 copying build/lib/evalscope/backend/opencompass/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-04-13T12:59:53,428 copying build/lib/evalscope/backend/opencompass/tasks/eval_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-04-13T12:59:53,431 copying build/lib/evalscope/backend/opencompass/api_meta_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-04-13T12:59:53,433 copying build/lib/evalscope/backend/opencompass/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-04-13T12:59:53,435 copying build/lib/evalscope/backend/opencompass/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-04-13T12:59:53,439 creating build/bdist.linux-armv7l/wheel/evalscope/backend/vlm_eval_kit 2026-04-13T12:59:53,440 copying build/lib/evalscope/backend/vlm_eval_kit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-04-13T12:59:53,443 copying build/lib/evalscope/backend/vlm_eval_kit/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-04-13T12:59:53,445 copying build/lib/evalscope/backend/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-04-13T12:59:53,447 copying build/lib/evalscope/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-04-13T12:59:53,449 running install_egg_info 2026-04-13T12:59:53,455 Copying evalscope.egg-info to build/bdist.linux-armv7l/wheel/./evalscope-1.6.0-py3.11.egg-info 2026-04-13T12:59:53,470 running install_scripts 2026-04-13T12:59:53,485 creating build/bdist.linux-armv7l/wheel/evalscope-1.6.0.dist-info/WHEEL 2026-04-13T12:59:53,488 creating '/tmp/pip-wheel-85ia55ga/.tmp-ujfnqcvb/evalscope-1.6.0-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-04-13T12:59:53,492 adding 'evalscope/__init__.py' 2026-04-13T12:59:53,494 adding 'evalscope/arguments.py' 2026-04-13T12:59:53,496 adding 'evalscope/config.py' 2026-04-13T12:59:53,498 adding 'evalscope/constants.py' 2026-04-13T12:59:53,501 adding 'evalscope/run.py' 2026-04-13T12:59:53,502 adding 'evalscope/version.py' 2026-04-13T12:59:53,504 adding 'evalscope/api/__init__.py' 2026-04-13T12:59:53,506 adding 'evalscope/api/registry.py' 2026-04-13T12:59:53,509 adding 'evalscope/api/benchmark/__init__.py' 2026-04-13T12:59:53,511 adding 'evalscope/api/benchmark/benchmark.py' 2026-04-13T12:59:53,513 adding 'evalscope/api/benchmark/meta.py' 2026-04-13T12:59:53,517 adding 'evalscope/api/benchmark/statistics.py' 2026-04-13T12:59:53,519 adding 'evalscope/api/benchmark/adapters/__init__.py' 2026-04-13T12:59:53,521 adding 'evalscope/api/benchmark/adapters/agent_adapter.py' 2026-04-13T12:59:53,525 adding 'evalscope/api/benchmark/adapters/default_data_adapter.py' 2026-04-13T12:59:53,527 adding 'evalscope/api/benchmark/adapters/image_edit_adapter.py' 2026-04-13T12:59:53,529 adding 'evalscope/api/benchmark/adapters/multi_choice_adapter.py' 2026-04-13T12:59:53,531 adding 'evalscope/api/benchmark/adapters/ner_adapter.py' 2026-04-13T12:59:53,533 adding 'evalscope/api/benchmark/adapters/text2image_adapter.py' 2026-04-13T12:59:53,535 adding 'evalscope/api/benchmark/adapters/vision_language_adapter.py' 2026-04-13T12:59:53,537 adding 'evalscope/api/dataset/__init__.py' 2026-04-13T12:59:53,540 adding 'evalscope/api/dataset/dataset.py' 2026-04-13T12:59:53,542 adding 'evalscope/api/dataset/loader.py' 2026-04-13T12:59:53,544 adding 'evalscope/api/dataset/utils.py' 2026-04-13T12:59:53,546 adding 'evalscope/api/evaluator/__init__.py' 2026-04-13T12:59:53,549 adding 'evalscope/api/evaluator/cache.py' 2026-04-13T12:59:53,550 adding 'evalscope/api/evaluator/evaluator.py' 2026-04-13T12:59:53,552 adding 'evalscope/api/evaluator/state.py' 2026-04-13T12:59:53,554 adding 'evalscope/api/filter/__init__.py' 2026-04-13T12:59:53,555 adding 'evalscope/api/filter/filter.py' 2026-04-13T12:59:53,557 adding 'evalscope/api/messages/__init__.py' 2026-04-13T12:59:53,559 adding 'evalscope/api/messages/chat_message.py' 2026-04-13T12:59:53,560 adding 'evalscope/api/messages/content.py' 2026-04-13T12:59:53,561 adding 'evalscope/api/messages/utils.py' 2026-04-13T12:59:53,563 adding 'evalscope/api/metric/__init__.py' 2026-04-13T12:59:53,564 adding 'evalscope/api/metric/metric.py' 2026-04-13T12:59:53,566 adding 'evalscope/api/metric/scorer.py' 2026-04-13T12:59:53,567 adding 'evalscope/api/mixin/__init__.py' 2026-04-13T12:59:53,569 adding 'evalscope/api/mixin/llm_judge_mixin.py' 2026-04-13T12:59:53,571 adding 'evalscope/api/mixin/sandbox_mixin.py' 2026-04-13T12:59:53,573 adding 'evalscope/api/model/__init__.py' 2026-04-13T12:59:53,575 adding 'evalscope/api/model/generate_config.py' 2026-04-13T12:59:53,576 adding 'evalscope/api/model/lazy_model.py' 2026-04-13T12:59:53,578 adding 'evalscope/api/model/model.py' 2026-04-13T12:59:53,580 adding 'evalscope/api/model/model_output.py' 2026-04-13T12:59:53,582 adding 'evalscope/api/tool/__init__.py' 2026-04-13T12:59:53,583 adding 'evalscope/api/tool/tool_call.py' 2026-04-13T12:59:53,585 adding 'evalscope/api/tool/tool_info.py' 2026-04-13T12:59:53,586 adding 'evalscope/api/tool/utils.py' 2026-04-13T12:59:53,588 adding 'evalscope/app/__init__.py' 2026-04-13T12:59:53,589 adding 'evalscope/app/app.py' 2026-04-13T12:59:53,591 adding 'evalscope/app/arguments.py' 2026-04-13T12:59:53,592 adding 'evalscope/app/constants.py' 2026-04-13T12:59:53,593 adding 'evalscope/app/ui/__init__.py' 2026-04-13T12:59:53,595 adding 'evalscope/app/ui/app_ui.py' 2026-04-13T12:59:53,597 adding 'evalscope/app/ui/multi_model.py' 2026-04-13T12:59:53,598 adding 'evalscope/app/ui/sidebar.py' 2026-04-13T12:59:53,600 adding 'evalscope/app/ui/single_model.py' 2026-04-13T12:59:53,601 adding 'evalscope/app/ui/visualization.py' 2026-04-13T12:59:53,604 adding 'evalscope/app/utils/data_utils.py' 2026-04-13T12:59:53,605 adding 'evalscope/app/utils/env_utils.py' 2026-04-13T12:59:53,606 adding 'evalscope/app/utils/localization.py' 2026-04-13T12:59:53,608 adding 'evalscope/app/utils/text_utils.py' 2026-04-13T12:59:53,610 adding 'evalscope/app/utils/visualization.py' 2026-04-13T12:59:53,612 adding 'evalscope/backend/__init__.py' 2026-04-13T12:59:53,613 adding 'evalscope/backend/base.py' 2026-04-13T12:59:53,615 adding 'evalscope/backend/opencompass/__init__.py' 2026-04-13T12:59:53,616 adding 'evalscope/backend/opencompass/api_meta_template.py' 2026-04-13T12:59:53,618 adding 'evalscope/backend/opencompass/backend_manager.py' 2026-04-13T12:59:53,620 adding 'evalscope/backend/opencompass/tasks/__init__.py' 2026-04-13T12:59:53,622 adding 'evalscope/backend/opencompass/tasks/eval_api.py' 2026-04-13T12:59:53,623 adding 'evalscope/backend/opencompass/tasks/eval_datasets.py' 2026-04-13T12:59:53,625 adding 'evalscope/backend/rag_eval/__init__.py' 2026-04-13T12:59:53,627 adding 'evalscope/backend/rag_eval/backend_manager.py' 2026-04-13T12:59:53,629 adding 'evalscope/backend/rag_eval/clip_benchmark/__init__.py' 2026-04-13T12:59:53,630 adding 'evalscope/backend/rag_eval/clip_benchmark/arguments.py' 2026-04-13T12:59:53,632 adding 'evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py' 2026-04-13T12:59:53,634 adding 'evalscope/backend/rag_eval/clip_benchmark/task_template.py' 2026-04-13T12:59:53,636 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py' 2026-04-13T12:59:53,637 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py' 2026-04-13T12:59:53,639 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py' 2026-04-13T12:59:53,641 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py' 2026-04-13T12:59:53,643 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py' 2026-04-13T12:59:53,644 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt' 2026-04-13T12:59:53,646 adding 'evalscope/backend/rag_eval/cmteb/__init__.py' 2026-04-13T12:59:53,647 adding 'evalscope/backend/rag_eval/cmteb/arguments.py' 2026-04-13T12:59:53,648 adding 'evalscope/backend/rag_eval/cmteb/base.py' 2026-04-13T12:59:53,650 adding 'evalscope/backend/rag_eval/cmteb/task_template.py' 2026-04-13T12:59:53,652 adding 'evalscope/backend/rag_eval/cmteb/tasks/Classification.py' 2026-04-13T12:59:53,653 adding 'evalscope/backend/rag_eval/cmteb/tasks/Clustering.py' 2026-04-13T12:59:53,655 adding 'evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py' 2026-04-13T12:59:53,656 adding 'evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py' 2026-04-13T12:59:53,658 adding 'evalscope/backend/rag_eval/cmteb/tasks/Reranking.py' 2026-04-13T12:59:53,659 adding 'evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py' 2026-04-13T12:59:53,661 adding 'evalscope/backend/rag_eval/cmteb/tasks/STS.py' 2026-04-13T12:59:53,662 adding 'evalscope/backend/rag_eval/cmteb/tasks/__init__.py' 2026-04-13T12:59:53,664 adding 'evalscope/backend/rag_eval/ragas/__init__.py' 2026-04-13T12:59:53,665 adding 'evalscope/backend/rag_eval/ragas/arguments.py' 2026-04-13T12:59:53,666 adding 'evalscope/backend/rag_eval/ragas/task_template.py' 2026-04-13T12:59:53,668 adding 'evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py' 2026-04-13T12:59:53,670 adding 'evalscope/backend/rag_eval/ragas/tasks/__init__.py' 2026-04-13T12:59:53,671 adding 'evalscope/backend/rag_eval/ragas/tasks/build_distribution.py' 2026-04-13T12:59:53,673 adding 'evalscope/backend/rag_eval/ragas/tasks/build_transform.py' 2026-04-13T12:59:53,674 adding 'evalscope/backend/rag_eval/ragas/tasks/testset_generation.py' 2026-04-13T12:59:53,676 adding 'evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py' 2026-04-13T12:59:53,677 adding 'evalscope/backend/rag_eval/utils/__init__.py' 2026-04-13T12:59:53,679 adding 'evalscope/backend/rag_eval/utils/clip.py' 2026-04-13T12:59:53,681 adding 'evalscope/backend/rag_eval/utils/embedding.py' 2026-04-13T12:59:53,682 adding 'evalscope/backend/rag_eval/utils/llm.py' 2026-04-13T12:59:53,683 adding 'evalscope/backend/rag_eval/utils/tools.py' 2026-04-13T12:59:53,685 adding 'evalscope/backend/vlm_eval_kit/__init__.py' 2026-04-13T12:59:53,686 adding 'evalscope/backend/vlm_eval_kit/backend_manager.py' 2026-04-13T12:59:53,691 adding 'evalscope/benchmarks/__init__.py' 2026-04-13T12:59:53,696 adding 'evalscope/benchmarks/_meta/a_okvqa.json' 2026-04-13T12:59:53,698 adding 'evalscope/benchmarks/_meta/aa_lcr.json' 2026-04-13T12:59:53,700 adding 'evalscope/benchmarks/_meta/ai2d.json' 2026-04-13T12:59:53,701 adding 'evalscope/benchmarks/_meta/aime24.json' 2026-04-13T12:59:53,703 adding 'evalscope/benchmarks/_meta/aime25.json' 2026-04-13T12:59:53,705 adding 'evalscope/benchmarks/_meta/aime26.json' 2026-04-13T12:59:53,707 adding 'evalscope/benchmarks/_meta/alpaca_eval.json' 2026-04-13T12:59:53,709 adding 'evalscope/benchmarks/_meta/amc.json' 2026-04-13T12:59:53,711 adding 'evalscope/benchmarks/_meta/anat_em.json' 2026-04-13T12:59:53,713 adding 'evalscope/benchmarks/_meta/arc.json' 2026-04-13T12:59:53,714 adding 'evalscope/benchmarks/_meta/arena_hard.json' 2026-04-13T12:59:53,717 adding 'evalscope/benchmarks/_meta/bbh.json' 2026-04-13T12:59:53,719 adding 'evalscope/benchmarks/_meta/bc2gm.json' 2026-04-13T12:59:53,721 adding 'evalscope/benchmarks/_meta/bc4chemd.json' 2026-04-13T12:59:53,723 adding 'evalscope/benchmarks/_meta/bc5cdr.json' 2026-04-13T12:59:53,726 adding 'evalscope/benchmarks/_meta/bfcl_v3.json' 2026-04-13T12:59:53,729 adding 'evalscope/benchmarks/_meta/bfcl_v4.json' 2026-04-13T12:59:53,731 adding 'evalscope/benchmarks/_meta/biomix_qa.json' 2026-04-13T12:59:53,733 adding 'evalscope/benchmarks/_meta/blink.json' 2026-04-13T12:59:53,736 adding 'evalscope/benchmarks/_meta/broad_twitter_corpus.json' 2026-04-13T12:59:53,737 adding 'evalscope/benchmarks/_meta/cc_bench.json' 2026-04-13T12:59:53,740 adding 'evalscope/benchmarks/_meta/ceval.json' 2026-04-13T12:59:53,742 adding 'evalscope/benchmarks/_meta/chartqa.json' 2026-04-13T12:59:53,744 adding 'evalscope/benchmarks/_meta/chinese_simpleqa.json' 2026-04-13T12:59:53,746 adding 'evalscope/benchmarks/_meta/cl_bench.json' 2026-04-13T12:59:53,750 adding 'evalscope/benchmarks/_meta/cmmlu.json' 2026-04-13T12:59:53,753 adding 'evalscope/benchmarks/_meta/cmmmu.json' 2026-04-13T12:59:53,756 adding 'evalscope/benchmarks/_meta/cmmu.json' 2026-04-13T12:59:53,758 adding 'evalscope/benchmarks/_meta/coin_flip.json' 2026-04-13T12:59:53,760 adding 'evalscope/benchmarks/_meta/commonsense_qa.json' 2026-04-13T12:59:53,762 adding 'evalscope/benchmarks/_meta/competition_math.json' 2026-04-13T12:59:53,764 adding 'evalscope/benchmarks/_meta/conll2003.json' 2026-04-13T12:59:53,766 adding 'evalscope/benchmarks/_meta/conllpp.json' 2026-04-13T12:59:53,769 adding 'evalscope/benchmarks/_meta/copious.json' 2026-04-13T12:59:53,772 adding 'evalscope/benchmarks/_meta/cross_ner.json' 2026-04-13T12:59:53,773 adding 'evalscope/benchmarks/_meta/data_collection.json' 2026-04-13T12:59:53,775 adding 'evalscope/benchmarks/_meta/docmath.json' 2026-04-13T12:59:53,777 adding 'evalscope/benchmarks/_meta/docvqa.json' 2026-04-13T12:59:53,779 adding 'evalscope/benchmarks/_meta/drivel_binary.json' 2026-04-13T12:59:53,781 adding 'evalscope/benchmarks/_meta/drivel_multilabel.json' 2026-04-13T12:59:53,783 adding 'evalscope/benchmarks/_meta/drivel_selection.json' 2026-04-13T12:59:53,784 adding 'evalscope/benchmarks/_meta/drivel_writing.json' 2026-04-13T12:59:53,786 adding 'evalscope/benchmarks/_meta/drop.json' 2026-04-13T12:59:53,788 adding 'evalscope/benchmarks/_meta/eq_bench.json' 2026-04-13T12:59:53,790 adding 'evalscope/benchmarks/_meta/evalmuse.json' 2026-04-13T12:59:53,792 adding 'evalscope/benchmarks/_meta/fin_ner.json' 2026-04-13T12:59:53,794 adding 'evalscope/benchmarks/_meta/fleurs.json' 2026-04-13T12:59:53,797 adding 'evalscope/benchmarks/_meta/frames.json' 2026-04-13T12:59:53,799 adding 'evalscope/benchmarks/_meta/gedit.json' 2026-04-13T12:59:53,801 adding 'evalscope/benchmarks/_meta/genai_bench.json' 2026-04-13T12:59:53,803 adding 'evalscope/benchmarks/_meta/general_arena.json' 2026-04-13T12:59:53,806 adding 'evalscope/benchmarks/_meta/general_fc.json' 2026-04-13T12:59:53,808 adding 'evalscope/benchmarks/_meta/general_mcq.json' 2026-04-13T12:59:53,810 adding 'evalscope/benchmarks/_meta/general_qa.json' 2026-04-13T12:59:53,812 adding 'evalscope/benchmarks/_meta/general_t2i.json' 2026-04-13T12:59:53,813 adding 'evalscope/benchmarks/_meta/general_vmcq.json' 2026-04-13T12:59:53,815 adding 'evalscope/benchmarks/_meta/general_vqa.json' 2026-04-13T12:59:53,817 adding 'evalscope/benchmarks/_meta/genia_ner.json' 2026-04-13T12:59:53,819 adding 'evalscope/benchmarks/_meta/gpqa_diamond.json' 2026-04-13T12:59:53,821 adding 'evalscope/benchmarks/_meta/gsm8k.json' 2026-04-13T12:59:53,823 adding 'evalscope/benchmarks/_meta/gsm8k_v.json' 2026-04-13T12:59:53,825 adding 'evalscope/benchmarks/_meta/hallusion_bench.json' 2026-04-13T12:59:53,827 adding 'evalscope/benchmarks/_meta/halueval.json' 2026-04-13T12:59:53,829 adding 'evalscope/benchmarks/_meta/harvey_ner.json' 2026-04-13T12:59:53,831 adding 'evalscope/benchmarks/_meta/health_bench.json' 2026-04-13T12:59:53,833 adding 'evalscope/benchmarks/_meta/hellaswag.json' 2026-04-13T12:59:53,836 adding 'evalscope/benchmarks/_meta/hle.json' 2026-04-13T12:59:53,838 adding 'evalscope/benchmarks/_meta/hmmt25.json' 2026-04-13T12:59:53,839 adding 'evalscope/benchmarks/_meta/hpdv2.json' 2026-04-13T12:59:53,841 adding 'evalscope/benchmarks/_meta/humaneval.json' 2026-04-13T12:59:53,844 adding 'evalscope/benchmarks/_meta/humaneval_plus.json' 2026-04-13T12:59:53,846 adding 'evalscope/benchmarks/_meta/ifbench.json' 2026-04-13T12:59:53,848 adding 'evalscope/benchmarks/_meta/ifeval.json' 2026-04-13T12:59:53,850 adding 'evalscope/benchmarks/_meta/infovqa.json' 2026-04-13T12:59:53,852 adding 'evalscope/benchmarks/_meta/iquiz.json' 2026-04-13T12:59:53,854 adding 'evalscope/benchmarks/_meta/jnlpba.json' 2026-04-13T12:59:53,856 adding 'evalscope/benchmarks/_meta/jnlpba_rare.json' 2026-04-13T12:59:53,857 adding 'evalscope/benchmarks/_meta/librispeech.json' 2026-04-13T12:59:53,859 adding 'evalscope/benchmarks/_meta/live_code_bench.json' 2026-04-13T12:59:53,861 adding 'evalscope/benchmarks/_meta/logi_qa.json' 2026-04-13T12:59:53,863 adding 'evalscope/benchmarks/_meta/longbench_v2.json' 2026-04-13T12:59:53,865 adding 'evalscope/benchmarks/_meta/maritime_bench.json' 2026-04-13T12:59:53,867 adding 'evalscope/benchmarks/_meta/math_500.json' 2026-04-13T12:59:53,869 adding 'evalscope/benchmarks/_meta/math_qa.json' 2026-04-13T12:59:53,871 adding 'evalscope/benchmarks/_meta/math_verse.json' 2026-04-13T12:59:53,873 adding 'evalscope/benchmarks/_meta/math_vision.json' 2026-04-13T12:59:53,875 adding 'evalscope/benchmarks/_meta/math_vista.json' 2026-04-13T12:59:53,877 adding 'evalscope/benchmarks/_meta/mbpp.json' 2026-04-13T12:59:53,879 adding 'evalscope/benchmarks/_meta/mbpp_plus.json' 2026-04-13T12:59:53,881 adding 'evalscope/benchmarks/_meta/med_mcqa.json' 2026-04-13T12:59:53,883 adding 'evalscope/benchmarks/_meta/mgsm.json' 2026-04-13T12:59:53,885 adding 'evalscope/benchmarks/_meta/mia_bench.json' 2026-04-13T12:59:53,888 adding 'evalscope/benchmarks/_meta/micro_vqa.json' 2026-04-13T12:59:53,889 adding 'evalscope/benchmarks/_meta/minerva_math.json' 2026-04-13T12:59:53,892 adding 'evalscope/benchmarks/_meta/mit_movie_trivia.json' 2026-04-13T12:59:53,894 adding 'evalscope/benchmarks/_meta/mit_restaurant.json' 2026-04-13T12:59:53,896 adding 'evalscope/benchmarks/_meta/mm_bench.json' 2026-04-13T12:59:53,898 adding 'evalscope/benchmarks/_meta/mm_star.json' 2026-04-13T12:59:53,901 adding 'evalscope/benchmarks/_meta/mmlu.json' 2026-04-13T12:59:53,903 adding 'evalscope/benchmarks/_meta/mmlu_pro.json' 2026-04-13T12:59:53,906 adding 'evalscope/benchmarks/_meta/mmlu_redux.json' 2026-04-13T12:59:53,909 adding 'evalscope/benchmarks/_meta/mmmlu.json' 2026-04-13T12:59:53,912 adding 'evalscope/benchmarks/_meta/mmmu.json' 2026-04-13T12:59:53,916 adding 'evalscope/benchmarks/_meta/mmmu_pro.json' 2026-04-13T12:59:53,918 adding 'evalscope/benchmarks/_meta/mri_mcqa.json' 2026-04-13T12:59:53,921 adding 'evalscope/benchmarks/_meta/multi_if.json' 2026-04-13T12:59:53,923 adding 'evalscope/benchmarks/_meta/multi_nerd.json' 2026-04-13T12:59:53,925 adding 'evalscope/benchmarks/_meta/multiple_humaneval.json' 2026-04-13T12:59:53,928 adding 'evalscope/benchmarks/_meta/multiple_mbpp.json' 2026-04-13T12:59:53,930 adding 'evalscope/benchmarks/_meta/music_trivia.json' 2026-04-13T12:59:53,931 adding 'evalscope/benchmarks/_meta/musr.json' 2026-04-13T12:59:53,934 adding 'evalscope/benchmarks/_meta/ncbi.json' 2026-04-13T12:59:53,936 adding 'evalscope/benchmarks/_meta/needle_haystack.json' 2026-04-13T12:59:53,939 adding 'evalscope/benchmarks/_meta/ocr_bench.json' 2026-04-13T12:59:53,942 adding 'evalscope/benchmarks/_meta/ocr_bench_v2.json' 2026-04-13T12:59:53,945 adding 'evalscope/benchmarks/_meta/olympiad_bench.json' 2026-04-13T12:59:53,947 adding 'evalscope/benchmarks/_meta/omni_bench.json' 2026-04-13T12:59:53,951 adding 'evalscope/benchmarks/_meta/omni_doc_bench.json' 2026-04-13T12:59:53,954 adding 'evalscope/benchmarks/_meta/ontonotes5.json' 2026-04-13T12:59:53,957 adding 'evalscope/benchmarks/_meta/openai_mrcr.json' 2026-04-13T12:59:53,959 adding 'evalscope/benchmarks/_meta/piqa.json' 2026-04-13T12:59:53,962 adding 'evalscope/benchmarks/_meta/poly_math.json' 2026-04-13T12:59:53,964 adding 'evalscope/benchmarks/_meta/pope.json' 2026-04-13T12:59:53,966 adding 'evalscope/benchmarks/_meta/process_bench.json' 2026-04-13T12:59:53,968 adding 'evalscope/benchmarks/_meta/pubmedqa.json' 2026-04-13T12:59:53,970 adding 'evalscope/benchmarks/_meta/qasc.json' 2026-04-13T12:59:53,972 adding 'evalscope/benchmarks/_meta/race.json' 2026-04-13T12:59:53,973 adding 'evalscope/benchmarks/_meta/real_world_qa.json' 2026-04-13T12:59:53,976 adding 'evalscope/benchmarks/_meta/refcoco.json' 2026-04-13T12:59:53,981 adding 'evalscope/benchmarks/_meta/scicode.json' 2026-04-13T12:59:53,984 adding 'evalscope/benchmarks/_meta/science_qa.json' 2026-04-13T12:59:53,986 adding 'evalscope/benchmarks/_meta/sciq.json' 2026-04-13T12:59:53,988 adding 'evalscope/benchmarks/_meta/seed_bench_2_plus.json' 2026-04-13T12:59:53,989 adding 'evalscope/benchmarks/_meta/simple_qa.json' 2026-04-13T12:59:53,991 adding 'evalscope/benchmarks/_meta/simple_vqa.json' 2026-04-13T12:59:53,993 adding 'evalscope/benchmarks/_meta/siqa.json' 2026-04-13T12:59:53,996 adding 'evalscope/benchmarks/_meta/super_gpqa.json' 2026-04-13T12:59:53,998 adding 'evalscope/benchmarks/_meta/swe_bench_lite.json' 2026-04-13T12:59:54,000 adding 'evalscope/benchmarks/_meta/swe_bench_verified.json' 2026-04-13T12:59:54,002 adding 'evalscope/benchmarks/_meta/swe_bench_verified_mini.json' 2026-04-13T12:59:54,004 adding 'evalscope/benchmarks/_meta/tau2_bench.json' 2026-04-13T12:59:54,006 adding 'evalscope/benchmarks/_meta/tau_bench.json' 2026-04-13T12:59:54,008 adding 'evalscope/benchmarks/_meta/terminal_bench_v2.json' 2026-04-13T12:59:54,009 adding 'evalscope/benchmarks/_meta/tifa160.json' 2026-04-13T12:59:54,012 adding 'evalscope/benchmarks/_meta/tool_bench.json' 2026-04-13T12:59:54,014 adding 'evalscope/benchmarks/_meta/torgo.json' 2026-04-13T12:59:54,016 adding 'evalscope/benchmarks/_meta/trivia_qa.json' 2026-04-13T12:59:54,019 adding 'evalscope/benchmarks/_meta/truthful_qa.json' 2026-04-13T12:59:54,021 adding 'evalscope/benchmarks/_meta/tweebank_ner.json' 2026-04-13T12:59:54,024 adding 'evalscope/benchmarks/_meta/tweet_ner_7.json' 2026-04-13T12:59:54,026 adding 'evalscope/benchmarks/_meta/visulogic.json' 2026-04-13T12:59:54,028 adding 'evalscope/benchmarks/_meta/vstar_bench.json' 2026-04-13T12:59:54,030 adding 'evalscope/benchmarks/_meta/winogrande.json' 2026-04-13T12:59:54,032 adding 'evalscope/benchmarks/_meta/wmt24pp.json' 2026-04-13T12:59:54,034 adding 'evalscope/benchmarks/_meta/wnut2017.json' 2026-04-13T12:59:54,036 adding 'evalscope/benchmarks/_meta/zebralogicbench.json' 2026-04-13T12:59:54,038 adding 'evalscope/benchmarks/_meta/zerobench.json' 2026-04-13T12:59:54,040 adding 'evalscope/benchmarks/a_okvqa/__init__.py' 2026-04-13T12:59:54,042 adding 'evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py' 2026-04-13T12:59:54,043 adding 'evalscope/benchmarks/aa_lcr/__init__.py' 2026-04-13T12:59:54,045 adding 'evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py' 2026-04-13T12:59:54,047 adding 'evalscope/benchmarks/ai2d/__init__.py' 2026-04-13T12:59:54,048 adding 'evalscope/benchmarks/ai2d/ai2d_adapter.py' 2026-04-13T12:59:54,050 adding 'evalscope/benchmarks/aime/__init__.py' 2026-04-13T12:59:54,051 adding 'evalscope/benchmarks/aime/aime_adapter.py' 2026-04-13T12:59:54,053 adding 'evalscope/benchmarks/aime/grader.py' 2026-04-13T12:59:54,055 adding 'evalscope/benchmarks/aime/math_normalize.py' 2026-04-13T12:59:54,057 adding 'evalscope/benchmarks/alpaca_eval/__init__.py' 2026-04-13T12:59:54,058 adding 'evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py' 2026-04-13T12:59:54,060 adding 'evalscope/benchmarks/amc/__init__.py' 2026-04-13T12:59:54,061 adding 'evalscope/benchmarks/amc/amc_adapter.py' 2026-04-13T12:59:54,063 adding 'evalscope/benchmarks/arc/__init__.py' 2026-04-13T12:59:54,064 adding 'evalscope/benchmarks/arc/arc_adapter.py' 2026-04-13T12:59:54,066 adding 'evalscope/benchmarks/arena_hard/__init__.py' 2026-04-13T12:59:54,068 adding 'evalscope/benchmarks/arena_hard/arena_hard_adapter.py' 2026-04-13T12:59:54,069 adding 'evalscope/benchmarks/arena_hard/requirements.txt' 2026-04-13T12:59:54,071 adding 'evalscope/benchmarks/arena_hard/utils.py' 2026-04-13T12:59:54,073 adding 'evalscope/benchmarks/bbh/__init__.py' 2026-04-13T12:59:54,075 adding 'evalscope/benchmarks/bbh/bbh_adapter.py' 2026-04-13T12:59:54,077 adding 'evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt' 2026-04-13T12:59:54,079 adding 'evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt' 2026-04-13T12:59:54,080 adding 'evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt' 2026-04-13T12:59:54,081 adding 'evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt' 2026-04-13T12:59:54,083 adding 'evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt' 2026-04-13T12:59:54,085 adding 'evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt' 2026-04-13T12:59:54,086 adding 'evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt' 2026-04-13T12:59:54,088 adding 'evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt' 2026-04-13T12:59:54,089 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt' 2026-04-13T12:59:54,091 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt' 2026-04-13T12:59:54,092 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt' 2026-04-13T12:59:54,093 adding 'evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt' 2026-04-13T12:59:54,094 adding 'evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt' 2026-04-13T12:59:54,096 adding 'evalscope/benchmarks/bbh/cot_prompts/navigate.txt' 2026-04-13T12:59:54,097 adding 'evalscope/benchmarks/bbh/cot_prompts/object_counting.txt' 2026-04-13T12:59:54,098 adding 'evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt' 2026-04-13T12:59:54,099 adding 'evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt' 2026-04-13T12:59:54,101 adding 'evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt' 2026-04-13T12:59:54,102 adding 'evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt' 2026-04-13T12:59:54,104 adding 'evalscope/benchmarks/bbh/cot_prompts/snarks.txt' 2026-04-13T12:59:54,105 adding 'evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt' 2026-04-13T12:59:54,106 adding 'evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt' 2026-04-13T12:59:54,107 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt' 2026-04-13T12:59:54,109 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt' 2026-04-13T12:59:54,110 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt' 2026-04-13T12:59:54,111 adding 'evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt' 2026-04-13T12:59:54,113 adding 'evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt' 2026-04-13T12:59:54,114 adding 'evalscope/benchmarks/bfcl/__init__.py' 2026-04-13T12:59:54,115 adding 'evalscope/benchmarks/bfcl/requirements.txt' 2026-04-13T12:59:54,117 adding 'evalscope/benchmarks/bfcl/v3/__init__.py' 2026-04-13T12:59:54,119 adding 'evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py' 2026-04-13T12:59:54,121 adding 'evalscope/benchmarks/bfcl/v3/generation.py' 2026-04-13T12:59:54,122 adding 'evalscope/benchmarks/bfcl/v3/utils.py' 2026-04-13T12:59:54,124 adding 'evalscope/benchmarks/bfcl/v4/__init__.py' 2026-04-13T12:59:54,126 adding 'evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py' 2026-04-13T12:59:54,128 adding 'evalscope/benchmarks/bfcl/v4/utils.py' 2026-04-13T12:59:54,130 adding 'evalscope/benchmarks/biomix_qa/__init__.py' 2026-04-13T12:59:54,131 adding 'evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py' 2026-04-13T12:59:54,133 adding 'evalscope/benchmarks/blink/__init__.py' 2026-04-13T12:59:54,134 adding 'evalscope/benchmarks/blink/blink_adapter.py' 2026-04-13T12:59:54,136 adding 'evalscope/benchmarks/ceval/__init__.py' 2026-04-13T12:59:54,138 adding 'evalscope/benchmarks/ceval/ceval_adapter.py' 2026-04-13T12:59:54,140 adding 'evalscope/benchmarks/chartqa/__init__.py' 2026-04-13T12:59:54,141 adding 'evalscope/benchmarks/chartqa/chartqa_adapter.py' 2026-04-13T12:59:54,143 adding 'evalscope/benchmarks/chartqa/utils.py' 2026-04-13T12:59:54,144 adding 'evalscope/benchmarks/chinese_simple_qa/__init__.py' 2026-04-13T12:59:54,146 adding 'evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py' 2026-04-13T12:59:54,148 adding 'evalscope/benchmarks/cl_bench/__init__.py' 2026-04-13T12:59:54,149 adding 'evalscope/benchmarks/cl_bench/cl_bench_adapter.py' 2026-04-13T12:59:54,151 adding 'evalscope/benchmarks/cl_bench/utils.py' 2026-04-13T12:59:54,152 adding 'evalscope/benchmarks/cmmlu/__init__.py' 2026-04-13T12:59:54,154 adding 'evalscope/benchmarks/cmmlu/cmmlu_adapter.py' 2026-04-13T12:59:54,155 adding 'evalscope/benchmarks/cmmmu/__init__.py' 2026-04-13T12:59:54,157 adding 'evalscope/benchmarks/cmmmu/cmmmu_adapter.py' 2026-04-13T12:59:54,159 adding 'evalscope/benchmarks/cmmmu/utils.py' 2026-04-13T12:59:54,160 adding 'evalscope/benchmarks/cmmu/__init__.py' 2026-04-13T12:59:54,162 adding 'evalscope/benchmarks/cmmu/cmmu_adapter.py' 2026-04-13T12:59:54,163 adding 'evalscope/benchmarks/cmmu/prompt.py' 2026-04-13T12:59:54,165 adding 'evalscope/benchmarks/coin_flip/__init__.py' 2026-04-13T12:59:54,166 adding 'evalscope/benchmarks/coin_flip/coin_flip_adapter.py' 2026-04-13T12:59:54,168 adding 'evalscope/benchmarks/commonsense_qa/__init__.py' 2026-04-13T12:59:54,169 adding 'evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py' 2026-04-13T12:59:54,171 adding 'evalscope/benchmarks/competition_math/__init__.py' 2026-04-13T12:59:54,172 adding 'evalscope/benchmarks/competition_math/competition_math_adapter.py' 2026-04-13T12:59:54,174 adding 'evalscope/benchmarks/data_collection/__init__.py' 2026-04-13T12:59:54,176 adding 'evalscope/benchmarks/data_collection/data_collection_adapter.py' 2026-04-13T12:59:54,177 adding 'evalscope/benchmarks/docmath/__init__.py' 2026-04-13T12:59:54,179 adding 'evalscope/benchmarks/docmath/docmath_adapter.py' 2026-04-13T12:59:54,180 adding 'evalscope/benchmarks/docmath/utils.py' 2026-04-13T12:59:54,182 adding 'evalscope/benchmarks/docvqa/__init__.py' 2026-04-13T12:59:54,184 adding 'evalscope/benchmarks/docvqa/docvqa_adapter.py' 2026-04-13T12:59:54,185 adding 'evalscope/benchmarks/drivelology/__init__.py' 2026-04-13T12:59:54,187 adding 'evalscope/benchmarks/drivelology/drivelology_binary_adapter.py' 2026-04-13T12:59:54,189 adding 'evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py' 2026-04-13T12:59:54,190 adding 'evalscope/benchmarks/drivelology/drivelology_selection_adapter.py' 2026-04-13T12:59:54,192 adding 'evalscope/benchmarks/drivelology/drivelology_writing_adapter.py' 2026-04-13T12:59:54,194 adding 'evalscope/benchmarks/drop/__init__.py' 2026-04-13T12:59:54,196 adding 'evalscope/benchmarks/drop/drop_adapter.py' 2026-04-13T12:59:54,197 adding 'evalscope/benchmarks/drop/utils.py' 2026-04-13T12:59:54,199 adding 'evalscope/benchmarks/eq_bench/__init__.py' 2026-04-13T12:59:54,201 adding 'evalscope/benchmarks/eq_bench/answer_validation.py' 2026-04-13T12:59:54,203 adding 'evalscope/benchmarks/eq_bench/eq_bench_adapter.py' 2026-04-13T12:59:54,204 adding 'evalscope/benchmarks/fleurs/__init__.py' 2026-04-13T12:59:54,206 adding 'evalscope/benchmarks/fleurs/fleurs_adapter.py' 2026-04-13T12:59:54,208 adding 'evalscope/benchmarks/frames/__init__.py' 2026-04-13T12:59:54,209 adding 'evalscope/benchmarks/frames/frames_adapter.py' 2026-04-13T12:59:54,211 adding 'evalscope/benchmarks/frames/utils.py' 2026-04-13T12:59:54,213 adding 'evalscope/benchmarks/general_arena/__init__.py' 2026-04-13T12:59:54,215 adding 'evalscope/benchmarks/general_arena/general_arena_adapter.py' 2026-04-13T12:59:54,217 adding 'evalscope/benchmarks/general_arena/requirements.txt' 2026-04-13T12:59:54,218 adding 'evalscope/benchmarks/general_arena/utils.py' 2026-04-13T12:59:54,220 adding 'evalscope/benchmarks/general_fc/__init__.py' 2026-04-13T12:59:54,222 adding 'evalscope/benchmarks/general_fc/general_fc_adapter.py' 2026-04-13T12:59:54,224 adding 'evalscope/benchmarks/general_mcq/__init__.py' 2026-04-13T12:59:54,225 adding 'evalscope/benchmarks/general_mcq/general_mcq_adapter.py' 2026-04-13T12:59:54,227 adding 'evalscope/benchmarks/general_qa/__init__.py' 2026-04-13T12:59:54,228 adding 'evalscope/benchmarks/general_qa/general_qa_adapter.py' 2026-04-13T12:59:54,229 adding 'evalscope/benchmarks/general_vmcq/__init__.py' 2026-04-13T12:59:54,231 adding 'evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py' 2026-04-13T12:59:54,232 adding 'evalscope/benchmarks/general_vqa/__init__.py' 2026-04-13T12:59:54,234 adding 'evalscope/benchmarks/general_vqa/general_vqa_adapter.py' 2026-04-13T12:59:54,235 adding 'evalscope/benchmarks/gpqa/__init__.py' 2026-04-13T12:59:54,237 adding 'evalscope/benchmarks/gpqa/gpqa_adapter.py' 2026-04-13T12:59:54,238 adding 'evalscope/benchmarks/gpqa/prompt.py' 2026-04-13T12:59:54,240 adding 'evalscope/benchmarks/gsm8k/__init__.py' 2026-04-13T12:59:54,241 adding 'evalscope/benchmarks/gsm8k/gsm8k_adapter.py' 2026-04-13T12:59:54,243 adding 'evalscope/benchmarks/gsm8k_v/__init__.py' 2026-04-13T12:59:54,244 adding 'evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py' 2026-04-13T12:59:54,245 adding 'evalscope/benchmarks/hallusion_bench/__init__.py' 2026-04-13T12:59:54,247 adding 'evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py' 2026-04-13T12:59:54,249 adding 'evalscope/benchmarks/halu_eval/__init__.py' 2026-04-13T12:59:54,250 adding 'evalscope/benchmarks/halu_eval/halu_eval_adapter.py' 2026-04-13T12:59:54,252 adding 'evalscope/benchmarks/halu_eval/halu_eval_instructions.py' 2026-04-13T12:59:54,254 adding 'evalscope/benchmarks/healthbench/__init__.py' 2026-04-13T12:59:54,256 adding 'evalscope/benchmarks/healthbench/healthbench_adapter.py' 2026-04-13T12:59:54,257 adding 'evalscope/benchmarks/healthbench/utils.py' 2026-04-13T12:59:54,259 adding 'evalscope/benchmarks/hellaswag/__init__.py' 2026-04-13T12:59:54,261 adding 'evalscope/benchmarks/hellaswag/hellaswag_adapter.py' 2026-04-13T12:59:54,262 adding 'evalscope/benchmarks/hle/__init__.py' 2026-04-13T12:59:54,264 adding 'evalscope/benchmarks/hle/hle_adapter.py' 2026-04-13T12:59:54,266 adding 'evalscope/benchmarks/hmmt25/hmmt25_adapter.py' 2026-04-13T12:59:54,268 adding 'evalscope/benchmarks/humaneval/__init__.py' 2026-04-13T12:59:54,269 adding 'evalscope/benchmarks/humaneval/humaneval_adapter.py' 2026-04-13T12:59:54,271 adding 'evalscope/benchmarks/humaneval/utils.py' 2026-04-13T12:59:54,273 adding 'evalscope/benchmarks/humanevalplus/__init__.py' 2026-04-13T12:59:54,274 adding 'evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py' 2026-04-13T12:59:54,276 adding 'evalscope/benchmarks/humanevalplus/docker/Dockerfile' 2026-04-13T12:59:54,278 adding 'evalscope/benchmarks/ifbench/__init__.py' 2026-04-13T12:59:54,279 adding 'evalscope/benchmarks/ifbench/evaluation_lib.py' 2026-04-13T12:59:54,281 adding 'evalscope/benchmarks/ifbench/ifbench_adapter.py' 2026-04-13T12:59:54,288 adding 'evalscope/benchmarks/ifbench/instructions.py' 2026-04-13T12:59:54,290 adding 'evalscope/benchmarks/ifbench/instructions_registry.py' 2026-04-13T12:59:54,293 adding 'evalscope/benchmarks/ifbench/instructions_util.py' 2026-04-13T12:59:54,295 adding 'evalscope/benchmarks/ifbench/requirements.txt' 2026-04-13T12:59:54,296 adding 'evalscope/benchmarks/ifeval/__init__.py' 2026-04-13T12:59:54,298 adding 'evalscope/benchmarks/ifeval/ifeval_adapter.py' 2026-04-13T12:59:54,303 adding 'evalscope/benchmarks/ifeval/instructions.py' 2026-04-13T12:59:54,305 adding 'evalscope/benchmarks/ifeval/instructions_registry.py' 2026-04-13T12:59:54,308 adding 'evalscope/benchmarks/ifeval/instructions_util.py' 2026-04-13T12:59:54,309 adding 'evalscope/benchmarks/ifeval/requirements.txt' 2026-04-13T12:59:54,311 adding 'evalscope/benchmarks/ifeval/utils.py' 2026-04-13T12:59:54,313 adding 'evalscope/benchmarks/image_edit/__init__.py' 2026-04-13T12:59:54,314 adding 'evalscope/benchmarks/image_edit/gedit/__init__.py' 2026-04-13T12:59:54,316 adding 'evalscope/benchmarks/image_edit/gedit/gedit_adapter.py' 2026-04-13T12:59:54,318 adding 'evalscope/benchmarks/image_edit/gedit/utils.py' 2026-04-13T12:59:54,320 adding 'evalscope/benchmarks/image_edit/gedit/vie_prompts.py' 2026-04-13T12:59:54,321 adding 'evalscope/benchmarks/infovqa/__init__.py' 2026-04-13T12:59:54,323 adding 'evalscope/benchmarks/infovqa/infovqa_adapter.py' 2026-04-13T12:59:54,324 adding 'evalscope/benchmarks/iquiz/__init__.py' 2026-04-13T12:59:54,325 adding 'evalscope/benchmarks/iquiz/iquiz_adapter.py' 2026-04-13T12:59:54,327 adding 'evalscope/benchmarks/librispeech/__init__.py' 2026-04-13T12:59:54,328 adding 'evalscope/benchmarks/librispeech/librispeech_adapter.py' 2026-04-13T12:59:54,330 adding 'evalscope/benchmarks/live_code_bench/__init__.py' 2026-04-13T12:59:54,331 adding 'evalscope/benchmarks/live_code_bench/evaluate_utils.py' 2026-04-13T12:59:54,333 adding 'evalscope/benchmarks/live_code_bench/extract_utils.py' 2026-04-13T12:59:54,334 adding 'evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py' 2026-04-13T12:59:54,336 adding 'evalscope/benchmarks/live_code_bench/load_utils.py' 2026-04-13T12:59:54,337 adding 'evalscope/benchmarks/live_code_bench/pass_k_utils.py' 2026-04-13T12:59:54,339 adding 'evalscope/benchmarks/live_code_bench/prompts.py' 2026-04-13T12:59:54,341 adding 'evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py' 2026-04-13T12:59:54,343 adding 'evalscope/benchmarks/live_code_bench/testing_util.py' 2026-04-13T12:59:54,345 adding 'evalscope/benchmarks/logi_qa/__int__.py' 2026-04-13T12:59:54,346 adding 'evalscope/benchmarks/logi_qa/logi_qa_adapter.py' 2026-04-13T12:59:54,348 adding 'evalscope/benchmarks/longbench_v2/__init__.py' 2026-04-13T12:59:54,349 adding 'evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py' 2026-04-13T12:59:54,351 adding 'evalscope/benchmarks/maritime_bench/__init__.py' 2026-04-13T12:59:54,352 adding 'evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py' 2026-04-13T12:59:54,354 adding 'evalscope/benchmarks/math_500/__init__.py' 2026-04-13T12:59:54,355 adding 'evalscope/benchmarks/math_500/math_500_adapter.py' 2026-04-13T12:59:54,357 adding 'evalscope/benchmarks/math_qa/__init__.py' 2026-04-13T12:59:54,358 adding 'evalscope/benchmarks/math_qa/math_qa_adapter.py' 2026-04-13T12:59:54,360 adding 'evalscope/benchmarks/math_verse/__init__.py' 2026-04-13T12:59:54,362 adding 'evalscope/benchmarks/math_verse/math_verse_adapter.py' 2026-04-13T12:59:54,363 adding 'evalscope/benchmarks/math_vision/__init__.py' 2026-04-13T12:59:54,365 adding 'evalscope/benchmarks/math_vision/math_vision_adapter.py' 2026-04-13T12:59:54,366 adding 'evalscope/benchmarks/math_vista/__init__.py' 2026-04-13T12:59:54,368 adding 'evalscope/benchmarks/math_vista/math_vista_adapter.py' 2026-04-13T12:59:54,370 adding 'evalscope/benchmarks/mbpp/__init__.py' 2026-04-13T12:59:54,371 adding 'evalscope/benchmarks/mbpp/mbpp_adapter.py' 2026-04-13T12:59:54,373 adding 'evalscope/benchmarks/mbppplus/__init__.py' 2026-04-13T12:59:54,375 adding 'evalscope/benchmarks/mbppplus/mbppplus_adapter.py' 2026-04-13T12:59:54,376 adding 'evalscope/benchmarks/med_mcqa/__init__.py' 2026-04-13T12:59:54,378 adding 'evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py' 2026-04-13T12:59:54,379 adding 'evalscope/benchmarks/mgsm/__init__.py' 2026-04-13T12:59:54,381 adding 'evalscope/benchmarks/mgsm/mgsm_adapter.py' 2026-04-13T12:59:54,383 adding 'evalscope/benchmarks/mia_bench/__init__.py' 2026-04-13T12:59:54,384 adding 'evalscope/benchmarks/mia_bench/mia_bench_adapter.py' 2026-04-13T12:59:54,386 adding 'evalscope/benchmarks/mia_bench/utils.py' 2026-04-13T12:59:54,387 adding 'evalscope/benchmarks/micro_vqa/__init__.py' 2026-04-13T12:59:54,389 adding 'evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py' 2026-04-13T12:59:54,391 adding 'evalscope/benchmarks/minerva_math/__init__.py' 2026-04-13T12:59:54,392 adding 'evalscope/benchmarks/minerva_math/minerva_math_adapter.py' 2026-04-13T12:59:54,394 adding 'evalscope/benchmarks/mm_bench/__init__.py' 2026-04-13T12:59:54,395 adding 'evalscope/benchmarks/mm_bench/mm_bench_adapter.py' 2026-04-13T12:59:54,397 adding 'evalscope/benchmarks/mm_star/__init__.py' 2026-04-13T12:59:54,398 adding 'evalscope/benchmarks/mm_star/mm_star_adapter.py' 2026-04-13T12:59:54,400 adding 'evalscope/benchmarks/mmlu/__init__.py' 2026-04-13T12:59:54,402 adding 'evalscope/benchmarks/mmlu/mmlu_adapter.py' 2026-04-13T12:59:54,403 adding 'evalscope/benchmarks/mmlu_pro/__init__.py' 2026-04-13T12:59:54,405 adding 'evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py' 2026-04-13T12:59:54,406 adding 'evalscope/benchmarks/mmlu_redux/__init__.py' 2026-04-13T12:59:54,408 adding 'evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py' 2026-04-13T12:59:54,410 adding 'evalscope/benchmarks/mmmlu/__init__.py' 2026-04-13T12:59:54,411 adding 'evalscope/benchmarks/mmmlu/mmmlu_adapter.py' 2026-04-13T12:59:54,413 adding 'evalscope/benchmarks/mmmlu/prompt.py' 2026-04-13T12:59:54,414 adding 'evalscope/benchmarks/mmmu/__init__.py' 2026-04-13T12:59:54,416 adding 'evalscope/benchmarks/mmmu/mmmu_adapter.py' 2026-04-13T12:59:54,418 adding 'evalscope/benchmarks/mmmu_pro/__init__.py' 2026-04-13T12:59:54,419 adding 'evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py' 2026-04-13T12:59:54,421 adding 'evalscope/benchmarks/mri_mcqa/__init__.py' 2026-04-13T12:59:54,422 adding 'evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py' 2026-04-13T12:59:54,424 adding 'evalscope/benchmarks/multi_if/__init__.py' 2026-04-13T12:59:54,432 adding 'evalscope/benchmarks/multi_if/ifeval.py' 2026-04-13T12:59:54,434 adding 'evalscope/benchmarks/multi_if/metrics.py' 2026-04-13T12:59:54,436 adding 'evalscope/benchmarks/multi_if/multi_if_adapter.py' 2026-04-13T12:59:54,437 adding 'evalscope/benchmarks/multi_if/requirements.txt' 2026-04-13T12:59:54,438 adding 'evalscope/benchmarks/multipl_e/__init__.py' 2026-04-13T12:59:54,440 adding 'evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py' 2026-04-13T12:59:54,442 adding 'evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py' 2026-04-13T12:59:54,443 adding 'evalscope/benchmarks/multipl_e/utils.py' 2026-04-13T12:59:54,445 adding 'evalscope/benchmarks/music_trivia/__init__.py' 2026-04-13T12:59:54,446 adding 'evalscope/benchmarks/music_trivia/music_trivia_adapter.py' 2026-04-13T12:59:54,448 adding 'evalscope/benchmarks/musr/__init__.py' 2026-04-13T12:59:54,449 adding 'evalscope/benchmarks/musr/musr_adapter.py' 2026-04-13T12:59:54,451 adding 'evalscope/benchmarks/needle_haystack/__init__.py' 2026-04-13T12:59:54,454 adding 'evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py' 2026-04-13T12:59:54,455 adding 'evalscope/benchmarks/needle_haystack/requirements.txt' 2026-04-13T12:59:54,456 adding 'evalscope/benchmarks/needle_haystack/utils.py' 2026-04-13T12:59:54,459 adding 'evalscope/benchmarks/ner/__init__.py' 2026-04-13T12:59:54,460 adding 'evalscope/benchmarks/ner/anat_em_adapter.py' 2026-04-13T12:59:54,461 adding 'evalscope/benchmarks/ner/bc2gm_adapter.py' 2026-04-13T12:59:54,463 adding 'evalscope/benchmarks/ner/bc4chemd_adapter.py' 2026-04-13T12:59:54,464 adding 'evalscope/benchmarks/ner/bc5cdr_adapter.py' 2026-04-13T12:59:54,465 adding 'evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py' 2026-04-13T12:59:54,467 adding 'evalscope/benchmarks/ner/conll2003_adapter.py' 2026-04-13T12:59:54,468 adding 'evalscope/benchmarks/ner/conllpp_adapter.py' 2026-04-13T12:59:54,470 adding 'evalscope/benchmarks/ner/copious_adapter.py' 2026-04-13T12:59:54,472 adding 'evalscope/benchmarks/ner/cross_ner_adapter.py' 2026-04-13T12:59:54,473 adding 'evalscope/benchmarks/ner/fin_ner_adapter.py' 2026-04-13T12:59:54,474 adding 'evalscope/benchmarks/ner/genia_ner_adapter.py' 2026-04-13T12:59:54,476 adding 'evalscope/benchmarks/ner/harvey_ner_adapter.py' 2026-04-13T12:59:54,477 adding 'evalscope/benchmarks/ner/jnlpba_adapter.py' 2026-04-13T12:59:54,479 adding 'evalscope/benchmarks/ner/jnlpba_rare_adapter.py' 2026-04-13T12:59:54,480 adding 'evalscope/benchmarks/ner/mit_movie_trivia_adapter.py' 2026-04-13T12:59:54,482 adding 'evalscope/benchmarks/ner/mit_restaurant_adapter.py' 2026-04-13T12:59:54,483 adding 'evalscope/benchmarks/ner/multi_nerd_adapter.py' 2026-04-13T12:59:54,484 adding 'evalscope/benchmarks/ner/ncbi_adapter.py' 2026-04-13T12:59:54,486 adding 'evalscope/benchmarks/ner/ontonotes5_adapter.py' 2026-04-13T12:59:54,487 adding 'evalscope/benchmarks/ner/tweebank_ner_adapter.py' 2026-04-13T12:59:54,488 adding 'evalscope/benchmarks/ner/tweet_ner_7_adapter.py' 2026-04-13T12:59:54,489 adding 'evalscope/benchmarks/ner/wnut2017_adapter.py' 2026-04-13T12:59:54,491 adding 'evalscope/benchmarks/ner/cross_ner_entities/__init__.py' 2026-04-13T12:59:54,492 adding 'evalscope/benchmarks/ner/cross_ner_entities/ai.py' 2026-04-13T12:59:54,494 adding 'evalscope/benchmarks/ner/cross_ner_entities/literature.py' 2026-04-13T12:59:54,495 adding 'evalscope/benchmarks/ner/cross_ner_entities/music.py' 2026-04-13T12:59:54,496 adding 'evalscope/benchmarks/ner/cross_ner_entities/politics.py' 2026-04-13T12:59:54,498 adding 'evalscope/benchmarks/ner/cross_ner_entities/science.py' 2026-04-13T12:59:54,499 adding 'evalscope/benchmarks/ocr_bench/__init__.py' 2026-04-13T12:59:54,501 adding 'evalscope/benchmarks/ocr_bench/requirements.txt' 2026-04-13T12:59:54,502 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py' 2026-04-13T12:59:54,504 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py' 2026-04-13T12:59:54,505 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py' 2026-04-13T12:59:54,510 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py' 2026-04-13T12:59:54,511 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py' 2026-04-13T12:59:54,513 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py' 2026-04-13T12:59:54,514 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py' 2026-04-13T12:59:54,515 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py' 2026-04-13T12:59:54,517 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py' 2026-04-13T12:59:54,519 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py' 2026-04-13T12:59:54,520 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py' 2026-04-13T12:59:54,522 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py' 2026-04-13T12:59:54,524 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt' 2026-04-13T12:59:54,526 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py' 2026-04-13T12:59:54,529 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py' 2026-04-13T12:59:54,531 adding 'evalscope/benchmarks/olympiad_bench/__init__.py' 2026-04-13T12:59:54,533 adding 'evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py' 2026-04-13T12:59:54,534 adding 'evalscope/benchmarks/olympiad_bench/requirements.txt' 2026-04-13T12:59:54,537 adding 'evalscope/benchmarks/olympiad_bench/utils.py' 2026-04-13T12:59:54,539 adding 'evalscope/benchmarks/omni_bench/__init__.py' 2026-04-13T12:59:54,540 adding 'evalscope/benchmarks/omni_bench/omni_bench_adapter.py' 2026-04-13T12:59:54,542 adding 'evalscope/benchmarks/omnidoc_bench/__init__.py' 2026-04-13T12:59:54,545 adding 'evalscope/benchmarks/omnidoc_bench/end2end_eval.py' 2026-04-13T12:59:54,548 adding 'evalscope/benchmarks/omnidoc_bench/metrics.py' 2026-04-13T12:59:54,551 adding 'evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py' 2026-04-13T12:59:54,553 adding 'evalscope/benchmarks/omnidoc_bench/requirements.txt' 2026-04-13T12:59:54,562 adding 'evalscope/benchmarks/omnidoc_bench/utils.py' 2026-04-13T12:59:54,564 adding 'evalscope/benchmarks/openai_mrcr/__init__.py' 2026-04-13T12:59:54,566 adding 'evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py' 2026-04-13T12:59:54,567 adding 'evalscope/benchmarks/openai_mrcr/requirements.txt' 2026-04-13T12:59:54,568 adding 'evalscope/benchmarks/openai_mrcr/utils.py' 2026-04-13T12:59:54,570 adding 'evalscope/benchmarks/piqa/__init__.py' 2026-04-13T12:59:54,572 adding 'evalscope/benchmarks/piqa/piqa_adapter.py' 2026-04-13T12:59:54,573 adding 'evalscope/benchmarks/poly_math/__init__.py' 2026-04-13T12:59:54,575 adding 'evalscope/benchmarks/poly_math/poly_math_adapter.py' 2026-04-13T12:59:54,577 adding 'evalscope/benchmarks/poly_math/utils/instruction.py' 2026-04-13T12:59:54,579 adding 'evalscope/benchmarks/pope/__init__.py' 2026-04-13T12:59:54,580 adding 'evalscope/benchmarks/pope/pope_adapter.py' 2026-04-13T12:59:54,582 adding 'evalscope/benchmarks/process_bench/__init__.py' 2026-04-13T12:59:54,583 adding 'evalscope/benchmarks/process_bench/process_bench_adapter.py' 2026-04-13T12:59:54,585 adding 'evalscope/benchmarks/pumed_qa/__init__.py' 2026-04-13T12:59:54,587 adding 'evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py' 2026-04-13T12:59:54,588 adding 'evalscope/benchmarks/qasc/__init__.py' 2026-04-13T12:59:54,590 adding 'evalscope/benchmarks/qasc/qasc_adapter.py' 2026-04-13T12:59:54,591 adding 'evalscope/benchmarks/race/__init__.py' 2026-04-13T12:59:54,593 adding 'evalscope/benchmarks/race/race_adapter.py' 2026-04-13T12:59:54,594 adding 'evalscope/benchmarks/real_world_qa/__init__.py' 2026-04-13T12:59:54,596 adding 'evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py' 2026-04-13T12:59:54,597 adding 'evalscope/benchmarks/refcoco/__init__.py' 2026-04-13T12:59:54,599 adding 'evalscope/benchmarks/refcoco/evaluation_lib.py' 2026-04-13T12:59:54,601 adding 'evalscope/benchmarks/refcoco/refcoco_adapter.py' 2026-04-13T12:59:54,602 adding 'evalscope/benchmarks/refcoco/requirements.txt' 2026-04-13T12:59:54,603 adding 'evalscope/benchmarks/refcoco/utils.py' 2026-04-13T12:59:54,605 adding 'evalscope/benchmarks/scicode/__init__.py' 2026-04-13T12:59:54,606 adding 'evalscope/benchmarks/scicode/prompt_templates.py' 2026-04-13T12:59:54,608 adding 'evalscope/benchmarks/scicode/scicode_adapter.py' 2026-04-13T12:59:54,610 adding 'evalscope/benchmarks/scicode/util.py' 2026-04-13T12:59:54,611 adding 'evalscope/benchmarks/scicode/docker/Dockerfile' 2026-04-13T12:59:54,613 adding 'evalscope/benchmarks/scicode/docker/docker_requirements.txt' 2026-04-13T12:59:54,614 adding 'evalscope/benchmarks/scicode/docker/process_data.py' 2026-04-13T12:59:54,615 adding 'evalscope/benchmarks/scicode/docker/test_util.py' 2026-04-13T12:59:54,617 adding 'evalscope/benchmarks/science_qa/__init__.py' 2026-04-13T12:59:54,618 adding 'evalscope/benchmarks/science_qa/science_qa_adapter.py' 2026-04-13T12:59:54,620 adding 'evalscope/benchmarks/sciq/__init__.py' 2026-04-13T12:59:54,622 adding 'evalscope/benchmarks/sciq/sciq_adapter.py' 2026-04-13T12:59:54,623 adding 'evalscope/benchmarks/seed_bench_2_plus/__init__.py' 2026-04-13T12:59:54,625 adding 'evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py' 2026-04-13T12:59:54,626 adding 'evalscope/benchmarks/simple_qa/__init__.py' 2026-04-13T12:59:54,628 adding 'evalscope/benchmarks/simple_qa/simple_qa_adapter.py' 2026-04-13T12:59:54,630 adding 'evalscope/benchmarks/simple_vqa/__init__.py' 2026-04-13T12:59:54,632 adding 'evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py' 2026-04-13T12:59:54,634 adding 'evalscope/benchmarks/siqa/__init__.py' 2026-04-13T12:59:54,635 adding 'evalscope/benchmarks/siqa/siqa_adapter.py' 2026-04-13T12:59:54,637 adding 'evalscope/benchmarks/super_gpqa/__init__.py' 2026-04-13T12:59:54,638 adding 'evalscope/benchmarks/super_gpqa/prompt.py' 2026-04-13T12:59:54,640 adding 'evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py' 2026-04-13T12:59:54,641 adding 'evalscope/benchmarks/super_gpqa/utils.py' 2026-04-13T12:59:54,643 adding 'evalscope/benchmarks/swe_bench/__init__.py' 2026-04-13T12:59:54,645 adding 'evalscope/benchmarks/swe_bench/build_images.py' 2026-04-13T12:59:54,646 adding 'evalscope/benchmarks/swe_bench/requirements.txt' 2026-04-13T12:59:54,648 adding 'evalscope/benchmarks/swe_bench/swe_bench_adapter.py' 2026-04-13T12:59:54,650 adding 'evalscope/benchmarks/swe_bench/utils.py' 2026-04-13T12:59:54,651 adding 'evalscope/benchmarks/tau_bench/__init__.py' 2026-04-13T12:59:54,653 adding 'evalscope/benchmarks/tau_bench/tau2_bench/__init__.py' 2026-04-13T12:59:54,654 adding 'evalscope/benchmarks/tau_bench/tau2_bench/generation.py' 2026-04-13T12:59:54,655 adding 'evalscope/benchmarks/tau_bench/tau2_bench/requirements.txt' 2026-04-13T12:59:54,657 adding 'evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py' 2026-04-13T12:59:54,658 adding 'evalscope/benchmarks/tau_bench/tau_bench/__init__.py' 2026-04-13T12:59:54,660 adding 'evalscope/benchmarks/tau_bench/tau_bench/generation.py' 2026-04-13T12:59:54,661 adding 'evalscope/benchmarks/tau_bench/tau_bench/requirements.txt' 2026-04-13T12:59:54,663 adding 'evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py' 2026-04-13T12:59:54,664 adding 'evalscope/benchmarks/terminal_bench/__init__.py' 2026-04-13T12:59:54,665 adding 'evalscope/benchmarks/terminal_bench/requirements.txt' 2026-04-13T12:59:54,667 adding 'evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py' 2026-04-13T12:59:54,668 adding 'evalscope/benchmarks/terminal_bench/utils.py' 2026-04-13T12:59:54,670 adding 'evalscope/benchmarks/text2image/__init__.py' 2026-04-13T12:59:54,672 adding 'evalscope/benchmarks/text2image/evalmuse_adapter.py' 2026-04-13T12:59:54,673 adding 'evalscope/benchmarks/text2image/genai_bench_adapter.py' 2026-04-13T12:59:54,674 adding 'evalscope/benchmarks/text2image/general_t2i_adapter.py' 2026-04-13T12:59:54,675 adding 'evalscope/benchmarks/text2image/hpdv2_adapter.py' 2026-04-13T12:59:54,677 adding 'evalscope/benchmarks/text2image/tifa_adapter.py' 2026-04-13T12:59:54,678 adding 'evalscope/benchmarks/tool_bench/__init__.py' 2026-04-13T12:59:54,680 adding 'evalscope/benchmarks/tool_bench/tool_bench_adapter.py' 2026-04-13T12:59:54,681 adding 'evalscope/benchmarks/tool_bench/utils.py' 2026-04-13T12:59:54,683 adding 'evalscope/benchmarks/torgo/__init__.py' 2026-04-13T12:59:54,684 adding 'evalscope/benchmarks/torgo/requirements.txt' 2026-04-13T12:59:54,686 adding 'evalscope/benchmarks/torgo/torgo_adapter.py' 2026-04-13T12:59:54,687 adding 'evalscope/benchmarks/trivia_qa/__init__.py' 2026-04-13T12:59:54,689 adding 'evalscope/benchmarks/trivia_qa/samples.jsonl' 2026-04-13T12:59:54,690 adding 'evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py' 2026-04-13T12:59:54,692 adding 'evalscope/benchmarks/truthful_qa/__init__.py' 2026-04-13T12:59:54,694 adding 'evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py' 2026-04-13T12:59:54,696 adding 'evalscope/benchmarks/visu_logic/__init__.py' 2026-04-13T12:59:54,698 adding 'evalscope/benchmarks/visu_logic/visu_logic_adapter.py' 2026-04-13T12:59:54,700 adding 'evalscope/benchmarks/vstar_bench/__init__.py' 2026-04-13T12:59:54,702 adding 'evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py' 2026-04-13T12:59:54,704 adding 'evalscope/benchmarks/winogrande/__init__.py' 2026-04-13T12:59:54,706 adding 'evalscope/benchmarks/winogrande/winogrande_adapter.py' 2026-04-13T12:59:54,708 adding 'evalscope/benchmarks/wmt/__init__.py' 2026-04-13T12:59:54,709 adding 'evalscope/benchmarks/wmt/requirements.txt' 2026-04-13T12:59:54,711 adding 'evalscope/benchmarks/wmt/wmt24_adapter.py' 2026-04-13T12:59:54,715 adding 'evalscope/benchmarks/zebralogicbench/__init__.py' 2026-04-13T12:59:54,717 adding 'evalscope/benchmarks/zebralogicbench/utils.py' 2026-04-13T12:59:54,720 adding 'evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py' 2026-04-13T12:59:54,722 adding 'evalscope/benchmarks/zerobench/__init__.py' 2026-04-13T12:59:54,724 adding 'evalscope/benchmarks/zerobench/zerobench_adapter.py' 2026-04-13T12:59:54,726 adding 'evalscope/cli/__init__.py' 2026-04-13T12:59:54,728 adding 'evalscope/cli/base.py' 2026-04-13T12:59:54,731 adding 'evalscope/cli/benchmark_info.py' 2026-04-13T12:59:54,732 adding 'evalscope/cli/cli.py' 2026-04-13T12:59:54,734 adding 'evalscope/cli/start_app.py' 2026-04-13T12:59:54,736 adding 'evalscope/cli/start_eval.py' 2026-04-13T12:59:54,737 adding 'evalscope/cli/start_perf.py' 2026-04-13T12:59:54,739 adding 'evalscope/cli/start_service.py' 2026-04-13T12:59:54,741 adding 'evalscope/collections/__init__.py' 2026-04-13T12:59:54,743 adding 'evalscope/collections/sampler.py' 2026-04-13T12:59:54,745 adding 'evalscope/collections/schema.py' 2026-04-13T12:59:54,747 adding 'evalscope/evaluator/__init__.py' 2026-04-13T12:59:54,749 adding 'evalscope/evaluator/batch_reviewer.py' 2026-04-13T12:59:54,753 adding 'evalscope/evaluator/evaluator.py' 2026-04-13T12:59:54,755 adding 'evalscope/filters/__init__.py' 2026-04-13T12:59:54,757 adding 'evalscope/filters/extraction.py' 2026-04-13T12:59:54,758 adding 'evalscope/filters/selection.py' 2026-04-13T12:59:54,761 adding 'evalscope/metrics/__init__.py' 2026-04-13T12:59:54,763 adding 'evalscope/metrics/llm_judge.py' 2026-04-13T12:59:54,766 adding 'evalscope/metrics/math_parser.py' 2026-04-13T12:59:54,770 adding 'evalscope/metrics/metric.py' 2026-04-13T12:59:54,774 adding 'evalscope/metrics/metrics.py' 2026-04-13T12:59:54,776 adding 'evalscope/metrics/rouge_metric.py' 2026-04-13T12:59:54,778 adding 'evalscope/metrics/bert_score/__init__.py' 2026-04-13T12:59:54,781 adding 'evalscope/metrics/bert_score/scorer.py' 2026-04-13T12:59:54,785 adding 'evalscope/metrics/bert_score/utils.py' 2026-04-13T12:59:54,788 adding 'evalscope/metrics/bundled_rouge_score/__init__.py' 2026-04-13T12:59:54,791 adding 'evalscope/metrics/bundled_rouge_score/rouge_scorer.py' 2026-04-13T12:59:54,793 adding 'evalscope/metrics/sem_score/__init__.py' 2026-04-13T12:59:54,794 adding 'evalscope/metrics/sem_score/scorer.py' 2026-04-13T12:59:54,796 adding 'evalscope/metrics/t2v_metrics/__init__.py' 2026-04-13T12:59:54,797 adding 'evalscope/metrics/t2v_metrics/clipscore.py' 2026-04-13T12:59:54,799 adding 'evalscope/metrics/t2v_metrics/constants.py' 2026-04-13T12:59:54,800 adding 'evalscope/metrics/t2v_metrics/itmscore.py' 2026-04-13T12:59:54,801 adding 'evalscope/metrics/t2v_metrics/score.py' 2026-04-13T12:59:54,802 adding 'evalscope/metrics/t2v_metrics/vqascore.py' 2026-04-13T12:59:54,804 adding 'evalscope/metrics/t2v_metrics/models/__init__.py' 2026-04-13T12:59:54,805 adding 'evalscope/metrics/t2v_metrics/models/model.py' 2026-04-13T12:59:54,807 adding 'evalscope/metrics/t2v_metrics/models/utils.py' 2026-04-13T12:59:54,808 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py' 2026-04-13T12:59:54,810 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py' 2026-04-13T12:59:54,812 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py' 2026-04-13T12:59:54,813 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py' 2026-04-13T12:59:54,815 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py' 2026-04-13T12:59:54,816 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py' 2026-04-13T12:59:54,817 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py' 2026-04-13T12:59:54,819 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py' 2026-04-13T12:59:54,820 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py' 2026-04-13T12:59:54,822 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py' 2026-04-13T12:59:54,824 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py' 2026-04-13T12:59:54,825 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py' 2026-04-13T12:59:54,827 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py' 2026-04-13T12:59:54,829 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py' 2026-04-13T12:59:54,830 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py' 2026-04-13T12:59:54,832 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py' 2026-04-13T12:59:54,833 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py' 2026-04-13T12:59:54,835 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py' 2026-04-13T12:59:54,837 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py' 2026-04-13T12:59:54,838 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py' 2026-04-13T12:59:54,839 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py' 2026-04-13T12:59:54,841 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py' 2026-04-13T12:59:54,842 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py' 2026-04-13T12:59:54,845 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py' 2026-04-13T12:59:54,847 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py' 2026-04-13T12:59:54,848 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py' 2026-04-13T12:59:54,850 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py' 2026-04-13T12:59:54,851 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py' 2026-04-13T12:59:54,854 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py' 2026-04-13T12:59:54,856 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py' 2026-04-13T12:59:54,857 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py' 2026-04-13T12:59:54,858 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py' 2026-04-13T12:59:54,860 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py' 2026-04-13T12:59:54,861 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py' 2026-04-13T12:59:54,863 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py' 2026-04-13T12:59:54,865 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py' 2026-04-13T12:59:54,867 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py' 2026-04-13T12:59:54,869 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py' 2026-04-13T12:59:54,870 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml' 2026-04-13T12:59:54,872 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json' 2026-04-13T12:59:54,873 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json' 2026-04-13T12:59:54,875 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json' 2026-04-13T12:59:54,876 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml' 2026-04-13T12:59:54,878 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml' 2026-04-13T12:59:54,879 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml' 2026-04-13T12:59:54,880 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml' 2026-04-13T12:59:54,882 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml' 2026-04-13T12:59:54,883 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml' 2026-04-13T12:59:54,884 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml' 2026-04-13T12:59:54,886 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml' 2026-04-13T12:59:54,887 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml' 2026-04-13T12:59:54,888 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml' 2026-04-13T12:59:54,889 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml' 2026-04-13T12:59:54,890 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml' 2026-04-13T12:59:54,891 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml' 2026-04-13T12:59:54,892 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml' 2026-04-13T12:59:54,894 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml' 2026-04-13T12:59:54,895 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml' 2026-04-13T12:59:54,896 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml' 2026-04-13T12:59:54,897 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml' 2026-04-13T12:59:54,899 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml' 2026-04-13T12:59:54,901 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py' 2026-04-13T12:59:54,903 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py' 2026-04-13T12:59:54,905 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py' 2026-04-13T12:59:54,907 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py' 2026-04-13T12:59:54,913 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py' 2026-04-13T12:59:54,915 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py' 2026-04-13T12:59:54,921 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py' 2026-04-13T12:59:54,922 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py' 2026-04-13T12:59:54,924 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py' 2026-04-13T12:59:54,925 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py' 2026-04-13T12:59:54,928 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py' 2026-04-13T12:59:54,930 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py' 2026-04-13T12:59:54,933 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py' 2026-04-13T12:59:54,935 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py' 2026-04-13T12:59:54,939 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py' 2026-04-13T12:59:54,948 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py' 2026-04-13T12:59:54,950 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py' 2026-04-13T12:59:54,952 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py' 2026-04-13T12:59:54,953 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py' 2026-04-13T12:59:54,955 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py' 2026-04-13T12:59:54,957 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py' 2026-04-13T12:59:54,958 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py' 2026-04-13T12:59:54,960 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py' 2026-04-13T12:59:54,961 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py' 2026-04-13T12:59:54,963 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py' 2026-04-13T12:59:54,965 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py' 2026-04-13T12:59:54,970 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py' 2026-04-13T12:59:54,972 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py' 2026-04-13T12:59:54,973 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py' 2026-04-13T12:59:54,975 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py' 2026-04-13T12:59:54,977 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py' 2026-04-13T12:59:54,978 adding 'evalscope/metrics/text_normalizer/__init__.py' 2026-04-13T12:59:54,980 adding 'evalscope/metrics/text_normalizer/basic.py' 2026-04-13T12:59:54,986 adding 'evalscope/metrics/text_normalizer/chinese.py' 2026-04-13T12:59:54,995 adding 'evalscope/metrics/text_normalizer/english.json' 2026-04-13T12:59:54,998 adding 'evalscope/metrics/text_normalizer/english.py' 2026-04-13T12:59:55,000 adding 'evalscope/metrics/text_normalizer/wer.py' 2026-04-13T12:59:55,002 adding 'evalscope/models/__init__.py' 2026-04-13T12:59:55,004 adding 'evalscope/models/anthropic_compatible.py' 2026-04-13T12:59:55,006 adding 'evalscope/models/image_edit_model.py' 2026-04-13T12:59:55,008 adding 'evalscope/models/mockllm.py' 2026-04-13T12:59:55,009 adding 'evalscope/models/model_apis.py' 2026-04-13T12:59:55,012 adding 'evalscope/models/modelscope.py' 2026-04-13T12:59:55,014 adding 'evalscope/models/openai_compatible.py' 2026-04-13T12:59:55,015 adding 'evalscope/models/text2image_model.py' 2026-04-13T12:59:55,019 adding 'evalscope/models/utils/anthropic.py' 2026-04-13T12:59:55,022 adding 'evalscope/models/utils/openai.py' 2026-04-13T12:59:55,024 adding 'evalscope/perf/__init__.py' 2026-04-13T12:59:55,027 adding 'evalscope/perf/arguments.py' 2026-04-13T12:59:55,029 adding 'evalscope/perf/benchmark.py' 2026-04-13T12:59:55,030 adding 'evalscope/perf/http_client.py' 2026-04-13T12:59:55,032 adding 'evalscope/perf/main.py' 2026-04-13T12:59:55,034 adding 'evalscope/perf/plugin/__init__.py' 2026-04-13T12:59:55,035 adding 'evalscope/perf/plugin/registry.py' 2026-04-13T12:59:55,037 adding 'evalscope/perf/plugin/api/__init__.py' 2026-04-13T12:59:55,038 adding 'evalscope/perf/plugin/api/base.py' 2026-04-13T12:59:55,040 adding 'evalscope/perf/plugin/api/custom_api.py' 2026-04-13T12:59:55,042 adding 'evalscope/perf/plugin/api/dashscope_api.py' 2026-04-13T12:59:55,044 adding 'evalscope/perf/plugin/api/default_api.py' 2026-04-13T12:59:55,046 adding 'evalscope/perf/plugin/api/openai_api.py' 2026-04-13T12:59:55,048 adding 'evalscope/perf/plugin/api/openai_embedding_api.py' 2026-04-13T12:59:55,051 adding 'evalscope/perf/plugin/api/openai_rerank_api.py' 2026-04-13T12:59:55,053 adding 'evalscope/perf/plugin/datasets/__init__.py' 2026-04-13T12:59:55,054 adding 'evalscope/perf/plugin/datasets/base.py' 2026-04-13T12:59:55,056 adding 'evalscope/perf/plugin/datasets/custom.py' 2026-04-13T12:59:55,057 adding 'evalscope/perf/plugin/datasets/embedding_dataset.py' 2026-04-13T12:59:55,059 adding 'evalscope/perf/plugin/datasets/flickr8k.py' 2026-04-13T12:59:55,060 adding 'evalscope/perf/plugin/datasets/kontext_bench.py' 2026-04-13T12:59:55,061 adding 'evalscope/perf/plugin/datasets/line_by_line.py' 2026-04-13T12:59:55,062 adding 'evalscope/perf/plugin/datasets/longalpaca.py' 2026-04-13T12:59:55,064 adding 'evalscope/perf/plugin/datasets/openqa.py' 2026-04-13T12:59:55,065 adding 'evalscope/perf/plugin/datasets/random_dataset.py' 2026-04-13T12:59:55,067 adding 'evalscope/perf/plugin/datasets/random_vl_dataset.py' 2026-04-13T12:59:55,068 adding 'evalscope/perf/plugin/datasets/rerank_dataset.py' 2026-04-13T12:59:55,070 adding 'evalscope/perf/plugin/datasets/speed_benchmark.py' 2026-04-13T12:59:55,071 adding 'evalscope/perf/plugin/datasets/utils.py' 2026-04-13T12:59:55,073 adding 'evalscope/perf/sla/__init__.py' 2026-04-13T12:59:55,074 adding 'evalscope/perf/sla/sla_criterion.py' 2026-04-13T12:59:55,076 adding 'evalscope/perf/sla/sla_run.py' 2026-04-13T12:59:55,078 adding 'evalscope/perf/utils/__init__.py' 2026-04-13T12:59:55,080 adding 'evalscope/perf/utils/analysis_result.py' 2026-04-13T12:59:55,081 adding 'evalscope/perf/utils/benchmark_util.py' 2026-04-13T12:59:55,084 adding 'evalscope/perf/utils/db_util.py' 2026-04-13T12:59:55,085 adding 'evalscope/perf/utils/handler.py' 2026-04-13T12:59:55,087 adding 'evalscope/perf/utils/local_server.py' 2026-04-13T12:59:55,089 adding 'evalscope/perf/utils/log_utils.py' 2026-04-13T12:59:55,091 adding 'evalscope/perf/utils/rich_display.py' 2026-04-13T12:59:55,093 adding 'evalscope/perf/utils/report/__init__.py' 2026-04-13T12:59:55,096 adding 'evalscope/perf/utils/report/generate_report.py' 2026-04-13T12:59:55,098 adding 'evalscope/perf/utils/report/perf_charts.py' 2026-04-13T12:59:55,100 adding 'evalscope/perf/utils/report/perf_data.py' 2026-04-13T12:59:55,102 adding 'evalscope/report/__init__.py' 2026-04-13T12:59:55,104 adding 'evalscope/report/combinator.py' 2026-04-13T12:59:55,105 adding 'evalscope/report/generator.py' 2026-04-13T12:59:55,107 adding 'evalscope/report/renderer.py' 2026-04-13T12:59:55,109 adding 'evalscope/report/report.py' 2026-04-13T12:59:55,111 adding 'evalscope/report/template/perf_report.html.j2' 2026-04-13T12:59:55,113 adding 'evalscope/report/template/report.html.j2' 2026-04-13T12:59:55,116 adding 'evalscope/report/template/css/base.css' 2026-04-13T12:59:55,117 adding 'evalscope/report/template/css/perf_extra.css' 2026-04-13T12:59:55,119 adding 'evalscope/report/template/js/eval_extra.js' 2026-04-13T12:59:55,121 adding 'evalscope/report/template/js/i18n_eval.js' 2026-04-13T12:59:55,123 adding 'evalscope/report/template/js/i18n_perf.js' 2026-04-13T12:59:55,124 adding 'evalscope/report/template/js/perf_extra.js' 2026-04-13T12:59:55,126 adding 'evalscope/report/template/js/shared.js' 2026-04-13T12:59:55,128 adding 'evalscope/report/template/partials/brand_logo.html' 2026-04-13T12:59:55,129 adding 'evalscope/report/template/partials/footer.html' 2026-04-13T12:59:55,130 adding 'evalscope/report/template/partials/header_eval.html' 2026-04-13T12:59:55,132 adding 'evalscope/report/template/partials/header_perf.html' 2026-04-13T12:59:55,133 adding 'evalscope/report/template/partials/toc_eval.html' 2026-04-13T12:59:55,135 adding 'evalscope/report/template/partials/toc_perf.html' 2026-04-13T12:59:55,137 adding 'evalscope/sandbox/__init__.py' 2026-04-13T12:59:55,139 adding 'evalscope/sandbox/volcengine.py' 2026-04-13T12:59:55,141 adding 'evalscope/service/__init__.py' 2026-04-13T12:59:55,142 adding 'evalscope/service/app.py' 2026-04-13T12:59:55,144 adding 'evalscope/service/blueprints/__init__.py' 2026-04-13T12:59:55,146 adding 'evalscope/service/blueprints/eval.py' 2026-04-13T12:59:55,147 adding 'evalscope/service/blueprints/perf.py' 2026-04-13T12:59:55,149 adding 'evalscope/service/frontend/__init__.py' 2026-04-13T12:59:55,151 adding 'evalscope/service/frontend/async_client.py' 2026-04-13T12:59:55,152 adding 'evalscope/service/frontend/main.py' 2026-04-13T12:59:55,154 adding 'evalscope/service/frontend/utils.py' 2026-04-13T12:59:55,156 adding 'evalscope/service/utils/__init__.py' 2026-04-13T12:59:55,157 adding 'evalscope/service/utils/benchmarks.py' 2026-04-13T12:59:55,159 adding 'evalscope/service/utils/log.py' 2026-04-13T12:59:55,161 adding 'evalscope/service/utils/process.py' 2026-04-13T12:59:55,162 adding 'evalscope/summarizer/__init__.py' 2026-04-13T12:59:55,164 adding 'evalscope/summarizer/summarizer.py' 2026-04-13T12:59:55,165 adding 'evalscope/third_party/__init__.py' 2026-04-13T12:59:55,167 adding 'evalscope/third_party/longbench_write/README.md' 2026-04-13T12:59:55,168 adding 'evalscope/third_party/longbench_write/__init__.py' 2026-04-13T12:59:55,170 adding 'evalscope/third_party/longbench_write/default_task.json' 2026-04-13T12:59:55,171 adding 'evalscope/third_party/longbench_write/default_task.yaml' 2026-04-13T12:59:55,173 adding 'evalscope/third_party/longbench_write/eval.py' 2026-04-13T12:59:55,174 adding 'evalscope/third_party/longbench_write/infer.py' 2026-04-13T12:59:55,176 adding 'evalscope/third_party/longbench_write/longbench_write.py' 2026-04-13T12:59:55,177 adding 'evalscope/third_party/longbench_write/utils.py' 2026-04-13T12:59:55,178 adding 'evalscope/third_party/longbench_write/resources/__init__.py' 2026-04-13T12:59:55,180 adding 'evalscope/third_party/longbench_write/resources/judge.txt' 2026-04-13T12:59:55,187 adding 'evalscope/third_party/longbench_write/resources/longbench_write.jsonl' 2026-04-13T12:59:55,191 adding 'evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl' 2026-04-13T12:59:55,192 adding 'evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl' 2026-04-13T12:59:55,194 adding 'evalscope/third_party/longbench_write/tools/__init__.py' 2026-04-13T12:59:55,196 adding 'evalscope/third_party/longbench_write/tools/data_etl.py' 2026-04-13T12:59:55,197 adding 'evalscope/third_party/longbench_write/tools/openai_api.py' 2026-04-13T12:59:55,199 adding 'evalscope/third_party/thinkbench/__init__.py' 2026-04-13T12:59:55,202 adding 'evalscope/third_party/thinkbench/eval.py' 2026-04-13T12:59:55,203 adding 'evalscope/third_party/thinkbench/infer.py' 2026-04-13T12:59:55,205 adding 'evalscope/third_party/thinkbench/resources/critique_template.txt' 2026-04-13T12:59:55,206 adding 'evalscope/third_party/thinkbench/resources/reformat_template.txt' 2026-04-13T12:59:55,208 adding 'evalscope/third_party/thinkbench/tools/__init__.py' 2026-04-13T12:59:55,209 adding 'evalscope/third_party/thinkbench/tools/llm.py' 2026-04-13T12:59:55,211 adding 'evalscope/third_party/thinkbench/tools/utils.py' 2026-04-13T12:59:55,213 adding 'evalscope/third_party/toolbench_static/README.md' 2026-04-13T12:59:55,214 adding 'evalscope/third_party/toolbench_static/__init__.py' 2026-04-13T12:59:55,215 adding 'evalscope/third_party/toolbench_static/config_default.json' 2026-04-13T12:59:55,216 adding 'evalscope/third_party/toolbench_static/config_default.yaml' 2026-04-13T12:59:55,218 adding 'evalscope/third_party/toolbench_static/eval.py' 2026-04-13T12:59:55,220 adding 'evalscope/third_party/toolbench_static/infer.py' 2026-04-13T12:59:55,221 adding 'evalscope/third_party/toolbench_static/requirements.txt' 2026-04-13T12:59:55,222 adding 'evalscope/third_party/toolbench_static/toolbench_static.py' 2026-04-13T12:59:55,224 adding 'evalscope/third_party/toolbench_static/llm/__init__.py' 2026-04-13T12:59:55,225 adding 'evalscope/third_party/toolbench_static/llm/swift_infer.py' 2026-04-13T12:59:55,228 adding 'evalscope/utils/__init__.py' 2026-04-13T12:59:55,229 adding 'evalscope/utils/argument_utils.py' 2026-04-13T12:59:55,231 adding 'evalscope/utils/chat_service.py' 2026-04-13T12:59:55,234 adding 'evalscope/utils/code_utils.py' 2026-04-13T12:59:55,236 adding 'evalscope/utils/deprecation_utils.py' 2026-04-13T12:59:55,238 adding 'evalscope/utils/function_utils.py' 2026-04-13T12:59:55,239 adding 'evalscope/utils/import_utils.py' 2026-04-13T12:59:55,242 adding 'evalscope/utils/io_utils.py' 2026-04-13T12:59:55,243 adding 'evalscope/utils/json_schema.py' 2026-04-13T12:59:55,245 adding 'evalscope/utils/logger.py' 2026-04-13T12:59:55,246 adding 'evalscope/utils/model_utils.py' 2026-04-13T12:59:55,248 adding 'evalscope/utils/multi_choices.py' 2026-04-13T12:59:55,250 adding 'evalscope/utils/ner.py' 2026-04-13T12:59:55,252 adding 'evalscope/utils/resource_utils.py' 2026-04-13T12:59:55,254 adding 'evalscope/utils/url_utils.py' 2026-04-13T12:59:55,255 adding 'evalscope/utils/doc_utils/__init__.py' 2026-04-13T12:59:55,259 adding 'evalscope/utils/doc_utils/benchmark_stats.py' 2026-04-13T12:59:55,261 adding 'evalscope/utils/doc_utils/generate_dataset_md.py' 2026-04-13T12:59:55,264 adding 'evalscope/utils/doc_utils/readme_generator.py' 2026-04-13T12:59:55,265 adding 'evalscope/utils/doc_utils/translate_description.py' 2026-04-13T12:59:55,267 adding 'evalscope/utils/tqdm_utils/__init__.py' 2026-04-13T12:59:55,269 adding 'evalscope/utils/tqdm_utils/progress_tracker.py' 2026-04-13T12:59:55,270 adding 'evalscope/utils/tqdm_utils/tqdm_logging.py' 2026-04-13T12:59:55,274 adding 'evalscope-1.6.0.dist-info/licenses/LICENSE' 2026-04-13T12:59:55,278 adding 'evalscope-1.6.0.dist-info/METADATA' 2026-04-13T12:59:55,280 adding 'evalscope-1.6.0.dist-info/WHEEL' 2026-04-13T12:59:55,281 adding 'evalscope-1.6.0.dist-info/entry_points.txt' 2026-04-13T12:59:55,282 adding 'evalscope-1.6.0.dist-info/top_level.txt' 2026-04-13T12:59:55,297 adding 'evalscope-1.6.0.dist-info/RECORD' 2026-04-13T12:59:55,333 removing build/bdist.linux-armv7l/wheel 2026-04-13T12:59:55,716 Building wheel for evalscope (pyproject.toml): finished with status 'done' 2026-04-13T12:59:55,763 Created wheel for evalscope: filename=evalscope-1.6.0-py3-none-any.whl size=2107224 sha256=5f6a31b463e6844867fdddb6488a17d304a770d4a30ad0a3615f853d4650f220 2026-04-13T12:59:55,765 Stored in directory: /tmp/pip-ephem-wheel-cache-wi6h2a8t/wheels/54/ec/f7/ce43f7bbab1bd633532b0283c649bc3d549605d501a818220c 2026-04-13T12:59:55,813 Successfully built evalscope 2026-04-13T12:59:55,870 Removed build tracker: '/tmp/pip-build-tracker-29rga0pi'