2026-03-25T09:05:38,937 Created temporary directory: /tmp/pip-ephem-wheel-cache-rsmexxo2 2026-03-25T09:05:38,939 Created temporary directory: /tmp/pip-build-tracker-bzp7w95z 2026-03-25T09:05:38,939 Initialized build tracking at /tmp/pip-build-tracker-bzp7w95z 2026-03-25T09:05:38,940 Created build tracker: /tmp/pip-build-tracker-bzp7w95z 2026-03-25T09:05:38,940 Entered build tracker: /tmp/pip-build-tracker-bzp7w95z 2026-03-25T09:05:38,941 Created temporary directory: /tmp/pip-wheel-kbsho2xm 2026-03-25T09:05:38,944 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-03-25T09:05:38,946 Created temporary directory: /tmp/pip-ephem-wheel-cache-03x8psey 2026-03-25T09:05:38,968 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-03-25T09:05:38,972 2 location(s) to search for versions of evalscope: 2026-03-25T09:05:38,972 * https://pypi.org/simple/evalscope/ 2026-03-25T09:05:38,972 * https://www.piwheels.org/simple/evalscope/ 2026-03-25T09:05:38,972 Fetching project page and analyzing links: https://pypi.org/simple/evalscope/ 2026-03-25T09:05:38,973 Getting page https://pypi.org/simple/evalscope/ 2026-03-25T09:05:38,974 Found index url https://pypi.org/simple 2026-03-25T09:05:39,190 Fetched page https://pypi.org/simple/evalscope/ as application/vnd.pypi.simple.v1+json 2026-03-25T09:05:39,207 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/94/d2/dc5e929802776bf4e662a46d794b765876bb93e2300189cafd113cac74d6/evalscope-0.5.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,208 Found link https://files.pythonhosted.org/packages/e5/70/45a5dad24b1fa535bff194b99a4668e7f5f328be972b51b3b91eafb4cdbb/evalscope-0.5.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0rc0 2026-03-25T09:05:39,209 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ac/eb/341fe367df2bc9a0ae7ef5eb2037a5d549d9bb8c0d7ad84844c9926e0947/evalscope-0.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,210 Found link https://files.pythonhosted.org/packages/3b/3f/585f7f1cf2ce90b234c1cfd654bb26977be4d889c0e5eed0122cb3024c45/evalscope-0.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0 2026-03-25T09:05:39,211 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ad/58/c0ce004159cfac6df9b5736c011576d52bdceb778943de8a022a419d86eb/evalscope-0.5.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,212 Found link https://files.pythonhosted.org/packages/94/33/7ad2f2285f5b68953ad4466d23dc5de1a2e57e7cc63d5924ab0e84d156ba/evalscope-0.5.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.2 2026-03-25T09:05:39,213 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/59/a9e1c4cf88018ece1fdd8d8b7fa976e28f9b4b181ef7ceb74a5e2db533ab/evalscope-0.5.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,214 Found link https://files.pythonhosted.org/packages/04/57/9ca7b1fd68f2acc32802b22236c83b597a5690a483d5938d38183b549d22/evalscope-0.5.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.3 2026-03-25T09:05:39,215 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/fa/c8/fcbaf01b7486c3b29b7790167c2cda560f00a04d100cec808ee9a3349ca0/evalscope-0.5.4-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,216 Found link https://files.pythonhosted.org/packages/8c/cc/abd412bad714c0266be1f0159b49a817d45db099c3bd031134d223589e93/evalscope-0.5.4.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.4 2026-03-25T09:05:39,217 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4d/da/d705d457683223f289e8c5d6cadbaad15d15e098692c53ea7e6196a94373/evalscope-0.5.5rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,217 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7e/4c/414dd545a1833245a53797d70a35a78ae1b9cfbcc81b35a4e1763e678437/evalscope-0.5.5rc1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,218 Found link https://files.pythonhosted.org/packages/56/5f/aa7fcf62102694dd66b69e88cb2523094bf04b53f785854e17ee6b7234a7/evalscope-0.5.5rc1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5rc1 2026-03-25T09:05:39,219 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/3e/d0/91a7a1f95f3fa19dd8d4e434dc711768abe5c006f32a514a8602b429e049/evalscope-0.5.5-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,220 Found link https://files.pythonhosted.org/packages/04/7e/a7f065d6ebac15fe172d3b0906ff5b26a71df5a9975c0f14978044211cf1/evalscope-0.5.5.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5 2026-03-25T09:05:39,220 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/89/b812b01a5ed91fc079dea052e1341860cd65d25d463c75c90e5a30ab6ae8/evalscope-0.6.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,221 Found link https://files.pythonhosted.org/packages/d5/ad/57a2d5f33c5b7d5066f8a5dcb1d34f14bf246112f7900228b9f2fb41b21b/evalscope-0.6.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0rc0 2026-03-25T09:05:39,222 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/25/3f03d9d924f1b65610724c9f10727b48ef952afdfee8687a461949c88c78/evalscope-0.6.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,223 Found link https://files.pythonhosted.org/packages/9d/ac/1f432bcc46ccb8348b869b80d2aaabde5e583b370418ba48714083e31068/evalscope-0.6.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0 2026-03-25T09:05:39,223 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7b/e1/42c9e58b4690f23ef48bce841fc95cbf0744c1579cdc80fa6f33b0453344/evalscope-0.6.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,224 Found link https://files.pythonhosted.org/packages/cd/d6/1d9d2db9acda6e61d4210074f51e7e3dee4d0212fabdd94999105db23eed/evalscope-0.6.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.1 2026-03-25T09:05:39,225 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/20/92/6fff1dfd12a4f73489c451dab56351b6a3c1095a92bb55025c2934fc625d/evalscope-0.7.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,226 Found link https://files.pythonhosted.org/packages/fe/3c/500d655a27ca80e1aba3fb2b1e8886951942732b869ad1516422d9e6ac97/evalscope-0.7.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.0 2026-03-25T09:05:39,227 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a5/3a/ae22d4d9a44ad37ac887da21b948bf6e784307001a09802d36c5bf04018d/evalscope-0.7.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,228 Found link https://files.pythonhosted.org/packages/88/47/69d067f0d3d784a7975cc4ea067fdc55c8f785ece6fbe86e5e21edc8b36f/evalscope-0.7.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.1 2026-03-25T09:05:39,228 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/23/acbcbc2ed6f00d3bb81220054651e6a2b1b02714d42b1aeb018a2f5574c4/evalscope-0.7.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,230 Found link https://files.pythonhosted.org/packages/a4/38/9126b9329cd2ad6ccfd4a73f04402bf71b65921564c3c12cb0d62b3b421d/evalscope-0.7.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.2 2026-03-25T09:05:39,230 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/1f/9598183da3026696adc19a6727a19d57f9abbf4fd7aeb20cdf12faee7693/evalscope-0.8.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,231 Found link https://files.pythonhosted.org/packages/63/7e/fc44d30e3a83dbc3070396d78279c9e3e8716cbc4cc05811a70f1b463bfb/evalscope-0.8.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.0 2026-03-25T09:05:39,232 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/37/a9/a9c6cda95a6b837c9303e9a1598999f2c4e605abb507365c6ff70b372a5a/evalscope-0.8.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,233 Found link https://files.pythonhosted.org/packages/63/5a/bbc230bb06a7bc40dd3985dde4615a8a71f111bf95761e40f6f0f8a7e1a6/evalscope-0.8.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.1 2026-03-25T09:05:39,233 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/07/39/a1eb2efed77d21e8253daa37f082669a004c4d288813a2ee9e15398f2e80/evalscope-0.8.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,234 Found link https://files.pythonhosted.org/packages/57/bc/5ff6d538e459d8b3c567577c991eec72ab6adbc19497dc164d96cd634d2f/evalscope-0.8.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.2 2026-03-25T09:05:39,235 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f6/29/81a188c03b272bf7def0c9bb556b9e9465adbc68ecb18907b636f1e8cbd7/evalscope-0.9.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,235 Found link https://files.pythonhosted.org/packages/c0/c2/19da3be1fbd6b548ecdc877d47269e92503518de53acbbfe96120c5c9753/evalscope-0.9.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.9.0 2026-03-25T09:05:39,236 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/6d/45d7407f31d6878494c3b493d7e49a8318b1839161839293c1a2e66aadcf/evalscope-0.10.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,237 Found link https://files.pythonhosted.org/packages/16/b1/b6cef37a0dd0acfa5873ca4763ac6b4ac4b19a0b15ca6bdc8f30d4443682/evalscope-0.10.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.0 2026-03-25T09:05:39,238 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a7/8f/907045290d359b4e07e7acc96ec60173380748b49e9f3c91b7ddd8e8342d/evalscope-0.10.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,239 Found link https://files.pythonhosted.org/packages/e3/d3/dda2ac0513904bff8aa0c2efe77bc851d3acc6d514707db36648e4a903d2/evalscope-0.10.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.1 2026-03-25T09:05:39,240 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d6/e5/852326943d86c85b5ca6b548a5f3753c217b771d93968c61ec2ca46ee0b1/evalscope-0.11.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,241 Found link https://files.pythonhosted.org/packages/5f/3f/e2816b99487b4ead257453a242b5282a85881f9d26fbb5efb21cc5cf88fa/evalscope-0.11.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.11.0 2026-03-25T09:05:39,242 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7d/8e/f9eceecf8bc7d740603f915eef7fab3e9d657a01f5de2c523e531445299c/evalscope-0.12.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,243 Found link https://files.pythonhosted.org/packages/33/82/7765517ff80a73eac7465369767aa45a5be3d5e0fb7c4f4a3ff743811f8c/evalscope-0.12.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.0 2026-03-25T09:05:39,243 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ee/4c/84a5e18985e149eb4283fef2b58a81ff2cec2d099c017684938ec3a3935f/evalscope-0.12.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,244 Found link https://files.pythonhosted.org/packages/b2/24/83b5530319bdb02142289e04640c6008dda1b988043c42feb7f0a5eab3b7/evalscope-0.12.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.1 2026-03-25T09:05:39,245 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/a5/65faf0660cd8ae2660354b002b2b3a586b9419bc894120fea97efd506cb6/evalscope-0.13.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,245 Found link https://files.pythonhosted.org/packages/d0/39/d5eb469a94191760c61d1bfcb235e28be1d2a080d88b44792f53d76c45d1/evalscope-0.13.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.0 2026-03-25T09:05:39,246 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f7/fc/91b7b4379131d2e15ca1f575f533daea589357293e492ef2c93e0aac6b55/evalscope-0.13.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,247 Found link https://files.pythonhosted.org/packages/9e/a3/33b4ce270d5500fe7c8f32fa2160749b607f248141328d4785b6032c8f2a/evalscope-0.13.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.1 2026-03-25T09:05:39,248 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e6/66/83752c305879cf3dea54170398839ab08a046485bc18c41a34f41aca11ab/evalscope-0.13.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,249 Found link https://files.pythonhosted.org/packages/b7/8d/711ae30b80329e2dd7da760c001d9a5b45e4d8e5292f317f1ea10c744c29/evalscope-0.13.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.2 2026-03-25T09:05:39,250 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a8/b4/b22c7e52e6a7381333bdfa0bf92fae0258e7812064b1e208cbab56a62d08/evalscope-0.14.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,251 Found link https://files.pythonhosted.org/packages/40/44/7db2cb90e6ca0c9db92124f10c0273d7c6ef4b81523e1c98c34a88e67faa/evalscope-0.14.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.14.0 2026-03-25T09:05:39,251 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b2/47/976633e0f29b58b8c9f3faacf7373a2da734771a7915ac45d721d96e0ad7/evalscope-0.15.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,252 Found link https://files.pythonhosted.org/packages/b8/e3/bd534d69328afa98bdd497b5eaaf4b7416da9e8f56109d045c332b17d016/evalscope-0.15.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.0 2026-03-25T09:05:39,253 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5a/54/237aee5f0317fa04450c9ad67c3bf28b730460b2e6dc1e65b74b4bf2cd67/evalscope-0.15.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,254 Found link https://files.pythonhosted.org/packages/7a/d4/8b87e83a3a08f87ce5b4325f0cd5ab9bc54d296dc3f3492a1d3216a97a6d/evalscope-0.15.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.1 2026-03-25T09:05:39,255 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/1e/b5/6fec1cbb02a41ab79430eb3fb51eea7709525df0f9753ae2c54fbc4633f1/evalscope-0.16.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,255 Found link https://files.pythonhosted.org/packages/99/69/63997bedfa6fd33af671b539f77c375b111017eff23313f76693e24b8872/evalscope-0.16.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.0 2026-03-25T09:05:39,256 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ea/76/639641578bdc92d25211eb8dae24d1fae19e40cd1649e17946f6ad8a5dc3/evalscope-0.16.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,257 Found link https://files.pythonhosted.org/packages/94/13/616bf9c33b0769db44a2bae32b54d33cf7129874392682aa76326a51e085/evalscope-0.16.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.1 2026-03-25T09:05:39,258 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f1/d3/9e83cc1b5a132342a05ef6ee79018bd7561f90a6406dc9db5c85fb0a281f/evalscope-0.16.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,258 Found link https://files.pythonhosted.org/packages/94/47/edd3faaddd321e464ae72db6e7bf82246dd4ca2f0f67127ca8c427cac664/evalscope-0.16.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.2 2026-03-25T09:05:39,259 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5d/4e/34c56086cdfe7d1ad912241af819b3e67f20373f382016e33ac89dd43dde/evalscope-0.16.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,260 Found link https://files.pythonhosted.org/packages/22/07/14603c038a8019472881f574f1c47bd4193481f256b6dc702c65d8b8f984/evalscope-0.16.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.3 2026-03-25T09:05:39,261 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f3/e2/104156f74779cf2849f53566bba585015492d7320c36a7cb76c7196b0ef5/evalscope-0.17.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,262 Found link https://files.pythonhosted.org/packages/c5/d0/b66d1b97ec67d65b6df54f18682e30cfeb6401604b93c9b1bdd1e97b8d79/evalscope-0.17.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.17.0 2026-03-25T09:05:39,263 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a9/b5/630d2c5dc5c32e9fbad5034e04d8aba6f6461dc08f255df77dd8d463857f/evalscope-0.17.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,264 Found link https://files.pythonhosted.org/packages/38/88/326e48929bb9577a6a36e07afa65bbf6bd870c1b644f82e5713874ae3238/evalscope-0.17.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 0.17.1 2026-03-25T09:05:39,264 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/21/18/9a6c208d2bc119ac67b5537b60851c6bccc99f25229eaa96cbe6e38721bd/evalscope-1.0.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,265 Found link https://files.pythonhosted.org/packages/d8/44/3d727dd28fcc50317c95d12e5bb850ed2863105f812373ac120877875434/evalscope-1.0.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.0 2026-03-25T09:05:39,266 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e1/8a/50456fa7dd77be4c3a0ea0b3d96cb7ae5b2557454fdd35cbf0009a9d792f/evalscope-1.0.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,267 Found link https://files.pythonhosted.org/packages/8d/52/93569134b3d8dea2a0d0bc2134c03056f0ffee1840f7299eb83d475457df/evalscope-1.0.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.1 2026-03-25T09:05:39,268 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/22/f6/32fb0fef08a6c881ac840117455a5697a0d63226db8a24cce5208b720829/evalscope-1.0.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,268 Found link https://files.pythonhosted.org/packages/3d/f5/025baefe432d9af1ed845ab5738b638b7b97f2dd3767e9478b8eee10966f/evalscope-1.0.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.2 2026-03-25T09:05:39,269 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/6e/12/f31dbb18daa7e3c6cfecf856ddc323a303e115271166c100e06af58ea6b6/evalscope-1.1.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,270 Found link https://files.pythonhosted.org/packages/b4/98/7449040e89beaa4556bf35ba1e171b2d4955ff15b2b4c43f2ff55b048aeb/evalscope-1.1.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.1.0 2026-03-25T09:05:39,271 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/79/d1/afc8b23345ad8f11a5e1f8c6c3112a8679604d833bdbc02aa06787952fd1/evalscope-1.1.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,272 Found link https://files.pythonhosted.org/packages/e8/3f/d67b73ce19789e914d6a78740fa7bfd0c07f161bc239b92cd3c26541f2fb/evalscope-1.1.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.1.1 2026-03-25T09:05:39,272 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/82/0a/d984751e4b5751f064209da3745b6aff6cc0f1d9d93f13cab0a1017a8639/evalscope-1.2.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,273 Found link https://files.pythonhosted.org/packages/7f/f9/0a2a069ee4500666ec5c3d10b302fc71d176c17bbe70447336e610953e1a/evalscope-1.2.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.2.0 2026-03-25T09:05:39,274 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/58/823d009dfa49cfdc750ac257e744eb97456d69528a09ac108ee8cab15316/evalscope-1.3.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,275 Found link https://files.pythonhosted.org/packages/42/5a/a309f7ce1fbe2b39e4b0a1f26cfcd7864eaa90e4792a5290e8cdd2ce3b4f/evalscope-1.3.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.3.0 2026-03-25T09:05:39,275 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c6/57/005d3ef07ecdd5163e1bbed3413537b653b637d9c8b62a2bcdd97546607b/evalscope-1.4.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,276 Found link https://files.pythonhosted.org/packages/11/d5/268f610ac7db9c5c2109936f65ac4df8b4bc52106ed4369509d3b3c4f127/evalscope-1.4.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.0 2026-03-25T09:05:39,277 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b8/96/a2b4fabf6fa6cf09ac6669aa01dc483ac53576a8fd7c2c4be21ea281840f/evalscope-1.4.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,278 Found link https://files.pythonhosted.org/packages/1c/8d/4fa3d9a2a5443ba2df06147ebc057961b0550bef806c716b2552fe7c7f66/evalscope-1.4.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.1 2026-03-25T09:05:39,279 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/bf/0f/97e68e89f7925160df49ea1dbbcef7f3f8e808a51756c199aaaadc75f5a5/evalscope-1.4.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,280 Found link https://files.pythonhosted.org/packages/97/dd/6e1cda8f161363ef806929b4989a0e9a7df733b7b82361ef1897315c12d0/evalscope-1.4.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.2 2026-03-25T09:05:39,280 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c0/fb/e6b1a396bad204e38591a6d6de1172dac2ce3e0d15b87e812d57e22d0e4f/evalscope-1.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,281 Found link https://files.pythonhosted.org/packages/a7/32/518a920ac8a73c4c6e39f7e443df6da6ea9a3be6567c4a425def866b8f5e/evalscope-1.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.0 2026-03-25T09:05:39,281 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/eb/68/0c870a84e38d5a8d3e7c9df918739a4ba6a45c3ddb624d2792a41a8d3293/evalscope-1.5.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,282 Found link https://files.pythonhosted.org/packages/2f/74/c4275a5a1746667352246ce10b9076137ab661d0f132796d2db32eca97fe/evalscope-1.5.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.5.1 2026-03-25T09:05:39,283 Fetching project page and analyzing links: https://www.piwheels.org/simple/evalscope/ 2026-03-25T09:05:39,284 Getting page https://www.piwheels.org/simple/evalscope/ 2026-03-25T09:05:39,285 Found index url https://www.piwheels.org/simple 2026-03-25T09:05:39,439 Fetched page https://www.piwheels.org/simple/evalscope/ as text/html 2026-03-25T09:05:39,449 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.5.0-py3-none-any.whl#sha256=933f1aa9915ed658bc3ae6901e0b96efbdbf80db96eca40c2d42453b26530d9b (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,450 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.2-py3-none-any.whl#sha256=222e938fe502394b9935f3c00677cf1372892caa530b6fb48476ae909a91399a (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,451 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.1-py3-none-any.whl#sha256=d74ddb7150b19de1eb995026c697b598cdfc0b75fee3a2d110219256c4241688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,451 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.4.0-py3-none-any.whl#sha256=307c9ed70f562ba776fdf9b0136b35fe4e361b1b974bb0f8ca39e425f4738e6e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,452 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.3.0-py3-none-any.whl#sha256=d9949bacf6c08b5ab341f872c6ee4b31995d724fe963f64ddf3c129e0e39145e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,452 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.2.0-py3-none-any.whl#sha256=5969cf4a3132f6a29f9ef39aa35a8a3be24f114bbcd7a77a19c145bbec432be9 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,453 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.1-py3-none-any.whl#sha256=5bfb8c55f45e1bcd5df5cb0cd4ecceae2ccb93cd09c9477b0b0e6c097ecb1d1f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-03-25T09:05:39,454 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.0-py3-none-any.whl#sha256=ca7c951fa316bb7ec6fb0e38ad6503632067f07f469e37828fe9e39e51591994 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,454 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.2-py3-none-any.whl#sha256=e14016040022bcd666c05ffa806f3713775e9de19a98290bd2a0a36e5c435409 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,455 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.1-py3-none-any.whl#sha256=61b9c14f3409804d84ddf31b749b587c7c7441f9d1aa0d453991b1bd0bbda74c (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,455 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.0-py3-none-any.whl#sha256=ea1044755177db8d9e94cc4ccafbd56e35b173a8425db7d1653dd9f66e1463ad (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,456 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.1-py3-none-any.whl#sha256=aa0054b8aed77684e56d0836d4568080fa4827799a16d62fef6ec13802cd4050 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-03-25T09:05:39,456 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.0-py3-none-any.whl#sha256=e7a2549f9f5ac5b0061d01f8ca99900e06f6340b91d3e546163423b896287862 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,457 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.3-py3-none-any.whl#sha256=75dcbd7cc0a0336f68d407a3925fe065ecaaea0fa9030ceebfaa3f22f0f3b417 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,457 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.2-py3-none-any.whl#sha256=14e00f16b506b723a359799e3cb271370a3da768da3667563612e458982d6847 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,458 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.1-py3-none-any.whl#sha256=296b90c06a33f69c9e7049a768f16929acd1279af0830f47f18fc598560d0e13 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,459 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.0-py3-none-any.whl#sha256=0a6f11a7a4d564d4a1dca5fe424cda19608cf947b9ab739079f5b60842651c7e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,459 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.1-py3-none-any.whl#sha256=504717cfc96a8fdbb0d4bc080d0292e80423891b658d61283385737794c4e95f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,460 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.0-py3-none-any.whl#sha256=263020a7b7f7788e17515c738604ea64904ca0da34d09664d2b7ee16d3522a00 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,460 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.14.0-py3-none-any.whl#sha256=cdf26beb4b188e1dd5e09feeb1832ccf909c9d2a535eb76e702fb3c66fc65688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,461 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.2-py3-none-any.whl#sha256=7d360083cf9dd960996847cde2140085d0830ddc8b12aa8007b4f72d395c5211 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,462 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.1-py3-none-any.whl#sha256=6b26f11daca05d6b56da3cb1b78ea0a8f28de94901b7520171c66e2a16b1c638 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,462 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.0-py3-none-any.whl#sha256=8ca93c4011f04e35df239b5486248e03100a9c9e265664e01330ef4b1cc691f7 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,463 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.1-py3-none-any.whl#sha256=72e5815923789c6cab6c32425477393bf645f670edcc493f8e4dec6e93f3da23 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,463 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.0-py3-none-any.whl#sha256=f26c7317a5bacac8527806723d45bac74563a2608ed761d822ace03be3a5e45f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,464 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.11.0-py3-none-any.whl#sha256=6e7d3242c5bf97b54d24644a1300575fdf41c1d90eef7c1344c20bf0d1518671 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,464 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.1-py3-none-any.whl#sha256=a7e645410333c1aaec5a024be4c02166dfb0d6b4635b020c181ce672d31102d2 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,465 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.0-py3-none-any.whl#sha256=494f1742178def5f86e552c004562c877cd8b8b6f5d4267d29f961f20e8bf569 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,465 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.9.0-py3-none-any.whl#sha256=26fd59fd387850f3e84f995fd38357994460e71d0124bc09954ca8837027ec52 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,466 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.2-py3-none-any.whl#sha256=3c4ed25b5c39d7a706927607e552e6ced6d50532bc442bba74979f70720f4894 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,467 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.1-py3-none-any.whl#sha256=64e9306453082a95b0c0507d6fd1dbf50ed0ab210ebcfebdcb52c534769d1856 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,467 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.0-py3-none-any.whl#sha256=7758550ef3406d9c6096de05909cc1c97ed0c2f7be6edaf3fe5344244e34f233 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,468 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.2-py3-none-any.whl#sha256=21a1f4448ffca926853b7516b6375f801ef6a8067501dde546e7d794fb759f20 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,469 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.1-py3-none-any.whl#sha256=b959f2d9850544f2d96ac765b06ed112cf579a11899618fcc5245407f4e33843 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,469 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.0-py3-none-any.whl#sha256=2deddeb89bfb6fa02844b72a2be24627526136cbd45315e8e4dadb672d53c9fc (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-03-25T09:05:39,470 Skipping link: not a file: https://www.piwheels.org/simple/evalscope/ 2026-03-25T09:05:39,470 Skipping link: not a file: https://pypi.org/simple/evalscope/ 2026-03-25T09:05:39,494 Given no hashes to check 1 links for project 'evalscope': discarding no candidates 2026-03-25T09:05:39,512 Collecting evalscope==1.5.1 2026-03-25T09:05:39,515 Created temporary directory: /tmp/pip-unpack-q0jwkbe0 2026-03-25T09:05:39,647 Downloading evalscope-1.5.1.tar.gz (1.5 MB) 2026-03-25T09:05:41,935 Added evalscope==1.5.1 from https://files.pythonhosted.org/packages/2f/74/c4275a5a1746667352246ce10b9076137ab661d0f132796d2db32eca97fe/evalscope-1.5.1.tar.gz to build tracker '/tmp/pip-build-tracker-bzp7w95z' 2026-03-25T09:05:41,941 Created temporary directory: /tmp/pip-build-env-13n2m6lc 2026-03-25T09:05:41,946 Installing build dependencies: started 2026-03-25T09:05:41,947 Running command pip subprocess to install build dependencies 2026-03-25T09:05:43,133 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-03-25T09:05:43,756 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-03-25T09:05:43,780 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-03-25T09:05:45,511 Collecting setuptools>=69 2026-03-25T09:05:45,586 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-03-25T09:05:45,857 Collecting wheel 2026-03-25T09:05:45,991 Using cached https://www.piwheels.org/simple/wheel/wheel-0.46.3-py3-none-any.whl (30 kB) 2026-03-25T09:05:46,277 Collecting packaging>=24.0 2026-03-25T09:05:46,337 Using cached https://www.piwheels.org/simple/packaging/packaging-26.0-py3-none-any.whl (74 kB) 2026-03-25T09:05:49,273 Installing collected packages: setuptools, packaging, wheel 2026-03-25T09:05:52,728 Creating /tmp/pip-build-env-13n2m6lc/overlay/local/bin 2026-03-25T09:05:52,730 changing mode of /tmp/pip-build-env-13n2m6lc/overlay/local/bin/wheel to 755 2026-03-25T09:05:52,750 Successfully installed packaging-26.0 setuptools-82.0.1 wheel-0.46.3 2026-03-25T09:05:53,029 Installing build dependencies: finished with status 'done' 2026-03-25T09:05:53,036 Getting requirements to build wheel: started 2026-03-25T09:05:53,037 Running command Getting requirements to build wheel 2026-03-25T09:05:53,773 /tmp/pip-build-env-13n2m6lc/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-03-25T09:05:53,773 !! 2026-03-25T09:05:53,774 ******************************************************************************** 2026-03-25T09:05:53,775 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-03-25T09:05:53,776 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-03-25T09:05:53,777 or your builds will no longer be supported. 2026-03-25T09:05:53,778 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-03-25T09:05:53,779 ******************************************************************************** 2026-03-25T09:05:53,779 !! 2026-03-25T09:05:53,780 corresp(dist, value, root_dir) 2026-03-25T09:05:53,863 /tmp/pip-build-env-13n2m6lc/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-03-25T09:05:53,864 !! 2026-03-25T09:05:53,865 ******************************************************************************** 2026-03-25T09:05:53,865 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-03-25T09:05:53,866 License :: OSI Approved :: Apache Software License 2026-03-25T09:05:53,867 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-03-25T09:05:53,867 ******************************************************************************** 2026-03-25T09:05:53,868 !! 2026-03-25T09:05:53,869 dist._finalize_license_expression() 2026-03-25T09:05:53,871 /tmp/pip-build-env-13n2m6lc/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-03-25T09:05:53,872 !! 2026-03-25T09:05:53,873 ******************************************************************************** 2026-03-25T09:05:53,874 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-03-25T09:05:53,875 License :: OSI Approved :: Apache Software License 2026-03-25T09:05:53,876 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-03-25T09:05:53,876 ******************************************************************************** 2026-03-25T09:05:53,878 !! 2026-03-25T09:05:53,878 self._finalize_license_expression() 2026-03-25T09:05:53,880 running egg_info 2026-03-25T09:05:53,886 writing evalscope.egg-info/PKG-INFO 2026-03-25T09:05:53,907 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-03-25T09:05:53,909 writing entry points to evalscope.egg-info/entry_points.txt 2026-03-25T09:05:53,920 writing requirements to evalscope.egg-info/requires.txt 2026-03-25T09:05:53,921 writing top-level names to evalscope.egg-info/top_level.txt 2026-03-25T09:05:54,180 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-03-25T09:05:54,231 reading manifest template 'MANIFEST.in' 2026-03-25T09:05:54,659 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-03-25T09:05:54,664 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-03-25T09:05:54,670 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-03-25T09:05:54,676 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-03-25T09:05:54,677 adding license file 'LICENSE' 2026-03-25T09:05:54,731 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-03-25T09:05:54,832 Getting requirements to build wheel: finished with status 'done' 2026-03-25T09:05:54,836 Created temporary directory: /tmp/pip-modern-metadata-vv8p0zi7 2026-03-25T09:05:54,838 Preparing metadata (pyproject.toml): started 2026-03-25T09:05:54,839 Running command Preparing metadata (pyproject.toml) 2026-03-25T09:05:55,495 /tmp/pip-build-env-13n2m6lc/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-03-25T09:05:55,496 !! 2026-03-25T09:05:55,497 ******************************************************************************** 2026-03-25T09:05:55,498 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-03-25T09:05:55,499 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-03-25T09:05:55,499 or your builds will no longer be supported. 2026-03-25T09:05:55,500 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-03-25T09:05:55,501 ******************************************************************************** 2026-03-25T09:05:55,501 !! 2026-03-25T09:05:55,502 corresp(dist, value, root_dir) 2026-03-25T09:05:55,579 /tmp/pip-build-env-13n2m6lc/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-03-25T09:05:55,579 !! 2026-03-25T09:05:55,580 ******************************************************************************** 2026-03-25T09:05:55,581 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-03-25T09:05:55,582 License :: OSI Approved :: Apache Software License 2026-03-25T09:05:55,583 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-03-25T09:05:55,583 ******************************************************************************** 2026-03-25T09:05:55,584 !! 2026-03-25T09:05:55,585 dist._finalize_license_expression() 2026-03-25T09:05:55,587 /tmp/pip-build-env-13n2m6lc/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-03-25T09:05:55,587 !! 2026-03-25T09:05:55,589 ******************************************************************************** 2026-03-25T09:05:55,589 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-03-25T09:05:55,590 License :: OSI Approved :: Apache Software License 2026-03-25T09:05:55,592 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-03-25T09:05:55,592 ******************************************************************************** 2026-03-25T09:05:55,594 !! 2026-03-25T09:05:55,594 self._finalize_license_expression() 2026-03-25T09:05:55,595 running dist_info 2026-03-25T09:05:55,604 creating /tmp/pip-modern-metadata-vv8p0zi7/evalscope.egg-info 2026-03-25T09:05:55,605 writing /tmp/pip-modern-metadata-vv8p0zi7/evalscope.egg-info/PKG-INFO 2026-03-25T09:05:55,625 writing dependency_links to /tmp/pip-modern-metadata-vv8p0zi7/evalscope.egg-info/dependency_links.txt 2026-03-25T09:05:55,627 writing entry points to /tmp/pip-modern-metadata-vv8p0zi7/evalscope.egg-info/entry_points.txt 2026-03-25T09:05:55,637 writing requirements to /tmp/pip-modern-metadata-vv8p0zi7/evalscope.egg-info/requires.txt 2026-03-25T09:05:55,639 writing top-level names to /tmp/pip-modern-metadata-vv8p0zi7/evalscope.egg-info/top_level.txt 2026-03-25T09:05:55,640 writing manifest file '/tmp/pip-modern-metadata-vv8p0zi7/evalscope.egg-info/SOURCES.txt' 2026-03-25T09:05:55,861 reading manifest file '/tmp/pip-modern-metadata-vv8p0zi7/evalscope.egg-info/SOURCES.txt' 2026-03-25T09:05:55,863 reading manifest template 'MANIFEST.in' 2026-03-25T09:05:56,256 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-03-25T09:05:56,259 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-03-25T09:05:56,263 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-03-25T09:05:56,267 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-03-25T09:05:56,267 adding license file 'LICENSE' 2026-03-25T09:05:56,309 writing manifest file '/tmp/pip-modern-metadata-vv8p0zi7/evalscope.egg-info/SOURCES.txt' 2026-03-25T09:05:56,312 creating '/tmp/pip-modern-metadata-vv8p0zi7/evalscope-1.5.1.dist-info' 2026-03-25T09:05:56,445 Preparing metadata (pyproject.toml): finished with status 'done' 2026-03-25T09:05:56,453 Source in /tmp/pip-wheel-kbsho2xm/evalscope_e758371f05af4f3d8c32bc4284429391 has version 1.5.1, which satisfies requirement evalscope==1.5.1 from https://files.pythonhosted.org/packages/2f/74/c4275a5a1746667352246ce10b9076137ab661d0f132796d2db32eca97fe/evalscope-1.5.1.tar.gz 2026-03-25T09:05:56,454 Removed evalscope==1.5.1 from https://files.pythonhosted.org/packages/2f/74/c4275a5a1746667352246ce10b9076137ab661d0f132796d2db32eca97fe/evalscope-1.5.1.tar.gz from build tracker '/tmp/pip-build-tracker-bzp7w95z' 2026-03-25T09:05:56,466 Created temporary directory: /tmp/pip-unpack-4fuu6t28 2026-03-25T09:05:56,466 Building wheels for collected packages: evalscope 2026-03-25T09:05:56,471 Created temporary directory: /tmp/pip-wheel-1vuoy08l 2026-03-25T09:05:56,471 Destination directory: /tmp/pip-wheel-1vuoy08l 2026-03-25T09:05:56,474 Building wheel for evalscope (pyproject.toml): started 2026-03-25T09:05:56,475 Running command Building wheel for evalscope (pyproject.toml) 2026-03-25T09:05:57,116 /tmp/pip-build-env-13n2m6lc/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-03-25T09:05:57,116 !! 2026-03-25T09:05:57,118 ******************************************************************************** 2026-03-25T09:05:57,118 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-03-25T09:05:57,119 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-03-25T09:05:57,120 or your builds will no longer be supported. 2026-03-25T09:05:57,121 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-03-25T09:05:57,122 ******************************************************************************** 2026-03-25T09:05:57,123 !! 2026-03-25T09:05:57,123 corresp(dist, value, root_dir) 2026-03-25T09:05:57,200 /tmp/pip-build-env-13n2m6lc/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-03-25T09:05:57,200 !! 2026-03-25T09:05:57,202 ******************************************************************************** 2026-03-25T09:05:57,202 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-03-25T09:05:57,204 License :: OSI Approved :: Apache Software License 2026-03-25T09:05:57,205 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-03-25T09:05:57,205 ******************************************************************************** 2026-03-25T09:05:57,207 !! 2026-03-25T09:05:57,207 dist._finalize_license_expression() 2026-03-25T09:05:57,208 /tmp/pip-build-env-13n2m6lc/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-03-25T09:05:57,209 !! 2026-03-25T09:05:57,210 ******************************************************************************** 2026-03-25T09:05:57,211 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-03-25T09:05:57,212 License :: OSI Approved :: Apache Software License 2026-03-25T09:05:57,214 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-03-25T09:05:57,214 ******************************************************************************** 2026-03-25T09:05:57,216 !! 2026-03-25T09:05:57,216 self._finalize_license_expression() 2026-03-25T09:05:57,217 running bdist_wheel 2026-03-25T09:05:57,229 running build 2026-03-25T09:05:57,229 running build_py 2026-03-25T09:05:57,235 creating build/lib/evalscope 2026-03-25T09:05:57,238 copying evalscope/version.py -> build/lib/evalscope 2026-03-25T09:05:57,240 copying evalscope/arguments.py -> build/lib/evalscope 2026-03-25T09:05:57,243 copying evalscope/run.py -> build/lib/evalscope 2026-03-25T09:05:57,246 copying evalscope/config.py -> build/lib/evalscope 2026-03-25T09:05:57,250 copying evalscope/constants.py -> build/lib/evalscope 2026-03-25T09:05:57,252 copying evalscope/__init__.py -> build/lib/evalscope 2026-03-25T09:05:57,255 creating build/lib/evalscope/models 2026-03-25T09:05:57,256 copying evalscope/models/text2image_model.py -> build/lib/evalscope/models 2026-03-25T09:05:57,258 copying evalscope/models/modelscope.py -> build/lib/evalscope/models 2026-03-25T09:05:57,261 copying evalscope/models/openai_compatible.py -> build/lib/evalscope/models 2026-03-25T09:05:57,263 copying evalscope/models/mockllm.py -> build/lib/evalscope/models 2026-03-25T09:05:57,265 copying evalscope/models/image_edit_model.py -> build/lib/evalscope/models 2026-03-25T09:05:57,268 copying evalscope/models/__init__.py -> build/lib/evalscope/models 2026-03-25T09:05:57,270 copying evalscope/models/anthropic_compatible.py -> build/lib/evalscope/models 2026-03-25T09:05:57,272 copying evalscope/models/model_apis.py -> build/lib/evalscope/models 2026-03-25T09:05:57,275 creating build/lib/evalscope/evaluator 2026-03-25T09:05:57,275 copying evalscope/evaluator/batch_reviewer.py -> build/lib/evalscope/evaluator 2026-03-25T09:05:57,278 copying evalscope/evaluator/evaluator.py -> build/lib/evalscope/evaluator 2026-03-25T09:05:57,281 copying evalscope/evaluator/__init__.py -> build/lib/evalscope/evaluator 2026-03-25T09:05:57,284 creating build/lib/evalscope/benchmarks 2026-03-25T09:05:57,285 copying evalscope/benchmarks/__init__.py -> build/lib/evalscope/benchmarks 2026-03-25T09:05:57,288 creating build/lib/evalscope/third_party 2026-03-25T09:05:57,289 copying evalscope/third_party/__init__.py -> build/lib/evalscope/third_party 2026-03-25T09:05:57,292 creating build/lib/evalscope/utils 2026-03-25T09:05:57,293 copying evalscope/utils/model_utils.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,295 copying evalscope/utils/multi_choices.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,298 copying evalscope/utils/ner.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,300 copying evalscope/utils/json_schema.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,303 copying evalscope/utils/import_utils.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,305 copying evalscope/utils/chat_service.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,308 copying evalscope/utils/function_utils.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,311 copying evalscope/utils/code_utils.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,314 copying evalscope/utils/deprecation_utils.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,316 copying evalscope/utils/logger.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,318 copying evalscope/utils/io_utils.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,321 copying evalscope/utils/argument_utils.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,323 copying evalscope/utils/url_utils.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,325 copying evalscope/utils/resource_utils.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,328 copying evalscope/utils/__init__.py -> build/lib/evalscope/utils 2026-03-25T09:05:57,330 creating build/lib/evalscope/service 2026-03-25T09:05:57,331 copying evalscope/service/app.py -> build/lib/evalscope/service 2026-03-25T09:05:57,333 copying evalscope/service/__init__.py -> build/lib/evalscope/service 2026-03-25T09:05:57,336 creating build/lib/evalscope/metrics 2026-03-25T09:05:57,337 copying evalscope/metrics/metric.py -> build/lib/evalscope/metrics 2026-03-25T09:05:57,339 copying evalscope/metrics/rouge_metric.py -> build/lib/evalscope/metrics 2026-03-25T09:05:57,342 copying evalscope/metrics/llm_judge.py -> build/lib/evalscope/metrics 2026-03-25T09:05:57,344 copying evalscope/metrics/math_parser.py -> build/lib/evalscope/metrics 2026-03-25T09:05:57,346 copying evalscope/metrics/metrics.py -> build/lib/evalscope/metrics 2026-03-25T09:05:57,349 copying evalscope/metrics/__init__.py -> build/lib/evalscope/metrics 2026-03-25T09:05:57,351 creating build/lib/evalscope/perf 2026-03-25T09:05:57,352 copying evalscope/perf/main.py -> build/lib/evalscope/perf 2026-03-25T09:05:57,354 copying evalscope/perf/arguments.py -> build/lib/evalscope/perf 2026-03-25T09:05:57,357 copying evalscope/perf/http_client.py -> build/lib/evalscope/perf 2026-03-25T09:05:57,359 copying evalscope/perf/benchmark.py -> build/lib/evalscope/perf 2026-03-25T09:05:57,361 copying evalscope/perf/__init__.py -> build/lib/evalscope/perf 2026-03-25T09:05:57,363 creating build/lib/evalscope/backend 2026-03-25T09:05:57,364 copying evalscope/backend/base.py -> build/lib/evalscope/backend 2026-03-25T09:05:57,366 copying evalscope/backend/__init__.py -> build/lib/evalscope/backend 2026-03-25T09:05:57,367 creating build/lib/evalscope/filters 2026-03-25T09:05:57,368 copying evalscope/filters/extraction.py -> build/lib/evalscope/filters 2026-03-25T09:05:57,371 copying evalscope/filters/selection.py -> build/lib/evalscope/filters 2026-03-25T09:05:57,372 copying evalscope/filters/__init__.py -> build/lib/evalscope/filters 2026-03-25T09:05:57,374 creating build/lib/evalscope/collections 2026-03-25T09:05:57,375 copying evalscope/collections/schema.py -> build/lib/evalscope/collections 2026-03-25T09:05:57,378 copying evalscope/collections/sampler.py -> build/lib/evalscope/collections 2026-03-25T09:05:57,380 copying evalscope/collections/__init__.py -> build/lib/evalscope/collections 2026-03-25T09:05:57,382 creating build/lib/evalscope/app 2026-03-25T09:05:57,383 copying evalscope/app/arguments.py -> build/lib/evalscope/app 2026-03-25T09:05:57,385 copying evalscope/app/app.py -> build/lib/evalscope/app 2026-03-25T09:05:57,387 copying evalscope/app/constants.py -> build/lib/evalscope/app 2026-03-25T09:05:57,389 copying evalscope/app/__init__.py -> build/lib/evalscope/app 2026-03-25T09:05:57,391 creating build/lib/evalscope/report 2026-03-25T09:05:57,393 copying evalscope/report/generator.py -> build/lib/evalscope/report 2026-03-25T09:05:57,395 copying evalscope/report/combinator.py -> build/lib/evalscope/report 2026-03-25T09:05:57,397 copying evalscope/report/renderer.py -> build/lib/evalscope/report 2026-03-25T09:05:57,400 copying evalscope/report/report.py -> build/lib/evalscope/report 2026-03-25T09:05:57,402 copying evalscope/report/__init__.py -> build/lib/evalscope/report 2026-03-25T09:05:57,404 creating build/lib/evalscope/cli 2026-03-25T09:05:57,405 copying evalscope/cli/cli.py -> build/lib/evalscope/cli 2026-03-25T09:05:57,407 copying evalscope/cli/start_eval.py -> build/lib/evalscope/cli 2026-03-25T09:05:57,409 copying evalscope/cli/start_app.py -> build/lib/evalscope/cli 2026-03-25T09:05:57,411 copying evalscope/cli/benchmark_info.py -> build/lib/evalscope/cli 2026-03-25T09:05:57,414 copying evalscope/cli/base.py -> build/lib/evalscope/cli 2026-03-25T09:05:57,415 copying evalscope/cli/start_perf.py -> build/lib/evalscope/cli 2026-03-25T09:05:57,417 copying evalscope/cli/start_service.py -> build/lib/evalscope/cli 2026-03-25T09:05:57,419 copying evalscope/cli/__init__.py -> build/lib/evalscope/cli 2026-03-25T09:05:57,421 creating build/lib/evalscope/summarizer 2026-03-25T09:05:57,423 copying evalscope/summarizer/summarizer.py -> build/lib/evalscope/summarizer 2026-03-25T09:05:57,425 copying evalscope/summarizer/__init__.py -> build/lib/evalscope/summarizer 2026-03-25T09:05:57,427 creating build/lib/evalscope/api 2026-03-25T09:05:57,428 copying evalscope/api/registry.py -> build/lib/evalscope/api 2026-03-25T09:05:57,430 copying evalscope/api/__init__.py -> build/lib/evalscope/api 2026-03-25T09:05:57,432 creating build/lib/evalscope/sandbox 2026-03-25T09:05:57,433 copying evalscope/sandbox/volcengine.py -> build/lib/evalscope/sandbox 2026-03-25T09:05:57,436 copying evalscope/sandbox/__init__.py -> build/lib/evalscope/sandbox 2026-03-25T09:05:57,438 creating build/lib/evalscope/models/utils 2026-03-25T09:05:57,439 copying evalscope/models/utils/anthropic.py -> build/lib/evalscope/models/utils 2026-03-25T09:05:57,441 copying evalscope/models/utils/openai.py -> build/lib/evalscope/models/utils 2026-03-25T09:05:57,444 creating build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-03-25T09:05:57,445 copying evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-03-25T09:05:57,447 copying evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-03-25T09:05:57,449 creating build/lib/evalscope/benchmarks/pope 2026-03-25T09:05:57,450 copying evalscope/benchmarks/pope/pope_adapter.py -> build/lib/evalscope/benchmarks/pope 2026-03-25T09:05:57,452 copying evalscope/benchmarks/pope/__init__.py -> build/lib/evalscope/benchmarks/pope 2026-03-25T09:05:57,454 creating build/lib/evalscope/benchmarks/super_gpqa 2026-03-25T09:05:57,455 copying evalscope/benchmarks/super_gpqa/utils.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-03-25T09:05:57,457 copying evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-03-25T09:05:57,459 copying evalscope/benchmarks/super_gpqa/prompt.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-03-25T09:05:57,461 copying evalscope/benchmarks/super_gpqa/__init__.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-03-25T09:05:57,463 creating build/lib/evalscope/benchmarks/multipl_e 2026-03-25T09:05:57,464 copying evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-03-25T09:05:57,466 copying evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-03-25T09:05:57,468 copying evalscope/benchmarks/multipl_e/utils.py -> build/lib/evalscope/benchmarks/multipl_e 2026-03-25T09:05:57,471 copying evalscope/benchmarks/multipl_e/__init__.py -> build/lib/evalscope/benchmarks/multipl_e 2026-03-25T09:05:57,473 creating build/lib/evalscope/benchmarks/aime 2026-03-25T09:05:57,474 copying evalscope/benchmarks/aime/math_normalize.py -> build/lib/evalscope/benchmarks/aime 2026-03-25T09:05:57,476 copying evalscope/benchmarks/aime/aime_adapter.py -> build/lib/evalscope/benchmarks/aime 2026-03-25T09:05:57,478 copying evalscope/benchmarks/aime/grader.py -> build/lib/evalscope/benchmarks/aime 2026-03-25T09:05:57,481 copying evalscope/benchmarks/aime/__init__.py -> build/lib/evalscope/benchmarks/aime 2026-03-25T09:05:57,483 creating build/lib/evalscope/benchmarks/ceval 2026-03-25T09:05:57,483 copying evalscope/benchmarks/ceval/ceval_adapter.py -> build/lib/evalscope/benchmarks/ceval 2026-03-25T09:05:57,486 copying evalscope/benchmarks/ceval/__init__.py -> build/lib/evalscope/benchmarks/ceval 2026-03-25T09:05:57,488 creating build/lib/evalscope/benchmarks/mbpp 2026-03-25T09:05:57,489 copying evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/lib/evalscope/benchmarks/mbpp 2026-03-25T09:05:57,491 copying evalscope/benchmarks/mbpp/__init__.py -> build/lib/evalscope/benchmarks/mbpp 2026-03-25T09:05:57,493 creating build/lib/evalscope/benchmarks/sciq 2026-03-25T09:05:57,494 copying evalscope/benchmarks/sciq/sciq_adapter.py -> build/lib/evalscope/benchmarks/sciq 2026-03-25T09:05:57,496 copying evalscope/benchmarks/sciq/__init__.py -> build/lib/evalscope/benchmarks/sciq 2026-03-25T09:05:57,498 creating build/lib/evalscope/benchmarks/mmmu_pro 2026-03-25T09:05:57,499 copying evalscope/benchmarks/mmmu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-03-25T09:05:57,501 copying evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-03-25T09:05:57,503 creating build/lib/evalscope/benchmarks/biomix_qa 2026-03-25T09:05:57,504 copying evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-03-25T09:05:57,506 copying evalscope/benchmarks/biomix_qa/__init__.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-03-25T09:05:57,508 creating build/lib/evalscope/benchmarks/musr 2026-03-25T09:05:57,509 copying evalscope/benchmarks/musr/musr_adapter.py -> build/lib/evalscope/benchmarks/musr 2026-03-25T09:05:57,511 copying evalscope/benchmarks/musr/__init__.py -> build/lib/evalscope/benchmarks/musr 2026-03-25T09:05:57,513 creating build/lib/evalscope/benchmarks/text2image 2026-03-25T09:05:57,514 copying evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-03-25T09:05:57,516 copying evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-03-25T09:05:57,517 copying evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-03-25T09:05:57,519 copying evalscope/benchmarks/text2image/tifa_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-03-25T09:05:57,521 copying evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-03-25T09:05:57,523 copying evalscope/benchmarks/text2image/__init__.py -> build/lib/evalscope/benchmarks/text2image 2026-03-25T09:05:57,524 creating build/lib/evalscope/benchmarks/science_qa 2026-03-25T09:05:57,525 copying evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/lib/evalscope/benchmarks/science_qa 2026-03-25T09:05:57,527 copying evalscope/benchmarks/science_qa/__init__.py -> build/lib/evalscope/benchmarks/science_qa 2026-03-25T09:05:57,529 creating build/lib/evalscope/benchmarks/omnidoc_bench 2026-03-25T09:05:57,530 copying evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-03-25T09:05:57,533 copying evalscope/benchmarks/omnidoc_bench/utils.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-03-25T09:05:57,536 copying evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-03-25T09:05:57,538 copying evalscope/benchmarks/omnidoc_bench/metrics.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-03-25T09:05:57,541 copying evalscope/benchmarks/omnidoc_bench/__init__.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-03-25T09:05:57,543 creating build/lib/evalscope/benchmarks/coin_flip 2026-03-25T09:05:57,544 copying evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/lib/evalscope/benchmarks/coin_flip 2026-03-25T09:05:57,546 copying evalscope/benchmarks/coin_flip/__init__.py -> build/lib/evalscope/benchmarks/coin_flip 2026-03-25T09:05:57,548 creating build/lib/evalscope/benchmarks/mm_star 2026-03-25T09:05:57,549 copying evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/lib/evalscope/benchmarks/mm_star 2026-03-25T09:05:57,551 copying evalscope/benchmarks/mm_star/__init__.py -> build/lib/evalscope/benchmarks/mm_star 2026-03-25T09:05:57,553 creating build/lib/evalscope/benchmarks/mmlu 2026-03-25T09:05:57,554 copying evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/lib/evalscope/benchmarks/mmlu 2026-03-25T09:05:57,556 copying evalscope/benchmarks/mmlu/__init__.py -> build/lib/evalscope/benchmarks/mmlu 2026-03-25T09:05:57,558 creating build/lib/evalscope/benchmarks/simple_qa 2026-03-25T09:05:57,559 copying evalscope/benchmarks/simple_qa/__init__.py -> build/lib/evalscope/benchmarks/simple_qa 2026-03-25T09:05:57,561 copying evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/lib/evalscope/benchmarks/simple_qa 2026-03-25T09:05:57,563 creating build/lib/evalscope/benchmarks/bbh 2026-03-25T09:05:57,564 copying evalscope/benchmarks/bbh/bbh_adapter.py -> build/lib/evalscope/benchmarks/bbh 2026-03-25T09:05:57,567 copying evalscope/benchmarks/bbh/__init__.py -> build/lib/evalscope/benchmarks/bbh 2026-03-25T09:05:57,569 creating build/lib/evalscope/benchmarks/librispeech 2026-03-25T09:05:57,570 copying evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/lib/evalscope/benchmarks/librispeech 2026-03-25T09:05:57,572 copying evalscope/benchmarks/librispeech/__init__.py -> build/lib/evalscope/benchmarks/librispeech 2026-03-25T09:05:57,573 creating build/lib/evalscope/benchmarks/arc 2026-03-25T09:05:57,574 copying evalscope/benchmarks/arc/arc_adapter.py -> build/lib/evalscope/benchmarks/arc 2026-03-25T09:05:57,576 copying evalscope/benchmarks/arc/__init__.py -> build/lib/evalscope/benchmarks/arc 2026-03-25T09:05:57,578 creating build/lib/evalscope/benchmarks/blink 2026-03-25T09:05:57,579 copying evalscope/benchmarks/blink/blink_adapter.py -> build/lib/evalscope/benchmarks/blink 2026-03-25T09:05:57,581 copying evalscope/benchmarks/blink/__init__.py -> build/lib/evalscope/benchmarks/blink 2026-03-25T09:05:57,583 creating build/lib/evalscope/benchmarks/general_vqa 2026-03-25T09:05:57,583 copying evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/lib/evalscope/benchmarks/general_vqa 2026-03-25T09:05:57,586 copying evalscope/benchmarks/general_vqa/__init__.py -> build/lib/evalscope/benchmarks/general_vqa 2026-03-25T09:05:57,588 creating build/lib/evalscope/benchmarks/omni_bench 2026-03-25T09:05:57,588 copying evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/lib/evalscope/benchmarks/omni_bench 2026-03-25T09:05:57,591 copying evalscope/benchmarks/omni_bench/__init__.py -> build/lib/evalscope/benchmarks/omni_bench 2026-03-25T09:05:57,593 creating build/lib/evalscope/benchmarks/pumed_qa 2026-03-25T09:05:57,593 copying evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-03-25T09:05:57,596 copying evalscope/benchmarks/pumed_qa/__init__.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-03-25T09:05:57,598 creating build/lib/evalscope/benchmarks/humanevalplus 2026-03-25T09:05:57,599 copying evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-03-25T09:05:57,601 copying evalscope/benchmarks/humanevalplus/__init__.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-03-25T09:05:57,603 creating build/lib/evalscope/benchmarks/hle 2026-03-25T09:05:57,604 copying evalscope/benchmarks/hle/hle_adapter.py -> build/lib/evalscope/benchmarks/hle 2026-03-25T09:05:57,606 copying evalscope/benchmarks/hle/__init__.py -> build/lib/evalscope/benchmarks/hle 2026-03-25T09:05:57,608 creating build/lib/evalscope/benchmarks/math_vista 2026-03-25T09:05:57,609 copying evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/lib/evalscope/benchmarks/math_vista 2026-03-25T09:05:57,612 copying evalscope/benchmarks/math_vista/__init__.py -> build/lib/evalscope/benchmarks/math_vista 2026-03-25T09:05:57,614 creating build/lib/evalscope/benchmarks/mbppplus 2026-03-25T09:05:57,615 copying evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/lib/evalscope/benchmarks/mbppplus 2026-03-25T09:05:57,617 copying evalscope/benchmarks/mbppplus/__init__.py -> build/lib/evalscope/benchmarks/mbppplus 2026-03-25T09:05:57,619 creating build/lib/evalscope/benchmarks/logi_qa 2026-03-25T09:05:57,620 copying evalscope/benchmarks/logi_qa/__int__.py -> build/lib/evalscope/benchmarks/logi_qa 2026-03-25T09:05:57,622 copying evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/lib/evalscope/benchmarks/logi_qa 2026-03-25T09:05:57,624 creating build/lib/evalscope/benchmarks/mm_bench 2026-03-25T09:05:57,625 copying evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/lib/evalscope/benchmarks/mm_bench 2026-03-25T09:05:57,627 copying evalscope/benchmarks/mm_bench/__init__.py -> build/lib/evalscope/benchmarks/mm_bench 2026-03-25T09:05:57,629 creating build/lib/evalscope/benchmarks/bfcl 2026-03-25T09:05:57,630 copying evalscope/benchmarks/bfcl/__init__.py -> build/lib/evalscope/benchmarks/bfcl 2026-03-25T09:05:57,632 creating build/lib/evalscope/benchmarks/mmmu 2026-03-25T09:05:57,633 copying evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/lib/evalscope/benchmarks/mmmu 2026-03-25T09:05:57,636 copying evalscope/benchmarks/mmmu/__init__.py -> build/lib/evalscope/benchmarks/mmmu 2026-03-25T09:05:57,638 creating build/lib/evalscope/benchmarks/longbench_v2 2026-03-25T09:05:57,639 copying evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py -> build/lib/evalscope/benchmarks/longbench_v2 2026-03-25T09:05:57,641 copying evalscope/benchmarks/longbench_v2/__init__.py -> build/lib/evalscope/benchmarks/longbench_v2 2026-03-25T09:05:57,656 creating build/lib/evalscope/benchmarks/med_mcqa 2026-03-25T09:05:57,657 copying evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-03-25T09:05:57,659 copying evalscope/benchmarks/med_mcqa/__init__.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-03-25T09:05:57,660 creating build/lib/evalscope/benchmarks/frames 2026-03-25T09:05:57,661 copying evalscope/benchmarks/frames/utils.py -> build/lib/evalscope/benchmarks/frames 2026-03-25T09:05:57,663 copying evalscope/benchmarks/frames/frames_adapter.py -> build/lib/evalscope/benchmarks/frames 2026-03-25T09:05:57,665 copying evalscope/benchmarks/frames/__init__.py -> build/lib/evalscope/benchmarks/frames 2026-03-25T09:05:57,667 creating build/lib/evalscope/benchmarks/drivelology 2026-03-25T09:05:57,668 copying evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-03-25T09:05:57,670 copying evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-03-25T09:05:57,672 copying evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-03-25T09:05:57,675 copying evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-03-25T09:05:57,677 copying evalscope/benchmarks/drivelology/__init__.py -> build/lib/evalscope/benchmarks/drivelology 2026-03-25T09:05:57,679 creating build/lib/evalscope/benchmarks/needle_haystack 2026-03-25T09:05:57,680 copying evalscope/benchmarks/needle_haystack/utils.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-03-25T09:05:57,682 copying evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-03-25T09:05:57,684 copying evalscope/benchmarks/needle_haystack/__init__.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-03-25T09:05:57,686 creating build/lib/evalscope/benchmarks/vstar_bench 2026-03-25T09:05:57,687 copying evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-03-25T09:05:57,689 copying evalscope/benchmarks/vstar_bench/__init__.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-03-25T09:05:57,691 creating build/lib/evalscope/benchmarks/mri_mcqa 2026-03-25T09:05:57,692 copying evalscope/benchmarks/mri_mcqa/__init__.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-03-25T09:05:57,693 copying evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-03-25T09:05:57,696 creating build/lib/evalscope/benchmarks/maritime_bench 2026-03-25T09:05:57,697 copying evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-03-25T09:05:57,699 copying evalscope/benchmarks/maritime_bench/__init__.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-03-25T09:05:57,701 creating build/lib/evalscope/benchmarks/alpaca_eval 2026-03-25T09:05:57,702 copying evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-03-25T09:05:57,705 copying evalscope/benchmarks/alpaca_eval/__init__.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-03-25T09:05:57,706 creating build/lib/evalscope/benchmarks/general_fc 2026-03-25T09:05:57,707 copying evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/lib/evalscope/benchmarks/general_fc 2026-03-25T09:05:57,710 copying evalscope/benchmarks/general_fc/__init__.py -> build/lib/evalscope/benchmarks/general_fc 2026-03-25T09:05:57,712 creating build/lib/evalscope/benchmarks/poly_math 2026-03-25T09:05:57,712 copying evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/lib/evalscope/benchmarks/poly_math 2026-03-25T09:05:57,715 copying evalscope/benchmarks/poly_math/__init__.py -> build/lib/evalscope/benchmarks/poly_math 2026-03-25T09:05:57,716 creating build/lib/evalscope/benchmarks/mmlu_redux 2026-03-25T09:05:57,717 copying evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-03-25T09:05:57,720 copying evalscope/benchmarks/mmlu_redux/__init__.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-03-25T09:05:57,721 creating build/lib/evalscope/benchmarks/image_edit 2026-03-25T09:05:57,722 copying evalscope/benchmarks/image_edit/__init__.py -> build/lib/evalscope/benchmarks/image_edit 2026-03-25T09:05:57,724 creating build/lib/evalscope/benchmarks/truthful_qa 2026-03-25T09:05:57,725 copying evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-03-25T09:05:57,727 copying evalscope/benchmarks/truthful_qa/__init__.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-03-25T09:05:57,729 creating build/lib/evalscope/benchmarks/general_mcq 2026-03-25T09:05:57,730 copying evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/lib/evalscope/benchmarks/general_mcq 2026-03-25T09:05:57,732 copying evalscope/benchmarks/general_mcq/__init__.py -> build/lib/evalscope/benchmarks/general_mcq 2026-03-25T09:05:57,734 creating build/lib/evalscope/benchmarks/ai2d 2026-03-25T09:05:57,735 copying evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/lib/evalscope/benchmarks/ai2d 2026-03-25T09:05:57,737 copying evalscope/benchmarks/ai2d/__init__.py -> build/lib/evalscope/benchmarks/ai2d 2026-03-25T09:05:57,739 creating build/lib/evalscope/benchmarks/aa_lcr 2026-03-25T09:05:57,740 copying evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-03-25T09:05:57,742 copying evalscope/benchmarks/aa_lcr/__init__.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-03-25T09:05:57,744 creating build/lib/evalscope/benchmarks/piqa 2026-03-25T09:05:57,745 copying evalscope/benchmarks/piqa/piqa_adapter.py -> build/lib/evalscope/benchmarks/piqa 2026-03-25T09:05:57,747 copying evalscope/benchmarks/piqa/__init__.py -> build/lib/evalscope/benchmarks/piqa 2026-03-25T09:05:57,749 creating build/lib/evalscope/benchmarks/a_okvqa 2026-03-25T09:05:57,750 copying evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-03-25T09:05:57,752 copying evalscope/benchmarks/a_okvqa/__init__.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-03-25T09:05:57,753 creating build/lib/evalscope/benchmarks/general_arena 2026-03-25T09:05:57,754 copying evalscope/benchmarks/general_arena/utils.py -> build/lib/evalscope/benchmarks/general_arena 2026-03-25T09:05:57,757 copying evalscope/benchmarks/general_arena/__init__.py -> build/lib/evalscope/benchmarks/general_arena 2026-03-25T09:05:57,758 copying evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/lib/evalscope/benchmarks/general_arena 2026-03-25T09:05:57,761 creating build/lib/evalscope/benchmarks/qasc 2026-03-25T09:05:57,762 copying evalscope/benchmarks/qasc/__init__.py -> build/lib/evalscope/benchmarks/qasc 2026-03-25T09:05:57,764 copying evalscope/benchmarks/qasc/qasc_adapter.py -> build/lib/evalscope/benchmarks/qasc 2026-03-25T09:05:57,766 creating build/lib/evalscope/benchmarks/mmlu_pro 2026-03-25T09:05:57,767 copying evalscope/benchmarks/mmlu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-03-25T09:05:57,769 copying evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-03-25T09:05:57,771 creating build/lib/evalscope/benchmarks/mgsm 2026-03-25T09:05:57,772 copying evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/lib/evalscope/benchmarks/mgsm 2026-03-25T09:05:57,774 copying evalscope/benchmarks/mgsm/__init__.py -> build/lib/evalscope/benchmarks/mgsm 2026-03-25T09:05:57,776 creating build/lib/evalscope/benchmarks/real_world_qa 2026-03-25T09:05:57,777 copying evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-03-25T09:05:57,779 copying evalscope/benchmarks/real_world_qa/__init__.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-03-25T09:05:57,781 creating build/lib/evalscope/benchmarks/cmmu 2026-03-25T09:05:57,782 copying evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmu 2026-03-25T09:05:57,784 copying evalscope/benchmarks/cmmu/prompt.py -> build/lib/evalscope/benchmarks/cmmu 2026-03-25T09:05:57,786 copying evalscope/benchmarks/cmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmu 2026-03-25T09:05:57,788 creating build/lib/evalscope/benchmarks/gsm8k 2026-03-25T09:05:57,789 copying evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/lib/evalscope/benchmarks/gsm8k 2026-03-25T09:05:57,791 copying evalscope/benchmarks/gsm8k/__init__.py -> build/lib/evalscope/benchmarks/gsm8k 2026-03-25T09:05:57,793 creating build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,794 copying evalscope/benchmarks/ner/bc2gm_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,796 copying evalscope/benchmarks/ner/copious_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,799 copying evalscope/benchmarks/ner/fin_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,800 copying evalscope/benchmarks/ner/anat_em_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,803 copying evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,805 copying evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,807 copying evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,809 copying evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,811 copying evalscope/benchmarks/ner/wnut2017_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,813 copying evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,815 copying evalscope/benchmarks/ner/genia_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,817 copying evalscope/benchmarks/ner/ncbi_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,819 copying evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,821 copying evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,823 copying evalscope/benchmarks/ner/cross_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,825 copying evalscope/benchmarks/ner/conll2003_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,827 copying evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,829 copying evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,831 copying evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,833 copying evalscope/benchmarks/ner/conllpp_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,835 copying evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,838 copying evalscope/benchmarks/ner/jnlpba_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,840 copying evalscope/benchmarks/ner/__init__.py -> build/lib/evalscope/benchmarks/ner 2026-03-25T09:05:57,842 creating build/lib/evalscope/benchmarks/cmmmu 2026-03-25T09:05:57,843 copying evalscope/benchmarks/cmmmu/utils.py -> build/lib/evalscope/benchmarks/cmmmu 2026-03-25T09:05:57,845 copying evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmmu 2026-03-25T09:05:57,848 copying evalscope/benchmarks/cmmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmmu 2026-03-25T09:05:57,850 creating build/lib/evalscope/benchmarks/trivia_qa 2026-03-25T09:05:57,851 copying evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-03-25T09:05:57,853 copying evalscope/benchmarks/trivia_qa/__init__.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-03-25T09:05:57,855 creating build/lib/evalscope/benchmarks/visu_logic 2026-03-25T09:05:57,855 copying evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/lib/evalscope/benchmarks/visu_logic 2026-03-25T09:05:57,857 copying evalscope/benchmarks/visu_logic/__init__.py -> build/lib/evalscope/benchmarks/visu_logic 2026-03-25T09:05:57,859 creating build/lib/evalscope/benchmarks/zebralogicbench 2026-03-25T09:05:57,860 copying evalscope/benchmarks/zebralogicbench/utils.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-03-25T09:05:57,862 copying evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-03-25T09:05:57,865 copying evalscope/benchmarks/zebralogicbench/__init__.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-03-25T09:05:57,867 creating build/lib/evalscope/benchmarks/healthbench 2026-03-25T09:05:57,868 copying evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/lib/evalscope/benchmarks/healthbench 2026-03-25T09:05:57,870 copying evalscope/benchmarks/healthbench/utils.py -> build/lib/evalscope/benchmarks/healthbench 2026-03-25T09:05:57,872 copying evalscope/benchmarks/healthbench/__init__.py -> build/lib/evalscope/benchmarks/healthbench 2026-03-25T09:05:57,874 creating build/lib/evalscope/benchmarks/ocr_bench 2026-03-25T09:05:57,875 copying evalscope/benchmarks/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench 2026-03-25T09:05:57,877 creating build/lib/evalscope/benchmarks/math_500 2026-03-25T09:05:57,878 copying evalscope/benchmarks/math_500/math_500_adapter.py -> build/lib/evalscope/benchmarks/math_500 2026-03-25T09:05:57,880 copying evalscope/benchmarks/math_500/__init__.py -> build/lib/evalscope/benchmarks/math_500 2026-03-25T09:05:57,882 creating build/lib/evalscope/benchmarks/gsm8k_v 2026-03-25T09:05:57,883 copying evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-03-25T09:05:57,885 copying evalscope/benchmarks/gsm8k_v/__init__.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-03-25T09:05:57,887 creating build/lib/evalscope/benchmarks/micro_vqa 2026-03-25T09:05:57,888 copying evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-03-25T09:05:57,891 copying evalscope/benchmarks/micro_vqa/__init__.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-03-25T09:05:57,893 creating build/lib/evalscope/benchmarks/math_vision 2026-03-25T09:05:57,894 copying evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/lib/evalscope/benchmarks/math_vision 2026-03-25T09:05:57,896 copying evalscope/benchmarks/math_vision/__init__.py -> build/lib/evalscope/benchmarks/math_vision 2026-03-25T09:05:57,898 creating build/lib/evalscope/benchmarks/infovqa 2026-03-25T09:05:57,899 copying evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/lib/evalscope/benchmarks/infovqa 2026-03-25T09:05:57,902 copying evalscope/benchmarks/infovqa/__init__.py -> build/lib/evalscope/benchmarks/infovqa 2026-03-25T09:05:57,903 creating build/lib/evalscope/benchmarks/olympiad_bench 2026-03-25T09:05:57,904 copying evalscope/benchmarks/olympiad_bench/utils.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-03-25T09:05:57,907 copying evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-03-25T09:05:57,910 copying evalscope/benchmarks/olympiad_bench/__init__.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-03-25T09:05:57,911 creating build/lib/evalscope/benchmarks/eq_bench 2026-03-25T09:05:57,912 copying evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/lib/evalscope/benchmarks/eq_bench 2026-03-25T09:05:57,915 copying evalscope/benchmarks/eq_bench/answer_validation.py -> build/lib/evalscope/benchmarks/eq_bench 2026-03-25T09:05:57,917 copying evalscope/benchmarks/eq_bench/__init__.py -> build/lib/evalscope/benchmarks/eq_bench 2026-03-25T09:05:57,919 creating build/lib/evalscope/benchmarks/tool_bench 2026-03-25T09:05:57,920 copying evalscope/benchmarks/tool_bench/utils.py -> build/lib/evalscope/benchmarks/tool_bench 2026-03-25T09:05:57,922 copying evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/lib/evalscope/benchmarks/tool_bench 2026-03-25T09:05:57,924 copying evalscope/benchmarks/tool_bench/__init__.py -> build/lib/evalscope/benchmarks/tool_bench 2026-03-25T09:05:57,926 creating build/lib/evalscope/benchmarks/simple_vqa 2026-03-25T09:05:57,927 copying evalscope/benchmarks/simple_vqa/__init__.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-03-25T09:05:57,929 copying evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-03-25T09:05:57,931 creating build/lib/evalscope/benchmarks/music_trivia 2026-03-25T09:05:57,932 copying evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/lib/evalscope/benchmarks/music_trivia 2026-03-25T09:05:57,934 copying evalscope/benchmarks/music_trivia/__init__.py -> build/lib/evalscope/benchmarks/music_trivia 2026-03-25T09:05:57,936 creating build/lib/evalscope/benchmarks/hellaswag 2026-03-25T09:05:57,937 copying evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/lib/evalscope/benchmarks/hellaswag 2026-03-25T09:05:57,939 copying evalscope/benchmarks/hellaswag/__init__.py -> build/lib/evalscope/benchmarks/hellaswag 2026-03-25T09:05:57,941 creating build/lib/evalscope/benchmarks/winogrande 2026-03-25T09:05:57,942 copying evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/lib/evalscope/benchmarks/winogrande 2026-03-25T09:05:57,944 copying evalscope/benchmarks/winogrande/__init__.py -> build/lib/evalscope/benchmarks/winogrande 2026-03-25T09:05:57,946 creating build/lib/evalscope/benchmarks/fleurs 2026-03-25T09:05:57,947 copying evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/lib/evalscope/benchmarks/fleurs 2026-03-25T09:05:57,949 copying evalscope/benchmarks/fleurs/__init__.py -> build/lib/evalscope/benchmarks/fleurs 2026-03-25T09:05:57,951 creating build/lib/evalscope/benchmarks/chinese_simple_qa 2026-03-25T09:05:57,952 copying evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-03-25T09:05:57,955 copying evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-03-25T09:05:57,956 creating build/lib/evalscope/benchmarks/minerva_math 2026-03-25T09:05:57,957 copying evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/lib/evalscope/benchmarks/minerva_math 2026-03-25T09:05:57,960 copying evalscope/benchmarks/minerva_math/__init__.py -> build/lib/evalscope/benchmarks/minerva_math 2026-03-25T09:05:57,961 creating build/lib/evalscope/benchmarks/math_verse 2026-03-25T09:05:57,962 copying evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/lib/evalscope/benchmarks/math_verse 2026-03-25T09:05:57,965 copying evalscope/benchmarks/math_verse/__init__.py -> build/lib/evalscope/benchmarks/math_verse 2026-03-25T09:05:57,967 creating build/lib/evalscope/benchmarks/amc 2026-03-25T09:05:57,968 copying evalscope/benchmarks/amc/amc_adapter.py -> build/lib/evalscope/benchmarks/amc 2026-03-25T09:05:57,970 copying evalscope/benchmarks/amc/__init__.py -> build/lib/evalscope/benchmarks/amc 2026-03-25T09:05:57,972 creating build/lib/evalscope/benchmarks/zerobench 2026-03-25T09:05:57,972 copying evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/lib/evalscope/benchmarks/zerobench 2026-03-25T09:05:57,974 copying evalscope/benchmarks/zerobench/__init__.py -> build/lib/evalscope/benchmarks/zerobench 2026-03-25T09:05:57,976 creating build/lib/evalscope/benchmarks/hmmt25 2026-03-25T09:05:57,977 copying evalscope/benchmarks/hmmt25/hmmt25_adapter.py -> build/lib/evalscope/benchmarks/hmmt25 2026-03-25T09:05:57,980 creating build/lib/evalscope/benchmarks/chartqa 2026-03-25T09:05:57,981 copying evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/lib/evalscope/benchmarks/chartqa 2026-03-25T09:05:57,983 copying evalscope/benchmarks/chartqa/utils.py -> build/lib/evalscope/benchmarks/chartqa 2026-03-25T09:05:57,984 copying evalscope/benchmarks/chartqa/__init__.py -> build/lib/evalscope/benchmarks/chartqa 2026-03-25T09:05:57,986 creating build/lib/evalscope/benchmarks/gpqa 2026-03-25T09:05:57,987 copying evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/lib/evalscope/benchmarks/gpqa 2026-03-25T09:05:57,989 copying evalscope/benchmarks/gpqa/prompt.py -> build/lib/evalscope/benchmarks/gpqa 2026-03-25T09:05:57,991 copying evalscope/benchmarks/gpqa/__init__.py -> build/lib/evalscope/benchmarks/gpqa 2026-03-25T09:05:57,993 creating build/lib/evalscope/benchmarks/general_vmcq 2026-03-25T09:05:57,994 copying evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-03-25T09:05:57,996 copying evalscope/benchmarks/general_vmcq/__init__.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-03-25T09:05:57,998 creating build/lib/evalscope/benchmarks/scicode 2026-03-25T09:05:57,999 copying evalscope/benchmarks/scicode/prompt_templates.py -> build/lib/evalscope/benchmarks/scicode 2026-03-25T09:05:58,001 copying evalscope/benchmarks/scicode/scicode_adapter.py -> build/lib/evalscope/benchmarks/scicode 2026-03-25T09:05:58,004 copying evalscope/benchmarks/scicode/__init__.py -> build/lib/evalscope/benchmarks/scicode 2026-03-25T09:05:58,005 copying evalscope/benchmarks/scicode/util.py -> build/lib/evalscope/benchmarks/scicode 2026-03-25T09:05:58,007 creating build/lib/evalscope/benchmarks/multi_if 2026-03-25T09:05:58,008 copying evalscope/benchmarks/multi_if/ifeval.py -> build/lib/evalscope/benchmarks/multi_if 2026-03-25T09:05:58,012 copying evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/lib/evalscope/benchmarks/multi_if 2026-03-25T09:05:58,014 copying evalscope/benchmarks/multi_if/metrics.py -> build/lib/evalscope/benchmarks/multi_if 2026-03-25T09:05:58,016 copying evalscope/benchmarks/multi_if/__init__.py -> build/lib/evalscope/benchmarks/multi_if 2026-03-25T09:05:58,019 creating build/lib/evalscope/benchmarks/ifeval 2026-03-25T09:05:58,019 copying evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/lib/evalscope/benchmarks/ifeval 2026-03-25T09:05:58,021 copying evalscope/benchmarks/ifeval/utils.py -> build/lib/evalscope/benchmarks/ifeval 2026-03-25T09:05:58,024 copying evalscope/benchmarks/ifeval/instructions_registry.py -> build/lib/evalscope/benchmarks/ifeval 2026-03-25T09:05:58,026 copying evalscope/benchmarks/ifeval/instructions_util.py -> build/lib/evalscope/benchmarks/ifeval 2026-03-25T09:05:58,028 copying evalscope/benchmarks/ifeval/__init__.py -> build/lib/evalscope/benchmarks/ifeval 2026-03-25T09:05:58,030 copying evalscope/benchmarks/ifeval/instructions.py -> build/lib/evalscope/benchmarks/ifeval 2026-03-25T09:05:58,034 creating build/lib/evalscope/benchmarks/halu_eval 2026-03-25T09:05:58,035 copying evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/lib/evalscope/benchmarks/halu_eval 2026-03-25T09:05:58,038 copying evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/lib/evalscope/benchmarks/halu_eval 2026-03-25T09:05:58,040 copying evalscope/benchmarks/halu_eval/__init__.py -> build/lib/evalscope/benchmarks/halu_eval 2026-03-25T09:05:58,042 creating build/lib/evalscope/benchmarks/process_bench 2026-03-25T09:05:58,043 copying evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/lib/evalscope/benchmarks/process_bench 2026-03-25T09:05:58,046 copying evalscope/benchmarks/process_bench/__init__.py -> build/lib/evalscope/benchmarks/process_bench 2026-03-25T09:05:58,047 creating build/lib/evalscope/benchmarks/general_qa 2026-03-25T09:05:58,048 copying evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/lib/evalscope/benchmarks/general_qa 2026-03-25T09:05:58,051 copying evalscope/benchmarks/general_qa/__init__.py -> build/lib/evalscope/benchmarks/general_qa 2026-03-25T09:05:58,053 creating build/lib/evalscope/benchmarks/docmath 2026-03-25T09:05:58,054 copying evalscope/benchmarks/docmath/utils.py -> build/lib/evalscope/benchmarks/docmath 2026-03-25T09:05:58,056 copying evalscope/benchmarks/docmath/docmath_adapter.py -> build/lib/evalscope/benchmarks/docmath 2026-03-25T09:05:58,058 copying evalscope/benchmarks/docmath/__init__.py -> build/lib/evalscope/benchmarks/docmath 2026-03-25T09:05:58,060 creating build/lib/evalscope/benchmarks/terminal_bench 2026-03-25T09:05:58,061 copying evalscope/benchmarks/terminal_bench/utils.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-03-25T09:05:58,063 copying evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-03-25T09:05:58,065 copying evalscope/benchmarks/terminal_bench/__init__.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-03-25T09:05:58,067 creating build/lib/evalscope/benchmarks/race 2026-03-25T09:05:58,068 copying evalscope/benchmarks/race/race_adapter.py -> build/lib/evalscope/benchmarks/race 2026-03-25T09:05:58,070 copying evalscope/benchmarks/race/__init__.py -> build/lib/evalscope/benchmarks/race 2026-03-25T09:05:58,072 creating build/lib/evalscope/benchmarks/competition_math 2026-03-25T09:05:58,073 copying evalscope/benchmarks/competition_math/__init__.py -> build/lib/evalscope/benchmarks/competition_math 2026-03-25T09:05:58,075 copying evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/lib/evalscope/benchmarks/competition_math 2026-03-25T09:05:58,077 creating build/lib/evalscope/benchmarks/cl_bench 2026-03-25T09:05:58,079 copying evalscope/benchmarks/cl_bench/utils.py -> build/lib/evalscope/benchmarks/cl_bench 2026-03-25T09:05:58,081 copying evalscope/benchmarks/cl_bench/cl_bench_adapter.py -> build/lib/evalscope/benchmarks/cl_bench 2026-03-25T09:05:58,083 copying evalscope/benchmarks/cl_bench/__init__.py -> build/lib/evalscope/benchmarks/cl_bench 2026-03-25T09:05:58,085 creating build/lib/evalscope/benchmarks/arena_hard 2026-03-25T09:05:58,086 copying evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/lib/evalscope/benchmarks/arena_hard 2026-03-25T09:05:58,088 copying evalscope/benchmarks/arena_hard/utils.py -> build/lib/evalscope/benchmarks/arena_hard 2026-03-25T09:05:58,090 copying evalscope/benchmarks/arena_hard/__init__.py -> build/lib/evalscope/benchmarks/arena_hard 2026-03-25T09:05:58,092 creating build/lib/evalscope/benchmarks/iquiz 2026-03-25T09:05:58,093 copying evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/lib/evalscope/benchmarks/iquiz 2026-03-25T09:05:58,095 copying evalscope/benchmarks/iquiz/__init__.py -> build/lib/evalscope/benchmarks/iquiz 2026-03-25T09:05:58,097 creating build/lib/evalscope/benchmarks/swe_bench 2026-03-25T09:05:58,098 copying evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/lib/evalscope/benchmarks/swe_bench 2026-03-25T09:05:58,101 copying evalscope/benchmarks/swe_bench/utils.py -> build/lib/evalscope/benchmarks/swe_bench 2026-03-25T09:05:58,103 copying evalscope/benchmarks/swe_bench/build_images.py -> build/lib/evalscope/benchmarks/swe_bench 2026-03-25T09:05:58,105 copying evalscope/benchmarks/swe_bench/__init__.py -> build/lib/evalscope/benchmarks/swe_bench 2026-03-25T09:05:58,107 creating build/lib/evalscope/benchmarks/cmmlu 2026-03-25T09:05:58,109 copying evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/lib/evalscope/benchmarks/cmmlu 2026-03-25T09:05:58,111 copying evalscope/benchmarks/cmmlu/__init__.py -> build/lib/evalscope/benchmarks/cmmlu 2026-03-25T09:05:58,113 creating build/lib/evalscope/benchmarks/commonsense_qa 2026-03-25T09:05:58,114 copying evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-03-25T09:05:58,116 copying evalscope/benchmarks/commonsense_qa/__init__.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-03-25T09:05:58,118 creating build/lib/evalscope/benchmarks/torgo 2026-03-25T09:05:58,119 copying evalscope/benchmarks/torgo/torgo_adapter.py -> build/lib/evalscope/benchmarks/torgo 2026-03-25T09:05:58,121 copying evalscope/benchmarks/torgo/__init__.py -> build/lib/evalscope/benchmarks/torgo 2026-03-25T09:05:58,123 creating build/lib/evalscope/benchmarks/wmt 2026-03-25T09:05:58,124 copying evalscope/benchmarks/wmt/wmt24_adapter.py -> build/lib/evalscope/benchmarks/wmt 2026-03-25T09:05:58,126 copying evalscope/benchmarks/wmt/__init__.py -> build/lib/evalscope/benchmarks/wmt 2026-03-25T09:05:58,128 creating build/lib/evalscope/benchmarks/hallusion_bench 2026-03-25T09:05:58,128 copying evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-03-25T09:05:58,131 copying evalscope/benchmarks/hallusion_bench/__init__.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-03-25T09:05:58,133 creating build/lib/evalscope/benchmarks/tau_bench 2026-03-25T09:05:58,134 copying evalscope/benchmarks/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench 2026-03-25T09:05:58,136 creating build/lib/evalscope/benchmarks/docvqa 2026-03-25T09:05:58,137 copying evalscope/benchmarks/docvqa/__init__.py -> build/lib/evalscope/benchmarks/docvqa 2026-03-25T09:05:58,139 copying evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/lib/evalscope/benchmarks/docvqa 2026-03-25T09:05:58,141 creating build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,142 copying evalscope/benchmarks/live_code_bench/extract_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,144 copying evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,146 copying evalscope/benchmarks/live_code_bench/testing_util.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,149 copying evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,151 copying evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,153 copying evalscope/benchmarks/live_code_bench/prompts.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,156 copying evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,158 copying evalscope/benchmarks/live_code_bench/__init__.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,159 copying evalscope/benchmarks/live_code_bench/load_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-03-25T09:05:58,162 creating build/lib/evalscope/benchmarks/math_qa 2026-03-25T09:05:58,163 copying evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/lib/evalscope/benchmarks/math_qa 2026-03-25T09:05:58,165 copying evalscope/benchmarks/math_qa/__init__.py -> build/lib/evalscope/benchmarks/math_qa 2026-03-25T09:05:58,167 creating build/lib/evalscope/benchmarks/drop 2026-03-25T09:05:58,168 copying evalscope/benchmarks/drop/utils.py -> build/lib/evalscope/benchmarks/drop 2026-03-25T09:05:58,170 copying evalscope/benchmarks/drop/drop_adapter.py -> build/lib/evalscope/benchmarks/drop 2026-03-25T09:05:58,173 copying evalscope/benchmarks/drop/__init__.py -> build/lib/evalscope/benchmarks/drop 2026-03-25T09:05:58,175 creating build/lib/evalscope/benchmarks/mmmlu 2026-03-25T09:05:58,176 copying evalscope/benchmarks/mmmlu/mmmlu_adapter.py -> build/lib/evalscope/benchmarks/mmmlu 2026-03-25T09:05:58,178 copying evalscope/benchmarks/mmmlu/prompt.py -> build/lib/evalscope/benchmarks/mmmlu 2026-03-25T09:05:58,180 copying evalscope/benchmarks/mmmlu/__init__.py -> build/lib/evalscope/benchmarks/mmmlu 2026-03-25T09:05:58,182 creating build/lib/evalscope/benchmarks/refcoco 2026-03-25T09:05:58,183 copying evalscope/benchmarks/refcoco/utils.py -> build/lib/evalscope/benchmarks/refcoco 2026-03-25T09:05:58,185 copying evalscope/benchmarks/refcoco/evaluation_lib.py -> build/lib/evalscope/benchmarks/refcoco 2026-03-25T09:05:58,186 copying evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/lib/evalscope/benchmarks/refcoco 2026-03-25T09:05:58,189 copying evalscope/benchmarks/refcoco/__init__.py -> build/lib/evalscope/benchmarks/refcoco 2026-03-25T09:05:58,190 creating build/lib/evalscope/benchmarks/siqa 2026-03-25T09:05:58,191 copying evalscope/benchmarks/siqa/siqa_adapter.py -> build/lib/evalscope/benchmarks/siqa 2026-03-25T09:05:58,193 copying evalscope/benchmarks/siqa/__init__.py -> build/lib/evalscope/benchmarks/siqa 2026-03-25T09:05:58,195 creating build/lib/evalscope/benchmarks/ifbench 2026-03-25T09:05:58,196 copying evalscope/benchmarks/ifbench/evaluation_lib.py -> build/lib/evalscope/benchmarks/ifbench 2026-03-25T09:05:58,198 copying evalscope/benchmarks/ifbench/instructions_registry.py -> build/lib/evalscope/benchmarks/ifbench 2026-03-25T09:05:58,200 copying evalscope/benchmarks/ifbench/instructions_util.py -> build/lib/evalscope/benchmarks/ifbench 2026-03-25T09:05:58,215 copying evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/lib/evalscope/benchmarks/ifbench 2026-03-25T09:05:58,217 copying evalscope/benchmarks/ifbench/__init__.py -> build/lib/evalscope/benchmarks/ifbench 2026-03-25T09:05:58,219 copying evalscope/benchmarks/ifbench/instructions.py -> build/lib/evalscope/benchmarks/ifbench 2026-03-25T09:05:58,223 creating build/lib/evalscope/benchmarks/data_collection 2026-03-25T09:05:58,224 copying evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/lib/evalscope/benchmarks/data_collection 2026-03-25T09:05:58,227 copying evalscope/benchmarks/data_collection/__init__.py -> build/lib/evalscope/benchmarks/data_collection 2026-03-25T09:05:58,229 creating build/lib/evalscope/benchmarks/humaneval 2026-03-25T09:05:58,230 copying evalscope/benchmarks/humaneval/utils.py -> build/lib/evalscope/benchmarks/humaneval 2026-03-25T09:05:58,232 copying evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/lib/evalscope/benchmarks/humaneval 2026-03-25T09:05:58,234 copying evalscope/benchmarks/humaneval/__init__.py -> build/lib/evalscope/benchmarks/humaneval 2026-03-25T09:05:58,236 creating build/lib/evalscope/benchmarks/openai_mrcr 2026-03-25T09:05:58,237 copying evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-03-25T09:05:58,240 copying evalscope/benchmarks/openai_mrcr/utils.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-03-25T09:05:58,242 copying evalscope/benchmarks/openai_mrcr/__init__.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-03-25T09:05:58,244 creating build/lib/evalscope/benchmarks/bfcl/v3 2026-03-25T09:05:58,245 copying evalscope/benchmarks/bfcl/v3/utils.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-03-25T09:05:58,247 copying evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-03-25T09:05:58,250 copying evalscope/benchmarks/bfcl/v3/generation.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-03-25T09:05:58,252 copying evalscope/benchmarks/bfcl/v3/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-03-25T09:05:58,254 creating build/lib/evalscope/benchmarks/bfcl/v4 2026-03-25T09:05:58,255 copying evalscope/benchmarks/bfcl/v4/utils.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-03-25T09:05:58,258 copying evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-03-25T09:05:58,260 copying evalscope/benchmarks/bfcl/v4/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-03-25T09:05:58,262 creating build/lib/evalscope/benchmarks/poly_math/utils 2026-03-25T09:05:58,263 copying evalscope/benchmarks/poly_math/utils/instruction.py -> build/lib/evalscope/benchmarks/poly_math/utils 2026-03-25T09:05:58,266 creating build/lib/evalscope/benchmarks/image_edit/gedit 2026-03-25T09:05:58,267 copying evalscope/benchmarks/image_edit/gedit/utils.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-03-25T09:05:58,269 copying evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-03-25T09:05:58,272 copying evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-03-25T09:05:58,274 copying evalscope/benchmarks/image_edit/gedit/__init__.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-03-25T09:05:58,276 creating build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:05:58,277 copying evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:05:58,279 copying evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:05:58,281 copying evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:05:58,283 copying evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:05:58,285 copying evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:05:58,286 copying evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:05:58,289 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,290 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,292 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,295 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,297 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,299 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,301 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,303 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,306 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,308 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:05:58,310 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-03-25T09:05:58,311 copying evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-03-25T09:05:58,314 copying evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-03-25T09:05:58,316 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:05:58,317 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:05:58,320 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:05:58,324 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:05:58,326 creating build/lib/evalscope/benchmarks/scicode/docker 2026-03-25T09:05:58,327 copying evalscope/benchmarks/scicode/docker/process_data.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-03-25T09:05:58,329 copying evalscope/benchmarks/scicode/docker/test_util.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-03-25T09:05:58,332 creating build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-03-25T09:05:58,333 copying evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-03-25T09:05:58,335 copying evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-03-25T09:05:58,337 copying evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-03-25T09:05:58,339 creating build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-03-25T09:05:58,340 copying evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-03-25T09:05:58,343 copying evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-03-25T09:05:58,345 copying evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-03-25T09:05:58,347 creating build/lib/evalscope/third_party/longbench_write 2026-03-25T09:05:58,348 copying evalscope/third_party/longbench_write/utils.py -> build/lib/evalscope/third_party/longbench_write 2026-03-25T09:05:58,350 copying evalscope/third_party/longbench_write/infer.py -> build/lib/evalscope/third_party/longbench_write 2026-03-25T09:05:58,352 copying evalscope/third_party/longbench_write/eval.py -> build/lib/evalscope/third_party/longbench_write 2026-03-25T09:05:58,354 copying evalscope/third_party/longbench_write/longbench_write.py -> build/lib/evalscope/third_party/longbench_write 2026-03-25T09:05:58,356 copying evalscope/third_party/longbench_write/__init__.py -> build/lib/evalscope/third_party/longbench_write 2026-03-25T09:05:58,358 creating build/lib/evalscope/third_party/thinkbench 2026-03-25T09:05:58,359 copying evalscope/third_party/thinkbench/infer.py -> build/lib/evalscope/third_party/thinkbench 2026-03-25T09:05:58,361 copying evalscope/third_party/thinkbench/eval.py -> build/lib/evalscope/third_party/thinkbench 2026-03-25T09:05:58,364 copying evalscope/third_party/thinkbench/__init__.py -> build/lib/evalscope/third_party/thinkbench 2026-03-25T09:05:58,366 creating build/lib/evalscope/third_party/toolbench_static 2026-03-25T09:05:58,367 copying evalscope/third_party/toolbench_static/infer.py -> build/lib/evalscope/third_party/toolbench_static 2026-03-25T09:05:58,371 copying evalscope/third_party/toolbench_static/toolbench_static.py -> build/lib/evalscope/third_party/toolbench_static 2026-03-25T09:05:58,372 copying evalscope/third_party/toolbench_static/eval.py -> build/lib/evalscope/third_party/toolbench_static 2026-03-25T09:05:58,375 copying evalscope/third_party/toolbench_static/__init__.py -> build/lib/evalscope/third_party/toolbench_static 2026-03-25T09:05:58,377 creating build/lib/evalscope/third_party/longbench_write/resources 2026-03-25T09:05:58,378 copying evalscope/third_party/longbench_write/resources/__init__.py -> build/lib/evalscope/third_party/longbench_write/resources 2026-03-25T09:05:58,380 creating build/lib/evalscope/third_party/longbench_write/tools 2026-03-25T09:05:58,382 copying evalscope/third_party/longbench_write/tools/data_etl.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-03-25T09:05:58,384 copying evalscope/third_party/longbench_write/tools/__init__.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-03-25T09:05:58,386 copying evalscope/third_party/longbench_write/tools/openai_api.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-03-25T09:05:58,389 creating build/lib/evalscope/third_party/thinkbench/tools 2026-03-25T09:05:58,390 copying evalscope/third_party/thinkbench/tools/utils.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-03-25T09:05:58,392 copying evalscope/third_party/thinkbench/tools/llm.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-03-25T09:05:58,394 copying evalscope/third_party/thinkbench/tools/__init__.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-03-25T09:05:58,396 creating build/lib/evalscope/third_party/toolbench_static/llm 2026-03-25T09:05:58,397 copying evalscope/third_party/toolbench_static/llm/__init__.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-03-25T09:05:58,399 copying evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-03-25T09:05:58,401 creating build/lib/evalscope/utils/doc_utils 2026-03-25T09:05:58,402 copying evalscope/utils/doc_utils/readme_generator.py -> build/lib/evalscope/utils/doc_utils 2026-03-25T09:05:58,405 copying evalscope/utils/doc_utils/generate_dataset_md.py -> build/lib/evalscope/utils/doc_utils 2026-03-25T09:05:58,408 copying evalscope/utils/doc_utils/translate_description.py -> build/lib/evalscope/utils/doc_utils 2026-03-25T09:05:58,410 copying evalscope/utils/doc_utils/benchmark_stats.py -> build/lib/evalscope/utils/doc_utils 2026-03-25T09:05:58,413 copying evalscope/utils/doc_utils/__init__.py -> build/lib/evalscope/utils/doc_utils 2026-03-25T09:05:58,415 creating build/lib/evalscope/utils/tqdm_utils 2026-03-25T09:05:58,416 copying evalscope/utils/tqdm_utils/tqdm_logging.py -> build/lib/evalscope/utils/tqdm_utils 2026-03-25T09:05:58,419 copying evalscope/utils/tqdm_utils/progress_tracker.py -> build/lib/evalscope/utils/tqdm_utils 2026-03-25T09:05:58,421 copying evalscope/utils/tqdm_utils/__init__.py -> build/lib/evalscope/utils/tqdm_utils 2026-03-25T09:05:58,423 creating build/lib/evalscope/service/utils 2026-03-25T09:05:58,424 copying evalscope/service/utils/log.py -> build/lib/evalscope/service/utils 2026-03-25T09:05:58,426 copying evalscope/service/utils/benchmarks.py -> build/lib/evalscope/service/utils 2026-03-25T09:05:58,429 copying evalscope/service/utils/process.py -> build/lib/evalscope/service/utils 2026-03-25T09:05:58,431 copying evalscope/service/utils/__init__.py -> build/lib/evalscope/service/utils 2026-03-25T09:05:58,433 creating build/lib/evalscope/service/frontend 2026-03-25T09:05:58,434 copying evalscope/service/frontend/utils.py -> build/lib/evalscope/service/frontend 2026-03-25T09:05:58,437 copying evalscope/service/frontend/main.py -> build/lib/evalscope/service/frontend 2026-03-25T09:05:58,439 copying evalscope/service/frontend/__init__.py -> build/lib/evalscope/service/frontend 2026-03-25T09:05:58,441 copying evalscope/service/frontend/async_client.py -> build/lib/evalscope/service/frontend 2026-03-25T09:05:58,443 creating build/lib/evalscope/service/blueprints 2026-03-25T09:05:58,444 copying evalscope/service/blueprints/eval.py -> build/lib/evalscope/service/blueprints 2026-03-25T09:05:58,447 copying evalscope/service/blueprints/perf.py -> build/lib/evalscope/service/blueprints 2026-03-25T09:05:58,449 copying evalscope/service/blueprints/__init__.py -> build/lib/evalscope/service/blueprints 2026-03-25T09:05:58,451 creating build/lib/evalscope/metrics/text_normalizer 2026-03-25T09:05:58,452 copying evalscope/metrics/text_normalizer/basic.py -> build/lib/evalscope/metrics/text_normalizer 2026-03-25T09:05:58,454 copying evalscope/metrics/text_normalizer/english.py -> build/lib/evalscope/metrics/text_normalizer 2026-03-25T09:05:58,457 copying evalscope/metrics/text_normalizer/wer.py -> build/lib/evalscope/metrics/text_normalizer 2026-03-25T09:05:58,459 copying evalscope/metrics/text_normalizer/__init__.py -> build/lib/evalscope/metrics/text_normalizer 2026-03-25T09:05:58,461 copying evalscope/metrics/text_normalizer/chinese.py -> build/lib/evalscope/metrics/text_normalizer 2026-03-25T09:05:58,466 creating build/lib/evalscope/metrics/sem_score 2026-03-25T09:05:58,467 copying evalscope/metrics/sem_score/scorer.py -> build/lib/evalscope/metrics/sem_score 2026-03-25T09:05:58,470 copying evalscope/metrics/sem_score/__init__.py -> build/lib/evalscope/metrics/sem_score 2026-03-25T09:05:58,472 creating build/lib/evalscope/metrics/t2v_metrics 2026-03-25T09:05:58,472 copying evalscope/metrics/t2v_metrics/clipscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-03-25T09:05:58,474 copying evalscope/metrics/t2v_metrics/vqascore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-03-25T09:05:58,476 copying evalscope/metrics/t2v_metrics/itmscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-03-25T09:05:58,478 copying evalscope/metrics/t2v_metrics/score.py -> build/lib/evalscope/metrics/t2v_metrics 2026-03-25T09:05:58,480 copying evalscope/metrics/t2v_metrics/constants.py -> build/lib/evalscope/metrics/t2v_metrics 2026-03-25T09:05:58,482 copying evalscope/metrics/t2v_metrics/__init__.py -> build/lib/evalscope/metrics/t2v_metrics 2026-03-25T09:05:58,483 creating build/lib/evalscope/metrics/bert_score 2026-03-25T09:05:58,484 copying evalscope/metrics/bert_score/utils.py -> build/lib/evalscope/metrics/bert_score 2026-03-25T09:05:58,487 copying evalscope/metrics/bert_score/scorer.py -> build/lib/evalscope/metrics/bert_score 2026-03-25T09:05:58,490 copying evalscope/metrics/bert_score/__init__.py -> build/lib/evalscope/metrics/bert_score 2026-03-25T09:05:58,491 creating build/lib/evalscope/metrics/bundled_rouge_score 2026-03-25T09:05:58,493 copying evalscope/metrics/bundled_rouge_score/__init__.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-03-25T09:05:58,495 copying evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-03-25T09:05:58,497 creating build/lib/evalscope/metrics/t2v_metrics/models 2026-03-25T09:05:58,498 copying evalscope/metrics/t2v_metrics/models/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-03-25T09:05:58,500 copying evalscope/metrics/t2v_metrics/models/model.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-03-25T09:05:58,502 copying evalscope/metrics/t2v_metrics/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-03-25T09:05:58,504 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:05:58,505 copying evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:05:58,508 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:05:58,510 copying evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:05:58,512 copying evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:05:58,514 copying evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:05:58,517 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:05:58,518 copying evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:05:58,520 copying evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:05:58,522 copying evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:05:58,524 copying evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:05:58,526 copying evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:05:58,528 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:05:58,529 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:05:58,531 copying evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:05:58,533 copying evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:05:58,535 copying evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:05:58,537 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-03-25T09:05:58,538 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-03-25T09:05:58,540 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-03-25T09:05:58,541 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-03-25T09:05:58,544 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-03-25T09:05:58,545 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-03-25T09:05:58,547 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-03-25T09:05:58,548 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-03-25T09:05:58,551 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-03-25T09:05:58,552 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-03-25T09:05:58,556 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-03-25T09:05:58,557 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-03-25T09:05:58,559 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-03-25T09:05:58,562 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:05:58,563 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:05:58,566 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:05:58,568 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:05:58,571 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:05:58,574 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:05:58,576 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:05:58,580 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:05:58,581 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:05:58,583 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:05:58,585 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:05:58,587 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:05:58,591 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:05:58,592 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:05:58,594 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:05:58,597 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:05:58,599 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:05:58,601 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:05:58,604 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:05:58,606 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:05:58,608 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,609 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,614 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,619 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,622 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,625 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,627 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,630 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,632 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,635 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,636 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:05:58,640 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,641 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,644 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,646 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,648 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,651 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,653 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,655 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,658 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,660 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,662 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,664 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:05:58,668 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-03-25T09:05:58,669 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-03-25T09:05:58,671 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-03-25T09:05:58,674 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-03-25T09:05:58,676 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:05:58,677 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:05:58,679 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:05:58,681 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:05:58,682 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:05:58,685 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-03-25T09:05:58,686 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-03-25T09:05:58,689 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-03-25T09:05:58,690 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-03-25T09:05:58,692 creating build/lib/evalscope/perf/utils 2026-03-25T09:05:58,693 copying evalscope/perf/utils/handler.py -> build/lib/evalscope/perf/utils 2026-03-25T09:05:58,695 copying evalscope/perf/utils/rich_display.py -> build/lib/evalscope/perf/utils 2026-03-25T09:05:58,698 copying evalscope/perf/utils/log_utils.py -> build/lib/evalscope/perf/utils 2026-03-25T09:05:58,700 copying evalscope/perf/utils/analysis_result.py -> build/lib/evalscope/perf/utils 2026-03-25T09:05:58,702 copying evalscope/perf/utils/local_server.py -> build/lib/evalscope/perf/utils 2026-03-25T09:05:58,704 copying evalscope/perf/utils/benchmark_util.py -> build/lib/evalscope/perf/utils 2026-03-25T09:05:58,706 copying evalscope/perf/utils/db_util.py -> build/lib/evalscope/perf/utils 2026-03-25T09:05:58,708 copying evalscope/perf/utils/__init__.py -> build/lib/evalscope/perf/utils 2026-03-25T09:05:58,710 creating build/lib/evalscope/perf/sla 2026-03-25T09:05:58,711 copying evalscope/perf/sla/sla_criterion.py -> build/lib/evalscope/perf/sla 2026-03-25T09:05:58,713 copying evalscope/perf/sla/sla_run.py -> build/lib/evalscope/perf/sla 2026-03-25T09:05:58,715 copying evalscope/perf/sla/__init__.py -> build/lib/evalscope/perf/sla 2026-03-25T09:05:58,717 creating build/lib/evalscope/perf/plugin 2026-03-25T09:05:58,718 copying evalscope/perf/plugin/registry.py -> build/lib/evalscope/perf/plugin 2026-03-25T09:05:58,720 copying evalscope/perf/plugin/__init__.py -> build/lib/evalscope/perf/plugin 2026-03-25T09:05:58,723 creating build/lib/evalscope/perf/utils/report 2026-03-25T09:05:58,724 copying evalscope/perf/utils/report/perf_charts.py -> build/lib/evalscope/perf/utils/report 2026-03-25T09:05:58,726 copying evalscope/perf/utils/report/generate_report.py -> build/lib/evalscope/perf/utils/report 2026-03-25T09:05:58,729 copying evalscope/perf/utils/report/__init__.py -> build/lib/evalscope/perf/utils/report 2026-03-25T09:05:58,730 copying evalscope/perf/utils/report/perf_data.py -> build/lib/evalscope/perf/utils/report 2026-03-25T09:05:58,733 creating build/lib/evalscope/perf/plugin/api 2026-03-25T09:05:58,734 copying evalscope/perf/plugin/api/custom_api.py -> build/lib/evalscope/perf/plugin/api 2026-03-25T09:05:58,736 copying evalscope/perf/plugin/api/openai_embedding_api.py -> build/lib/evalscope/perf/plugin/api 2026-03-25T09:05:58,739 copying evalscope/perf/plugin/api/openai_rerank_api.py -> build/lib/evalscope/perf/plugin/api 2026-03-25T09:05:58,741 copying evalscope/perf/plugin/api/default_api.py -> build/lib/evalscope/perf/plugin/api 2026-03-25T09:05:58,743 copying evalscope/perf/plugin/api/dashscope_api.py -> build/lib/evalscope/perf/plugin/api 2026-03-25T09:05:58,745 copying evalscope/perf/plugin/api/base.py -> build/lib/evalscope/perf/plugin/api 2026-03-25T09:05:58,747 copying evalscope/perf/plugin/api/__init__.py -> build/lib/evalscope/perf/plugin/api 2026-03-25T09:05:58,749 copying evalscope/perf/plugin/api/openai_api.py -> build/lib/evalscope/perf/plugin/api 2026-03-25T09:05:58,752 creating build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,753 copying evalscope/perf/plugin/datasets/custom.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,754 copying evalscope/perf/plugin/datasets/line_by_line.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,756 copying evalscope/perf/plugin/datasets/speed_benchmark.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,758 copying evalscope/perf/plugin/datasets/utils.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,760 copying evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,762 copying evalscope/perf/plugin/datasets/longalpaca.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,763 copying evalscope/perf/plugin/datasets/rerank_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,766 copying evalscope/perf/plugin/datasets/random_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,768 copying evalscope/perf/plugin/datasets/embedding_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,770 copying evalscope/perf/plugin/datasets/base.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,772 copying evalscope/perf/plugin/datasets/openqa.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,774 copying evalscope/perf/plugin/datasets/flickr8k.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,776 copying evalscope/perf/plugin/datasets/__init__.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,778 copying evalscope/perf/plugin/datasets/kontext_bench.py -> build/lib/evalscope/perf/plugin/datasets 2026-03-25T09:05:58,780 creating build/lib/evalscope/backend/opencompass 2026-03-25T09:05:58,781 copying evalscope/backend/opencompass/api_meta_template.py -> build/lib/evalscope/backend/opencompass 2026-03-25T09:05:58,783 copying evalscope/backend/opencompass/backend_manager.py -> build/lib/evalscope/backend/opencompass 2026-03-25T09:05:58,786 copying evalscope/backend/opencompass/__init__.py -> build/lib/evalscope/backend/opencompass 2026-03-25T09:05:58,788 creating build/lib/evalscope/backend/vlm_eval_kit 2026-03-25T09:05:58,789 copying evalscope/backend/vlm_eval_kit/backend_manager.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-03-25T09:05:58,791 copying evalscope/backend/vlm_eval_kit/__init__.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-03-25T09:05:58,794 creating build/lib/evalscope/backend/rag_eval 2026-03-25T09:05:58,795 copying evalscope/backend/rag_eval/backend_manager.py -> build/lib/evalscope/backend/rag_eval 2026-03-25T09:05:58,797 copying evalscope/backend/rag_eval/__init__.py -> build/lib/evalscope/backend/rag_eval 2026-03-25T09:05:58,799 creating build/lib/evalscope/backend/opencompass/tasks 2026-03-25T09:05:58,800 copying evalscope/backend/opencompass/tasks/eval_datasets.py -> build/lib/evalscope/backend/opencompass/tasks 2026-03-25T09:05:58,803 copying evalscope/backend/opencompass/tasks/eval_api.py -> build/lib/evalscope/backend/opencompass/tasks 2026-03-25T09:05:58,805 copying evalscope/backend/opencompass/tasks/__init__.py -> build/lib/evalscope/backend/opencompass/tasks 2026-03-25T09:05:58,807 creating build/lib/evalscope/backend/rag_eval/utils 2026-03-25T09:05:58,808 copying evalscope/backend/rag_eval/utils/tools.py -> build/lib/evalscope/backend/rag_eval/utils 2026-03-25T09:05:58,810 copying evalscope/backend/rag_eval/utils/embedding.py -> build/lib/evalscope/backend/rag_eval/utils 2026-03-25T09:05:58,812 copying evalscope/backend/rag_eval/utils/llm.py -> build/lib/evalscope/backend/rag_eval/utils 2026-03-25T09:05:58,814 copying evalscope/backend/rag_eval/utils/__init__.py -> build/lib/evalscope/backend/rag_eval/utils 2026-03-25T09:05:58,815 copying evalscope/backend/rag_eval/utils/clip.py -> build/lib/evalscope/backend/rag_eval/utils 2026-03-25T09:05:58,818 creating build/lib/evalscope/backend/rag_eval/cmteb 2026-03-25T09:05:58,819 copying evalscope/backend/rag_eval/cmteb/arguments.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-03-25T09:05:58,821 copying evalscope/backend/rag_eval/cmteb/task_template.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-03-25T09:05:58,823 copying evalscope/backend/rag_eval/cmteb/base.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-03-25T09:05:58,825 copying evalscope/backend/rag_eval/cmteb/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-03-25T09:05:58,827 creating build/lib/evalscope/backend/rag_eval/ragas 2026-03-25T09:05:58,828 copying evalscope/backend/rag_eval/ragas/arguments.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-03-25T09:05:58,830 copying evalscope/backend/rag_eval/ragas/task_template.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-03-25T09:05:58,832 copying evalscope/backend/rag_eval/ragas/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-03-25T09:05:58,834 creating build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:05:58,835 copying evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:05:58,837 copying evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:05:58,839 copying evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:05:58,841 copying evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:05:58,844 creating build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:05:58,845 copying evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:05:58,848 copying evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:05:58,850 copying evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:05:58,852 copying evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:05:58,854 copying evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:05:58,857 copying evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:05:58,859 copying evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:05:58,861 copying evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:05:58,864 creating build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-03-25T09:05:58,865 copying evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-03-25T09:05:58,867 creating build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:05:58,868 copying evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:05:58,871 copying evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:05:58,873 copying evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:05:58,874 copying evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:05:58,877 copying evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:05:58,879 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-03-25T09:05:58,880 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-03-25T09:05:58,883 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:05:58,884 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:05:58,887 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:05:58,889 copying evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:05:58,891 copying evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:05:58,893 creating build/lib/evalscope/app/utils 2026-03-25T09:05:58,894 copying evalscope/app/utils/text_utils.py -> build/lib/evalscope/app/utils 2026-03-25T09:05:58,897 copying evalscope/app/utils/env_utils.py -> build/lib/evalscope/app/utils 2026-03-25T09:05:58,899 copying evalscope/app/utils/localization.py -> build/lib/evalscope/app/utils 2026-03-25T09:05:58,901 copying evalscope/app/utils/data_utils.py -> build/lib/evalscope/app/utils 2026-03-25T09:05:58,903 copying evalscope/app/utils/visualization.py -> build/lib/evalscope/app/utils 2026-03-25T09:05:58,906 creating build/lib/evalscope/app/ui 2026-03-25T09:05:58,907 copying evalscope/app/ui/app_ui.py -> build/lib/evalscope/app/ui 2026-03-25T09:05:58,909 copying evalscope/app/ui/sidebar.py -> build/lib/evalscope/app/ui 2026-03-25T09:05:58,911 copying evalscope/app/ui/single_model.py -> build/lib/evalscope/app/ui 2026-03-25T09:05:58,913 copying evalscope/app/ui/multi_model.py -> build/lib/evalscope/app/ui 2026-03-25T09:05:58,915 copying evalscope/app/ui/visualization.py -> build/lib/evalscope/app/ui 2026-03-25T09:05:58,917 copying evalscope/app/ui/__init__.py -> build/lib/evalscope/app/ui 2026-03-25T09:05:58,921 creating build/lib/evalscope/api/evaluator 2026-03-25T09:05:58,922 copying evalscope/api/evaluator/state.py -> build/lib/evalscope/api/evaluator 2026-03-25T09:05:58,924 copying evalscope/api/evaluator/evaluator.py -> build/lib/evalscope/api/evaluator 2026-03-25T09:05:58,926 copying evalscope/api/evaluator/__init__.py -> build/lib/evalscope/api/evaluator 2026-03-25T09:05:58,928 copying evalscope/api/evaluator/cache.py -> build/lib/evalscope/api/evaluator 2026-03-25T09:05:58,931 creating build/lib/evalscope/api/model 2026-03-25T09:05:58,932 copying evalscope/api/model/lazy_model.py -> build/lib/evalscope/api/model 2026-03-25T09:05:58,934 copying evalscope/api/model/generate_config.py -> build/lib/evalscope/api/model 2026-03-25T09:05:58,936 copying evalscope/api/model/model_output.py -> build/lib/evalscope/api/model 2026-03-25T09:05:58,939 copying evalscope/api/model/model.py -> build/lib/evalscope/api/model 2026-03-25T09:05:58,941 copying evalscope/api/model/__init__.py -> build/lib/evalscope/api/model 2026-03-25T09:05:58,943 creating build/lib/evalscope/api/metric 2026-03-25T09:05:58,944 copying evalscope/api/metric/metric.py -> build/lib/evalscope/api/metric 2026-03-25T09:05:58,946 copying evalscope/api/metric/scorer.py -> build/lib/evalscope/api/metric 2026-03-25T09:05:58,948 copying evalscope/api/metric/__init__.py -> build/lib/evalscope/api/metric 2026-03-25T09:05:58,950 creating build/lib/evalscope/api/messages 2026-03-25T09:05:58,951 copying evalscope/api/messages/content.py -> build/lib/evalscope/api/messages 2026-03-25T09:05:58,953 copying evalscope/api/messages/utils.py -> build/lib/evalscope/api/messages 2026-03-25T09:05:58,955 copying evalscope/api/messages/chat_message.py -> build/lib/evalscope/api/messages 2026-03-25T09:05:58,957 copying evalscope/api/messages/__init__.py -> build/lib/evalscope/api/messages 2026-03-25T09:05:58,959 creating build/lib/evalscope/api/tool 2026-03-25T09:05:58,960 copying evalscope/api/tool/utils.py -> build/lib/evalscope/api/tool 2026-03-25T09:05:58,962 copying evalscope/api/tool/tool_call.py -> build/lib/evalscope/api/tool 2026-03-25T09:05:58,964 copying evalscope/api/tool/__init__.py -> build/lib/evalscope/api/tool 2026-03-25T09:05:58,966 copying evalscope/api/tool/tool_info.py -> build/lib/evalscope/api/tool 2026-03-25T09:05:58,968 creating build/lib/evalscope/api/mixin 2026-03-25T09:05:58,969 copying evalscope/api/mixin/sandbox_mixin.py -> build/lib/evalscope/api/mixin 2026-03-25T09:05:58,972 copying evalscope/api/mixin/llm_judge_mixin.py -> build/lib/evalscope/api/mixin 2026-03-25T09:05:58,974 copying evalscope/api/mixin/__init__.py -> build/lib/evalscope/api/mixin 2026-03-25T09:05:58,976 creating build/lib/evalscope/api/benchmark 2026-03-25T09:05:58,977 copying evalscope/api/benchmark/benchmark.py -> build/lib/evalscope/api/benchmark 2026-03-25T09:05:58,981 copying evalscope/api/benchmark/meta.py -> build/lib/evalscope/api/benchmark 2026-03-25T09:05:58,983 copying evalscope/api/benchmark/__init__.py -> build/lib/evalscope/api/benchmark 2026-03-25T09:05:58,985 copying evalscope/api/benchmark/statistics.py -> build/lib/evalscope/api/benchmark 2026-03-25T09:05:58,989 creating build/lib/evalscope/api/filter 2026-03-25T09:05:58,990 copying evalscope/api/filter/filter.py -> build/lib/evalscope/api/filter 2026-03-25T09:05:58,992 copying evalscope/api/filter/__init__.py -> build/lib/evalscope/api/filter 2026-03-25T09:05:58,994 creating build/lib/evalscope/api/dataset 2026-03-25T09:05:58,995 copying evalscope/api/dataset/utils.py -> build/lib/evalscope/api/dataset 2026-03-25T09:05:58,998 copying evalscope/api/dataset/loader.py -> build/lib/evalscope/api/dataset 2026-03-25T09:05:59,000 copying evalscope/api/dataset/__init__.py -> build/lib/evalscope/api/dataset 2026-03-25T09:05:59,002 copying evalscope/api/dataset/dataset.py -> build/lib/evalscope/api/dataset 2026-03-25T09:05:59,006 creating build/lib/evalscope/api/benchmark/adapters 2026-03-25T09:05:59,007 copying evalscope/api/benchmark/adapters/agent_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-03-25T09:05:59,009 copying evalscope/api/benchmark/adapters/ner_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-03-25T09:05:59,012 copying evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-03-25T09:05:59,014 copying evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-03-25T09:05:59,015 copying evalscope/api/benchmark/adapters/text2image_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-03-25T09:05:59,018 copying evalscope/api/benchmark/adapters/default_data_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-03-25T09:05:59,022 copying evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-03-25T09:05:59,024 copying evalscope/api/benchmark/adapters/__init__.py -> build/lib/evalscope/api/benchmark/adapters 2026-03-25T09:05:59,027 running egg_info 2026-03-25T09:05:59,037 writing evalscope.egg-info/PKG-INFO 2026-03-25T09:05:59,058 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-03-25T09:05:59,060 writing entry points to evalscope.egg-info/entry_points.txt 2026-03-25T09:05:59,070 writing requirements to evalscope.egg-info/requires.txt 2026-03-25T09:05:59,071 writing top-level names to evalscope.egg-info/top_level.txt 2026-03-25T09:05:59,257 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-03-25T09:05:59,306 reading manifest template 'MANIFEST.in' 2026-03-25T09:05:59,712 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-03-25T09:05:59,717 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-03-25T09:05:59,723 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-03-25T09:05:59,729 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-03-25T09:05:59,730 adding license file 'LICENSE' 2026-03-25T09:05:59,783 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-03-25T09:05:59,897 creating build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,898 copying evalscope/benchmarks/_meta/a_okvqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,901 copying evalscope/benchmarks/_meta/aa_lcr.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,904 copying evalscope/benchmarks/_meta/ai2d.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,906 copying evalscope/benchmarks/_meta/aime24.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,909 copying evalscope/benchmarks/_meta/aime25.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,911 copying evalscope/benchmarks/_meta/aime26.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,914 copying evalscope/benchmarks/_meta/alpaca_eval.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,917 copying evalscope/benchmarks/_meta/amc.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,920 copying evalscope/benchmarks/_meta/anat_em.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,923 copying evalscope/benchmarks/_meta/arc.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,925 copying evalscope/benchmarks/_meta/arena_hard.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,928 copying evalscope/benchmarks/_meta/bbh.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,931 copying evalscope/benchmarks/_meta/bc2gm.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,934 copying evalscope/benchmarks/_meta/bc4chemd.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,937 copying evalscope/benchmarks/_meta/bc5cdr.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,940 copying evalscope/benchmarks/_meta/bfcl_v3.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,943 copying evalscope/benchmarks/_meta/bfcl_v4.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,946 copying evalscope/benchmarks/_meta/biomix_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,948 copying evalscope/benchmarks/_meta/blink.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,952 copying evalscope/benchmarks/_meta/broad_twitter_corpus.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,954 copying evalscope/benchmarks/_meta/cc_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,957 copying evalscope/benchmarks/_meta/ceval.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,960 copying evalscope/benchmarks/_meta/chartqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,963 copying evalscope/benchmarks/_meta/chinese_simpleqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,966 copying evalscope/benchmarks/_meta/cl_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,969 copying evalscope/benchmarks/_meta/cmmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,973 copying evalscope/benchmarks/_meta/cmmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,976 copying evalscope/benchmarks/_meta/cmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,979 copying evalscope/benchmarks/_meta/coin_flip.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,981 copying evalscope/benchmarks/_meta/commonsense_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,984 copying evalscope/benchmarks/_meta/competition_math.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,987 copying evalscope/benchmarks/_meta/conll2003.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,990 copying evalscope/benchmarks/_meta/conllpp.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,993 copying evalscope/benchmarks/_meta/copious.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:05:59,997 copying evalscope/benchmarks/_meta/cross_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,000 copying evalscope/benchmarks/_meta/data_collection.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,003 copying evalscope/benchmarks/_meta/docmath.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,005 copying evalscope/benchmarks/_meta/docvqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,008 copying evalscope/benchmarks/_meta/drivel_binary.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,011 copying evalscope/benchmarks/_meta/drivel_multilabel.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,014 copying evalscope/benchmarks/_meta/drivel_selection.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,017 copying evalscope/benchmarks/_meta/drivel_writing.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,019 copying evalscope/benchmarks/_meta/drop.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,022 copying evalscope/benchmarks/_meta/eq_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,025 copying evalscope/benchmarks/_meta/evalmuse.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,027 copying evalscope/benchmarks/_meta/fin_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,030 copying evalscope/benchmarks/_meta/fleurs.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,033 copying evalscope/benchmarks/_meta/frames.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,036 copying evalscope/benchmarks/_meta/gedit.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,039 copying evalscope/benchmarks/_meta/genai_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,042 copying evalscope/benchmarks/_meta/general_arena.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,045 copying evalscope/benchmarks/_meta/general_fc.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,048 copying evalscope/benchmarks/_meta/general_mcq.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,051 copying evalscope/benchmarks/_meta/general_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,053 copying evalscope/benchmarks/_meta/general_t2i.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,056 copying evalscope/benchmarks/_meta/general_vmcq.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,058 copying evalscope/benchmarks/_meta/general_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,061 copying evalscope/benchmarks/_meta/genia_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,064 copying evalscope/benchmarks/_meta/gpqa_diamond.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,067 copying evalscope/benchmarks/_meta/gsm8k.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,070 copying evalscope/benchmarks/_meta/gsm8k_v.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,073 copying evalscope/benchmarks/_meta/hallusion_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,075 copying evalscope/benchmarks/_meta/halueval.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,078 copying evalscope/benchmarks/_meta/harvey_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,081 copying evalscope/benchmarks/_meta/health_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,084 copying evalscope/benchmarks/_meta/hellaswag.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,086 copying evalscope/benchmarks/_meta/hle.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,089 copying evalscope/benchmarks/_meta/hmmt25.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,092 copying evalscope/benchmarks/_meta/hpdv2.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,094 copying evalscope/benchmarks/_meta/humaneval.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,097 copying evalscope/benchmarks/_meta/humaneval_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,099 copying evalscope/benchmarks/_meta/ifbench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,102 copying evalscope/benchmarks/_meta/ifeval.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,105 copying evalscope/benchmarks/_meta/infovqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,108 copying evalscope/benchmarks/_meta/iquiz.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,111 copying evalscope/benchmarks/_meta/jnlpba.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,114 copying evalscope/benchmarks/_meta/jnlpba_rare.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,117 copying evalscope/benchmarks/_meta/librispeech.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,120 copying evalscope/benchmarks/_meta/live_code_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,123 copying evalscope/benchmarks/_meta/logi_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,125 copying evalscope/benchmarks/_meta/longbench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,128 copying evalscope/benchmarks/_meta/maritime_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,131 copying evalscope/benchmarks/_meta/math_500.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,134 copying evalscope/benchmarks/_meta/math_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,137 copying evalscope/benchmarks/_meta/math_verse.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,140 copying evalscope/benchmarks/_meta/math_vision.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,143 copying evalscope/benchmarks/_meta/math_vista.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,146 copying evalscope/benchmarks/_meta/mbpp.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,149 copying evalscope/benchmarks/_meta/mbpp_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,152 copying evalscope/benchmarks/_meta/med_mcqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,155 copying evalscope/benchmarks/_meta/mgsm.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,158 copying evalscope/benchmarks/_meta/micro_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,161 copying evalscope/benchmarks/_meta/minerva_math.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,164 copying evalscope/benchmarks/_meta/mit_movie_trivia.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,167 copying evalscope/benchmarks/_meta/mit_restaurant.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,169 copying evalscope/benchmarks/_meta/mm_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,172 copying evalscope/benchmarks/_meta/mm_star.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,175 copying evalscope/benchmarks/_meta/mmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,179 copying evalscope/benchmarks/_meta/mmlu_pro.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,182 copying evalscope/benchmarks/_meta/mmlu_redux.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,185 copying evalscope/benchmarks/_meta/mmmlu.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,188 copying evalscope/benchmarks/_meta/mmmu.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,191 copying evalscope/benchmarks/_meta/mmmu_pro.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,194 copying evalscope/benchmarks/_meta/mri_mcqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,197 copying evalscope/benchmarks/_meta/multi_if.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,200 copying evalscope/benchmarks/_meta/multi_nerd.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,203 copying evalscope/benchmarks/_meta/multiple_humaneval.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,206 copying evalscope/benchmarks/_meta/multiple_mbpp.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,209 copying evalscope/benchmarks/_meta/music_trivia.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,212 copying evalscope/benchmarks/_meta/musr.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,215 copying evalscope/benchmarks/_meta/ncbi.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,218 copying evalscope/benchmarks/_meta/needle_haystack.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,221 copying evalscope/benchmarks/_meta/ocr_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,224 copying evalscope/benchmarks/_meta/ocr_bench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,227 copying evalscope/benchmarks/_meta/olympiad_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,230 copying evalscope/benchmarks/_meta/omni_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,233 copying evalscope/benchmarks/_meta/omni_doc_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,237 copying evalscope/benchmarks/_meta/ontonotes5.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,240 copying evalscope/benchmarks/_meta/openai_mrcr.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,244 copying evalscope/benchmarks/_meta/piqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,246 copying evalscope/benchmarks/_meta/poly_math.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,249 copying evalscope/benchmarks/_meta/pope.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,252 copying evalscope/benchmarks/_meta/process_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,255 copying evalscope/benchmarks/_meta/pubmedqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,258 copying evalscope/benchmarks/_meta/qasc.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,261 copying evalscope/benchmarks/_meta/race.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,264 copying evalscope/benchmarks/_meta/real_world_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,267 copying evalscope/benchmarks/_meta/refcoco.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,270 copying evalscope/benchmarks/_meta/scicode.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,274 copying evalscope/benchmarks/_meta/science_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,277 copying evalscope/benchmarks/_meta/sciq.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,280 copying evalscope/benchmarks/_meta/seed_bench_2_plus.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,283 copying evalscope/benchmarks/_meta/simple_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,285 copying evalscope/benchmarks/_meta/simple_vqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,288 copying evalscope/benchmarks/_meta/siqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,290 copying evalscope/benchmarks/_meta/super_gpqa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,294 copying evalscope/benchmarks/_meta/swe_bench_lite.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,297 copying evalscope/benchmarks/_meta/swe_bench_verified.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,300 copying evalscope/benchmarks/_meta/swe_bench_verified_mini.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,303 copying evalscope/benchmarks/_meta/tau2_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,307 copying evalscope/benchmarks/_meta/tau_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,310 copying evalscope/benchmarks/_meta/terminal_bench_v2.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,312 copying evalscope/benchmarks/_meta/tifa160.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,315 copying evalscope/benchmarks/_meta/tool_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,318 copying evalscope/benchmarks/_meta/torgo.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,321 copying evalscope/benchmarks/_meta/trivia_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,324 copying evalscope/benchmarks/_meta/truthful_qa.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,326 copying evalscope/benchmarks/_meta/tweebank_ner.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,329 copying evalscope/benchmarks/_meta/tweet_ner_7.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,332 copying evalscope/benchmarks/_meta/visulogic.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,335 copying evalscope/benchmarks/_meta/vstar_bench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,338 copying evalscope/benchmarks/_meta/winogrande.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,341 copying evalscope/benchmarks/_meta/wmt24pp.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,344 copying evalscope/benchmarks/_meta/wnut2017.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,347 copying evalscope/benchmarks/_meta/zebralogicbench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,350 copying evalscope/benchmarks/_meta/zerobench.json -> build/lib/evalscope/benchmarks/_meta 2026-03-25T09:06:00,352 copying evalscope/benchmarks/trivia_qa/samples.jsonl -> build/lib/evalscope/benchmarks/trivia_qa 2026-03-25T09:06:00,355 creating build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,356 copying evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,358 copying evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,361 copying evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,363 copying evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,365 copying evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,368 copying evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,370 copying evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,373 copying evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,375 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,378 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,380 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,382 copying evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,384 copying evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,386 copying evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,389 copying evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,391 copying evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,393 copying evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,395 copying evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,397 copying evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,400 copying evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,402 copying evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,404 copying evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,406 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,408 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,411 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,413 copying evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,415 copying evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,417 creating build/lib/evalscope/benchmarks/humanevalplus/docker 2026-03-25T09:06:00,418 copying evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/lib/evalscope/benchmarks/humanevalplus/docker 2026-03-25T09:06:00,420 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:06:00,422 copying evalscope/benchmarks/scicode/docker/Dockerfile -> build/lib/evalscope/benchmarks/scicode/docker 2026-03-25T09:06:00,425 copying evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/lib/evalscope/benchmarks/scicode/docker 2026-03-25T09:06:00,427 copying evalscope/third_party/longbench_write/README.md -> build/lib/evalscope/third_party/longbench_write 2026-03-25T09:06:00,429 copying evalscope/third_party/longbench_write/default_task.json -> build/lib/evalscope/third_party/longbench_write 2026-03-25T09:06:00,432 copying evalscope/third_party/longbench_write/default_task.yaml -> build/lib/evalscope/third_party/longbench_write 2026-03-25T09:06:00,434 copying evalscope/third_party/toolbench_static/README.md -> build/lib/evalscope/third_party/toolbench_static 2026-03-25T09:06:00,436 copying evalscope/third_party/toolbench_static/config_default.json -> build/lib/evalscope/third_party/toolbench_static 2026-03-25T09:06:00,439 copying evalscope/third_party/toolbench_static/config_default.yaml -> build/lib/evalscope/third_party/toolbench_static 2026-03-25T09:06:00,441 copying evalscope/third_party/toolbench_static/requirements.txt -> build/lib/evalscope/third_party/toolbench_static 2026-03-25T09:06:00,443 copying evalscope/third_party/longbench_write/resources/judge.txt -> build/lib/evalscope/third_party/longbench_write/resources 2026-03-25T09:06:00,445 copying evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-03-25T09:06:00,448 copying evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-03-25T09:06:00,451 copying evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-03-25T09:06:00,454 creating build/lib/evalscope/third_party/thinkbench/resources 2026-03-25T09:06:00,455 copying evalscope/third_party/thinkbench/resources/critique_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-03-25T09:06:00,457 copying evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-03-25T09:06:00,459 copying evalscope/metrics/text_normalizer/english.json -> build/lib/evalscope/metrics/text_normalizer 2026-03-25T09:06:00,463 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-03-25T09:06:00,464 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-03-25T09:06:00,466 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-03-25T09:06:00,467 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-03-25T09:06:00,469 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-03-25T09:06:00,472 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-03-25T09:06:00,474 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,475 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,477 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,479 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,481 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,483 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,485 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,487 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,490 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,492 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,494 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,496 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,499 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,501 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,503 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,505 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,507 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,509 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,512 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,514 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:00,516 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-03-25T09:06:00,518 creating build/lib/evalscope/report/template 2026-03-25T09:06:00,519 copying evalscope/report/template/perf_report.html.j2 -> build/lib/evalscope/report/template 2026-03-25T09:06:00,522 copying evalscope/report/template/report.html.j2 -> build/lib/evalscope/report/template 2026-03-25T09:06:00,524 creating build/lib/evalscope/report/template/partials 2026-03-25T09:06:00,525 copying evalscope/report/template/partials/footer.html -> build/lib/evalscope/report/template/partials 2026-03-25T09:06:00,528 copying evalscope/report/template/partials/header_eval.html -> build/lib/evalscope/report/template/partials 2026-03-25T09:06:00,530 copying evalscope/report/template/partials/header_perf.html -> build/lib/evalscope/report/template/partials 2026-03-25T09:06:00,532 copying evalscope/report/template/partials/toc_eval.html -> build/lib/evalscope/report/template/partials 2026-03-25T09:06:00,534 copying evalscope/report/template/partials/toc_perf.html -> build/lib/evalscope/report/template/partials 2026-03-25T09:06:00,536 creating build/lib/evalscope/report/template/js 2026-03-25T09:06:00,537 copying evalscope/report/template/js/eval_extra.js -> build/lib/evalscope/report/template/js 2026-03-25T09:06:00,540 copying evalscope/report/template/js/i18n_eval.js -> build/lib/evalscope/report/template/js 2026-03-25T09:06:00,542 copying evalscope/report/template/js/i18n_perf.js -> build/lib/evalscope/report/template/js 2026-03-25T09:06:00,545 copying evalscope/report/template/js/perf_extra.js -> build/lib/evalscope/report/template/js 2026-03-25T09:06:00,547 copying evalscope/report/template/js/shared.js -> build/lib/evalscope/report/template/js 2026-03-25T09:06:00,549 creating build/lib/evalscope/report/template/css 2026-03-25T09:06:00,550 copying evalscope/report/template/css/base.css -> build/lib/evalscope/report/template/css 2026-03-25T09:06:00,553 copying evalscope/report/template/css/perf_extra.css -> build/lib/evalscope/report/template/css 2026-03-25T09:06:00,677 installing to build/bdist.linux-armv7l/wheel 2026-03-25T09:06:00,678 running install 2026-03-25T09:06:00,701 running install_lib 2026-03-25T09:06:00,706 creating build/bdist.linux-armv7l/wheel 2026-03-25T09:06:00,708 creating build/bdist.linux-armv7l/wheel/evalscope 2026-03-25T09:06:00,710 creating build/bdist.linux-armv7l/wheel/evalscope/models 2026-03-25T09:06:00,711 copying build/lib/evalscope/models/text2image_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-03-25T09:06:00,713 copying build/lib/evalscope/models/modelscope.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-03-25T09:06:00,716 creating build/bdist.linux-armv7l/wheel/evalscope/models/utils 2026-03-25T09:06:00,718 copying build/lib/evalscope/models/utils/anthropic.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-03-25T09:06:00,722 copying build/lib/evalscope/models/utils/openai.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-03-25T09:06:00,727 copying build/lib/evalscope/models/openai_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-03-25T09:06:00,734 copying build/lib/evalscope/models/mockllm.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-03-25T09:06:00,735 copying build/lib/evalscope/models/image_edit_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-03-25T09:06:00,737 copying build/lib/evalscope/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-03-25T09:06:00,739 copying build/lib/evalscope/models/anthropic_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-03-25T09:06:00,742 copying build/lib/evalscope/models/model_apis.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-03-25T09:06:00,744 copying build/lib/evalscope/version.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-03-25T09:06:00,746 creating build/bdist.linux-armv7l/wheel/evalscope/evaluator 2026-03-25T09:06:00,747 copying build/lib/evalscope/evaluator/batch_reviewer.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-03-25T09:06:00,750 copying build/lib/evalscope/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-03-25T09:06:00,752 copying build/lib/evalscope/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-03-25T09:06:00,757 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks 2026-03-25T09:06:00,758 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/seed_bench_2_plus 2026-03-25T09:06:00,759 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-03-25T09:06:00,761 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-03-25T09:06:00,764 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pope 2026-03-25T09:06:00,765 copying build/lib/evalscope/benchmarks/pope/pope_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-03-25T09:06:00,767 copying build/lib/evalscope/benchmarks/pope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-03-25T09:06:00,769 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/super_gpqa 2026-03-25T09:06:00,770 copying build/lib/evalscope/benchmarks/super_gpqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-03-25T09:06:00,772 copying build/lib/evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-03-25T09:06:00,774 copying build/lib/evalscope/benchmarks/super_gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-03-25T09:06:00,777 copying build/lib/evalscope/benchmarks/super_gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-03-25T09:06:00,779 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multipl_e 2026-03-25T09:06:00,780 copying build/lib/evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-03-25T09:06:00,782 copying build/lib/evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-03-25T09:06:00,784 copying build/lib/evalscope/benchmarks/multipl_e/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-03-25T09:06:00,786 copying build/lib/evalscope/benchmarks/multipl_e/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-03-25T09:06:00,788 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aime 2026-03-25T09:06:00,789 copying build/lib/evalscope/benchmarks/aime/math_normalize.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-03-25T09:06:00,792 copying build/lib/evalscope/benchmarks/aime/aime_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-03-25T09:06:00,794 copying build/lib/evalscope/benchmarks/aime/grader.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-03-25T09:06:00,796 copying build/lib/evalscope/benchmarks/aime/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-03-25T09:06:00,798 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ceval 2026-03-25T09:06:00,799 copying build/lib/evalscope/benchmarks/ceval/ceval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-03-25T09:06:00,802 copying build/lib/evalscope/benchmarks/ceval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-03-25T09:06:00,804 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbpp 2026-03-25T09:06:00,805 copying build/lib/evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-03-25T09:06:00,808 copying build/lib/evalscope/benchmarks/mbpp/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-03-25T09:06:00,810 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/sciq 2026-03-25T09:06:00,811 copying build/lib/evalscope/benchmarks/sciq/sciq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-03-25T09:06:00,813 copying build/lib/evalscope/benchmarks/sciq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-03-25T09:06:00,815 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu_pro 2026-03-25T09:06:00,816 copying build/lib/evalscope/benchmarks/mmmu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-03-25T09:06:00,818 copying build/lib/evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-03-25T09:06:00,821 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/biomix_qa 2026-03-25T09:06:00,822 copying build/lib/evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-03-25T09:06:00,824 copying build/lib/evalscope/benchmarks/biomix_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-03-25T09:06:00,826 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/musr 2026-03-25T09:06:00,827 copying build/lib/evalscope/benchmarks/musr/musr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-03-25T09:06:00,829 copying build/lib/evalscope/benchmarks/musr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-03-25T09:06:00,831 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/text2image 2026-03-25T09:06:00,832 copying build/lib/evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-03-25T09:06:00,834 copying build/lib/evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-03-25T09:06:00,836 copying build/lib/evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-03-25T09:06:00,838 copying build/lib/evalscope/benchmarks/text2image/tifa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-03-25T09:06:00,840 copying build/lib/evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-03-25T09:06:00,841 copying build/lib/evalscope/benchmarks/text2image/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-03-25T09:06:00,844 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/science_qa 2026-03-25T09:06:00,845 copying build/lib/evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-03-25T09:06:00,847 copying build/lib/evalscope/benchmarks/science_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-03-25T09:06:00,849 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omnidoc_bench 2026-03-25T09:06:00,850 copying build/lib/evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-03-25T09:06:00,852 copying build/lib/evalscope/benchmarks/omnidoc_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-03-25T09:06:00,855 copying build/lib/evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-03-25T09:06:00,858 copying build/lib/evalscope/benchmarks/omnidoc_bench/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-03-25T09:06:00,861 copying build/lib/evalscope/benchmarks/omnidoc_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-03-25T09:06:00,863 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/coin_flip 2026-03-25T09:06:00,864 copying build/lib/evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-03-25T09:06:00,866 copying build/lib/evalscope/benchmarks/coin_flip/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-03-25T09:06:00,868 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_star 2026-03-25T09:06:00,869 copying build/lib/evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-03-25T09:06:00,871 copying build/lib/evalscope/benchmarks/mm_star/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-03-25T09:06:00,874 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu 2026-03-25T09:06:00,874 copying build/lib/evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-03-25T09:06:00,877 copying build/lib/evalscope/benchmarks/mmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-03-25T09:06:00,879 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_qa 2026-03-25T09:06:00,880 copying build/lib/evalscope/benchmarks/simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-03-25T09:06:00,882 copying build/lib/evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-03-25T09:06:00,884 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh 2026-03-25T09:06:00,885 copying build/lib/evalscope/benchmarks/bbh/bbh_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-03-25T09:06:00,888 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,889 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,891 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,893 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,895 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,897 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,899 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,901 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,903 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,905 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,907 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,909 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,911 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,913 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,915 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,917 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,919 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,921 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,922 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,924 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,926 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,928 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,930 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,932 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,934 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,935 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,937 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,939 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-03-25T09:06:00,941 copying build/lib/evalscope/benchmarks/bbh/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-03-25T09:06:00,944 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/librispeech 2026-03-25T09:06:00,945 copying build/lib/evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-03-25T09:06:00,947 copying build/lib/evalscope/benchmarks/librispeech/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-03-25T09:06:00,949 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arc 2026-03-25T09:06:00,950 copying build/lib/evalscope/benchmarks/arc/arc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-03-25T09:06:00,953 copying build/lib/evalscope/benchmarks/arc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-03-25T09:06:00,955 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/blink 2026-03-25T09:06:00,956 copying build/lib/evalscope/benchmarks/blink/blink_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-03-25T09:06:00,958 copying build/lib/evalscope/benchmarks/blink/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-03-25T09:06:00,960 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vqa 2026-03-25T09:06:00,961 copying build/lib/evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-03-25T09:06:00,964 copying build/lib/evalscope/benchmarks/general_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-03-25T09:06:00,966 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omni_bench 2026-03-25T09:06:00,967 copying build/lib/evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-03-25T09:06:00,969 copying build/lib/evalscope/benchmarks/omni_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-03-25T09:06:00,971 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pumed_qa 2026-03-25T09:06:00,972 copying build/lib/evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-03-25T09:06:00,975 copying build/lib/evalscope/benchmarks/pumed_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-03-25T09:06:00,977 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus 2026-03-25T09:06:00,978 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus/docker 2026-03-25T09:06:00,980 copying build/lib/evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus/docker 2026-03-25T09:06:00,981 copying build/lib/evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-03-25T09:06:00,984 copying build/lib/evalscope/benchmarks/humanevalplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-03-25T09:06:00,986 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hle 2026-03-25T09:06:00,987 copying build/lib/evalscope/benchmarks/hle/hle_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-03-25T09:06:00,989 copying build/lib/evalscope/benchmarks/hle/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-03-25T09:06:00,991 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vista 2026-03-25T09:06:00,992 copying build/lib/evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-03-25T09:06:00,994 copying build/lib/evalscope/benchmarks/math_vista/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-03-25T09:06:00,996 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbppplus 2026-03-25T09:06:00,997 copying build/lib/evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-03-25T09:06:01,000 copying build/lib/evalscope/benchmarks/mbppplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-03-25T09:06:01,002 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/logi_qa 2026-03-25T09:06:01,003 copying build/lib/evalscope/benchmarks/logi_qa/__int__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-03-25T09:06:01,005 copying build/lib/evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-03-25T09:06:01,007 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_bench 2026-03-25T09:06:01,008 copying build/lib/evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-03-25T09:06:01,011 copying build/lib/evalscope/benchmarks/mm_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-03-25T09:06:01,013 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl 2026-03-25T09:06:01,014 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v3 2026-03-25T09:06:01,016 copying build/lib/evalscope/benchmarks/bfcl/v3/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-03-25T09:06:01,017 copying build/lib/evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-03-25T09:06:01,020 copying build/lib/evalscope/benchmarks/bfcl/v3/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-03-25T09:06:01,022 copying build/lib/evalscope/benchmarks/bfcl/v3/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-03-25T09:06:01,024 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v4 2026-03-25T09:06:01,025 copying build/lib/evalscope/benchmarks/bfcl/v4/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-03-25T09:06:01,028 copying build/lib/evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-03-25T09:06:01,030 copying build/lib/evalscope/benchmarks/bfcl/v4/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-03-25T09:06:01,031 copying build/lib/evalscope/benchmarks/bfcl/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl 2026-03-25T09:06:01,033 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu 2026-03-25T09:06:01,034 copying build/lib/evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-03-25T09:06:01,037 copying build/lib/evalscope/benchmarks/mmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-03-25T09:06:01,039 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/longbench_v2 2026-03-25T09:06:01,040 copying build/lib/evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/longbench_v2 2026-03-25T09:06:01,042 copying build/lib/evalscope/benchmarks/longbench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/longbench_v2 2026-03-25T09:06:01,044 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/med_mcqa 2026-03-25T09:06:01,045 copying build/lib/evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-03-25T09:06:01,047 copying build/lib/evalscope/benchmarks/med_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-03-25T09:06:01,049 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/frames 2026-03-25T09:06:01,050 copying build/lib/evalscope/benchmarks/frames/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-03-25T09:06:01,052 copying build/lib/evalscope/benchmarks/frames/frames_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-03-25T09:06:01,054 copying build/lib/evalscope/benchmarks/frames/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-03-25T09:06:01,057 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drivelology 2026-03-25T09:06:01,058 copying build/lib/evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-03-25T09:06:01,060 copying build/lib/evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-03-25T09:06:01,062 copying build/lib/evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-03-25T09:06:01,064 copying build/lib/evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-03-25T09:06:01,067 copying build/lib/evalscope/benchmarks/drivelology/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-03-25T09:06:01,069 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/needle_haystack 2026-03-25T09:06:01,070 copying build/lib/evalscope/benchmarks/needle_haystack/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-03-25T09:06:01,072 copying build/lib/evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-03-25T09:06:01,075 copying build/lib/evalscope/benchmarks/needle_haystack/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-03-25T09:06:01,077 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/vstar_bench 2026-03-25T09:06:01,079 copying build/lib/evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-03-25T09:06:01,081 copying build/lib/evalscope/benchmarks/vstar_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-03-25T09:06:01,083 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mri_mcqa 2026-03-25T09:06:01,084 copying build/lib/evalscope/benchmarks/mri_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-03-25T09:06:01,085 copying build/lib/evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-03-25T09:06:01,087 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/maritime_bench 2026-03-25T09:06:01,088 copying build/lib/evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-03-25T09:06:01,090 copying build/lib/evalscope/benchmarks/maritime_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-03-25T09:06:01,095 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/_meta 2026-03-25T09:06:01,095 copying build/lib/evalscope/benchmarks/_meta/tool_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,098 copying build/lib/evalscope/benchmarks/_meta/broad_twitter_corpus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,101 copying build/lib/evalscope/benchmarks/_meta/piqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,103 copying build/lib/evalscope/benchmarks/_meta/halueval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,105 copying build/lib/evalscope/benchmarks/_meta/health_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,108 copying build/lib/evalscope/benchmarks/_meta/multi_nerd.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,110 copying build/lib/evalscope/benchmarks/_meta/gedit.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,113 copying build/lib/evalscope/benchmarks/_meta/live_code_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,116 copying build/lib/evalscope/benchmarks/_meta/chartqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,118 copying build/lib/evalscope/benchmarks/_meta/math_vision.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,121 copying build/lib/evalscope/benchmarks/_meta/infovqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,123 copying build/lib/evalscope/benchmarks/_meta/aa_lcr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,125 copying build/lib/evalscope/benchmarks/_meta/data_collection.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,128 copying build/lib/evalscope/benchmarks/_meta/winogrande.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,130 copying build/lib/evalscope/benchmarks/_meta/general_vmcq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,133 copying build/lib/evalscope/benchmarks/_meta/omni_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,135 copying build/lib/evalscope/benchmarks/_meta/drivel_binary.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,138 copying build/lib/evalscope/benchmarks/_meta/ontonotes5.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,141 copying build/lib/evalscope/benchmarks/_meta/drop.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,144 copying build/lib/evalscope/benchmarks/_meta/ocr_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,146 copying build/lib/evalscope/benchmarks/_meta/tweet_ner_7.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,149 copying build/lib/evalscope/benchmarks/_meta/bfcl_v4.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,151 copying build/lib/evalscope/benchmarks/_meta/tifa160.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,154 copying build/lib/evalscope/benchmarks/_meta/amc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,156 copying build/lib/evalscope/benchmarks/_meta/siqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,159 copying build/lib/evalscope/benchmarks/_meta/mbpp_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,161 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified_mini.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,163 copying build/lib/evalscope/benchmarks/_meta/blink.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,166 copying build/lib/evalscope/benchmarks/_meta/frames.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,168 copying build/lib/evalscope/benchmarks/_meta/mmmu_pro.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,171 copying build/lib/evalscope/benchmarks/_meta/jnlpba_rare.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,174 copying build/lib/evalscope/benchmarks/_meta/poly_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,177 copying build/lib/evalscope/benchmarks/_meta/mbpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,180 copying build/lib/evalscope/benchmarks/_meta/ifeval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,183 copying build/lib/evalscope/benchmarks/_meta/iquiz.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,185 copying build/lib/evalscope/benchmarks/_meta/evalmuse.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,188 copying build/lib/evalscope/benchmarks/_meta/pope.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,190 copying build/lib/evalscope/benchmarks/_meta/needle_haystack.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,193 copying build/lib/evalscope/benchmarks/_meta/science_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,195 copying build/lib/evalscope/benchmarks/_meta/truthful_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,198 copying build/lib/evalscope/benchmarks/_meta/swe_bench_verified.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,200 copying build/lib/evalscope/benchmarks/_meta/genia_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,203 copying build/lib/evalscope/benchmarks/_meta/librispeech.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,205 copying build/lib/evalscope/benchmarks/_meta/zebralogicbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,207 copying build/lib/evalscope/benchmarks/_meta/harvey_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,210 copying build/lib/evalscope/benchmarks/_meta/commonsense_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,212 copying build/lib/evalscope/benchmarks/_meta/simple_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,215 copying build/lib/evalscope/benchmarks/_meta/math_verse.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,217 copying build/lib/evalscope/benchmarks/_meta/anat_em.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,220 copying build/lib/evalscope/benchmarks/_meta/torgo.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,222 copying build/lib/evalscope/benchmarks/_meta/scicode.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,225 copying build/lib/evalscope/benchmarks/_meta/aime24.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,228 copying build/lib/evalscope/benchmarks/_meta/process_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,230 copying build/lib/evalscope/benchmarks/_meta/humaneval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,233 copying build/lib/evalscope/benchmarks/_meta/tau_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,235 copying build/lib/evalscope/benchmarks/_meta/mit_movie_trivia.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,238 copying build/lib/evalscope/benchmarks/_meta/bc2gm.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,240 copying build/lib/evalscope/benchmarks/_meta/math_500.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,243 copying build/lib/evalscope/benchmarks/_meta/bc4chemd.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,245 copying build/lib/evalscope/benchmarks/_meta/mgsm.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,248 copying build/lib/evalscope/benchmarks/_meta/general_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,250 copying build/lib/evalscope/benchmarks/_meta/seed_bench_2_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,253 copying build/lib/evalscope/benchmarks/_meta/cl_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,255 copying build/lib/evalscope/benchmarks/_meta/coin_flip.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,258 copying build/lib/evalscope/benchmarks/_meta/drivel_writing.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,260 copying build/lib/evalscope/benchmarks/_meta/cc_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,264 copying build/lib/evalscope/benchmarks/_meta/cmmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,267 copying build/lib/evalscope/benchmarks/_meta/hellaswag.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,269 copying build/lib/evalscope/benchmarks/_meta/gsm8k_v.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,271 copying build/lib/evalscope/benchmarks/_meta/ai2d.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,274 copying build/lib/evalscope/benchmarks/_meta/ncbi.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,276 copying build/lib/evalscope/benchmarks/_meta/competition_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,279 copying build/lib/evalscope/benchmarks/_meta/hle.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,282 copying build/lib/evalscope/benchmarks/_meta/real_world_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,284 copying build/lib/evalscope/benchmarks/_meta/general_t2i.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,286 copying build/lib/evalscope/benchmarks/_meta/aime25.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,289 copying build/lib/evalscope/benchmarks/_meta/mri_mcqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,291 copying build/lib/evalscope/benchmarks/_meta/minerva_math.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,293 copying build/lib/evalscope/benchmarks/_meta/med_mcqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,296 copying build/lib/evalscope/benchmarks/_meta/logi_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,298 copying build/lib/evalscope/benchmarks/_meta/eq_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,300 copying build/lib/evalscope/benchmarks/_meta/omni_doc_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,303 copying build/lib/evalscope/benchmarks/_meta/mm_star.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,306 copying build/lib/evalscope/benchmarks/_meta/mmlu_pro.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,309 copying build/lib/evalscope/benchmarks/_meta/math_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,311 copying build/lib/evalscope/benchmarks/_meta/cross_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,314 copying build/lib/evalscope/benchmarks/_meta/bfcl_v3.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,316 copying build/lib/evalscope/benchmarks/_meta/wmt24pp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,319 copying build/lib/evalscope/benchmarks/_meta/gpqa_diamond.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,321 copying build/lib/evalscope/benchmarks/_meta/a_okvqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,323 copying build/lib/evalscope/benchmarks/_meta/mm_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,326 copying build/lib/evalscope/benchmarks/_meta/general_arena.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,328 copying build/lib/evalscope/benchmarks/_meta/fin_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,331 copying build/lib/evalscope/benchmarks/_meta/multiple_mbpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,333 copying build/lib/evalscope/benchmarks/_meta/qasc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,335 copying build/lib/evalscope/benchmarks/_meta/cmmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,338 copying build/lib/evalscope/benchmarks/_meta/ceval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,341 copying build/lib/evalscope/benchmarks/_meta/visulogic.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,343 copying build/lib/evalscope/benchmarks/_meta/swe_bench_lite.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,346 copying build/lib/evalscope/benchmarks/_meta/terminal_bench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,348 copying build/lib/evalscope/benchmarks/_meta/openai_mrcr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,351 copying build/lib/evalscope/benchmarks/_meta/tau2_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,353 copying build/lib/evalscope/benchmarks/_meta/cmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,356 copying build/lib/evalscope/benchmarks/_meta/zerobench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,358 copying build/lib/evalscope/benchmarks/_meta/pubmedqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,360 copying build/lib/evalscope/benchmarks/_meta/general_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,362 copying build/lib/evalscope/benchmarks/_meta/trivia_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,365 copying build/lib/evalscope/benchmarks/_meta/olympiad_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,367 copying build/lib/evalscope/benchmarks/_meta/drivel_selection.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,370 copying build/lib/evalscope/benchmarks/_meta/ifbench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,372 copying build/lib/evalscope/benchmarks/_meta/docvqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,374 copying build/lib/evalscope/benchmarks/_meta/maritime_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,376 copying build/lib/evalscope/benchmarks/_meta/super_gpqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,379 copying build/lib/evalscope/benchmarks/_meta/simple_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,381 copying build/lib/evalscope/benchmarks/_meta/mmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,384 copying build/lib/evalscope/benchmarks/_meta/humaneval_plus.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,387 copying build/lib/evalscope/benchmarks/_meta/mit_restaurant.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,390 copying build/lib/evalscope/benchmarks/_meta/tweebank_ner.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,393 copying build/lib/evalscope/benchmarks/_meta/general_fc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,397 copying build/lib/evalscope/benchmarks/_meta/arena_hard.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,400 copying build/lib/evalscope/benchmarks/_meta/docmath.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,403 copying build/lib/evalscope/benchmarks/_meta/micro_vqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,407 copying build/lib/evalscope/benchmarks/_meta/aime26.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,410 copying build/lib/evalscope/benchmarks/_meta/refcoco.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,413 copying build/lib/evalscope/benchmarks/_meta/gsm8k.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,416 copying build/lib/evalscope/benchmarks/_meta/mmlu_redux.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,419 copying build/lib/evalscope/benchmarks/_meta/race.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,422 copying build/lib/evalscope/benchmarks/_meta/conllpp.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,425 copying build/lib/evalscope/benchmarks/_meta/bbh.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,428 copying build/lib/evalscope/benchmarks/_meta/wnut2017.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,431 copying build/lib/evalscope/benchmarks/_meta/fleurs.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,434 copying build/lib/evalscope/benchmarks/_meta/conll2003.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,437 copying build/lib/evalscope/benchmarks/_meta/genai_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,441 copying build/lib/evalscope/benchmarks/_meta/multiple_humaneval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,444 copying build/lib/evalscope/benchmarks/_meta/mmmlu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,448 copying build/lib/evalscope/benchmarks/_meta/alpaca_eval.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,451 copying build/lib/evalscope/benchmarks/_meta/hallusion_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,454 copying build/lib/evalscope/benchmarks/_meta/ocr_bench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,458 copying build/lib/evalscope/benchmarks/_meta/mmmu.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,462 copying build/lib/evalscope/benchmarks/_meta/bc5cdr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,465 copying build/lib/evalscope/benchmarks/_meta/drivel_multilabel.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,467 copying build/lib/evalscope/benchmarks/_meta/musr.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,470 copying build/lib/evalscope/benchmarks/_meta/general_mcq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,473 copying build/lib/evalscope/benchmarks/_meta/chinese_simpleqa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,475 copying build/lib/evalscope/benchmarks/_meta/hpdv2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,478 copying build/lib/evalscope/benchmarks/_meta/multi_if.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,481 copying build/lib/evalscope/benchmarks/_meta/longbench_v2.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,484 copying build/lib/evalscope/benchmarks/_meta/biomix_qa.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,487 copying build/lib/evalscope/benchmarks/_meta/math_vista.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,489 copying build/lib/evalscope/benchmarks/_meta/arc.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,491 copying build/lib/evalscope/benchmarks/_meta/copious.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,495 copying build/lib/evalscope/benchmarks/_meta/hmmt25.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,497 copying build/lib/evalscope/benchmarks/_meta/sciq.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,499 copying build/lib/evalscope/benchmarks/_meta/music_trivia.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,502 copying build/lib/evalscope/benchmarks/_meta/vstar_bench.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,504 copying build/lib/evalscope/benchmarks/_meta/jnlpba.json -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/_meta 2026-03-25T09:06:01,507 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/alpaca_eval 2026-03-25T09:06:01,508 copying build/lib/evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-03-25T09:06:01,510 copying build/lib/evalscope/benchmarks/alpaca_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-03-25T09:06:01,512 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_fc 2026-03-25T09:06:01,513 copying build/lib/evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-03-25T09:06:01,515 copying build/lib/evalscope/benchmarks/general_fc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-03-25T09:06:01,517 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math 2026-03-25T09:06:01,519 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math/utils 2026-03-25T09:06:01,520 copying build/lib/evalscope/benchmarks/poly_math/utils/instruction.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math/utils 2026-03-25T09:06:01,522 copying build/lib/evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-03-25T09:06:01,524 copying build/lib/evalscope/benchmarks/poly_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-03-25T09:06:01,526 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_redux 2026-03-25T09:06:01,527 copying build/lib/evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-03-25T09:06:01,529 copying build/lib/evalscope/benchmarks/mmlu_redux/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-03-25T09:06:01,531 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit 2026-03-25T09:06:01,533 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit/gedit 2026-03-25T09:06:01,534 copying build/lib/evalscope/benchmarks/image_edit/gedit/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-03-25T09:06:01,536 copying build/lib/evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-03-25T09:06:01,538 copying build/lib/evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-03-25T09:06:01,541 copying build/lib/evalscope/benchmarks/image_edit/gedit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-03-25T09:06:01,542 copying build/lib/evalscope/benchmarks/image_edit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit 2026-03-25T09:06:01,544 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/truthful_qa 2026-03-25T09:06:01,545 copying build/lib/evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-03-25T09:06:01,548 copying build/lib/evalscope/benchmarks/truthful_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-03-25T09:06:01,550 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_mcq 2026-03-25T09:06:01,551 copying build/lib/evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-03-25T09:06:01,553 copying build/lib/evalscope/benchmarks/general_mcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-03-25T09:06:01,555 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ai2d 2026-03-25T09:06:01,556 copying build/lib/evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-03-25T09:06:01,558 copying build/lib/evalscope/benchmarks/ai2d/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-03-25T09:06:01,560 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aa_lcr 2026-03-25T09:06:01,561 copying build/lib/evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-03-25T09:06:01,563 copying build/lib/evalscope/benchmarks/aa_lcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-03-25T09:06:01,565 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/piqa 2026-03-25T09:06:01,566 copying build/lib/evalscope/benchmarks/piqa/piqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-03-25T09:06:01,568 copying build/lib/evalscope/benchmarks/piqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-03-25T09:06:01,570 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/a_okvqa 2026-03-25T09:06:01,571 copying build/lib/evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-03-25T09:06:01,573 copying build/lib/evalscope/benchmarks/a_okvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-03-25T09:06:01,575 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_arena 2026-03-25T09:06:01,576 copying build/lib/evalscope/benchmarks/general_arena/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-03-25T09:06:01,578 copying build/lib/evalscope/benchmarks/general_arena/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-03-25T09:06:01,579 copying build/lib/evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-03-25T09:06:01,582 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/qasc 2026-03-25T09:06:01,583 copying build/lib/evalscope/benchmarks/qasc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-03-25T09:06:01,585 copying build/lib/evalscope/benchmarks/qasc/qasc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-03-25T09:06:01,587 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_pro 2026-03-25T09:06:01,588 copying build/lib/evalscope/benchmarks/mmlu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-03-25T09:06:01,590 copying build/lib/evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-03-25T09:06:01,592 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mgsm 2026-03-25T09:06:01,593 copying build/lib/evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-03-25T09:06:01,595 copying build/lib/evalscope/benchmarks/mgsm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-03-25T09:06:01,597 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/real_world_qa 2026-03-25T09:06:01,598 copying build/lib/evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-03-25T09:06:01,600 copying build/lib/evalscope/benchmarks/real_world_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-03-25T09:06:01,602 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmu 2026-03-25T09:06:01,603 copying build/lib/evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-03-25T09:06:01,606 copying build/lib/evalscope/benchmarks/cmmu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-03-25T09:06:01,607 copying build/lib/evalscope/benchmarks/cmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-03-25T09:06:01,609 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k 2026-03-25T09:06:01,611 copying build/lib/evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-03-25T09:06:01,612 copying build/lib/evalscope/benchmarks/gsm8k/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-03-25T09:06:01,615 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner 2026-03-25T09:06:01,616 copying build/lib/evalscope/benchmarks/ner/bc2gm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,618 copying build/lib/evalscope/benchmarks/ner/copious_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,620 copying build/lib/evalscope/benchmarks/ner/fin_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,621 copying build/lib/evalscope/benchmarks/ner/anat_em_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,624 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:06:01,625 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:06:01,626 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:06:01,628 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:06:01,630 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:06:01,631 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:06:01,633 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-03-25T09:06:01,635 copying build/lib/evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,637 copying build/lib/evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,639 copying build/lib/evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,642 copying build/lib/evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,644 copying build/lib/evalscope/benchmarks/ner/wnut2017_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,646 copying build/lib/evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,648 copying build/lib/evalscope/benchmarks/ner/genia_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,650 copying build/lib/evalscope/benchmarks/ner/ncbi_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,651 copying build/lib/evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,653 copying build/lib/evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,655 copying build/lib/evalscope/benchmarks/ner/cross_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,657 copying build/lib/evalscope/benchmarks/ner/conll2003_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,659 copying build/lib/evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,661 copying build/lib/evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,663 copying build/lib/evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,665 copying build/lib/evalscope/benchmarks/ner/conllpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,667 copying build/lib/evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,669 copying build/lib/evalscope/benchmarks/ner/jnlpba_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,671 copying build/lib/evalscope/benchmarks/ner/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-03-25T09:06:01,672 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmmu 2026-03-25T09:06:01,674 copying build/lib/evalscope/benchmarks/cmmmu/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-03-25T09:06:01,676 copying build/lib/evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-03-25T09:06:01,678 copying build/lib/evalscope/benchmarks/cmmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-03-25T09:06:01,680 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/trivia_qa 2026-03-25T09:06:01,682 copying build/lib/evalscope/benchmarks/trivia_qa/samples.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-03-25T09:06:01,683 copying build/lib/evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-03-25T09:06:01,686 copying build/lib/evalscope/benchmarks/trivia_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-03-25T09:06:01,688 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/visu_logic 2026-03-25T09:06:01,689 copying build/lib/evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-03-25T09:06:01,691 copying build/lib/evalscope/benchmarks/visu_logic/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-03-25T09:06:01,693 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zebralogicbench 2026-03-25T09:06:01,694 copying build/lib/evalscope/benchmarks/zebralogicbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-03-25T09:06:01,697 copying build/lib/evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-03-25T09:06:01,699 copying build/lib/evalscope/benchmarks/zebralogicbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-03-25T09:06:01,701 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/healthbench 2026-03-25T09:06:01,702 copying build/lib/evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-03-25T09:06:01,705 copying build/lib/evalscope/benchmarks/healthbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-03-25T09:06:01,707 copying build/lib/evalscope/benchmarks/healthbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-03-25T09:06:01,709 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench 2026-03-25T09:06:01,711 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,712 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,714 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,717 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,719 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,721 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,723 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,724 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,728 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:06:01,729 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:06:01,731 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:06:01,734 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:06:01,735 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-03-25T09:06:01,737 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,739 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-03-25T09:06:01,741 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench 2026-03-25T09:06:01,742 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-03-25T09:06:01,745 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-03-25T09:06:01,746 copying build/lib/evalscope/benchmarks/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench 2026-03-25T09:06:01,748 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_500 2026-03-25T09:06:01,749 copying build/lib/evalscope/benchmarks/math_500/math_500_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-03-25T09:06:01,751 copying build/lib/evalscope/benchmarks/math_500/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-03-25T09:06:01,753 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k_v 2026-03-25T09:06:01,754 copying build/lib/evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-03-25T09:06:01,756 copying build/lib/evalscope/benchmarks/gsm8k_v/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-03-25T09:06:01,758 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/micro_vqa 2026-03-25T09:06:01,759 copying build/lib/evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-03-25T09:06:01,761 copying build/lib/evalscope/benchmarks/micro_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-03-25T09:06:01,763 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vision 2026-03-25T09:06:01,764 copying build/lib/evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-03-25T09:06:01,767 copying build/lib/evalscope/benchmarks/math_vision/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-03-25T09:06:01,769 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/infovqa 2026-03-25T09:06:01,769 copying build/lib/evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-03-25T09:06:01,771 copying build/lib/evalscope/benchmarks/infovqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-03-25T09:06:01,773 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/olympiad_bench 2026-03-25T09:06:01,774 copying build/lib/evalscope/benchmarks/olympiad_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-03-25T09:06:01,777 copying build/lib/evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-03-25T09:06:01,779 copying build/lib/evalscope/benchmarks/olympiad_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-03-25T09:06:01,781 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/eq_bench 2026-03-25T09:06:01,782 copying build/lib/evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-03-25T09:06:01,784 copying build/lib/evalscope/benchmarks/eq_bench/answer_validation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-03-25T09:06:01,786 copying build/lib/evalscope/benchmarks/eq_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-03-25T09:06:01,789 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tool_bench 2026-03-25T09:06:01,789 copying build/lib/evalscope/benchmarks/tool_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-03-25T09:06:01,792 copying build/lib/evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-03-25T09:06:01,794 copying build/lib/evalscope/benchmarks/tool_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-03-25T09:06:01,796 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_vqa 2026-03-25T09:06:01,797 copying build/lib/evalscope/benchmarks/simple_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-03-25T09:06:01,799 copying build/lib/evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-03-25T09:06:01,801 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/music_trivia 2026-03-25T09:06:01,802 copying build/lib/evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-03-25T09:06:01,804 copying build/lib/evalscope/benchmarks/music_trivia/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-03-25T09:06:01,806 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hellaswag 2026-03-25T09:06:01,807 copying build/lib/evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-03-25T09:06:01,809 copying build/lib/evalscope/benchmarks/hellaswag/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-03-25T09:06:01,812 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/winogrande 2026-03-25T09:06:01,813 copying build/lib/evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-03-25T09:06:01,815 copying build/lib/evalscope/benchmarks/winogrande/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-03-25T09:06:01,817 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/fleurs 2026-03-25T09:06:01,817 copying build/lib/evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-03-25T09:06:01,820 copying build/lib/evalscope/benchmarks/fleurs/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-03-25T09:06:01,822 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chinese_simple_qa 2026-03-25T09:06:01,822 copying build/lib/evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-03-25T09:06:01,825 copying build/lib/evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-03-25T09:06:01,827 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/minerva_math 2026-03-25T09:06:01,827 copying build/lib/evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-03-25T09:06:01,830 copying build/lib/evalscope/benchmarks/minerva_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-03-25T09:06:01,831 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_verse 2026-03-25T09:06:01,832 copying build/lib/evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-03-25T09:06:01,835 copying build/lib/evalscope/benchmarks/math_verse/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-03-25T09:06:01,836 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/amc 2026-03-25T09:06:01,837 copying build/lib/evalscope/benchmarks/amc/amc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-03-25T09:06:01,839 copying build/lib/evalscope/benchmarks/amc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-03-25T09:06:01,841 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zerobench 2026-03-25T09:06:01,842 copying build/lib/evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-03-25T09:06:01,844 copying build/lib/evalscope/benchmarks/zerobench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-03-25T09:06:01,846 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hmmt25 2026-03-25T09:06:01,847 copying build/lib/evalscope/benchmarks/hmmt25/hmmt25_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hmmt25 2026-03-25T09:06:01,850 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chartqa 2026-03-25T09:06:01,851 copying build/lib/evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-03-25T09:06:01,853 copying build/lib/evalscope/benchmarks/chartqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-03-25T09:06:01,855 copying build/lib/evalscope/benchmarks/chartqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-03-25T09:06:01,857 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gpqa 2026-03-25T09:06:01,858 copying build/lib/evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-03-25T09:06:01,860 copying build/lib/evalscope/benchmarks/gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-03-25T09:06:01,862 copying build/lib/evalscope/benchmarks/gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-03-25T09:06:01,864 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vmcq 2026-03-25T09:06:01,865 copying build/lib/evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-03-25T09:06:01,867 copying build/lib/evalscope/benchmarks/general_vmcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-03-25T09:06:01,869 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode 2026-03-25T09:06:01,870 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode/docker 2026-03-25T09:06:01,871 copying build/lib/evalscope/benchmarks/scicode/docker/process_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-03-25T09:06:01,874 copying build/lib/evalscope/benchmarks/scicode/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-03-25T09:06:01,875 copying build/lib/evalscope/benchmarks/scicode/docker/test_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-03-25T09:06:01,877 copying build/lib/evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-03-25T09:06:01,879 copying build/lib/evalscope/benchmarks/scicode/prompt_templates.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-03-25T09:06:01,881 copying build/lib/evalscope/benchmarks/scicode/scicode_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-03-25T09:06:01,883 copying build/lib/evalscope/benchmarks/scicode/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-03-25T09:06:01,884 copying build/lib/evalscope/benchmarks/scicode/util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-03-25T09:06:01,886 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multi_if 2026-03-25T09:06:01,887 copying build/lib/evalscope/benchmarks/multi_if/ifeval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-03-25T09:06:01,891 copying build/lib/evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-03-25T09:06:01,893 copying build/lib/evalscope/benchmarks/multi_if/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-03-25T09:06:01,895 copying build/lib/evalscope/benchmarks/multi_if/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-03-25T09:06:01,898 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifeval 2026-03-25T09:06:01,899 copying build/lib/evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-03-25T09:06:01,901 copying build/lib/evalscope/benchmarks/ifeval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-03-25T09:06:01,903 copying build/lib/evalscope/benchmarks/ifeval/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-03-25T09:06:01,905 copying build/lib/evalscope/benchmarks/ifeval/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-03-25T09:06:01,908 copying build/lib/evalscope/benchmarks/ifeval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-03-25T09:06:01,909 copying build/lib/evalscope/benchmarks/ifeval/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-03-25T09:06:01,913 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/halu_eval 2026-03-25T09:06:01,914 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-03-25T09:06:01,916 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-03-25T09:06:01,919 copying build/lib/evalscope/benchmarks/halu_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-03-25T09:06:01,921 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/process_bench 2026-03-25T09:06:01,922 copying build/lib/evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-03-25T09:06:01,924 copying build/lib/evalscope/benchmarks/process_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-03-25T09:06:01,926 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_qa 2026-03-25T09:06:01,927 copying build/lib/evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-03-25T09:06:01,929 copying build/lib/evalscope/benchmarks/general_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-03-25T09:06:01,931 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docmath 2026-03-25T09:06:01,932 copying build/lib/evalscope/benchmarks/docmath/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-03-25T09:06:01,935 copying build/lib/evalscope/benchmarks/docmath/docmath_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-03-25T09:06:01,937 copying build/lib/evalscope/benchmarks/docmath/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-03-25T09:06:01,939 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/terminal_bench 2026-03-25T09:06:01,940 copying build/lib/evalscope/benchmarks/terminal_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-03-25T09:06:01,942 copying build/lib/evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-03-25T09:06:01,944 copying build/lib/evalscope/benchmarks/terminal_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-03-25T09:06:01,946 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/race 2026-03-25T09:06:01,947 copying build/lib/evalscope/benchmarks/race/race_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-03-25T09:06:01,949 copying build/lib/evalscope/benchmarks/race/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-03-25T09:06:01,952 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/competition_math 2026-03-25T09:06:01,953 copying build/lib/evalscope/benchmarks/competition_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-03-25T09:06:01,954 copying build/lib/evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-03-25T09:06:01,957 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cl_bench 2026-03-25T09:06:01,958 copying build/lib/evalscope/benchmarks/cl_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-03-25T09:06:01,960 copying build/lib/evalscope/benchmarks/cl_bench/cl_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-03-25T09:06:01,962 copying build/lib/evalscope/benchmarks/cl_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cl_bench 2026-03-25T09:06:01,964 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arena_hard 2026-03-25T09:06:01,965 copying build/lib/evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-03-25T09:06:01,968 copying build/lib/evalscope/benchmarks/arena_hard/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-03-25T09:06:01,970 copying build/lib/evalscope/benchmarks/arena_hard/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-03-25T09:06:01,972 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/iquiz 2026-03-25T09:06:01,973 copying build/lib/evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-03-25T09:06:01,975 copying build/lib/evalscope/benchmarks/iquiz/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-03-25T09:06:01,977 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/swe_bench 2026-03-25T09:06:01,978 copying build/lib/evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-03-25T09:06:01,980 copying build/lib/evalscope/benchmarks/swe_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-03-25T09:06:01,983 copying build/lib/evalscope/benchmarks/swe_bench/build_images.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-03-25T09:06:01,985 copying build/lib/evalscope/benchmarks/swe_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-03-25T09:06:01,986 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmlu 2026-03-25T09:06:01,987 copying build/lib/evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-03-25T09:06:01,990 copying build/lib/evalscope/benchmarks/cmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-03-25T09:06:01,992 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/commonsense_qa 2026-03-25T09:06:01,993 copying build/lib/evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-03-25T09:06:01,995 copying build/lib/evalscope/benchmarks/commonsense_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-03-25T09:06:01,997 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/torgo 2026-03-25T09:06:01,998 copying build/lib/evalscope/benchmarks/torgo/torgo_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-03-25T09:06:02,000 copying build/lib/evalscope/benchmarks/torgo/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-03-25T09:06:02,002 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/wmt 2026-03-25T09:06:02,003 copying build/lib/evalscope/benchmarks/wmt/wmt24_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-03-25T09:06:02,005 copying build/lib/evalscope/benchmarks/wmt/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-03-25T09:06:02,007 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hallusion_bench 2026-03-25T09:06:02,008 copying build/lib/evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-03-25T09:06:02,010 copying build/lib/evalscope/benchmarks/hallusion_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-03-25T09:06:02,013 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench 2026-03-25T09:06:02,014 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau2_bench 2026-03-25T09:06:02,015 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-03-25T09:06:02,018 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-03-25T09:06:02,020 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-03-25T09:06:02,023 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau_bench 2026-03-25T09:06:02,024 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-03-25T09:06:02,026 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-03-25T09:06:02,028 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-03-25T09:06:02,030 copying build/lib/evalscope/benchmarks/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench 2026-03-25T09:06:02,032 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docvqa 2026-03-25T09:06:02,033 copying build/lib/evalscope/benchmarks/docvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-03-25T09:06:02,034 copying build/lib/evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-03-25T09:06:02,036 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,037 copying build/lib/evalscope/benchmarks/live_code_bench/extract_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,040 copying build/lib/evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,042 copying build/lib/evalscope/benchmarks/live_code_bench/testing_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,044 copying build/lib/evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,046 copying build/lib/evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,048 copying build/lib/evalscope/benchmarks/live_code_bench/prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,051 copying build/lib/evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,052 copying build/lib/evalscope/benchmarks/live_code_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,054 copying build/lib/evalscope/benchmarks/live_code_bench/load_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-03-25T09:06:02,056 copying build/lib/evalscope/benchmarks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks 2026-03-25T09:06:02,058 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_qa 2026-03-25T09:06:02,059 copying build/lib/evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-03-25T09:06:02,061 copying build/lib/evalscope/benchmarks/math_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-03-25T09:06:02,063 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drop 2026-03-25T09:06:02,064 copying build/lib/evalscope/benchmarks/drop/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-03-25T09:06:02,066 copying build/lib/evalscope/benchmarks/drop/drop_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-03-25T09:06:02,069 copying build/lib/evalscope/benchmarks/drop/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-03-25T09:06:02,071 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmlu 2026-03-25T09:06:02,072 copying build/lib/evalscope/benchmarks/mmmlu/mmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-03-25T09:06:02,074 copying build/lib/evalscope/benchmarks/mmmlu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-03-25T09:06:02,076 copying build/lib/evalscope/benchmarks/mmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmlu 2026-03-25T09:06:02,078 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/refcoco 2026-03-25T09:06:02,079 copying build/lib/evalscope/benchmarks/refcoco/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-03-25T09:06:02,081 copying build/lib/evalscope/benchmarks/refcoco/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-03-25T09:06:02,082 copying build/lib/evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-03-25T09:06:02,084 copying build/lib/evalscope/benchmarks/refcoco/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-03-25T09:06:02,086 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/siqa 2026-03-25T09:06:02,087 copying build/lib/evalscope/benchmarks/siqa/siqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-03-25T09:06:02,089 copying build/lib/evalscope/benchmarks/siqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-03-25T09:06:02,091 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifbench 2026-03-25T09:06:02,092 copying build/lib/evalscope/benchmarks/ifbench/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-03-25T09:06:02,094 copying build/lib/evalscope/benchmarks/ifbench/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-03-25T09:06:02,096 copying build/lib/evalscope/benchmarks/ifbench/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-03-25T09:06:02,099 copying build/lib/evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-03-25T09:06:02,100 copying build/lib/evalscope/benchmarks/ifbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-03-25T09:06:02,102 copying build/lib/evalscope/benchmarks/ifbench/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-03-25T09:06:02,106 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/data_collection 2026-03-25T09:06:02,107 copying build/lib/evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-03-25T09:06:02,109 copying build/lib/evalscope/benchmarks/data_collection/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-03-25T09:06:02,111 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humaneval 2026-03-25T09:06:02,112 copying build/lib/evalscope/benchmarks/humaneval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-03-25T09:06:02,115 copying build/lib/evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-03-25T09:06:02,117 copying build/lib/evalscope/benchmarks/humaneval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-03-25T09:06:02,119 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/openai_mrcr 2026-03-25T09:06:02,120 copying build/lib/evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-03-25T09:06:02,123 copying build/lib/evalscope/benchmarks/openai_mrcr/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-03-25T09:06:02,124 copying build/lib/evalscope/benchmarks/openai_mrcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-03-25T09:06:02,127 creating build/bdist.linux-armv7l/wheel/evalscope/third_party 2026-03-25T09:06:02,128 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write 2026-03-25T09:06:02,130 copying build/lib/evalscope/third_party/longbench_write/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-03-25T09:06:02,132 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/resources 2026-03-25T09:06:02,133 copying build/lib/evalscope/third_party/longbench_write/resources/judge.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-03-25T09:06:02,135 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-03-25T09:06:02,138 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-03-25T09:06:02,141 copying build/lib/evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-03-25T09:06:02,143 copying build/lib/evalscope/third_party/longbench_write/resources/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-03-25T09:06:02,145 copying build/lib/evalscope/third_party/longbench_write/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-03-25T09:06:02,146 copying build/lib/evalscope/third_party/longbench_write/default_task.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-03-25T09:06:02,148 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/tools 2026-03-25T09:06:02,149 copying build/lib/evalscope/third_party/longbench_write/tools/data_etl.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-03-25T09:06:02,152 copying build/lib/evalscope/third_party/longbench_write/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-03-25T09:06:02,153 copying build/lib/evalscope/third_party/longbench_write/tools/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-03-25T09:06:02,155 copying build/lib/evalscope/third_party/longbench_write/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-03-25T09:06:02,157 copying build/lib/evalscope/third_party/longbench_write/default_task.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-03-25T09:06:02,160 copying build/lib/evalscope/third_party/longbench_write/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-03-25T09:06:02,162 copying build/lib/evalscope/third_party/longbench_write/longbench_write.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-03-25T09:06:02,164 copying build/lib/evalscope/third_party/longbench_write/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-03-25T09:06:02,166 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench 2026-03-25T09:06:02,167 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/resources 2026-03-25T09:06:02,168 copying build/lib/evalscope/third_party/thinkbench/resources/critique_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-03-25T09:06:02,170 copying build/lib/evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-03-25T09:06:02,173 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/tools 2026-03-25T09:06:02,174 copying build/lib/evalscope/third_party/thinkbench/tools/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-03-25T09:06:02,176 copying build/lib/evalscope/third_party/thinkbench/tools/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-03-25T09:06:02,177 copying build/lib/evalscope/third_party/thinkbench/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-03-25T09:06:02,179 copying build/lib/evalscope/third_party/thinkbench/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-03-25T09:06:02,181 copying build/lib/evalscope/third_party/thinkbench/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-03-25T09:06:02,183 copying build/lib/evalscope/third_party/thinkbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-03-25T09:06:02,185 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static 2026-03-25T09:06:02,186 copying build/lib/evalscope/third_party/toolbench_static/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-03-25T09:06:02,188 copying build/lib/evalscope/third_party/toolbench_static/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-03-25T09:06:02,190 copying build/lib/evalscope/third_party/toolbench_static/config_default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-03-25T09:06:02,192 copying build/lib/evalscope/third_party/toolbench_static/config_default.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-03-25T09:06:02,193 copying build/lib/evalscope/third_party/toolbench_static/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-03-25T09:06:02,195 copying build/lib/evalscope/third_party/toolbench_static/toolbench_static.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-03-25T09:06:02,197 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static/llm 2026-03-25T09:06:02,199 copying build/lib/evalscope/third_party/toolbench_static/llm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-03-25T09:06:02,200 copying build/lib/evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-03-25T09:06:02,202 copying build/lib/evalscope/third_party/toolbench_static/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-03-25T09:06:02,204 copying build/lib/evalscope/third_party/toolbench_static/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-03-25T09:06:02,206 copying build/lib/evalscope/third_party/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party 2026-03-25T09:06:02,208 creating build/bdist.linux-armv7l/wheel/evalscope/utils 2026-03-25T09:06:02,209 copying build/lib/evalscope/utils/model_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,211 copying build/lib/evalscope/utils/multi_choices.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,213 copying build/lib/evalscope/utils/ner.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,216 copying build/lib/evalscope/utils/json_schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,218 copying build/lib/evalscope/utils/import_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,220 creating build/bdist.linux-armv7l/wheel/evalscope/utils/doc_utils 2026-03-25T09:06:02,222 copying build/lib/evalscope/utils/doc_utils/readme_generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-03-25T09:06:02,224 copying build/lib/evalscope/utils/doc_utils/generate_dataset_md.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-03-25T09:06:02,227 copying build/lib/evalscope/utils/doc_utils/translate_description.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-03-25T09:06:02,229 copying build/lib/evalscope/utils/doc_utils/benchmark_stats.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-03-25T09:06:02,232 copying build/lib/evalscope/utils/doc_utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/doc_utils 2026-03-25T09:06:02,233 copying build/lib/evalscope/utils/chat_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,235 copying build/lib/evalscope/utils/function_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,238 copying build/lib/evalscope/utils/code_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,240 copying build/lib/evalscope/utils/deprecation_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,242 copying build/lib/evalscope/utils/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,244 copying build/lib/evalscope/utils/io_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,246 copying build/lib/evalscope/utils/argument_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,249 creating build/bdist.linux-armv7l/wheel/evalscope/utils/tqdm_utils 2026-03-25T09:06:02,250 copying build/lib/evalscope/utils/tqdm_utils/tqdm_logging.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-03-25T09:06:02,252 copying build/lib/evalscope/utils/tqdm_utils/progress_tracker.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-03-25T09:06:02,254 copying build/lib/evalscope/utils/tqdm_utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils/tqdm_utils 2026-03-25T09:06:02,256 copying build/lib/evalscope/utils/url_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,258 copying build/lib/evalscope/utils/resource_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,260 copying build/lib/evalscope/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-03-25T09:06:02,262 copying build/lib/evalscope/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-03-25T09:06:02,265 creating build/bdist.linux-armv7l/wheel/evalscope/service 2026-03-25T09:06:02,266 creating build/bdist.linux-armv7l/wheel/evalscope/service/utils 2026-03-25T09:06:02,267 copying build/lib/evalscope/service/utils/log.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-03-25T09:06:02,269 copying build/lib/evalscope/service/utils/benchmarks.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-03-25T09:06:02,271 copying build/lib/evalscope/service/utils/process.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-03-25T09:06:02,273 copying build/lib/evalscope/service/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/utils 2026-03-25T09:06:02,276 creating build/bdist.linux-armv7l/wheel/evalscope/service/frontend 2026-03-25T09:06:02,277 copying build/lib/evalscope/service/frontend/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-03-25T09:06:02,279 copying build/lib/evalscope/service/frontend/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-03-25T09:06:02,281 copying build/lib/evalscope/service/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-03-25T09:06:02,283 copying build/lib/evalscope/service/frontend/async_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-03-25T09:06:02,285 copying build/lib/evalscope/service/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-03-25T09:06:02,288 creating build/bdist.linux-armv7l/wheel/evalscope/service/blueprints 2026-03-25T09:06:02,289 copying build/lib/evalscope/service/blueprints/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-03-25T09:06:02,291 copying build/lib/evalscope/service/blueprints/perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-03-25T09:06:02,293 copying build/lib/evalscope/service/blueprints/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/blueprints 2026-03-25T09:06:02,295 copying build/lib/evalscope/service/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-03-25T09:06:02,297 creating build/bdist.linux-armv7l/wheel/evalscope/metrics 2026-03-25T09:06:02,298 copying build/lib/evalscope/metrics/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-03-25T09:06:02,301 copying build/lib/evalscope/metrics/rouge_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-03-25T09:06:02,303 copying build/lib/evalscope/metrics/llm_judge.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-03-25T09:06:02,306 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/text_normalizer 2026-03-25T09:06:02,307 copying build/lib/evalscope/metrics/text_normalizer/english.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-03-25T09:06:02,310 copying build/lib/evalscope/metrics/text_normalizer/basic.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-03-25T09:06:02,312 copying build/lib/evalscope/metrics/text_normalizer/english.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-03-25T09:06:02,314 copying build/lib/evalscope/metrics/text_normalizer/wer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-03-25T09:06:02,317 copying build/lib/evalscope/metrics/text_normalizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-03-25T09:06:02,318 copying build/lib/evalscope/metrics/text_normalizer/chinese.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-03-25T09:06:02,321 copying build/lib/evalscope/metrics/math_parser.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-03-25T09:06:02,324 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/sem_score 2026-03-25T09:06:02,325 copying build/lib/evalscope/metrics/sem_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-03-25T09:06:02,328 copying build/lib/evalscope/metrics/sem_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-03-25T09:06:02,330 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics 2026-03-25T09:06:02,331 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models 2026-03-25T09:06:02,332 copying build/lib/evalscope/metrics/t2v_metrics/models/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-03-25T09:06:02,335 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:06:02,336 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:06:02,339 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:06:02,341 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:06:02,344 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-03-25T09:06:02,345 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-03-25T09:06:02,347 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-03-25T09:06:02,348 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-03-25T09:06:02,351 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-03-25T09:06:02,352 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-03-25T09:06:02,355 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-03-25T09:06:02,356 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-03-25T09:06:02,358 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-03-25T09:06:02,359 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-03-25T09:06:02,361 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-03-25T09:06:02,363 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-03-25T09:06:02,365 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:06:02,366 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:06:02,369 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,371 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,374 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,377 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,380 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,382 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,385 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,387 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,389 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,392 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,394 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-03-25T09:06:02,397 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:06:02,399 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:06:02,402 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,403 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,405 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,407 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,410 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,412 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,414 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,416 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,419 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,423 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,425 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,430 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-03-25T09:06:02,433 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:06:02,436 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:06:02,439 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-03-25T09:06:02,444 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:06:02,445 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:06:02,447 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:06:02,449 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:06:02,452 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-03-25T09:06:02,454 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-03-25T09:06:02,456 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-03-25T09:06:02,457 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-03-25T09:06:02,460 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-03-25T09:06:02,461 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-03-25T09:06:02,464 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,465 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,467 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,469 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,471 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,473 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,474 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,476 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,478 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,480 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,481 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,483 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,485 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,487 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,488 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,490 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,492 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,494 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,496 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,498 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-03-25T09:06:02,500 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-03-25T09:06:02,503 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:06:02,504 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:06:02,507 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-03-25T09:06:02,508 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-03-25T09:06:02,510 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-03-25T09:06:02,512 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-03-25T09:06:02,514 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:06:02,516 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:06:02,518 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:06:02,520 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:06:02,523 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:06:02,525 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-03-25T09:06:02,526 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-03-25T09:06:02,528 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:06:02,531 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-03-25T09:06:02,533 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:06:02,535 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:06:02,536 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:06:02,539 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:06:02,540 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:06:02,542 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-03-25T09:06:02,544 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:06:02,546 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:06:02,548 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:06:02,550 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:06:02,552 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-03-25T09:06:02,554 copying build/lib/evalscope/metrics/t2v_metrics/models/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-03-25T09:06:02,556 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:06:02,557 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:06:02,560 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-03-25T09:06:02,561 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-03-25T09:06:02,563 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-03-25T09:06:02,565 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-03-25T09:06:02,566 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:06:02,569 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:06:02,571 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-03-25T09:06:02,572 copying build/lib/evalscope/metrics/t2v_metrics/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-03-25T09:06:02,574 copying build/lib/evalscope/metrics/t2v_metrics/clipscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-03-25T09:06:02,576 copying build/lib/evalscope/metrics/t2v_metrics/vqascore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-03-25T09:06:02,578 copying build/lib/evalscope/metrics/t2v_metrics/itmscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-03-25T09:06:02,580 copying build/lib/evalscope/metrics/t2v_metrics/score.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-03-25T09:06:02,582 copying build/lib/evalscope/metrics/t2v_metrics/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-03-25T09:06:02,583 copying build/lib/evalscope/metrics/t2v_metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-03-25T09:06:02,586 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bert_score 2026-03-25T09:06:02,587 copying build/lib/evalscope/metrics/bert_score/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-03-25T09:06:02,589 copying build/lib/evalscope/metrics/bert_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-03-25T09:06:02,591 copying build/lib/evalscope/metrics/bert_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-03-25T09:06:02,593 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bundled_rouge_score 2026-03-25T09:06:02,594 copying build/lib/evalscope/metrics/bundled_rouge_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-03-25T09:06:02,596 copying build/lib/evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-03-25T09:06:02,598 copying build/lib/evalscope/metrics/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-03-25T09:06:02,601 copying build/lib/evalscope/metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-03-25T09:06:02,603 creating build/bdist.linux-armv7l/wheel/evalscope/perf 2026-03-25T09:06:02,604 copying build/lib/evalscope/perf/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-03-25T09:06:02,608 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils 2026-03-25T09:06:02,609 copying build/lib/evalscope/perf/utils/handler.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-03-25T09:06:02,611 copying build/lib/evalscope/perf/utils/rich_display.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-03-25T09:06:02,613 copying build/lib/evalscope/perf/utils/log_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-03-25T09:06:02,615 copying build/lib/evalscope/perf/utils/analysis_result.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-03-25T09:06:02,616 copying build/lib/evalscope/perf/utils/local_server.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-03-25T09:06:02,618 copying build/lib/evalscope/perf/utils/benchmark_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-03-25T09:06:02,621 copying build/lib/evalscope/perf/utils/db_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-03-25T09:06:02,624 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils/report 2026-03-25T09:06:02,625 copying build/lib/evalscope/perf/utils/report/perf_charts.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-03-25T09:06:02,627 copying build/lib/evalscope/perf/utils/report/generate_report.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-03-25T09:06:02,630 copying build/lib/evalscope/perf/utils/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-03-25T09:06:02,631 copying build/lib/evalscope/perf/utils/report/perf_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils/report 2026-03-25T09:06:02,633 copying build/lib/evalscope/perf/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-03-25T09:06:02,635 copying build/lib/evalscope/perf/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-03-25T09:06:02,638 creating build/bdist.linux-armv7l/wheel/evalscope/perf/sla 2026-03-25T09:06:02,639 copying build/lib/evalscope/perf/sla/sla_criterion.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-03-25T09:06:02,642 copying build/lib/evalscope/perf/sla/sla_run.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-03-25T09:06:02,644 copying build/lib/evalscope/perf/sla/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-03-25T09:06:02,646 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin 2026-03-25T09:06:02,648 copying build/lib/evalscope/perf/plugin/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-03-25T09:06:02,650 copying build/lib/evalscope/perf/plugin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-03-25T09:06:02,652 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/api 2026-03-25T09:06:02,653 copying build/lib/evalscope/perf/plugin/api/custom_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-03-25T09:06:02,656 copying build/lib/evalscope/perf/plugin/api/openai_embedding_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-03-25T09:06:02,658 copying build/lib/evalscope/perf/plugin/api/openai_rerank_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-03-25T09:06:02,660 copying build/lib/evalscope/perf/plugin/api/default_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-03-25T09:06:02,662 copying build/lib/evalscope/perf/plugin/api/dashscope_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-03-25T09:06:02,664 copying build/lib/evalscope/perf/plugin/api/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-03-25T09:06:02,666 copying build/lib/evalscope/perf/plugin/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-03-25T09:06:02,667 copying build/lib/evalscope/perf/plugin/api/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-03-25T09:06:02,670 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/datasets 2026-03-25T09:06:02,671 copying build/lib/evalscope/perf/plugin/datasets/custom.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,673 copying build/lib/evalscope/perf/plugin/datasets/line_by_line.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,675 copying build/lib/evalscope/perf/plugin/datasets/speed_benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,677 copying build/lib/evalscope/perf/plugin/datasets/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,679 copying build/lib/evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,681 copying build/lib/evalscope/perf/plugin/datasets/longalpaca.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,683 copying build/lib/evalscope/perf/plugin/datasets/rerank_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,685 copying build/lib/evalscope/perf/plugin/datasets/random_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,688 copying build/lib/evalscope/perf/plugin/datasets/embedding_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,690 copying build/lib/evalscope/perf/plugin/datasets/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,692 copying build/lib/evalscope/perf/plugin/datasets/openqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,694 copying build/lib/evalscope/perf/plugin/datasets/flickr8k.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,696 copying build/lib/evalscope/perf/plugin/datasets/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,698 copying build/lib/evalscope/perf/plugin/datasets/kontext_bench.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-03-25T09:06:02,700 copying build/lib/evalscope/perf/http_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-03-25T09:06:02,702 copying build/lib/evalscope/perf/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-03-25T09:06:02,704 copying build/lib/evalscope/perf/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-03-25T09:06:02,706 creating build/bdist.linux-armv7l/wheel/evalscope/backend 2026-03-25T09:06:02,708 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass 2026-03-25T09:06:02,709 copying build/lib/evalscope/backend/opencompass/api_meta_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-03-25T09:06:02,711 copying build/lib/evalscope/backend/opencompass/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-03-25T09:06:02,713 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass/tasks 2026-03-25T09:06:02,714 copying build/lib/evalscope/backend/opencompass/tasks/eval_datasets.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-03-25T09:06:02,717 copying build/lib/evalscope/backend/opencompass/tasks/eval_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-03-25T09:06:02,719 copying build/lib/evalscope/backend/opencompass/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-03-25T09:06:02,720 copying build/lib/evalscope/backend/opencompass/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-03-25T09:06:02,722 creating build/bdist.linux-armv7l/wheel/evalscope/backend/vlm_eval_kit 2026-03-25T09:06:02,723 copying build/lib/evalscope/backend/vlm_eval_kit/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-03-25T09:06:02,725 copying build/lib/evalscope/backend/vlm_eval_kit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-03-25T09:06:02,727 copying build/lib/evalscope/backend/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-03-25T09:06:02,729 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval 2026-03-25T09:06:02,731 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/utils 2026-03-25T09:06:02,732 copying build/lib/evalscope/backend/rag_eval/utils/tools.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-03-25T09:06:02,734 copying build/lib/evalscope/backend/rag_eval/utils/embedding.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-03-25T09:06:02,736 copying build/lib/evalscope/backend/rag_eval/utils/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-03-25T09:06:02,738 copying build/lib/evalscope/backend/rag_eval/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-03-25T09:06:02,740 copying build/lib/evalscope/backend/rag_eval/utils/clip.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-03-25T09:06:02,743 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb 2026-03-25T09:06:02,744 copying build/lib/evalscope/backend/rag_eval/cmteb/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-03-25T09:06:02,746 copying build/lib/evalscope/backend/rag_eval/cmteb/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-03-25T09:06:02,748 copying build/lib/evalscope/backend/rag_eval/cmteb/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-03-25T09:06:02,751 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:06:02,752 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:06:02,754 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:06:02,756 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:06:02,759 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:06:02,761 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:06:02,763 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:06:02,765 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:06:02,767 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-03-25T09:06:02,769 copying build/lib/evalscope/backend/rag_eval/cmteb/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-03-25T09:06:02,771 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas 2026-03-25T09:06:02,772 copying build/lib/evalscope/backend/rag_eval/ragas/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-03-25T09:06:02,774 copying build/lib/evalscope/backend/rag_eval/ragas/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-03-25T09:06:02,776 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/prompts 2026-03-25T09:06:02,777 copying build/lib/evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/prompts 2026-03-25T09:06:02,779 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:06:02,780 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:06:02,783 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:06:02,784 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:06:02,786 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:06:02,789 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-03-25T09:06:02,790 copying build/lib/evalscope/backend/rag_eval/ragas/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-03-25T09:06:02,792 copying build/lib/evalscope/backend/rag_eval/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-03-25T09:06:02,794 copying build/lib/evalscope/backend/rag_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-03-25T09:06:02,796 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:06:02,798 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/utils 2026-03-25T09:06:02,799 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-03-25T09:06:02,802 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-03-25T09:06:02,803 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:06:02,805 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:06:02,808 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:06:02,811 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:06:02,811 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:06:02,814 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:06:02,816 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:06:02,818 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-03-25T09:06:02,819 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-03-25T09:06:02,821 copying build/lib/evalscope/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-03-25T09:06:02,823 creating build/bdist.linux-armv7l/wheel/evalscope/filters 2026-03-25T09:06:02,824 copying build/lib/evalscope/filters/extraction.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-03-25T09:06:02,826 copying build/lib/evalscope/filters/selection.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-03-25T09:06:02,827 copying build/lib/evalscope/filters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-03-25T09:06:02,830 creating build/bdist.linux-armv7l/wheel/evalscope/collections 2026-03-25T09:06:02,831 copying build/lib/evalscope/collections/schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-03-25T09:06:02,833 copying build/lib/evalscope/collections/sampler.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-03-25T09:06:02,835 copying build/lib/evalscope/collections/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-03-25T09:06:02,837 copying build/lib/evalscope/run.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-03-25T09:06:02,839 copying build/lib/evalscope/config.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-03-25T09:06:02,842 creating build/bdist.linux-armv7l/wheel/evalscope/app 2026-03-25T09:06:02,844 creating build/bdist.linux-armv7l/wheel/evalscope/app/utils 2026-03-25T09:06:02,845 copying build/lib/evalscope/app/utils/text_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-03-25T09:06:02,847 copying build/lib/evalscope/app/utils/env_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-03-25T09:06:02,849 copying build/lib/evalscope/app/utils/localization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-03-25T09:06:02,851 copying build/lib/evalscope/app/utils/data_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-03-25T09:06:02,854 copying build/lib/evalscope/app/utils/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-03-25T09:06:02,856 copying build/lib/evalscope/app/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-03-25T09:06:02,858 creating build/bdist.linux-armv7l/wheel/evalscope/app/ui 2026-03-25T09:06:02,859 copying build/lib/evalscope/app/ui/app_ui.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-03-25T09:06:02,862 copying build/lib/evalscope/app/ui/sidebar.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-03-25T09:06:02,864 copying build/lib/evalscope/app/ui/single_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-03-25T09:06:02,866 copying build/lib/evalscope/app/ui/multi_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-03-25T09:06:02,869 copying build/lib/evalscope/app/ui/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-03-25T09:06:02,872 copying build/lib/evalscope/app/ui/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-03-25T09:06:02,874 copying build/lib/evalscope/app/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-03-25T09:06:02,876 copying build/lib/evalscope/app/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-03-25T09:06:02,877 copying build/lib/evalscope/app/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-03-25T09:06:02,880 creating build/bdist.linux-armv7l/wheel/evalscope/report 2026-03-25T09:06:02,881 copying build/lib/evalscope/report/generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-03-25T09:06:02,884 creating build/bdist.linux-armv7l/wheel/evalscope/report/template 2026-03-25T09:06:02,885 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/partials 2026-03-25T09:06:02,886 copying build/lib/evalscope/report/template/partials/toc_perf.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-03-25T09:06:02,888 copying build/lib/evalscope/report/template/partials/footer.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-03-25T09:06:02,890 copying build/lib/evalscope/report/template/partials/header_eval.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-03-25T09:06:02,892 copying build/lib/evalscope/report/template/partials/header_perf.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-03-25T09:06:02,894 copying build/lib/evalscope/report/template/partials/toc_eval.html -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/partials 2026-03-25T09:06:02,897 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/js 2026-03-25T09:06:02,898 copying build/lib/evalscope/report/template/js/shared.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-03-25T09:06:02,901 copying build/lib/evalscope/report/template/js/i18n_perf.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-03-25T09:06:02,904 copying build/lib/evalscope/report/template/js/perf_extra.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-03-25T09:06:02,906 copying build/lib/evalscope/report/template/js/eval_extra.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-03-25T09:06:02,909 copying build/lib/evalscope/report/template/js/i18n_eval.js -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/js 2026-03-25T09:06:02,912 copying build/lib/evalscope/report/template/perf_report.html.j2 -> build/bdist.linux-armv7l/wheel/./evalscope/report/template 2026-03-25T09:06:02,915 copying build/lib/evalscope/report/template/report.html.j2 -> build/bdist.linux-armv7l/wheel/./evalscope/report/template 2026-03-25T09:06:02,918 creating build/bdist.linux-armv7l/wheel/evalscope/report/template/css 2026-03-25T09:06:02,920 copying build/lib/evalscope/report/template/css/base.css -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/css 2026-03-25T09:06:02,923 copying build/lib/evalscope/report/template/css/perf_extra.css -> build/bdist.linux-armv7l/wheel/./evalscope/report/template/css 2026-03-25T09:06:02,926 copying build/lib/evalscope/report/combinator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-03-25T09:06:02,929 copying build/lib/evalscope/report/renderer.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-03-25T09:06:02,932 copying build/lib/evalscope/report/report.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-03-25T09:06:02,935 copying build/lib/evalscope/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-03-25T09:06:02,938 creating build/bdist.linux-armv7l/wheel/evalscope/cli 2026-03-25T09:06:02,940 copying build/lib/evalscope/cli/cli.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-03-25T09:06:02,942 copying build/lib/evalscope/cli/start_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-03-25T09:06:02,944 copying build/lib/evalscope/cli/start_app.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-03-25T09:06:02,947 copying build/lib/evalscope/cli/benchmark_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-03-25T09:06:02,950 copying build/lib/evalscope/cli/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-03-25T09:06:02,952 copying build/lib/evalscope/cli/start_perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-03-25T09:06:02,954 copying build/lib/evalscope/cli/start_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-03-25T09:06:02,957 copying build/lib/evalscope/cli/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-03-25T09:06:02,959 creating build/bdist.linux-armv7l/wheel/evalscope/summarizer 2026-03-25T09:06:02,961 copying build/lib/evalscope/summarizer/summarizer.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-03-25T09:06:02,964 copying build/lib/evalscope/summarizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-03-25T09:06:02,966 copying build/lib/evalscope/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-03-25T09:06:02,969 copying build/lib/evalscope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-03-25T09:06:02,972 creating build/bdist.linux-armv7l/wheel/evalscope/api 2026-03-25T09:06:02,975 creating build/bdist.linux-armv7l/wheel/evalscope/api/evaluator 2026-03-25T09:06:02,976 copying build/lib/evalscope/api/evaluator/state.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-03-25T09:06:02,979 copying build/lib/evalscope/api/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-03-25T09:06:02,982 copying build/lib/evalscope/api/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-03-25T09:06:02,984 copying build/lib/evalscope/api/evaluator/cache.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-03-25T09:06:02,988 creating build/bdist.linux-armv7l/wheel/evalscope/api/model 2026-03-25T09:06:02,990 copying build/lib/evalscope/api/model/lazy_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-03-25T09:06:02,993 copying build/lib/evalscope/api/model/generate_config.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-03-25T09:06:02,995 copying build/lib/evalscope/api/model/model_output.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-03-25T09:06:02,998 copying build/lib/evalscope/api/model/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-03-25T09:06:03,000 copying build/lib/evalscope/api/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-03-25T09:06:03,002 creating build/bdist.linux-armv7l/wheel/evalscope/api/metric 2026-03-25T09:06:03,004 copying build/lib/evalscope/api/metric/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-03-25T09:06:03,006 copying build/lib/evalscope/api/metric/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-03-25T09:06:03,008 copying build/lib/evalscope/api/metric/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-03-25T09:06:03,010 creating build/bdist.linux-armv7l/wheel/evalscope/api/messages 2026-03-25T09:06:03,011 copying build/lib/evalscope/api/messages/content.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-03-25T09:06:03,013 copying build/lib/evalscope/api/messages/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-03-25T09:06:03,015 copying build/lib/evalscope/api/messages/chat_message.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-03-25T09:06:03,018 copying build/lib/evalscope/api/messages/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-03-25T09:06:03,020 creating build/bdist.linux-armv7l/wheel/evalscope/api/tool 2026-03-25T09:06:03,021 copying build/lib/evalscope/api/tool/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-03-25T09:06:03,024 copying build/lib/evalscope/api/tool/tool_call.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-03-25T09:06:03,025 copying build/lib/evalscope/api/tool/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-03-25T09:06:03,027 copying build/lib/evalscope/api/tool/tool_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-03-25T09:06:03,030 creating build/bdist.linux-armv7l/wheel/evalscope/api/mixin 2026-03-25T09:06:03,031 copying build/lib/evalscope/api/mixin/sandbox_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-03-25T09:06:03,033 copying build/lib/evalscope/api/mixin/llm_judge_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-03-25T09:06:03,035 copying build/lib/evalscope/api/mixin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-03-25T09:06:03,038 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark 2026-03-25T09:06:03,039 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark/adapters 2026-03-25T09:06:03,040 copying build/lib/evalscope/api/benchmark/adapters/agent_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-03-25T09:06:03,042 copying build/lib/evalscope/api/benchmark/adapters/ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-03-25T09:06:03,045 copying build/lib/evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-03-25T09:06:03,047 copying build/lib/evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-03-25T09:06:03,048 copying build/lib/evalscope/api/benchmark/adapters/text2image_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-03-25T09:06:03,050 copying build/lib/evalscope/api/benchmark/adapters/default_data_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-03-25T09:06:03,054 copying build/lib/evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-03-25T09:06:03,056 copying build/lib/evalscope/api/benchmark/adapters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-03-25T09:06:03,057 copying build/lib/evalscope/api/benchmark/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-03-25T09:06:03,060 copying build/lib/evalscope/api/benchmark/meta.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-03-25T09:06:03,062 copying build/lib/evalscope/api/benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-03-25T09:06:03,064 copying build/lib/evalscope/api/benchmark/statistics.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-03-25T09:06:03,066 copying build/lib/evalscope/api/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-03-25T09:06:03,069 creating build/bdist.linux-armv7l/wheel/evalscope/api/filter 2026-03-25T09:06:03,070 copying build/lib/evalscope/api/filter/filter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-03-25T09:06:03,072 copying build/lib/evalscope/api/filter/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-03-25T09:06:03,074 copying build/lib/evalscope/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-03-25T09:06:03,076 creating build/bdist.linux-armv7l/wheel/evalscope/api/dataset 2026-03-25T09:06:03,077 copying build/lib/evalscope/api/dataset/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-03-25T09:06:03,080 copying build/lib/evalscope/api/dataset/loader.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-03-25T09:06:03,083 copying build/lib/evalscope/api/dataset/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-03-25T09:06:03,085 copying build/lib/evalscope/api/dataset/dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-03-25T09:06:03,087 creating build/bdist.linux-armv7l/wheel/evalscope/sandbox 2026-03-25T09:06:03,088 copying build/lib/evalscope/sandbox/volcengine.py -> build/bdist.linux-armv7l/wheel/./evalscope/sandbox 2026-03-25T09:06:03,091 copying build/lib/evalscope/sandbox/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/sandbox 2026-03-25T09:06:03,093 running install_egg_info 2026-03-25T09:06:03,097 Copying evalscope.egg-info to build/bdist.linux-armv7l/wheel/./evalscope-1.5.1-py3.11.egg-info 2026-03-25T09:06:03,111 running install_scripts 2026-03-25T09:06:03,124 creating build/bdist.linux-armv7l/wheel/evalscope-1.5.1.dist-info/WHEEL 2026-03-25T09:06:03,127 creating '/tmp/pip-wheel-1vuoy08l/.tmp-36yvb0na/evalscope-1.5.1-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-03-25T09:06:03,129 adding 'evalscope/__init__.py' 2026-03-25T09:06:03,131 adding 'evalscope/arguments.py' 2026-03-25T09:06:03,134 adding 'evalscope/config.py' 2026-03-25T09:06:03,135 adding 'evalscope/constants.py' 2026-03-25T09:06:03,137 adding 'evalscope/run.py' 2026-03-25T09:06:03,139 adding 'evalscope/version.py' 2026-03-25T09:06:03,140 adding 'evalscope/api/__init__.py' 2026-03-25T09:06:03,142 adding 'evalscope/api/registry.py' 2026-03-25T09:06:03,144 adding 'evalscope/api/benchmark/__init__.py' 2026-03-25T09:06:03,146 adding 'evalscope/api/benchmark/benchmark.py' 2026-03-25T09:06:03,148 adding 'evalscope/api/benchmark/meta.py' 2026-03-25T09:06:03,150 adding 'evalscope/api/benchmark/statistics.py' 2026-03-25T09:06:03,152 adding 'evalscope/api/benchmark/adapters/__init__.py' 2026-03-25T09:06:03,154 adding 'evalscope/api/benchmark/adapters/agent_adapter.py' 2026-03-25T09:06:03,158 adding 'evalscope/api/benchmark/adapters/default_data_adapter.py' 2026-03-25T09:06:03,159 adding 'evalscope/api/benchmark/adapters/image_edit_adapter.py' 2026-03-25T09:06:03,160 adding 'evalscope/api/benchmark/adapters/multi_choice_adapter.py' 2026-03-25T09:06:03,162 adding 'evalscope/api/benchmark/adapters/ner_adapter.py' 2026-03-25T09:06:03,164 adding 'evalscope/api/benchmark/adapters/text2image_adapter.py' 2026-03-25T09:06:03,165 adding 'evalscope/api/benchmark/adapters/vision_language_adapter.py' 2026-03-25T09:06:03,167 adding 'evalscope/api/dataset/__init__.py' 2026-03-25T09:06:03,169 adding 'evalscope/api/dataset/dataset.py' 2026-03-25T09:06:03,171 adding 'evalscope/api/dataset/loader.py' 2026-03-25T09:06:03,172 adding 'evalscope/api/dataset/utils.py' 2026-03-25T09:06:03,174 adding 'evalscope/api/evaluator/__init__.py' 2026-03-25T09:06:03,176 adding 'evalscope/api/evaluator/cache.py' 2026-03-25T09:06:03,177 adding 'evalscope/api/evaluator/evaluator.py' 2026-03-25T09:06:03,179 adding 'evalscope/api/evaluator/state.py' 2026-03-25T09:06:03,181 adding 'evalscope/api/filter/__init__.py' 2026-03-25T09:06:03,183 adding 'evalscope/api/filter/filter.py' 2026-03-25T09:06:03,184 adding 'evalscope/api/messages/__init__.py' 2026-03-25T09:06:03,186 adding 'evalscope/api/messages/chat_message.py' 2026-03-25T09:06:03,188 adding 'evalscope/api/messages/content.py' 2026-03-25T09:06:03,189 adding 'evalscope/api/messages/utils.py' 2026-03-25T09:06:03,191 adding 'evalscope/api/metric/__init__.py' 2026-03-25T09:06:03,192 adding 'evalscope/api/metric/metric.py' 2026-03-25T09:06:03,193 adding 'evalscope/api/metric/scorer.py' 2026-03-25T09:06:03,195 adding 'evalscope/api/mixin/__init__.py' 2026-03-25T09:06:03,197 adding 'evalscope/api/mixin/llm_judge_mixin.py' 2026-03-25T09:06:03,199 adding 'evalscope/api/mixin/sandbox_mixin.py' 2026-03-25T09:06:03,201 adding 'evalscope/api/model/__init__.py' 2026-03-25T09:06:03,203 adding 'evalscope/api/model/generate_config.py' 2026-03-25T09:06:03,204 adding 'evalscope/api/model/lazy_model.py' 2026-03-25T09:06:03,206 adding 'evalscope/api/model/model.py' 2026-03-25T09:06:03,208 adding 'evalscope/api/model/model_output.py' 2026-03-25T09:06:03,210 adding 'evalscope/api/tool/__init__.py' 2026-03-25T09:06:03,211 adding 'evalscope/api/tool/tool_call.py' 2026-03-25T09:06:03,212 adding 'evalscope/api/tool/tool_info.py' 2026-03-25T09:06:03,214 adding 'evalscope/api/tool/utils.py' 2026-03-25T09:06:03,215 adding 'evalscope/app/__init__.py' 2026-03-25T09:06:03,216 adding 'evalscope/app/app.py' 2026-03-25T09:06:03,217 adding 'evalscope/app/arguments.py' 2026-03-25T09:06:03,218 adding 'evalscope/app/constants.py' 2026-03-25T09:06:03,220 adding 'evalscope/app/ui/__init__.py' 2026-03-25T09:06:03,221 adding 'evalscope/app/ui/app_ui.py' 2026-03-25T09:06:03,223 adding 'evalscope/app/ui/multi_model.py' 2026-03-25T09:06:03,225 adding 'evalscope/app/ui/sidebar.py' 2026-03-25T09:06:03,226 adding 'evalscope/app/ui/single_model.py' 2026-03-25T09:06:03,228 adding 'evalscope/app/ui/visualization.py' 2026-03-25T09:06:03,230 adding 'evalscope/app/utils/data_utils.py' 2026-03-25T09:06:03,231 adding 'evalscope/app/utils/env_utils.py' 2026-03-25T09:06:03,232 adding 'evalscope/app/utils/localization.py' 2026-03-25T09:06:03,234 adding 'evalscope/app/utils/text_utils.py' 2026-03-25T09:06:03,235 adding 'evalscope/app/utils/visualization.py' 2026-03-25T09:06:03,237 adding 'evalscope/backend/__init__.py' 2026-03-25T09:06:03,238 adding 'evalscope/backend/base.py' 2026-03-25T09:06:03,240 adding 'evalscope/backend/opencompass/__init__.py' 2026-03-25T09:06:03,241 adding 'evalscope/backend/opencompass/api_meta_template.py' 2026-03-25T09:06:03,243 adding 'evalscope/backend/opencompass/backend_manager.py' 2026-03-25T09:06:03,245 adding 'evalscope/backend/opencompass/tasks/__init__.py' 2026-03-25T09:06:03,246 adding 'evalscope/backend/opencompass/tasks/eval_api.py' 2026-03-25T09:06:03,247 adding 'evalscope/backend/opencompass/tasks/eval_datasets.py' 2026-03-25T09:06:03,249 adding 'evalscope/backend/rag_eval/__init__.py' 2026-03-25T09:06:03,250 adding 'evalscope/backend/rag_eval/backend_manager.py' 2026-03-25T09:06:03,252 adding 'evalscope/backend/rag_eval/clip_benchmark/__init__.py' 2026-03-25T09:06:03,254 adding 'evalscope/backend/rag_eval/clip_benchmark/arguments.py' 2026-03-25T09:06:03,256 adding 'evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py' 2026-03-25T09:06:03,257 adding 'evalscope/backend/rag_eval/clip_benchmark/task_template.py' 2026-03-25T09:06:03,259 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py' 2026-03-25T09:06:03,260 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py' 2026-03-25T09:06:03,262 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py' 2026-03-25T09:06:03,263 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py' 2026-03-25T09:06:03,266 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py' 2026-03-25T09:06:03,267 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt' 2026-03-25T09:06:03,269 adding 'evalscope/backend/rag_eval/cmteb/__init__.py' 2026-03-25T09:06:03,270 adding 'evalscope/backend/rag_eval/cmteb/arguments.py' 2026-03-25T09:06:03,271 adding 'evalscope/backend/rag_eval/cmteb/base.py' 2026-03-25T09:06:03,273 adding 'evalscope/backend/rag_eval/cmteb/task_template.py' 2026-03-25T09:06:03,275 adding 'evalscope/backend/rag_eval/cmteb/tasks/Classification.py' 2026-03-25T09:06:03,277 adding 'evalscope/backend/rag_eval/cmteb/tasks/Clustering.py' 2026-03-25T09:06:03,278 adding 'evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py' 2026-03-25T09:06:03,279 adding 'evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py' 2026-03-25T09:06:03,281 adding 'evalscope/backend/rag_eval/cmteb/tasks/Reranking.py' 2026-03-25T09:06:03,282 adding 'evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py' 2026-03-25T09:06:03,284 adding 'evalscope/backend/rag_eval/cmteb/tasks/STS.py' 2026-03-25T09:06:03,285 adding 'evalscope/backend/rag_eval/cmteb/tasks/__init__.py' 2026-03-25T09:06:03,287 adding 'evalscope/backend/rag_eval/ragas/__init__.py' 2026-03-25T09:06:03,288 adding 'evalscope/backend/rag_eval/ragas/arguments.py' 2026-03-25T09:06:03,289 adding 'evalscope/backend/rag_eval/ragas/task_template.py' 2026-03-25T09:06:03,291 adding 'evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py' 2026-03-25T09:06:03,293 adding 'evalscope/backend/rag_eval/ragas/tasks/__init__.py' 2026-03-25T09:06:03,294 adding 'evalscope/backend/rag_eval/ragas/tasks/build_distribution.py' 2026-03-25T09:06:03,295 adding 'evalscope/backend/rag_eval/ragas/tasks/build_transform.py' 2026-03-25T09:06:03,297 adding 'evalscope/backend/rag_eval/ragas/tasks/testset_generation.py' 2026-03-25T09:06:03,298 adding 'evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py' 2026-03-25T09:06:03,300 adding 'evalscope/backend/rag_eval/utils/__init__.py' 2026-03-25T09:06:03,301 adding 'evalscope/backend/rag_eval/utils/clip.py' 2026-03-25T09:06:03,303 adding 'evalscope/backend/rag_eval/utils/embedding.py' 2026-03-25T09:06:03,304 adding 'evalscope/backend/rag_eval/utils/llm.py' 2026-03-25T09:06:03,305 adding 'evalscope/backend/rag_eval/utils/tools.py' 2026-03-25T09:06:03,307 adding 'evalscope/backend/vlm_eval_kit/__init__.py' 2026-03-25T09:06:03,308 adding 'evalscope/backend/vlm_eval_kit/backend_manager.py' 2026-03-25T09:06:03,312 adding 'evalscope/benchmarks/__init__.py' 2026-03-25T09:06:03,318 adding 'evalscope/benchmarks/_meta/a_okvqa.json' 2026-03-25T09:06:03,320 adding 'evalscope/benchmarks/_meta/aa_lcr.json' 2026-03-25T09:06:03,321 adding 'evalscope/benchmarks/_meta/ai2d.json' 2026-03-25T09:06:03,323 adding 'evalscope/benchmarks/_meta/aime24.json' 2026-03-25T09:06:03,325 adding 'evalscope/benchmarks/_meta/aime25.json' 2026-03-25T09:06:03,327 adding 'evalscope/benchmarks/_meta/aime26.json' 2026-03-25T09:06:03,328 adding 'evalscope/benchmarks/_meta/alpaca_eval.json' 2026-03-25T09:06:03,330 adding 'evalscope/benchmarks/_meta/amc.json' 2026-03-25T09:06:03,332 adding 'evalscope/benchmarks/_meta/anat_em.json' 2026-03-25T09:06:03,334 adding 'evalscope/benchmarks/_meta/arc.json' 2026-03-25T09:06:03,336 adding 'evalscope/benchmarks/_meta/arena_hard.json' 2026-03-25T09:06:03,338 adding 'evalscope/benchmarks/_meta/bbh.json' 2026-03-25T09:06:03,340 adding 'evalscope/benchmarks/_meta/bc2gm.json' 2026-03-25T09:06:03,343 adding 'evalscope/benchmarks/_meta/bc4chemd.json' 2026-03-25T09:06:03,345 adding 'evalscope/benchmarks/_meta/bc5cdr.json' 2026-03-25T09:06:03,347 adding 'evalscope/benchmarks/_meta/bfcl_v3.json' 2026-03-25T09:06:03,350 adding 'evalscope/benchmarks/_meta/bfcl_v4.json' 2026-03-25T09:06:03,352 adding 'evalscope/benchmarks/_meta/biomix_qa.json' 2026-03-25T09:06:03,354 adding 'evalscope/benchmarks/_meta/blink.json' 2026-03-25T09:06:03,356 adding 'evalscope/benchmarks/_meta/broad_twitter_corpus.json' 2026-03-25T09:06:03,358 adding 'evalscope/benchmarks/_meta/cc_bench.json' 2026-03-25T09:06:03,361 adding 'evalscope/benchmarks/_meta/ceval.json' 2026-03-25T09:06:03,363 adding 'evalscope/benchmarks/_meta/chartqa.json' 2026-03-25T09:06:03,365 adding 'evalscope/benchmarks/_meta/chinese_simpleqa.json' 2026-03-25T09:06:03,368 adding 'evalscope/benchmarks/_meta/cl_bench.json' 2026-03-25T09:06:03,371 adding 'evalscope/benchmarks/_meta/cmmlu.json' 2026-03-25T09:06:03,375 adding 'evalscope/benchmarks/_meta/cmmmu.json' 2026-03-25T09:06:03,378 adding 'evalscope/benchmarks/_meta/cmmu.json' 2026-03-25T09:06:03,380 adding 'evalscope/benchmarks/_meta/coin_flip.json' 2026-03-25T09:06:03,381 adding 'evalscope/benchmarks/_meta/commonsense_qa.json' 2026-03-25T09:06:03,383 adding 'evalscope/benchmarks/_meta/competition_math.json' 2026-03-25T09:06:03,386 adding 'evalscope/benchmarks/_meta/conll2003.json' 2026-03-25T09:06:03,388 adding 'evalscope/benchmarks/_meta/conllpp.json' 2026-03-25T09:06:03,391 adding 'evalscope/benchmarks/_meta/copious.json' 2026-03-25T09:06:03,394 adding 'evalscope/benchmarks/_meta/cross_ner.json' 2026-03-25T09:06:03,396 adding 'evalscope/benchmarks/_meta/data_collection.json' 2026-03-25T09:06:03,397 adding 'evalscope/benchmarks/_meta/docmath.json' 2026-03-25T09:06:03,399 adding 'evalscope/benchmarks/_meta/docvqa.json' 2026-03-25T09:06:03,401 adding 'evalscope/benchmarks/_meta/drivel_binary.json' 2026-03-25T09:06:03,403 adding 'evalscope/benchmarks/_meta/drivel_multilabel.json' 2026-03-25T09:06:03,405 adding 'evalscope/benchmarks/_meta/drivel_selection.json' 2026-03-25T09:06:03,407 adding 'evalscope/benchmarks/_meta/drivel_writing.json' 2026-03-25T09:06:03,409 adding 'evalscope/benchmarks/_meta/drop.json' 2026-03-25T09:06:03,411 adding 'evalscope/benchmarks/_meta/eq_bench.json' 2026-03-25T09:06:03,413 adding 'evalscope/benchmarks/_meta/evalmuse.json' 2026-03-25T09:06:03,415 adding 'evalscope/benchmarks/_meta/fin_ner.json' 2026-03-25T09:06:03,417 adding 'evalscope/benchmarks/_meta/fleurs.json' 2026-03-25T09:06:03,419 adding 'evalscope/benchmarks/_meta/frames.json' 2026-03-25T09:06:03,421 adding 'evalscope/benchmarks/_meta/gedit.json' 2026-03-25T09:06:03,423 adding 'evalscope/benchmarks/_meta/genai_bench.json' 2026-03-25T09:06:03,425 adding 'evalscope/benchmarks/_meta/general_arena.json' 2026-03-25T09:06:03,429 adding 'evalscope/benchmarks/_meta/general_fc.json' 2026-03-25T09:06:03,430 adding 'evalscope/benchmarks/_meta/general_mcq.json' 2026-03-25T09:06:03,432 adding 'evalscope/benchmarks/_meta/general_qa.json' 2026-03-25T09:06:03,433 adding 'evalscope/benchmarks/_meta/general_t2i.json' 2026-03-25T09:06:03,435 adding 'evalscope/benchmarks/_meta/general_vmcq.json' 2026-03-25T09:06:03,436 adding 'evalscope/benchmarks/_meta/general_vqa.json' 2026-03-25T09:06:03,439 adding 'evalscope/benchmarks/_meta/genia_ner.json' 2026-03-25T09:06:03,440 adding 'evalscope/benchmarks/_meta/gpqa_diamond.json' 2026-03-25T09:06:03,442 adding 'evalscope/benchmarks/_meta/gsm8k.json' 2026-03-25T09:06:03,444 adding 'evalscope/benchmarks/_meta/gsm8k_v.json' 2026-03-25T09:06:03,446 adding 'evalscope/benchmarks/_meta/hallusion_bench.json' 2026-03-25T09:06:03,448 adding 'evalscope/benchmarks/_meta/halueval.json' 2026-03-25T09:06:03,450 adding 'evalscope/benchmarks/_meta/harvey_ner.json' 2026-03-25T09:06:03,453 adding 'evalscope/benchmarks/_meta/health_bench.json' 2026-03-25T09:06:03,454 adding 'evalscope/benchmarks/_meta/hellaswag.json' 2026-03-25T09:06:03,457 adding 'evalscope/benchmarks/_meta/hle.json' 2026-03-25T09:06:03,459 adding 'evalscope/benchmarks/_meta/hmmt25.json' 2026-03-25T09:06:03,461 adding 'evalscope/benchmarks/_meta/hpdv2.json' 2026-03-25T09:06:03,463 adding 'evalscope/benchmarks/_meta/humaneval.json' 2026-03-25T09:06:03,465 adding 'evalscope/benchmarks/_meta/humaneval_plus.json' 2026-03-25T09:06:03,467 adding 'evalscope/benchmarks/_meta/ifbench.json' 2026-03-25T09:06:03,469 adding 'evalscope/benchmarks/_meta/ifeval.json' 2026-03-25T09:06:03,471 adding 'evalscope/benchmarks/_meta/infovqa.json' 2026-03-25T09:06:03,473 adding 'evalscope/benchmarks/_meta/iquiz.json' 2026-03-25T09:06:03,475 adding 'evalscope/benchmarks/_meta/jnlpba.json' 2026-03-25T09:06:03,477 adding 'evalscope/benchmarks/_meta/jnlpba_rare.json' 2026-03-25T09:06:03,479 adding 'evalscope/benchmarks/_meta/librispeech.json' 2026-03-25T09:06:03,481 adding 'evalscope/benchmarks/_meta/live_code_bench.json' 2026-03-25T09:06:03,483 adding 'evalscope/benchmarks/_meta/logi_qa.json' 2026-03-25T09:06:03,485 adding 'evalscope/benchmarks/_meta/longbench_v2.json' 2026-03-25T09:06:03,487 adding 'evalscope/benchmarks/_meta/maritime_bench.json' 2026-03-25T09:06:03,489 adding 'evalscope/benchmarks/_meta/math_500.json' 2026-03-25T09:06:03,490 adding 'evalscope/benchmarks/_meta/math_qa.json' 2026-03-25T09:06:03,493 adding 'evalscope/benchmarks/_meta/math_verse.json' 2026-03-25T09:06:03,495 adding 'evalscope/benchmarks/_meta/math_vision.json' 2026-03-25T09:06:03,497 adding 'evalscope/benchmarks/_meta/math_vista.json' 2026-03-25T09:06:03,499 adding 'evalscope/benchmarks/_meta/mbpp.json' 2026-03-25T09:06:03,501 adding 'evalscope/benchmarks/_meta/mbpp_plus.json' 2026-03-25T09:06:03,503 adding 'evalscope/benchmarks/_meta/med_mcqa.json' 2026-03-25T09:06:03,505 adding 'evalscope/benchmarks/_meta/mgsm.json' 2026-03-25T09:06:03,507 adding 'evalscope/benchmarks/_meta/micro_vqa.json' 2026-03-25T09:06:03,509 adding 'evalscope/benchmarks/_meta/minerva_math.json' 2026-03-25T09:06:03,511 adding 'evalscope/benchmarks/_meta/mit_movie_trivia.json' 2026-03-25T09:06:03,513 adding 'evalscope/benchmarks/_meta/mit_restaurant.json' 2026-03-25T09:06:03,515 adding 'evalscope/benchmarks/_meta/mm_bench.json' 2026-03-25T09:06:03,517 adding 'evalscope/benchmarks/_meta/mm_star.json' 2026-03-25T09:06:03,521 adding 'evalscope/benchmarks/_meta/mmlu.json' 2026-03-25T09:06:03,523 adding 'evalscope/benchmarks/_meta/mmlu_pro.json' 2026-03-25T09:06:03,526 adding 'evalscope/benchmarks/_meta/mmlu_redux.json' 2026-03-25T09:06:03,528 adding 'evalscope/benchmarks/_meta/mmmlu.json' 2026-03-25T09:06:03,531 adding 'evalscope/benchmarks/_meta/mmmu.json' 2026-03-25T09:06:03,535 adding 'evalscope/benchmarks/_meta/mmmu_pro.json' 2026-03-25T09:06:03,537 adding 'evalscope/benchmarks/_meta/mri_mcqa.json' 2026-03-25T09:06:03,540 adding 'evalscope/benchmarks/_meta/multi_if.json' 2026-03-25T09:06:03,542 adding 'evalscope/benchmarks/_meta/multi_nerd.json' 2026-03-25T09:06:03,544 adding 'evalscope/benchmarks/_meta/multiple_humaneval.json' 2026-03-25T09:06:03,546 adding 'evalscope/benchmarks/_meta/multiple_mbpp.json' 2026-03-25T09:06:03,548 adding 'evalscope/benchmarks/_meta/music_trivia.json' 2026-03-25T09:06:03,550 adding 'evalscope/benchmarks/_meta/musr.json' 2026-03-25T09:06:03,552 adding 'evalscope/benchmarks/_meta/ncbi.json' 2026-03-25T09:06:03,554 adding 'evalscope/benchmarks/_meta/needle_haystack.json' 2026-03-25T09:06:03,556 adding 'evalscope/benchmarks/_meta/ocr_bench.json' 2026-03-25T09:06:03,560 adding 'evalscope/benchmarks/_meta/ocr_bench_v2.json' 2026-03-25T09:06:03,563 adding 'evalscope/benchmarks/_meta/olympiad_bench.json' 2026-03-25T09:06:03,565 adding 'evalscope/benchmarks/_meta/omni_bench.json' 2026-03-25T09:06:03,569 adding 'evalscope/benchmarks/_meta/omni_doc_bench.json' 2026-03-25T09:06:03,571 adding 'evalscope/benchmarks/_meta/ontonotes5.json' 2026-03-25T09:06:03,574 adding 'evalscope/benchmarks/_meta/openai_mrcr.json' 2026-03-25T09:06:03,576 adding 'evalscope/benchmarks/_meta/piqa.json' 2026-03-25T09:06:03,579 adding 'evalscope/benchmarks/_meta/poly_math.json' 2026-03-25T09:06:03,581 adding 'evalscope/benchmarks/_meta/pope.json' 2026-03-25T09:06:03,583 adding 'evalscope/benchmarks/_meta/process_bench.json' 2026-03-25T09:06:03,585 adding 'evalscope/benchmarks/_meta/pubmedqa.json' 2026-03-25T09:06:03,586 adding 'evalscope/benchmarks/_meta/qasc.json' 2026-03-25T09:06:03,588 adding 'evalscope/benchmarks/_meta/race.json' 2026-03-25T09:06:03,590 adding 'evalscope/benchmarks/_meta/real_world_qa.json' 2026-03-25T09:06:03,593 adding 'evalscope/benchmarks/_meta/refcoco.json' 2026-03-25T09:06:03,599 adding 'evalscope/benchmarks/_meta/scicode.json' 2026-03-25T09:06:03,601 adding 'evalscope/benchmarks/_meta/science_qa.json' 2026-03-25T09:06:03,603 adding 'evalscope/benchmarks/_meta/sciq.json' 2026-03-25T09:06:03,605 adding 'evalscope/benchmarks/_meta/seed_bench_2_plus.json' 2026-03-25T09:06:03,607 adding 'evalscope/benchmarks/_meta/simple_qa.json' 2026-03-25T09:06:03,609 adding 'evalscope/benchmarks/_meta/simple_vqa.json' 2026-03-25T09:06:03,611 adding 'evalscope/benchmarks/_meta/siqa.json' 2026-03-25T09:06:03,614 adding 'evalscope/benchmarks/_meta/super_gpqa.json' 2026-03-25T09:06:03,616 adding 'evalscope/benchmarks/_meta/swe_bench_lite.json' 2026-03-25T09:06:03,618 adding 'evalscope/benchmarks/_meta/swe_bench_verified.json' 2026-03-25T09:06:03,619 adding 'evalscope/benchmarks/_meta/swe_bench_verified_mini.json' 2026-03-25T09:06:03,622 adding 'evalscope/benchmarks/_meta/tau2_bench.json' 2026-03-25T09:06:03,624 adding 'evalscope/benchmarks/_meta/tau_bench.json' 2026-03-25T09:06:03,625 adding 'evalscope/benchmarks/_meta/terminal_bench_v2.json' 2026-03-25T09:06:03,627 adding 'evalscope/benchmarks/_meta/tifa160.json' 2026-03-25T09:06:03,629 adding 'evalscope/benchmarks/_meta/tool_bench.json' 2026-03-25T09:06:03,631 adding 'evalscope/benchmarks/_meta/torgo.json' 2026-03-25T09:06:03,633 adding 'evalscope/benchmarks/_meta/trivia_qa.json' 2026-03-25T09:06:03,635 adding 'evalscope/benchmarks/_meta/truthful_qa.json' 2026-03-25T09:06:03,637 adding 'evalscope/benchmarks/_meta/tweebank_ner.json' 2026-03-25T09:06:03,639 adding 'evalscope/benchmarks/_meta/tweet_ner_7.json' 2026-03-25T09:06:03,641 adding 'evalscope/benchmarks/_meta/visulogic.json' 2026-03-25T09:06:03,643 adding 'evalscope/benchmarks/_meta/vstar_bench.json' 2026-03-25T09:06:03,645 adding 'evalscope/benchmarks/_meta/winogrande.json' 2026-03-25T09:06:03,648 adding 'evalscope/benchmarks/_meta/wmt24pp.json' 2026-03-25T09:06:03,650 adding 'evalscope/benchmarks/_meta/wnut2017.json' 2026-03-25T09:06:03,652 adding 'evalscope/benchmarks/_meta/zebralogicbench.json' 2026-03-25T09:06:03,654 adding 'evalscope/benchmarks/_meta/zerobench.json' 2026-03-25T09:06:03,655 adding 'evalscope/benchmarks/a_okvqa/__init__.py' 2026-03-25T09:06:03,657 adding 'evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py' 2026-03-25T09:06:03,658 adding 'evalscope/benchmarks/aa_lcr/__init__.py' 2026-03-25T09:06:03,660 adding 'evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py' 2026-03-25T09:06:03,661 adding 'evalscope/benchmarks/ai2d/__init__.py' 2026-03-25T09:06:03,663 adding 'evalscope/benchmarks/ai2d/ai2d_adapter.py' 2026-03-25T09:06:03,664 adding 'evalscope/benchmarks/aime/__init__.py' 2026-03-25T09:06:03,666 adding 'evalscope/benchmarks/aime/aime_adapter.py' 2026-03-25T09:06:03,668 adding 'evalscope/benchmarks/aime/grader.py' 2026-03-25T09:06:03,670 adding 'evalscope/benchmarks/aime/math_normalize.py' 2026-03-25T09:06:03,673 adding 'evalscope/benchmarks/alpaca_eval/__init__.py' 2026-03-25T09:06:03,675 adding 'evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py' 2026-03-25T09:06:03,676 adding 'evalscope/benchmarks/amc/__init__.py' 2026-03-25T09:06:03,678 adding 'evalscope/benchmarks/amc/amc_adapter.py' 2026-03-25T09:06:03,679 adding 'evalscope/benchmarks/arc/__init__.py' 2026-03-25T09:06:03,681 adding 'evalscope/benchmarks/arc/arc_adapter.py' 2026-03-25T09:06:03,682 adding 'evalscope/benchmarks/arena_hard/__init__.py' 2026-03-25T09:06:03,684 adding 'evalscope/benchmarks/arena_hard/arena_hard_adapter.py' 2026-03-25T09:06:03,686 adding 'evalscope/benchmarks/arena_hard/utils.py' 2026-03-25T09:06:03,687 adding 'evalscope/benchmarks/bbh/__init__.py' 2026-03-25T09:06:03,689 adding 'evalscope/benchmarks/bbh/bbh_adapter.py' 2026-03-25T09:06:03,691 adding 'evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt' 2026-03-25T09:06:03,693 adding 'evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt' 2026-03-25T09:06:03,694 adding 'evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt' 2026-03-25T09:06:03,695 adding 'evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt' 2026-03-25T09:06:03,697 adding 'evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt' 2026-03-25T09:06:03,698 adding 'evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt' 2026-03-25T09:06:03,700 adding 'evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt' 2026-03-25T09:06:03,701 adding 'evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt' 2026-03-25T09:06:03,702 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt' 2026-03-25T09:06:03,704 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt' 2026-03-25T09:06:03,705 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt' 2026-03-25T09:06:03,707 adding 'evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt' 2026-03-25T09:06:03,708 adding 'evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt' 2026-03-25T09:06:03,709 adding 'evalscope/benchmarks/bbh/cot_prompts/navigate.txt' 2026-03-25T09:06:03,710 adding 'evalscope/benchmarks/bbh/cot_prompts/object_counting.txt' 2026-03-25T09:06:03,712 adding 'evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt' 2026-03-25T09:06:03,713 adding 'evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt' 2026-03-25T09:06:03,714 adding 'evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt' 2026-03-25T09:06:03,715 adding 'evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt' 2026-03-25T09:06:03,717 adding 'evalscope/benchmarks/bbh/cot_prompts/snarks.txt' 2026-03-25T09:06:03,718 adding 'evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt' 2026-03-25T09:06:03,719 adding 'evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt' 2026-03-25T09:06:03,720 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt' 2026-03-25T09:06:03,722 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt' 2026-03-25T09:06:03,723 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt' 2026-03-25T09:06:03,724 adding 'evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt' 2026-03-25T09:06:03,725 adding 'evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt' 2026-03-25T09:06:03,727 adding 'evalscope/benchmarks/bfcl/__init__.py' 2026-03-25T09:06:03,729 adding 'evalscope/benchmarks/bfcl/v3/__init__.py' 2026-03-25T09:06:03,731 adding 'evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py' 2026-03-25T09:06:03,733 adding 'evalscope/benchmarks/bfcl/v3/generation.py' 2026-03-25T09:06:03,734 adding 'evalscope/benchmarks/bfcl/v3/utils.py' 2026-03-25T09:06:03,735 adding 'evalscope/benchmarks/bfcl/v4/__init__.py' 2026-03-25T09:06:03,737 adding 'evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py' 2026-03-25T09:06:03,740 adding 'evalscope/benchmarks/bfcl/v4/utils.py' 2026-03-25T09:06:03,741 adding 'evalscope/benchmarks/biomix_qa/__init__.py' 2026-03-25T09:06:03,743 adding 'evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py' 2026-03-25T09:06:03,744 adding 'evalscope/benchmarks/blink/__init__.py' 2026-03-25T09:06:03,746 adding 'evalscope/benchmarks/blink/blink_adapter.py' 2026-03-25T09:06:03,747 adding 'evalscope/benchmarks/ceval/__init__.py' 2026-03-25T09:06:03,749 adding 'evalscope/benchmarks/ceval/ceval_adapter.py' 2026-03-25T09:06:03,751 adding 'evalscope/benchmarks/chartqa/__init__.py' 2026-03-25T09:06:03,753 adding 'evalscope/benchmarks/chartqa/chartqa_adapter.py' 2026-03-25T09:06:03,754 adding 'evalscope/benchmarks/chartqa/utils.py' 2026-03-25T09:06:03,755 adding 'evalscope/benchmarks/chinese_simple_qa/__init__.py' 2026-03-25T09:06:03,757 adding 'evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py' 2026-03-25T09:06:03,759 adding 'evalscope/benchmarks/cl_bench/__init__.py' 2026-03-25T09:06:03,761 adding 'evalscope/benchmarks/cl_bench/cl_bench_adapter.py' 2026-03-25T09:06:03,762 adding 'evalscope/benchmarks/cl_bench/utils.py' 2026-03-25T09:06:03,764 adding 'evalscope/benchmarks/cmmlu/__init__.py' 2026-03-25T09:06:03,765 adding 'evalscope/benchmarks/cmmlu/cmmlu_adapter.py' 2026-03-25T09:06:03,767 adding 'evalscope/benchmarks/cmmmu/__init__.py' 2026-03-25T09:06:03,769 adding 'evalscope/benchmarks/cmmmu/cmmmu_adapter.py' 2026-03-25T09:06:03,771 adding 'evalscope/benchmarks/cmmmu/utils.py' 2026-03-25T09:06:03,773 adding 'evalscope/benchmarks/cmmu/__init__.py' 2026-03-25T09:06:03,774 adding 'evalscope/benchmarks/cmmu/cmmu_adapter.py' 2026-03-25T09:06:03,776 adding 'evalscope/benchmarks/cmmu/prompt.py' 2026-03-25T09:06:03,777 adding 'evalscope/benchmarks/coin_flip/__init__.py' 2026-03-25T09:06:03,779 adding 'evalscope/benchmarks/coin_flip/coin_flip_adapter.py' 2026-03-25T09:06:03,781 adding 'evalscope/benchmarks/commonsense_qa/__init__.py' 2026-03-25T09:06:03,782 adding 'evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py' 2026-03-25T09:06:03,784 adding 'evalscope/benchmarks/competition_math/__init__.py' 2026-03-25T09:06:03,785 adding 'evalscope/benchmarks/competition_math/competition_math_adapter.py' 2026-03-25T09:06:03,787 adding 'evalscope/benchmarks/data_collection/__init__.py' 2026-03-25T09:06:03,789 adding 'evalscope/benchmarks/data_collection/data_collection_adapter.py' 2026-03-25T09:06:03,790 adding 'evalscope/benchmarks/docmath/__init__.py' 2026-03-25T09:06:03,792 adding 'evalscope/benchmarks/docmath/docmath_adapter.py' 2026-03-25T09:06:03,793 adding 'evalscope/benchmarks/docmath/utils.py' 2026-03-25T09:06:03,795 adding 'evalscope/benchmarks/docvqa/__init__.py' 2026-03-25T09:06:03,796 adding 'evalscope/benchmarks/docvqa/docvqa_adapter.py' 2026-03-25T09:06:03,798 adding 'evalscope/benchmarks/drivelology/__init__.py' 2026-03-25T09:06:03,800 adding 'evalscope/benchmarks/drivelology/drivelology_binary_adapter.py' 2026-03-25T09:06:03,802 adding 'evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py' 2026-03-25T09:06:03,803 adding 'evalscope/benchmarks/drivelology/drivelology_selection_adapter.py' 2026-03-25T09:06:03,805 adding 'evalscope/benchmarks/drivelology/drivelology_writing_adapter.py' 2026-03-25T09:06:03,807 adding 'evalscope/benchmarks/drop/__init__.py' 2026-03-25T09:06:03,809 adding 'evalscope/benchmarks/drop/drop_adapter.py' 2026-03-25T09:06:03,811 adding 'evalscope/benchmarks/drop/utils.py' 2026-03-25T09:06:03,812 adding 'evalscope/benchmarks/eq_bench/__init__.py' 2026-03-25T09:06:03,814 adding 'evalscope/benchmarks/eq_bench/answer_validation.py' 2026-03-25T09:06:03,816 adding 'evalscope/benchmarks/eq_bench/eq_bench_adapter.py' 2026-03-25T09:06:03,818 adding 'evalscope/benchmarks/fleurs/__init__.py' 2026-03-25T09:06:03,819 adding 'evalscope/benchmarks/fleurs/fleurs_adapter.py' 2026-03-25T09:06:03,821 adding 'evalscope/benchmarks/frames/__init__.py' 2026-03-25T09:06:03,823 adding 'evalscope/benchmarks/frames/frames_adapter.py' 2026-03-25T09:06:03,824 adding 'evalscope/benchmarks/frames/utils.py' 2026-03-25T09:06:03,825 adding 'evalscope/benchmarks/general_arena/__init__.py' 2026-03-25T09:06:03,829 adding 'evalscope/benchmarks/general_arena/general_arena_adapter.py' 2026-03-25T09:06:03,830 adding 'evalscope/benchmarks/general_arena/utils.py' 2026-03-25T09:06:03,832 adding 'evalscope/benchmarks/general_fc/__init__.py' 2026-03-25T09:06:03,834 adding 'evalscope/benchmarks/general_fc/general_fc_adapter.py' 2026-03-25T09:06:03,836 adding 'evalscope/benchmarks/general_mcq/__init__.py' 2026-03-25T09:06:03,837 adding 'evalscope/benchmarks/general_mcq/general_mcq_adapter.py' 2026-03-25T09:06:03,839 adding 'evalscope/benchmarks/general_qa/__init__.py' 2026-03-25T09:06:03,841 adding 'evalscope/benchmarks/general_qa/general_qa_adapter.py' 2026-03-25T09:06:03,842 adding 'evalscope/benchmarks/general_vmcq/__init__.py' 2026-03-25T09:06:03,844 adding 'evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py' 2026-03-25T09:06:03,846 adding 'evalscope/benchmarks/general_vqa/__init__.py' 2026-03-25T09:06:03,848 adding 'evalscope/benchmarks/general_vqa/general_vqa_adapter.py' 2026-03-25T09:06:03,850 adding 'evalscope/benchmarks/gpqa/__init__.py' 2026-03-25T09:06:03,851 adding 'evalscope/benchmarks/gpqa/gpqa_adapter.py' 2026-03-25T09:06:03,853 adding 'evalscope/benchmarks/gpqa/prompt.py' 2026-03-25T09:06:03,855 adding 'evalscope/benchmarks/gsm8k/__init__.py' 2026-03-25T09:06:03,856 adding 'evalscope/benchmarks/gsm8k/gsm8k_adapter.py' 2026-03-25T09:06:03,858 adding 'evalscope/benchmarks/gsm8k_v/__init__.py' 2026-03-25T09:06:03,859 adding 'evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py' 2026-03-25T09:06:03,861 adding 'evalscope/benchmarks/hallusion_bench/__init__.py' 2026-03-25T09:06:03,863 adding 'evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py' 2026-03-25T09:06:03,864 adding 'evalscope/benchmarks/halu_eval/__init__.py' 2026-03-25T09:06:03,866 adding 'evalscope/benchmarks/halu_eval/halu_eval_adapter.py' 2026-03-25T09:06:03,868 adding 'evalscope/benchmarks/halu_eval/halu_eval_instructions.py' 2026-03-25T09:06:03,870 adding 'evalscope/benchmarks/healthbench/__init__.py' 2026-03-25T09:06:03,872 adding 'evalscope/benchmarks/healthbench/healthbench_adapter.py' 2026-03-25T09:06:03,873 adding 'evalscope/benchmarks/healthbench/utils.py' 2026-03-25T09:06:03,875 adding 'evalscope/benchmarks/hellaswag/__init__.py' 2026-03-25T09:06:03,876 adding 'evalscope/benchmarks/hellaswag/hellaswag_adapter.py' 2026-03-25T09:06:03,877 adding 'evalscope/benchmarks/hle/__init__.py' 2026-03-25T09:06:03,879 adding 'evalscope/benchmarks/hle/hle_adapter.py' 2026-03-25T09:06:03,881 adding 'evalscope/benchmarks/hmmt25/hmmt25_adapter.py' 2026-03-25T09:06:03,883 adding 'evalscope/benchmarks/humaneval/__init__.py' 2026-03-25T09:06:03,884 adding 'evalscope/benchmarks/humaneval/humaneval_adapter.py' 2026-03-25T09:06:03,886 adding 'evalscope/benchmarks/humaneval/utils.py' 2026-03-25T09:06:03,888 adding 'evalscope/benchmarks/humanevalplus/__init__.py' 2026-03-25T09:06:03,889 adding 'evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py' 2026-03-25T09:06:03,891 adding 'evalscope/benchmarks/humanevalplus/docker/Dockerfile' 2026-03-25T09:06:03,893 adding 'evalscope/benchmarks/ifbench/__init__.py' 2026-03-25T09:06:03,895 adding 'evalscope/benchmarks/ifbench/evaluation_lib.py' 2026-03-25T09:06:03,896 adding 'evalscope/benchmarks/ifbench/ifbench_adapter.py' 2026-03-25T09:06:03,903 adding 'evalscope/benchmarks/ifbench/instructions.py' 2026-03-25T09:06:03,905 adding 'evalscope/benchmarks/ifbench/instructions_registry.py' 2026-03-25T09:06:03,908 adding 'evalscope/benchmarks/ifbench/instructions_util.py' 2026-03-25T09:06:03,910 adding 'evalscope/benchmarks/ifeval/__init__.py' 2026-03-25T09:06:03,911 adding 'evalscope/benchmarks/ifeval/ifeval_adapter.py' 2026-03-25T09:06:03,916 adding 'evalscope/benchmarks/ifeval/instructions.py' 2026-03-25T09:06:03,918 adding 'evalscope/benchmarks/ifeval/instructions_registry.py' 2026-03-25T09:06:03,921 adding 'evalscope/benchmarks/ifeval/instructions_util.py' 2026-03-25T09:06:03,923 adding 'evalscope/benchmarks/ifeval/utils.py' 2026-03-25T09:06:03,925 adding 'evalscope/benchmarks/image_edit/__init__.py' 2026-03-25T09:06:03,926 adding 'evalscope/benchmarks/image_edit/gedit/__init__.py' 2026-03-25T09:06:03,928 adding 'evalscope/benchmarks/image_edit/gedit/gedit_adapter.py' 2026-03-25T09:06:03,930 adding 'evalscope/benchmarks/image_edit/gedit/utils.py' 2026-03-25T09:06:03,932 adding 'evalscope/benchmarks/image_edit/gedit/vie_prompts.py' 2026-03-25T09:06:03,933 adding 'evalscope/benchmarks/infovqa/__init__.py' 2026-03-25T09:06:03,935 adding 'evalscope/benchmarks/infovqa/infovqa_adapter.py' 2026-03-25T09:06:03,936 adding 'evalscope/benchmarks/iquiz/__init__.py' 2026-03-25T09:06:03,938 adding 'evalscope/benchmarks/iquiz/iquiz_adapter.py' 2026-03-25T09:06:03,940 adding 'evalscope/benchmarks/librispeech/__init__.py' 2026-03-25T09:06:03,941 adding 'evalscope/benchmarks/librispeech/librispeech_adapter.py' 2026-03-25T09:06:03,943 adding 'evalscope/benchmarks/live_code_bench/__init__.py' 2026-03-25T09:06:03,944 adding 'evalscope/benchmarks/live_code_bench/evaluate_utils.py' 2026-03-25T09:06:03,946 adding 'evalscope/benchmarks/live_code_bench/extract_utils.py' 2026-03-25T09:06:03,947 adding 'evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py' 2026-03-25T09:06:03,949 adding 'evalscope/benchmarks/live_code_bench/load_utils.py' 2026-03-25T09:06:03,950 adding 'evalscope/benchmarks/live_code_bench/pass_k_utils.py' 2026-03-25T09:06:03,952 adding 'evalscope/benchmarks/live_code_bench/prompts.py' 2026-03-25T09:06:03,954 adding 'evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py' 2026-03-25T09:06:03,956 adding 'evalscope/benchmarks/live_code_bench/testing_util.py' 2026-03-25T09:06:03,958 adding 'evalscope/benchmarks/logi_qa/__int__.py' 2026-03-25T09:06:03,959 adding 'evalscope/benchmarks/logi_qa/logi_qa_adapter.py' 2026-03-25T09:06:03,961 adding 'evalscope/benchmarks/longbench_v2/__init__.py' 2026-03-25T09:06:03,962 adding 'evalscope/benchmarks/longbench_v2/longbench_v2_adapter.py' 2026-03-25T09:06:03,964 adding 'evalscope/benchmarks/maritime_bench/__init__.py' 2026-03-25T09:06:03,965 adding 'evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py' 2026-03-25T09:06:03,967 adding 'evalscope/benchmarks/math_500/__init__.py' 2026-03-25T09:06:03,968 adding 'evalscope/benchmarks/math_500/math_500_adapter.py' 2026-03-25T09:06:03,970 adding 'evalscope/benchmarks/math_qa/__init__.py' 2026-03-25T09:06:03,971 adding 'evalscope/benchmarks/math_qa/math_qa_adapter.py' 2026-03-25T09:06:03,972 adding 'evalscope/benchmarks/math_verse/__init__.py' 2026-03-25T09:06:03,974 adding 'evalscope/benchmarks/math_verse/math_verse_adapter.py' 2026-03-25T09:06:03,975 adding 'evalscope/benchmarks/math_vision/__init__.py' 2026-03-25T09:06:03,977 adding 'evalscope/benchmarks/math_vision/math_vision_adapter.py' 2026-03-25T09:06:03,978 adding 'evalscope/benchmarks/math_vista/__init__.py' 2026-03-25T09:06:03,980 adding 'evalscope/benchmarks/math_vista/math_vista_adapter.py' 2026-03-25T09:06:03,982 adding 'evalscope/benchmarks/mbpp/__init__.py' 2026-03-25T09:06:03,983 adding 'evalscope/benchmarks/mbpp/mbpp_adapter.py' 2026-03-25T09:06:03,985 adding 'evalscope/benchmarks/mbppplus/__init__.py' 2026-03-25T09:06:03,986 adding 'evalscope/benchmarks/mbppplus/mbppplus_adapter.py' 2026-03-25T09:06:03,988 adding 'evalscope/benchmarks/med_mcqa/__init__.py' 2026-03-25T09:06:03,989 adding 'evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py' 2026-03-25T09:06:03,991 adding 'evalscope/benchmarks/mgsm/__init__.py' 2026-03-25T09:06:03,992 adding 'evalscope/benchmarks/mgsm/mgsm_adapter.py' 2026-03-25T09:06:03,994 adding 'evalscope/benchmarks/micro_vqa/__init__.py' 2026-03-25T09:06:03,995 adding 'evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py' 2026-03-25T09:06:03,997 adding 'evalscope/benchmarks/minerva_math/__init__.py' 2026-03-25T09:06:03,998 adding 'evalscope/benchmarks/minerva_math/minerva_math_adapter.py' 2026-03-25T09:06:04,000 adding 'evalscope/benchmarks/mm_bench/__init__.py' 2026-03-25T09:06:04,001 adding 'evalscope/benchmarks/mm_bench/mm_bench_adapter.py' 2026-03-25T09:06:04,003 adding 'evalscope/benchmarks/mm_star/__init__.py' 2026-03-25T09:06:04,005 adding 'evalscope/benchmarks/mm_star/mm_star_adapter.py' 2026-03-25T09:06:04,006 adding 'evalscope/benchmarks/mmlu/__init__.py' 2026-03-25T09:06:04,008 adding 'evalscope/benchmarks/mmlu/mmlu_adapter.py' 2026-03-25T09:06:04,010 adding 'evalscope/benchmarks/mmlu_pro/__init__.py' 2026-03-25T09:06:04,011 adding 'evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py' 2026-03-25T09:06:04,013 adding 'evalscope/benchmarks/mmlu_redux/__init__.py' 2026-03-25T09:06:04,014 adding 'evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py' 2026-03-25T09:06:04,016 adding 'evalscope/benchmarks/mmmlu/__init__.py' 2026-03-25T09:06:04,018 adding 'evalscope/benchmarks/mmmlu/mmmlu_adapter.py' 2026-03-25T09:06:04,019 adding 'evalscope/benchmarks/mmmlu/prompt.py' 2026-03-25T09:06:04,021 adding 'evalscope/benchmarks/mmmu/__init__.py' 2026-03-25T09:06:04,023 adding 'evalscope/benchmarks/mmmu/mmmu_adapter.py' 2026-03-25T09:06:04,024 adding 'evalscope/benchmarks/mmmu_pro/__init__.py' 2026-03-25T09:06:04,026 adding 'evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py' 2026-03-25T09:06:04,027 adding 'evalscope/benchmarks/mri_mcqa/__init__.py' 2026-03-25T09:06:04,029 adding 'evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py' 2026-03-25T09:06:04,031 adding 'evalscope/benchmarks/multi_if/__init__.py' 2026-03-25T09:06:04,039 adding 'evalscope/benchmarks/multi_if/ifeval.py' 2026-03-25T09:06:04,041 adding 'evalscope/benchmarks/multi_if/metrics.py' 2026-03-25T09:06:04,042 adding 'evalscope/benchmarks/multi_if/multi_if_adapter.py' 2026-03-25T09:06:04,044 adding 'evalscope/benchmarks/multipl_e/__init__.py' 2026-03-25T09:06:04,045 adding 'evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py' 2026-03-25T09:06:04,047 adding 'evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py' 2026-03-25T09:06:04,049 adding 'evalscope/benchmarks/multipl_e/utils.py' 2026-03-25T09:06:04,050 adding 'evalscope/benchmarks/music_trivia/__init__.py' 2026-03-25T09:06:04,051 adding 'evalscope/benchmarks/music_trivia/music_trivia_adapter.py' 2026-03-25T09:06:04,053 adding 'evalscope/benchmarks/musr/__init__.py' 2026-03-25T09:06:04,054 adding 'evalscope/benchmarks/musr/musr_adapter.py' 2026-03-25T09:06:04,056 adding 'evalscope/benchmarks/needle_haystack/__init__.py' 2026-03-25T09:06:04,058 adding 'evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py' 2026-03-25T09:06:04,060 adding 'evalscope/benchmarks/needle_haystack/utils.py' 2026-03-25T09:06:04,062 adding 'evalscope/benchmarks/ner/__init__.py' 2026-03-25T09:06:04,063 adding 'evalscope/benchmarks/ner/anat_em_adapter.py' 2026-03-25T09:06:04,065 adding 'evalscope/benchmarks/ner/bc2gm_adapter.py' 2026-03-25T09:06:04,066 adding 'evalscope/benchmarks/ner/bc4chemd_adapter.py' 2026-03-25T09:06:04,067 adding 'evalscope/benchmarks/ner/bc5cdr_adapter.py' 2026-03-25T09:06:04,069 adding 'evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py' 2026-03-25T09:06:04,070 adding 'evalscope/benchmarks/ner/conll2003_adapter.py' 2026-03-25T09:06:04,071 adding 'evalscope/benchmarks/ner/conllpp_adapter.py' 2026-03-25T09:06:04,073 adding 'evalscope/benchmarks/ner/copious_adapter.py' 2026-03-25T09:06:04,074 adding 'evalscope/benchmarks/ner/cross_ner_adapter.py' 2026-03-25T09:06:04,075 adding 'evalscope/benchmarks/ner/fin_ner_adapter.py' 2026-03-25T09:06:04,077 adding 'evalscope/benchmarks/ner/genia_ner_adapter.py' 2026-03-25T09:06:04,078 adding 'evalscope/benchmarks/ner/harvey_ner_adapter.py' 2026-03-25T09:06:04,080 adding 'evalscope/benchmarks/ner/jnlpba_adapter.py' 2026-03-25T09:06:04,081 adding 'evalscope/benchmarks/ner/jnlpba_rare_adapter.py' 2026-03-25T09:06:04,082 adding 'evalscope/benchmarks/ner/mit_movie_trivia_adapter.py' 2026-03-25T09:06:04,084 adding 'evalscope/benchmarks/ner/mit_restaurant_adapter.py' 2026-03-25T09:06:04,085 adding 'evalscope/benchmarks/ner/multi_nerd_adapter.py' 2026-03-25T09:06:04,086 adding 'evalscope/benchmarks/ner/ncbi_adapter.py' 2026-03-25T09:06:04,088 adding 'evalscope/benchmarks/ner/ontonotes5_adapter.py' 2026-03-25T09:06:04,089 adding 'evalscope/benchmarks/ner/tweebank_ner_adapter.py' 2026-03-25T09:06:04,091 adding 'evalscope/benchmarks/ner/tweet_ner_7_adapter.py' 2026-03-25T09:06:04,092 adding 'evalscope/benchmarks/ner/wnut2017_adapter.py' 2026-03-25T09:06:04,094 adding 'evalscope/benchmarks/ner/cross_ner_entities/__init__.py' 2026-03-25T09:06:04,095 adding 'evalscope/benchmarks/ner/cross_ner_entities/ai.py' 2026-03-25T09:06:04,096 adding 'evalscope/benchmarks/ner/cross_ner_entities/literature.py' 2026-03-25T09:06:04,098 adding 'evalscope/benchmarks/ner/cross_ner_entities/music.py' 2026-03-25T09:06:04,099 adding 'evalscope/benchmarks/ner/cross_ner_entities/politics.py' 2026-03-25T09:06:04,100 adding 'evalscope/benchmarks/ner/cross_ner_entities/science.py' 2026-03-25T09:06:04,102 adding 'evalscope/benchmarks/ocr_bench/__init__.py' 2026-03-25T09:06:04,103 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py' 2026-03-25T09:06:04,105 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py' 2026-03-25T09:06:04,107 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py' 2026-03-25T09:06:04,111 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py' 2026-03-25T09:06:04,112 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py' 2026-03-25T09:06:04,114 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py' 2026-03-25T09:06:04,115 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py' 2026-03-25T09:06:04,117 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py' 2026-03-25T09:06:04,118 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py' 2026-03-25T09:06:04,120 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py' 2026-03-25T09:06:04,122 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py' 2026-03-25T09:06:04,123 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py' 2026-03-25T09:06:04,124 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt' 2026-03-25T09:06:04,127 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py' 2026-03-25T09:06:04,129 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py' 2026-03-25T09:06:04,131 adding 'evalscope/benchmarks/olympiad_bench/__init__.py' 2026-03-25T09:06:04,132 adding 'evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py' 2026-03-25T09:06:04,135 adding 'evalscope/benchmarks/olympiad_bench/utils.py' 2026-03-25T09:06:04,137 adding 'evalscope/benchmarks/omni_bench/__init__.py' 2026-03-25T09:06:04,138 adding 'evalscope/benchmarks/omni_bench/omni_bench_adapter.py' 2026-03-25T09:06:04,140 adding 'evalscope/benchmarks/omnidoc_bench/__init__.py' 2026-03-25T09:06:04,142 adding 'evalscope/benchmarks/omnidoc_bench/end2end_eval.py' 2026-03-25T09:06:04,144 adding 'evalscope/benchmarks/omnidoc_bench/metrics.py' 2026-03-25T09:06:04,146 adding 'evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py' 2026-03-25T09:06:04,154 adding 'evalscope/benchmarks/omnidoc_bench/utils.py' 2026-03-25T09:06:04,156 adding 'evalscope/benchmarks/openai_mrcr/__init__.py' 2026-03-25T09:06:04,158 adding 'evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py' 2026-03-25T09:06:04,159 adding 'evalscope/benchmarks/openai_mrcr/utils.py' 2026-03-25T09:06:04,161 adding 'evalscope/benchmarks/piqa/__init__.py' 2026-03-25T09:06:04,162 adding 'evalscope/benchmarks/piqa/piqa_adapter.py' 2026-03-25T09:06:04,164 adding 'evalscope/benchmarks/poly_math/__init__.py' 2026-03-25T09:06:04,165 adding 'evalscope/benchmarks/poly_math/poly_math_adapter.py' 2026-03-25T09:06:04,168 adding 'evalscope/benchmarks/poly_math/utils/instruction.py' 2026-03-25T09:06:04,169 adding 'evalscope/benchmarks/pope/__init__.py' 2026-03-25T09:06:04,171 adding 'evalscope/benchmarks/pope/pope_adapter.py' 2026-03-25T09:06:04,173 adding 'evalscope/benchmarks/process_bench/__init__.py' 2026-03-25T09:06:04,174 adding 'evalscope/benchmarks/process_bench/process_bench_adapter.py' 2026-03-25T09:06:04,176 adding 'evalscope/benchmarks/pumed_qa/__init__.py' 2026-03-25T09:06:04,178 adding 'evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py' 2026-03-25T09:06:04,179 adding 'evalscope/benchmarks/qasc/__init__.py' 2026-03-25T09:06:04,181 adding 'evalscope/benchmarks/qasc/qasc_adapter.py' 2026-03-25T09:06:04,182 adding 'evalscope/benchmarks/race/__init__.py' 2026-03-25T09:06:04,184 adding 'evalscope/benchmarks/race/race_adapter.py' 2026-03-25T09:06:04,185 adding 'evalscope/benchmarks/real_world_qa/__init__.py' 2026-03-25T09:06:04,187 adding 'evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py' 2026-03-25T09:06:04,189 adding 'evalscope/benchmarks/refcoco/__init__.py' 2026-03-25T09:06:04,190 adding 'evalscope/benchmarks/refcoco/evaluation_lib.py' 2026-03-25T09:06:04,192 adding 'evalscope/benchmarks/refcoco/refcoco_adapter.py' 2026-03-25T09:06:04,193 adding 'evalscope/benchmarks/refcoco/utils.py' 2026-03-25T09:06:04,195 adding 'evalscope/benchmarks/scicode/__init__.py' 2026-03-25T09:06:04,196 adding 'evalscope/benchmarks/scicode/prompt_templates.py' 2026-03-25T09:06:04,198 adding 'evalscope/benchmarks/scicode/scicode_adapter.py' 2026-03-25T09:06:04,199 adding 'evalscope/benchmarks/scicode/util.py' 2026-03-25T09:06:04,201 adding 'evalscope/benchmarks/scicode/docker/Dockerfile' 2026-03-25T09:06:04,202 adding 'evalscope/benchmarks/scicode/docker/docker_requirements.txt' 2026-03-25T09:06:04,204 adding 'evalscope/benchmarks/scicode/docker/process_data.py' 2026-03-25T09:06:04,205 adding 'evalscope/benchmarks/scicode/docker/test_util.py' 2026-03-25T09:06:04,207 adding 'evalscope/benchmarks/science_qa/__init__.py' 2026-03-25T09:06:04,208 adding 'evalscope/benchmarks/science_qa/science_qa_adapter.py' 2026-03-25T09:06:04,210 adding 'evalscope/benchmarks/sciq/__init__.py' 2026-03-25T09:06:04,211 adding 'evalscope/benchmarks/sciq/sciq_adapter.py' 2026-03-25T09:06:04,212 adding 'evalscope/benchmarks/seed_bench_2_plus/__init__.py' 2026-03-25T09:06:04,214 adding 'evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py' 2026-03-25T09:06:04,215 adding 'evalscope/benchmarks/simple_qa/__init__.py' 2026-03-25T09:06:04,217 adding 'evalscope/benchmarks/simple_qa/simple_qa_adapter.py' 2026-03-25T09:06:04,219 adding 'evalscope/benchmarks/simple_vqa/__init__.py' 2026-03-25T09:06:04,221 adding 'evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py' 2026-03-25T09:06:04,222 adding 'evalscope/benchmarks/siqa/__init__.py' 2026-03-25T09:06:04,223 adding 'evalscope/benchmarks/siqa/siqa_adapter.py' 2026-03-25T09:06:04,225 adding 'evalscope/benchmarks/super_gpqa/__init__.py' 2026-03-25T09:06:04,227 adding 'evalscope/benchmarks/super_gpqa/prompt.py' 2026-03-25T09:06:04,229 adding 'evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py' 2026-03-25T09:06:04,230 adding 'evalscope/benchmarks/super_gpqa/utils.py' 2026-03-25T09:06:04,232 adding 'evalscope/benchmarks/swe_bench/__init__.py' 2026-03-25T09:06:04,233 adding 'evalscope/benchmarks/swe_bench/build_images.py' 2026-03-25T09:06:04,235 adding 'evalscope/benchmarks/swe_bench/swe_bench_adapter.py' 2026-03-25T09:06:04,237 adding 'evalscope/benchmarks/swe_bench/utils.py' 2026-03-25T09:06:04,238 adding 'evalscope/benchmarks/tau_bench/__init__.py' 2026-03-25T09:06:04,240 adding 'evalscope/benchmarks/tau_bench/tau2_bench/__init__.py' 2026-03-25T09:06:04,241 adding 'evalscope/benchmarks/tau_bench/tau2_bench/generation.py' 2026-03-25T09:06:04,243 adding 'evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py' 2026-03-25T09:06:04,245 adding 'evalscope/benchmarks/tau_bench/tau_bench/__init__.py' 2026-03-25T09:06:04,246 adding 'evalscope/benchmarks/tau_bench/tau_bench/generation.py' 2026-03-25T09:06:04,249 adding 'evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py' 2026-03-25T09:06:04,250 adding 'evalscope/benchmarks/terminal_bench/__init__.py' 2026-03-25T09:06:04,252 adding 'evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py' 2026-03-25T09:06:04,253 adding 'evalscope/benchmarks/terminal_bench/utils.py' 2026-03-25T09:06:04,255 adding 'evalscope/benchmarks/text2image/__init__.py' 2026-03-25T09:06:04,257 adding 'evalscope/benchmarks/text2image/evalmuse_adapter.py' 2026-03-25T09:06:04,258 adding 'evalscope/benchmarks/text2image/genai_bench_adapter.py' 2026-03-25T09:06:04,259 adding 'evalscope/benchmarks/text2image/general_t2i_adapter.py' 2026-03-25T09:06:04,261 adding 'evalscope/benchmarks/text2image/hpdv2_adapter.py' 2026-03-25T09:06:04,262 adding 'evalscope/benchmarks/text2image/tifa_adapter.py' 2026-03-25T09:06:04,264 adding 'evalscope/benchmarks/tool_bench/__init__.py' 2026-03-25T09:06:04,265 adding 'evalscope/benchmarks/tool_bench/tool_bench_adapter.py' 2026-03-25T09:06:04,267 adding 'evalscope/benchmarks/tool_bench/utils.py' 2026-03-25T09:06:04,269 adding 'evalscope/benchmarks/torgo/__init__.py' 2026-03-25T09:06:04,270 adding 'evalscope/benchmarks/torgo/torgo_adapter.py' 2026-03-25T09:06:04,272 adding 'evalscope/benchmarks/trivia_qa/__init__.py' 2026-03-25T09:06:04,273 adding 'evalscope/benchmarks/trivia_qa/samples.jsonl' 2026-03-25T09:06:04,275 adding 'evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py' 2026-03-25T09:06:04,277 adding 'evalscope/benchmarks/truthful_qa/__init__.py' 2026-03-25T09:06:04,278 adding 'evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py' 2026-03-25T09:06:04,280 adding 'evalscope/benchmarks/visu_logic/__init__.py' 2026-03-25T09:06:04,281 adding 'evalscope/benchmarks/visu_logic/visu_logic_adapter.py' 2026-03-25T09:06:04,283 adding 'evalscope/benchmarks/vstar_bench/__init__.py' 2026-03-25T09:06:04,285 adding 'evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py' 2026-03-25T09:06:04,286 adding 'evalscope/benchmarks/winogrande/__init__.py' 2026-03-25T09:06:04,287 adding 'evalscope/benchmarks/winogrande/winogrande_adapter.py' 2026-03-25T09:06:04,289 adding 'evalscope/benchmarks/wmt/__init__.py' 2026-03-25T09:06:04,291 adding 'evalscope/benchmarks/wmt/wmt24_adapter.py' 2026-03-25T09:06:04,292 adding 'evalscope/benchmarks/zebralogicbench/__init__.py' 2026-03-25T09:06:04,294 adding 'evalscope/benchmarks/zebralogicbench/utils.py' 2026-03-25T09:06:04,296 adding 'evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py' 2026-03-25T09:06:04,297 adding 'evalscope/benchmarks/zerobench/__init__.py' 2026-03-25T09:06:04,299 adding 'evalscope/benchmarks/zerobench/zerobench_adapter.py' 2026-03-25T09:06:04,300 adding 'evalscope/cli/__init__.py' 2026-03-25T09:06:04,301 adding 'evalscope/cli/base.py' 2026-03-25T09:06:04,303 adding 'evalscope/cli/benchmark_info.py' 2026-03-25T09:06:04,305 adding 'evalscope/cli/cli.py' 2026-03-25T09:06:04,306 adding 'evalscope/cli/start_app.py' 2026-03-25T09:06:04,307 adding 'evalscope/cli/start_eval.py' 2026-03-25T09:06:04,308 adding 'evalscope/cli/start_perf.py' 2026-03-25T09:06:04,309 adding 'evalscope/cli/start_service.py' 2026-03-25T09:06:04,311 adding 'evalscope/collections/__init__.py' 2026-03-25T09:06:04,312 adding 'evalscope/collections/sampler.py' 2026-03-25T09:06:04,314 adding 'evalscope/collections/schema.py' 2026-03-25T09:06:04,316 adding 'evalscope/evaluator/__init__.py' 2026-03-25T09:06:04,317 adding 'evalscope/evaluator/batch_reviewer.py' 2026-03-25T09:06:04,320 adding 'evalscope/evaluator/evaluator.py' 2026-03-25T09:06:04,321 adding 'evalscope/filters/__init__.py' 2026-03-25T09:06:04,323 adding 'evalscope/filters/extraction.py' 2026-03-25T09:06:04,324 adding 'evalscope/filters/selection.py' 2026-03-25T09:06:04,326 adding 'evalscope/metrics/__init__.py' 2026-03-25T09:06:04,328 adding 'evalscope/metrics/llm_judge.py' 2026-03-25T09:06:04,330 adding 'evalscope/metrics/math_parser.py' 2026-03-25T09:06:04,333 adding 'evalscope/metrics/metric.py' 2026-03-25T09:06:04,336 adding 'evalscope/metrics/metrics.py' 2026-03-25T09:06:04,337 adding 'evalscope/metrics/rouge_metric.py' 2026-03-25T09:06:04,339 adding 'evalscope/metrics/bert_score/__init__.py' 2026-03-25T09:06:04,341 adding 'evalscope/metrics/bert_score/scorer.py' 2026-03-25T09:06:04,344 adding 'evalscope/metrics/bert_score/utils.py' 2026-03-25T09:06:04,346 adding 'evalscope/metrics/bundled_rouge_score/__init__.py' 2026-03-25T09:06:04,348 adding 'evalscope/metrics/bundled_rouge_score/rouge_scorer.py' 2026-03-25T09:06:04,350 adding 'evalscope/metrics/sem_score/__init__.py' 2026-03-25T09:06:04,351 adding 'evalscope/metrics/sem_score/scorer.py' 2026-03-25T09:06:04,353 adding 'evalscope/metrics/t2v_metrics/__init__.py' 2026-03-25T09:06:04,354 adding 'evalscope/metrics/t2v_metrics/clipscore.py' 2026-03-25T09:06:04,356 adding 'evalscope/metrics/t2v_metrics/constants.py' 2026-03-25T09:06:04,357 adding 'evalscope/metrics/t2v_metrics/itmscore.py' 2026-03-25T09:06:04,358 adding 'evalscope/metrics/t2v_metrics/score.py' 2026-03-25T09:06:04,359 adding 'evalscope/metrics/t2v_metrics/vqascore.py' 2026-03-25T09:06:04,361 adding 'evalscope/metrics/t2v_metrics/models/__init__.py' 2026-03-25T09:06:04,362 adding 'evalscope/metrics/t2v_metrics/models/model.py' 2026-03-25T09:06:04,364 adding 'evalscope/metrics/t2v_metrics/models/utils.py' 2026-03-25T09:06:04,365 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py' 2026-03-25T09:06:04,367 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py' 2026-03-25T09:06:04,369 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py' 2026-03-25T09:06:04,370 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py' 2026-03-25T09:06:04,371 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py' 2026-03-25T09:06:04,373 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py' 2026-03-25T09:06:04,374 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py' 2026-03-25T09:06:04,376 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py' 2026-03-25T09:06:04,377 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py' 2026-03-25T09:06:04,379 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py' 2026-03-25T09:06:04,380 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py' 2026-03-25T09:06:04,382 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py' 2026-03-25T09:06:04,383 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py' 2026-03-25T09:06:04,385 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py' 2026-03-25T09:06:04,386 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py' 2026-03-25T09:06:04,387 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py' 2026-03-25T09:06:04,389 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py' 2026-03-25T09:06:04,391 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py' 2026-03-25T09:06:04,393 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py' 2026-03-25T09:06:04,394 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py' 2026-03-25T09:06:04,395 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py' 2026-03-25T09:06:04,397 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py' 2026-03-25T09:06:04,398 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py' 2026-03-25T09:06:04,400 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py' 2026-03-25T09:06:04,402 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py' 2026-03-25T09:06:04,403 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py' 2026-03-25T09:06:04,405 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py' 2026-03-25T09:06:04,407 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py' 2026-03-25T09:06:04,410 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py' 2026-03-25T09:06:04,411 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py' 2026-03-25T09:06:04,412 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py' 2026-03-25T09:06:04,414 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py' 2026-03-25T09:06:04,415 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py' 2026-03-25T09:06:04,417 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py' 2026-03-25T09:06:04,419 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py' 2026-03-25T09:06:04,421 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py' 2026-03-25T09:06:04,422 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py' 2026-03-25T09:06:04,424 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py' 2026-03-25T09:06:04,426 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml' 2026-03-25T09:06:04,428 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json' 2026-03-25T09:06:04,429 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json' 2026-03-25T09:06:04,430 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json' 2026-03-25T09:06:04,432 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml' 2026-03-25T09:06:04,433 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml' 2026-03-25T09:06:04,434 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml' 2026-03-25T09:06:04,436 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml' 2026-03-25T09:06:04,437 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml' 2026-03-25T09:06:04,438 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml' 2026-03-25T09:06:04,439 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml' 2026-03-25T09:06:04,441 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml' 2026-03-25T09:06:04,442 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml' 2026-03-25T09:06:04,443 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml' 2026-03-25T09:06:04,444 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml' 2026-03-25T09:06:04,446 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml' 2026-03-25T09:06:04,447 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml' 2026-03-25T09:06:04,448 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml' 2026-03-25T09:06:04,449 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml' 2026-03-25T09:06:04,450 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml' 2026-03-25T09:06:04,452 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml' 2026-03-25T09:06:04,453 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml' 2026-03-25T09:06:04,454 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml' 2026-03-25T09:06:04,456 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py' 2026-03-25T09:06:04,458 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py' 2026-03-25T09:06:04,460 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py' 2026-03-25T09:06:04,462 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py' 2026-03-25T09:06:04,468 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py' 2026-03-25T09:06:04,471 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py' 2026-03-25T09:06:04,476 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py' 2026-03-25T09:06:04,477 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py' 2026-03-25T09:06:04,479 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py' 2026-03-25T09:06:04,481 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py' 2026-03-25T09:06:04,483 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py' 2026-03-25T09:06:04,485 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py' 2026-03-25T09:06:04,489 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py' 2026-03-25T09:06:04,491 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py' 2026-03-25T09:06:04,495 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py' 2026-03-25T09:06:04,503 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py' 2026-03-25T09:06:04,505 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py' 2026-03-25T09:06:04,507 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py' 2026-03-25T09:06:04,508 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py' 2026-03-25T09:06:04,510 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py' 2026-03-25T09:06:04,512 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py' 2026-03-25T09:06:04,513 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py' 2026-03-25T09:06:04,515 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py' 2026-03-25T09:06:04,516 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py' 2026-03-25T09:06:04,518 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py' 2026-03-25T09:06:04,520 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py' 2026-03-25T09:06:04,524 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py' 2026-03-25T09:06:04,526 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py' 2026-03-25T09:06:04,528 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py' 2026-03-25T09:06:04,529 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py' 2026-03-25T09:06:04,531 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py' 2026-03-25T09:06:04,533 adding 'evalscope/metrics/text_normalizer/__init__.py' 2026-03-25T09:06:04,534 adding 'evalscope/metrics/text_normalizer/basic.py' 2026-03-25T09:06:04,540 adding 'evalscope/metrics/text_normalizer/chinese.py' 2026-03-25T09:06:04,549 adding 'evalscope/metrics/text_normalizer/english.json' 2026-03-25T09:06:04,552 adding 'evalscope/metrics/text_normalizer/english.py' 2026-03-25T09:06:04,553 adding 'evalscope/metrics/text_normalizer/wer.py' 2026-03-25T09:06:04,555 adding 'evalscope/models/__init__.py' 2026-03-25T09:06:04,557 adding 'evalscope/models/anthropic_compatible.py' 2026-03-25T09:06:04,558 adding 'evalscope/models/image_edit_model.py' 2026-03-25T09:06:04,560 adding 'evalscope/models/mockllm.py' 2026-03-25T09:06:04,561 adding 'evalscope/models/model_apis.py' 2026-03-25T09:06:04,563 adding 'evalscope/models/modelscope.py' 2026-03-25T09:06:04,565 adding 'evalscope/models/openai_compatible.py' 2026-03-25T09:06:04,567 adding 'evalscope/models/text2image_model.py' 2026-03-25T09:06:04,570 adding 'evalscope/models/utils/anthropic.py' 2026-03-25T09:06:04,574 adding 'evalscope/models/utils/openai.py' 2026-03-25T09:06:04,576 adding 'evalscope/perf/__init__.py' 2026-03-25T09:06:04,578 adding 'evalscope/perf/arguments.py' 2026-03-25T09:06:04,580 adding 'evalscope/perf/benchmark.py' 2026-03-25T09:06:04,581 adding 'evalscope/perf/http_client.py' 2026-03-25T09:06:04,583 adding 'evalscope/perf/main.py' 2026-03-25T09:06:04,585 adding 'evalscope/perf/plugin/__init__.py' 2026-03-25T09:06:04,586 adding 'evalscope/perf/plugin/registry.py' 2026-03-25T09:06:04,587 adding 'evalscope/perf/plugin/api/__init__.py' 2026-03-25T09:06:04,589 adding 'evalscope/perf/plugin/api/base.py' 2026-03-25T09:06:04,590 adding 'evalscope/perf/plugin/api/custom_api.py' 2026-03-25T09:06:04,592 adding 'evalscope/perf/plugin/api/dashscope_api.py' 2026-03-25T09:06:04,593 adding 'evalscope/perf/plugin/api/default_api.py' 2026-03-25T09:06:04,595 adding 'evalscope/perf/plugin/api/openai_api.py' 2026-03-25T09:06:04,597 adding 'evalscope/perf/plugin/api/openai_embedding_api.py' 2026-03-25T09:06:04,598 adding 'evalscope/perf/plugin/api/openai_rerank_api.py' 2026-03-25T09:06:04,600 adding 'evalscope/perf/plugin/datasets/__init__.py' 2026-03-25T09:06:04,602 adding 'evalscope/perf/plugin/datasets/base.py' 2026-03-25T09:06:04,603 adding 'evalscope/perf/plugin/datasets/custom.py' 2026-03-25T09:06:04,604 adding 'evalscope/perf/plugin/datasets/embedding_dataset.py' 2026-03-25T09:06:04,606 adding 'evalscope/perf/plugin/datasets/flickr8k.py' 2026-03-25T09:06:04,607 adding 'evalscope/perf/plugin/datasets/kontext_bench.py' 2026-03-25T09:06:04,608 adding 'evalscope/perf/plugin/datasets/line_by_line.py' 2026-03-25T09:06:04,609 adding 'evalscope/perf/plugin/datasets/longalpaca.py' 2026-03-25T09:06:04,611 adding 'evalscope/perf/plugin/datasets/openqa.py' 2026-03-25T09:06:04,612 adding 'evalscope/perf/plugin/datasets/random_dataset.py' 2026-03-25T09:06:04,613 adding 'evalscope/perf/plugin/datasets/random_vl_dataset.py' 2026-03-25T09:06:04,615 adding 'evalscope/perf/plugin/datasets/rerank_dataset.py' 2026-03-25T09:06:04,616 adding 'evalscope/perf/plugin/datasets/speed_benchmark.py' 2026-03-25T09:06:04,618 adding 'evalscope/perf/plugin/datasets/utils.py' 2026-03-25T09:06:04,619 adding 'evalscope/perf/sla/__init__.py' 2026-03-25T09:06:04,621 adding 'evalscope/perf/sla/sla_criterion.py' 2026-03-25T09:06:04,623 adding 'evalscope/perf/sla/sla_run.py' 2026-03-25T09:06:04,625 adding 'evalscope/perf/utils/__init__.py' 2026-03-25T09:06:04,626 adding 'evalscope/perf/utils/analysis_result.py' 2026-03-25T09:06:04,628 adding 'evalscope/perf/utils/benchmark_util.py' 2026-03-25T09:06:04,630 adding 'evalscope/perf/utils/db_util.py' 2026-03-25T09:06:04,631 adding 'evalscope/perf/utils/handler.py' 2026-03-25T09:06:04,632 adding 'evalscope/perf/utils/local_server.py' 2026-03-25T09:06:04,634 adding 'evalscope/perf/utils/log_utils.py' 2026-03-25T09:06:04,636 adding 'evalscope/perf/utils/rich_display.py' 2026-03-25T09:06:04,638 adding 'evalscope/perf/utils/report/__init__.py' 2026-03-25T09:06:04,640 adding 'evalscope/perf/utils/report/generate_report.py' 2026-03-25T09:06:04,642 adding 'evalscope/perf/utils/report/perf_charts.py' 2026-03-25T09:06:04,644 adding 'evalscope/perf/utils/report/perf_data.py' 2026-03-25T09:06:04,645 adding 'evalscope/report/__init__.py' 2026-03-25T09:06:04,647 adding 'evalscope/report/combinator.py' 2026-03-25T09:06:04,648 adding 'evalscope/report/generator.py' 2026-03-25T09:06:04,651 adding 'evalscope/report/renderer.py' 2026-03-25T09:06:04,653 adding 'evalscope/report/report.py' 2026-03-25T09:06:04,655 adding 'evalscope/report/template/perf_report.html.j2' 2026-03-25T09:06:04,657 adding 'evalscope/report/template/report.html.j2' 2026-03-25T09:06:04,660 adding 'evalscope/report/template/css/base.css' 2026-03-25T09:06:04,661 adding 'evalscope/report/template/css/perf_extra.css' 2026-03-25T09:06:04,663 adding 'evalscope/report/template/js/eval_extra.js' 2026-03-25T09:06:04,665 adding 'evalscope/report/template/js/i18n_eval.js' 2026-03-25T09:06:04,666 adding 'evalscope/report/template/js/i18n_perf.js' 2026-03-25T09:06:04,668 adding 'evalscope/report/template/js/perf_extra.js' 2026-03-25T09:06:04,669 adding 'evalscope/report/template/js/shared.js' 2026-03-25T09:06:04,671 adding 'evalscope/report/template/partials/footer.html' 2026-03-25T09:06:04,672 adding 'evalscope/report/template/partials/header_eval.html' 2026-03-25T09:06:04,673 adding 'evalscope/report/template/partials/header_perf.html' 2026-03-25T09:06:04,675 adding 'evalscope/report/template/partials/toc_eval.html' 2026-03-25T09:06:04,676 adding 'evalscope/report/template/partials/toc_perf.html' 2026-03-25T09:06:04,677 adding 'evalscope/sandbox/__init__.py' 2026-03-25T09:06:04,679 adding 'evalscope/sandbox/volcengine.py' 2026-03-25T09:06:04,681 adding 'evalscope/service/__init__.py' 2026-03-25T09:06:04,682 adding 'evalscope/service/app.py' 2026-03-25T09:06:04,684 adding 'evalscope/service/blueprints/__init__.py' 2026-03-25T09:06:04,686 adding 'evalscope/service/blueprints/eval.py' 2026-03-25T09:06:04,687 adding 'evalscope/service/blueprints/perf.py' 2026-03-25T09:06:04,689 adding 'evalscope/service/frontend/__init__.py' 2026-03-25T09:06:04,690 adding 'evalscope/service/frontend/async_client.py' 2026-03-25T09:06:04,692 adding 'evalscope/service/frontend/main.py' 2026-03-25T09:06:04,694 adding 'evalscope/service/frontend/utils.py' 2026-03-25T09:06:04,695 adding 'evalscope/service/utils/__init__.py' 2026-03-25T09:06:04,697 adding 'evalscope/service/utils/benchmarks.py' 2026-03-25T09:06:04,698 adding 'evalscope/service/utils/log.py' 2026-03-25T09:06:04,699 adding 'evalscope/service/utils/process.py' 2026-03-25T09:06:04,701 adding 'evalscope/summarizer/__init__.py' 2026-03-25T09:06:04,702 adding 'evalscope/summarizer/summarizer.py' 2026-03-25T09:06:04,704 adding 'evalscope/third_party/__init__.py' 2026-03-25T09:06:04,706 adding 'evalscope/third_party/longbench_write/README.md' 2026-03-25T09:06:04,707 adding 'evalscope/third_party/longbench_write/__init__.py' 2026-03-25T09:06:04,708 adding 'evalscope/third_party/longbench_write/default_task.json' 2026-03-25T09:06:04,709 adding 'evalscope/third_party/longbench_write/default_task.yaml' 2026-03-25T09:06:04,711 adding 'evalscope/third_party/longbench_write/eval.py' 2026-03-25T09:06:04,713 adding 'evalscope/third_party/longbench_write/infer.py' 2026-03-25T09:06:04,714 adding 'evalscope/third_party/longbench_write/longbench_write.py' 2026-03-25T09:06:04,715 adding 'evalscope/third_party/longbench_write/utils.py' 2026-03-25T09:06:04,717 adding 'evalscope/third_party/longbench_write/resources/__init__.py' 2026-03-25T09:06:04,718 adding 'evalscope/third_party/longbench_write/resources/judge.txt' 2026-03-25T09:06:04,725 adding 'evalscope/third_party/longbench_write/resources/longbench_write.jsonl' 2026-03-25T09:06:04,729 adding 'evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl' 2026-03-25T09:06:04,731 adding 'evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl' 2026-03-25T09:06:04,732 adding 'evalscope/third_party/longbench_write/tools/__init__.py' 2026-03-25T09:06:04,734 adding 'evalscope/third_party/longbench_write/tools/data_etl.py' 2026-03-25T09:06:04,736 adding 'evalscope/third_party/longbench_write/tools/openai_api.py' 2026-03-25T09:06:04,738 adding 'evalscope/third_party/thinkbench/__init__.py' 2026-03-25T09:06:04,740 adding 'evalscope/third_party/thinkbench/eval.py' 2026-03-25T09:06:04,742 adding 'evalscope/third_party/thinkbench/infer.py' 2026-03-25T09:06:04,743 adding 'evalscope/third_party/thinkbench/resources/critique_template.txt' 2026-03-25T09:06:04,745 adding 'evalscope/third_party/thinkbench/resources/reformat_template.txt' 2026-03-25T09:06:04,746 adding 'evalscope/third_party/thinkbench/tools/__init__.py' 2026-03-25T09:06:04,747 adding 'evalscope/third_party/thinkbench/tools/llm.py' 2026-03-25T09:06:04,748 adding 'evalscope/third_party/thinkbench/tools/utils.py' 2026-03-25T09:06:04,751 adding 'evalscope/third_party/toolbench_static/README.md' 2026-03-25T09:06:04,752 adding 'evalscope/third_party/toolbench_static/__init__.py' 2026-03-25T09:06:04,753 adding 'evalscope/third_party/toolbench_static/config_default.json' 2026-03-25T09:06:04,754 adding 'evalscope/third_party/toolbench_static/config_default.yaml' 2026-03-25T09:06:04,756 adding 'evalscope/third_party/toolbench_static/eval.py' 2026-03-25T09:06:04,758 adding 'evalscope/third_party/toolbench_static/infer.py' 2026-03-25T09:06:04,759 adding 'evalscope/third_party/toolbench_static/requirements.txt' 2026-03-25T09:06:04,760 adding 'evalscope/third_party/toolbench_static/toolbench_static.py' 2026-03-25T09:06:04,762 adding 'evalscope/third_party/toolbench_static/llm/__init__.py' 2026-03-25T09:06:04,763 adding 'evalscope/third_party/toolbench_static/llm/swift_infer.py' 2026-03-25T09:06:04,765 adding 'evalscope/utils/__init__.py' 2026-03-25T09:06:04,766 adding 'evalscope/utils/argument_utils.py' 2026-03-25T09:06:04,768 adding 'evalscope/utils/chat_service.py' 2026-03-25T09:06:04,771 adding 'evalscope/utils/code_utils.py' 2026-03-25T09:06:04,772 adding 'evalscope/utils/deprecation_utils.py' 2026-03-25T09:06:04,774 adding 'evalscope/utils/function_utils.py' 2026-03-25T09:06:04,776 adding 'evalscope/utils/import_utils.py' 2026-03-25T09:06:04,778 adding 'evalscope/utils/io_utils.py' 2026-03-25T09:06:04,780 adding 'evalscope/utils/json_schema.py' 2026-03-25T09:06:04,782 adding 'evalscope/utils/logger.py' 2026-03-25T09:06:04,783 adding 'evalscope/utils/model_utils.py' 2026-03-25T09:06:04,785 adding 'evalscope/utils/multi_choices.py' 2026-03-25T09:06:04,787 adding 'evalscope/utils/ner.py' 2026-03-25T09:06:04,788 adding 'evalscope/utils/resource_utils.py' 2026-03-25T09:06:04,790 adding 'evalscope/utils/url_utils.py' 2026-03-25T09:06:04,791 adding 'evalscope/utils/doc_utils/__init__.py' 2026-03-25T09:06:04,795 adding 'evalscope/utils/doc_utils/benchmark_stats.py' 2026-03-25T09:06:04,797 adding 'evalscope/utils/doc_utils/generate_dataset_md.py' 2026-03-25T09:06:04,800 adding 'evalscope/utils/doc_utils/readme_generator.py' 2026-03-25T09:06:04,802 adding 'evalscope/utils/doc_utils/translate_description.py' 2026-03-25T09:06:04,803 adding 'evalscope/utils/tqdm_utils/__init__.py' 2026-03-25T09:06:04,805 adding 'evalscope/utils/tqdm_utils/progress_tracker.py' 2026-03-25T09:06:04,807 adding 'evalscope/utils/tqdm_utils/tqdm_logging.py' 2026-03-25T09:06:04,810 adding 'evalscope-1.5.1.dist-info/licenses/LICENSE' 2026-03-25T09:06:04,815 adding 'evalscope-1.5.1.dist-info/METADATA' 2026-03-25T09:06:04,816 adding 'evalscope-1.5.1.dist-info/WHEEL' 2026-03-25T09:06:04,817 adding 'evalscope-1.5.1.dist-info/entry_points.txt' 2026-03-25T09:06:04,818 adding 'evalscope-1.5.1.dist-info/top_level.txt' 2026-03-25T09:06:04,833 adding 'evalscope-1.5.1.dist-info/RECORD' 2026-03-25T09:06:04,866 removing build/bdist.linux-armv7l/wheel 2026-03-25T09:06:05,256 Building wheel for evalscope (pyproject.toml): finished with status 'done' 2026-03-25T09:06:05,614 Created wheel for evalscope: filename=evalscope-1.5.1-py3-none-any.whl size=2088001 sha256=a95eabf175191595bfeebe4e6face613a6c137a65067e8fd7dca613567bba440 2026-03-25T09:06:05,615 Stored in directory: /tmp/pip-ephem-wheel-cache-03x8psey/wheels/54/ac/82/cce750b22e67d58f08ee1e3b08c84fa66b0c356d10a5aabe7a 2026-03-25T09:06:05,662 Successfully built evalscope 2026-03-25T09:06:05,714 Removed build tracker: '/tmp/pip-build-tracker-bzp7w95z'