2026-02-08T20:41:33,830 Created temporary directory: /tmp/pip-ephem-wheel-cache-7k81_u12 2026-02-08T20:41:33,831 Created temporary directory: /tmp/pip-build-tracker-g2uz737x 2026-02-08T20:41:33,832 Initialized build tracking at /tmp/pip-build-tracker-g2uz737x 2026-02-08T20:41:33,832 Created build tracker: /tmp/pip-build-tracker-g2uz737x 2026-02-08T20:41:33,833 Entered build tracker: /tmp/pip-build-tracker-g2uz737x 2026-02-08T20:41:33,834 Created temporary directory: /tmp/pip-wheel-q98ytt1_ 2026-02-08T20:41:33,836 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-02-08T20:41:33,839 Created temporary directory: /tmp/pip-ephem-wheel-cache-2yi8tsav 2026-02-08T20:41:33,862 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-02-08T20:41:33,866 2 location(s) to search for versions of evalscope: 2026-02-08T20:41:33,866 * https://pypi.org/simple/evalscope/ 2026-02-08T20:41:33,866 * https://www.piwheels.org/simple/evalscope/ 2026-02-08T20:41:33,866 Fetching project page and analyzing links: https://pypi.org/simple/evalscope/ 2026-02-08T20:41:33,867 Getting page https://pypi.org/simple/evalscope/ 2026-02-08T20:41:33,869 Found index url https://pypi.org/simple 2026-02-08T20:41:34,087 Fetched page https://pypi.org/simple/evalscope/ as application/vnd.pypi.simple.v1+json 2026-02-08T20:41:34,104 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/94/d2/dc5e929802776bf4e662a46d794b765876bb93e2300189cafd113cac74d6/evalscope-0.5.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,105 Found link https://files.pythonhosted.org/packages/e5/70/45a5dad24b1fa535bff194b99a4668e7f5f328be972b51b3b91eafb4cdbb/evalscope-0.5.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0rc0 2026-02-08T20:41:34,106 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ac/eb/341fe367df2bc9a0ae7ef5eb2037a5d549d9bb8c0d7ad84844c9926e0947/evalscope-0.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,107 Found link https://files.pythonhosted.org/packages/3b/3f/585f7f1cf2ce90b234c1cfd654bb26977be4d889c0e5eed0122cb3024c45/evalscope-0.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0 2026-02-08T20:41:34,107 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ad/58/c0ce004159cfac6df9b5736c011576d52bdceb778943de8a022a419d86eb/evalscope-0.5.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,108 Found link https://files.pythonhosted.org/packages/94/33/7ad2f2285f5b68953ad4466d23dc5de1a2e57e7cc63d5924ab0e84d156ba/evalscope-0.5.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.2 2026-02-08T20:41:34,109 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/59/a9e1c4cf88018ece1fdd8d8b7fa976e28f9b4b181ef7ceb74a5e2db533ab/evalscope-0.5.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,110 Found link https://files.pythonhosted.org/packages/04/57/9ca7b1fd68f2acc32802b22236c83b597a5690a483d5938d38183b549d22/evalscope-0.5.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.3 2026-02-08T20:41:34,111 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/fa/c8/fcbaf01b7486c3b29b7790167c2cda560f00a04d100cec808ee9a3349ca0/evalscope-0.5.4-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,111 Found link https://files.pythonhosted.org/packages/8c/cc/abd412bad714c0266be1f0159b49a817d45db099c3bd031134d223589e93/evalscope-0.5.4.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.4 2026-02-08T20:41:34,112 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4d/da/d705d457683223f289e8c5d6cadbaad15d15e098692c53ea7e6196a94373/evalscope-0.5.5rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,113 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7e/4c/414dd545a1833245a53797d70a35a78ae1b9cfbcc81b35a4e1763e678437/evalscope-0.5.5rc1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,114 Found link https://files.pythonhosted.org/packages/56/5f/aa7fcf62102694dd66b69e88cb2523094bf04b53f785854e17ee6b7234a7/evalscope-0.5.5rc1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5rc1 2026-02-08T20:41:34,114 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/3e/d0/91a7a1f95f3fa19dd8d4e434dc711768abe5c006f32a514a8602b429e049/evalscope-0.5.5-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,115 Found link https://files.pythonhosted.org/packages/04/7e/a7f065d6ebac15fe172d3b0906ff5b26a71df5a9975c0f14978044211cf1/evalscope-0.5.5.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5 2026-02-08T20:41:34,116 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/89/b812b01a5ed91fc079dea052e1341860cd65d25d463c75c90e5a30ab6ae8/evalscope-0.6.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,117 Found link https://files.pythonhosted.org/packages/d5/ad/57a2d5f33c5b7d5066f8a5dcb1d34f14bf246112f7900228b9f2fb41b21b/evalscope-0.6.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0rc0 2026-02-08T20:41:34,117 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/25/3f03d9d924f1b65610724c9f10727b48ef952afdfee8687a461949c88c78/evalscope-0.6.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,118 Found link https://files.pythonhosted.org/packages/9d/ac/1f432bcc46ccb8348b869b80d2aaabde5e583b370418ba48714083e31068/evalscope-0.6.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0 2026-02-08T20:41:34,119 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7b/e1/42c9e58b4690f23ef48bce841fc95cbf0744c1579cdc80fa6f33b0453344/evalscope-0.6.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,120 Found link https://files.pythonhosted.org/packages/cd/d6/1d9d2db9acda6e61d4210074f51e7e3dee4d0212fabdd94999105db23eed/evalscope-0.6.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.1 2026-02-08T20:41:34,120 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/20/92/6fff1dfd12a4f73489c451dab56351b6a3c1095a92bb55025c2934fc625d/evalscope-0.7.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,121 Found link https://files.pythonhosted.org/packages/fe/3c/500d655a27ca80e1aba3fb2b1e8886951942732b869ad1516422d9e6ac97/evalscope-0.7.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.0 2026-02-08T20:41:34,122 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a5/3a/ae22d4d9a44ad37ac887da21b948bf6e784307001a09802d36c5bf04018d/evalscope-0.7.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,123 Found link https://files.pythonhosted.org/packages/88/47/69d067f0d3d784a7975cc4ea067fdc55c8f785ece6fbe86e5e21edc8b36f/evalscope-0.7.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.1 2026-02-08T20:41:34,123 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/23/acbcbc2ed6f00d3bb81220054651e6a2b1b02714d42b1aeb018a2f5574c4/evalscope-0.7.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,125 Found link https://files.pythonhosted.org/packages/a4/38/9126b9329cd2ad6ccfd4a73f04402bf71b65921564c3c12cb0d62b3b421d/evalscope-0.7.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.2 2026-02-08T20:41:34,125 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/1f/9598183da3026696adc19a6727a19d57f9abbf4fd7aeb20cdf12faee7693/evalscope-0.8.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,126 Found link https://files.pythonhosted.org/packages/63/7e/fc44d30e3a83dbc3070396d78279c9e3e8716cbc4cc05811a70f1b463bfb/evalscope-0.8.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.0 2026-02-08T20:41:34,127 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/37/a9/a9c6cda95a6b837c9303e9a1598999f2c4e605abb507365c6ff70b372a5a/evalscope-0.8.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,128 Found link https://files.pythonhosted.org/packages/63/5a/bbc230bb06a7bc40dd3985dde4615a8a71f111bf95761e40f6f0f8a7e1a6/evalscope-0.8.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.1 2026-02-08T20:41:34,128 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/07/39/a1eb2efed77d21e8253daa37f082669a004c4d288813a2ee9e15398f2e80/evalscope-0.8.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,129 Found link https://files.pythonhosted.org/packages/57/bc/5ff6d538e459d8b3c567577c991eec72ab6adbc19497dc164d96cd634d2f/evalscope-0.8.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.2 2026-02-08T20:41:34,130 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f6/29/81a188c03b272bf7def0c9bb556b9e9465adbc68ecb18907b636f1e8cbd7/evalscope-0.9.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,131 Found link https://files.pythonhosted.org/packages/c0/c2/19da3be1fbd6b548ecdc877d47269e92503518de53acbbfe96120c5c9753/evalscope-0.9.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.9.0 2026-02-08T20:41:34,131 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/6d/45d7407f31d6878494c3b493d7e49a8318b1839161839293c1a2e66aadcf/evalscope-0.10.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,132 Found link https://files.pythonhosted.org/packages/16/b1/b6cef37a0dd0acfa5873ca4763ac6b4ac4b19a0b15ca6bdc8f30d4443682/evalscope-0.10.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.0 2026-02-08T20:41:34,133 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a7/8f/907045290d359b4e07e7acc96ec60173380748b49e9f3c91b7ddd8e8342d/evalscope-0.10.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,134 Found link https://files.pythonhosted.org/packages/e3/d3/dda2ac0513904bff8aa0c2efe77bc851d3acc6d514707db36648e4a903d2/evalscope-0.10.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.1 2026-02-08T20:41:34,134 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d6/e5/852326943d86c85b5ca6b548a5f3753c217b771d93968c61ec2ca46ee0b1/evalscope-0.11.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,135 Found link https://files.pythonhosted.org/packages/5f/3f/e2816b99487b4ead257453a242b5282a85881f9d26fbb5efb21cc5cf88fa/evalscope-0.11.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.11.0 2026-02-08T20:41:34,136 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7d/8e/f9eceecf8bc7d740603f915eef7fab3e9d657a01f5de2c523e531445299c/evalscope-0.12.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,137 Found link https://files.pythonhosted.org/packages/33/82/7765517ff80a73eac7465369767aa45a5be3d5e0fb7c4f4a3ff743811f8c/evalscope-0.12.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.0 2026-02-08T20:41:34,138 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ee/4c/84a5e18985e149eb4283fef2b58a81ff2cec2d099c017684938ec3a3935f/evalscope-0.12.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,138 Found link https://files.pythonhosted.org/packages/b2/24/83b5530319bdb02142289e04640c6008dda1b988043c42feb7f0a5eab3b7/evalscope-0.12.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.1 2026-02-08T20:41:34,139 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/a5/65faf0660cd8ae2660354b002b2b3a586b9419bc894120fea97efd506cb6/evalscope-0.13.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,140 Found link https://files.pythonhosted.org/packages/d0/39/d5eb469a94191760c61d1bfcb235e28be1d2a080d88b44792f53d76c45d1/evalscope-0.13.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.0 2026-02-08T20:41:34,140 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f7/fc/91b7b4379131d2e15ca1f575f533daea589357293e492ef2c93e0aac6b55/evalscope-0.13.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,141 Found link https://files.pythonhosted.org/packages/9e/a3/33b4ce270d5500fe7c8f32fa2160749b607f248141328d4785b6032c8f2a/evalscope-0.13.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.1 2026-02-08T20:41:34,142 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e6/66/83752c305879cf3dea54170398839ab08a046485bc18c41a34f41aca11ab/evalscope-0.13.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,143 Found link https://files.pythonhosted.org/packages/b7/8d/711ae30b80329e2dd7da760c001d9a5b45e4d8e5292f317f1ea10c744c29/evalscope-0.13.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.2 2026-02-08T20:41:34,143 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a8/b4/b22c7e52e6a7381333bdfa0bf92fae0258e7812064b1e208cbab56a62d08/evalscope-0.14.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,144 Found link https://files.pythonhosted.org/packages/40/44/7db2cb90e6ca0c9db92124f10c0273d7c6ef4b81523e1c98c34a88e67faa/evalscope-0.14.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.14.0 2026-02-08T20:41:34,145 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b2/47/976633e0f29b58b8c9f3faacf7373a2da734771a7915ac45d721d96e0ad7/evalscope-0.15.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,146 Found link https://files.pythonhosted.org/packages/b8/e3/bd534d69328afa98bdd497b5eaaf4b7416da9e8f56109d045c332b17d016/evalscope-0.15.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.0 2026-02-08T20:41:34,146 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5a/54/237aee5f0317fa04450c9ad67c3bf28b730460b2e6dc1e65b74b4bf2cd67/evalscope-0.15.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,147 Found link https://files.pythonhosted.org/packages/7a/d4/8b87e83a3a08f87ce5b4325f0cd5ab9bc54d296dc3f3492a1d3216a97a6d/evalscope-0.15.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.1 2026-02-08T20:41:34,148 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/1e/b5/6fec1cbb02a41ab79430eb3fb51eea7709525df0f9753ae2c54fbc4633f1/evalscope-0.16.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,149 Found link https://files.pythonhosted.org/packages/99/69/63997bedfa6fd33af671b539f77c375b111017eff23313f76693e24b8872/evalscope-0.16.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.0 2026-02-08T20:41:34,149 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ea/76/639641578bdc92d25211eb8dae24d1fae19e40cd1649e17946f6ad8a5dc3/evalscope-0.16.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,150 Found link https://files.pythonhosted.org/packages/94/13/616bf9c33b0769db44a2bae32b54d33cf7129874392682aa76326a51e085/evalscope-0.16.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.1 2026-02-08T20:41:34,151 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f1/d3/9e83cc1b5a132342a05ef6ee79018bd7561f90a6406dc9db5c85fb0a281f/evalscope-0.16.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,152 Found link https://files.pythonhosted.org/packages/94/47/edd3faaddd321e464ae72db6e7bf82246dd4ca2f0f67127ca8c427cac664/evalscope-0.16.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.2 2026-02-08T20:41:34,152 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5d/4e/34c56086cdfe7d1ad912241af819b3e67f20373f382016e33ac89dd43dde/evalscope-0.16.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,153 Found link https://files.pythonhosted.org/packages/22/07/14603c038a8019472881f574f1c47bd4193481f256b6dc702c65d8b8f984/evalscope-0.16.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.3 2026-02-08T20:41:34,154 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f3/e2/104156f74779cf2849f53566bba585015492d7320c36a7cb76c7196b0ef5/evalscope-0.17.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,154 Found link https://files.pythonhosted.org/packages/c5/d0/b66d1b97ec67d65b6df54f18682e30cfeb6401604b93c9b1bdd1e97b8d79/evalscope-0.17.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.17.0 2026-02-08T20:41:34,155 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a9/b5/630d2c5dc5c32e9fbad5034e04d8aba6f6461dc08f255df77dd8d463857f/evalscope-0.17.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,156 Found link https://files.pythonhosted.org/packages/38/88/326e48929bb9577a6a36e07afa65bbf6bd870c1b644f82e5713874ae3238/evalscope-0.17.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 0.17.1 2026-02-08T20:41:34,157 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/21/18/9a6c208d2bc119ac67b5537b60851c6bccc99f25229eaa96cbe6e38721bd/evalscope-1.0.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,157 Found link https://files.pythonhosted.org/packages/d8/44/3d727dd28fcc50317c95d12e5bb850ed2863105f812373ac120877875434/evalscope-1.0.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.0 2026-02-08T20:41:34,158 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e1/8a/50456fa7dd77be4c3a0ea0b3d96cb7ae5b2557454fdd35cbf0009a9d792f/evalscope-1.0.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,159 Found link https://files.pythonhosted.org/packages/8d/52/93569134b3d8dea2a0d0bc2134c03056f0ffee1840f7299eb83d475457df/evalscope-1.0.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.1 2026-02-08T20:41:34,160 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/22/f6/32fb0fef08a6c881ac840117455a5697a0d63226db8a24cce5208b720829/evalscope-1.0.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,161 Found link https://files.pythonhosted.org/packages/3d/f5/025baefe432d9af1ed845ab5738b638b7b97f2dd3767e9478b8eee10966f/evalscope-1.0.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.2 2026-02-08T20:41:34,161 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/6e/12/f31dbb18daa7e3c6cfecf856ddc323a303e115271166c100e06af58ea6b6/evalscope-1.1.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,162 Found link https://files.pythonhosted.org/packages/b4/98/7449040e89beaa4556bf35ba1e171b2d4955ff15b2b4c43f2ff55b048aeb/evalscope-1.1.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.1.0 2026-02-08T20:41:34,162 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/79/d1/afc8b23345ad8f11a5e1f8c6c3112a8679604d833bdbc02aa06787952fd1/evalscope-1.1.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,163 Found link https://files.pythonhosted.org/packages/e8/3f/d67b73ce19789e914d6a78740fa7bfd0c07f161bc239b92cd3c26541f2fb/evalscope-1.1.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.1.1 2026-02-08T20:41:34,164 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/82/0a/d984751e4b5751f064209da3745b6aff6cc0f1d9d93f13cab0a1017a8639/evalscope-1.2.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,165 Found link https://files.pythonhosted.org/packages/7f/f9/0a2a069ee4500666ec5c3d10b302fc71d176c17bbe70447336e610953e1a/evalscope-1.2.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.2.0 2026-02-08T20:41:34,165 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/58/823d009dfa49cfdc750ac257e744eb97456d69528a09ac108ee8cab15316/evalscope-1.3.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,166 Found link https://files.pythonhosted.org/packages/42/5a/a309f7ce1fbe2b39e4b0a1f26cfcd7864eaa90e4792a5290e8cdd2ce3b4f/evalscope-1.3.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.3.0 2026-02-08T20:41:34,167 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c6/57/005d3ef07ecdd5163e1bbed3413537b653b637d9c8b62a2bcdd97546607b/evalscope-1.4.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,167 Found link https://files.pythonhosted.org/packages/11/d5/268f610ac7db9c5c2109936f65ac4df8b4bc52106ed4369509d3b3c4f127/evalscope-1.4.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.0 2026-02-08T20:41:34,168 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b8/96/a2b4fabf6fa6cf09ac6669aa01dc483ac53576a8fd7c2c4be21ea281840f/evalscope-1.4.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,169 Found link https://files.pythonhosted.org/packages/1c/8d/4fa3d9a2a5443ba2df06147ebc057961b0550bef806c716b2552fe7c7f66/evalscope-1.4.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.1 2026-02-08T20:41:34,170 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/bf/0f/97e68e89f7925160df49ea1dbbcef7f3f8e808a51756c199aaaadc75f5a5/evalscope-1.4.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,171 Found link https://files.pythonhosted.org/packages/97/dd/6e1cda8f161363ef806929b4989a0e9a7df733b7b82361ef1897315c12d0/evalscope-1.4.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.2 2026-02-08T20:41:34,172 Fetching project page and analyzing links: https://www.piwheels.org/simple/evalscope/ 2026-02-08T20:41:34,172 Getting page https://www.piwheels.org/simple/evalscope/ 2026-02-08T20:41:34,173 Found index url https://www.piwheels.org/simple 2026-02-08T20:41:34,332 Fetched page https://www.piwheels.org/simple/evalscope/ as text/html 2026-02-08T20:41:34,342 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.4.1-py3-none-any.whl#sha256=d74ddb7150b19de1eb995026c697b598cdfc0b75fee3a2d110219256c4241688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,342 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.4.0-py3-none-any.whl#sha256=307c9ed70f562ba776fdf9b0136b35fe4e361b1b974bb0f8ca39e425f4738e6e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,343 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.3.0-py3-none-any.whl#sha256=d9949bacf6c08b5ab341f872c6ee4b31995d724fe963f64ddf3c129e0e39145e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,343 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.2.0-py3-none-any.whl#sha256=5969cf4a3132f6a29f9ef39aa35a8a3be24f114bbcd7a77a19c145bbec432be9 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,344 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.1-py3-none-any.whl#sha256=5bfb8c55f45e1bcd5df5cb0cd4ecceae2ccb93cd09c9477b0b0e6c097ecb1d1f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-02-08T20:41:34,344 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.0-py3-none-any.whl#sha256=ca7c951fa316bb7ec6fb0e38ad6503632067f07f469e37828fe9e39e51591994 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,345 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.2-py3-none-any.whl#sha256=e14016040022bcd666c05ffa806f3713775e9de19a98290bd2a0a36e5c435409 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,345 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.1-py3-none-any.whl#sha256=61b9c14f3409804d84ddf31b749b587c7c7441f9d1aa0d453991b1bd0bbda74c (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,346 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.0-py3-none-any.whl#sha256=ea1044755177db8d9e94cc4ccafbd56e35b173a8425db7d1653dd9f66e1463ad (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,346 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.1-py3-none-any.whl#sha256=aa0054b8aed77684e56d0836d4568080fa4827799a16d62fef6ec13802cd4050 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-02-08T20:41:34,347 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.0-py3-none-any.whl#sha256=e7a2549f9f5ac5b0061d01f8ca99900e06f6340b91d3e546163423b896287862 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,348 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.3-py3-none-any.whl#sha256=75dcbd7cc0a0336f68d407a3925fe065ecaaea0fa9030ceebfaa3f22f0f3b417 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,348 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.2-py3-none-any.whl#sha256=14e00f16b506b723a359799e3cb271370a3da768da3667563612e458982d6847 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,349 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.1-py3-none-any.whl#sha256=296b90c06a33f69c9e7049a768f16929acd1279af0830f47f18fc598560d0e13 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,349 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.0-py3-none-any.whl#sha256=0a6f11a7a4d564d4a1dca5fe424cda19608cf947b9ab739079f5b60842651c7e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,350 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.1-py3-none-any.whl#sha256=504717cfc96a8fdbb0d4bc080d0292e80423891b658d61283385737794c4e95f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,350 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.0-py3-none-any.whl#sha256=263020a7b7f7788e17515c738604ea64904ca0da34d09664d2b7ee16d3522a00 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,351 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.14.0-py3-none-any.whl#sha256=cdf26beb4b188e1dd5e09feeb1832ccf909c9d2a535eb76e702fb3c66fc65688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,351 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.2-py3-none-any.whl#sha256=7d360083cf9dd960996847cde2140085d0830ddc8b12aa8007b4f72d395c5211 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,352 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.1-py3-none-any.whl#sha256=6b26f11daca05d6b56da3cb1b78ea0a8f28de94901b7520171c66e2a16b1c638 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,352 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.0-py3-none-any.whl#sha256=8ca93c4011f04e35df239b5486248e03100a9c9e265664e01330ef4b1cc691f7 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,353 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.1-py3-none-any.whl#sha256=72e5815923789c6cab6c32425477393bf645f670edcc493f8e4dec6e93f3da23 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,353 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.0-py3-none-any.whl#sha256=f26c7317a5bacac8527806723d45bac74563a2608ed761d822ace03be3a5e45f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,354 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.11.0-py3-none-any.whl#sha256=6e7d3242c5bf97b54d24644a1300575fdf41c1d90eef7c1344c20bf0d1518671 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,354 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.1-py3-none-any.whl#sha256=a7e645410333c1aaec5a024be4c02166dfb0d6b4635b020c181ce672d31102d2 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,355 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.0-py3-none-any.whl#sha256=494f1742178def5f86e552c004562c877cd8b8b6f5d4267d29f961f20e8bf569 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,355 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.9.0-py3-none-any.whl#sha256=26fd59fd387850f3e84f995fd38357994460e71d0124bc09954ca8837027ec52 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,356 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.2-py3-none-any.whl#sha256=3c4ed25b5c39d7a706927607e552e6ced6d50532bc442bba74979f70720f4894 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,356 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.1-py3-none-any.whl#sha256=64e9306453082a95b0c0507d6fd1dbf50ed0ab210ebcfebdcb52c534769d1856 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,357 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.0-py3-none-any.whl#sha256=7758550ef3406d9c6096de05909cc1c97ed0c2f7be6edaf3fe5344244e34f233 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,357 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.2-py3-none-any.whl#sha256=21a1f4448ffca926853b7516b6375f801ef6a8067501dde546e7d794fb759f20 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,358 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.1-py3-none-any.whl#sha256=b959f2d9850544f2d96ac765b06ed112cf579a11899618fcc5245407f4e33843 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,358 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.0-py3-none-any.whl#sha256=2deddeb89bfb6fa02844b72a2be24627526136cbd45315e8e4dadb672d53c9fc (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-02-08T20:41:34,359 Skipping link: not a file: https://www.piwheels.org/simple/evalscope/ 2026-02-08T20:41:34,359 Skipping link: not a file: https://pypi.org/simple/evalscope/ 2026-02-08T20:41:34,383 Given no hashes to check 1 links for project 'evalscope': discarding no candidates 2026-02-08T20:41:34,401 Collecting evalscope==1.4.2 2026-02-08T20:41:34,403 Created temporary directory: /tmp/pip-unpack-mo44mz16 2026-02-08T20:41:34,538 Downloading evalscope-1.4.2.tar.gz (944 kB) 2026-02-08T20:41:36,104 Added evalscope==1.4.2 from https://files.pythonhosted.org/packages/97/dd/6e1cda8f161363ef806929b4989a0e9a7df733b7b82361ef1897315c12d0/evalscope-1.4.2.tar.gz to build tracker '/tmp/pip-build-tracker-g2uz737x' 2026-02-08T20:41:36,110 Created temporary directory: /tmp/pip-build-env-e2ofrwgs 2026-02-08T20:41:36,115 Installing build dependencies: started 2026-02-08T20:41:36,117 Running command pip subprocess to install build dependencies 2026-02-08T20:41:37,279 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-02-08T20:41:37,911 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-02-08T20:41:37,935 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-02-08T20:41:39,677 Collecting setuptools>=69 2026-02-08T20:41:39,697 Using cached setuptools-82.0.0-py3-none-any.whl (1.0 MB) 2026-02-08T20:41:39,972 Collecting wheel 2026-02-08T20:41:39,978 Using cached wheel-0.46.3-py3-none-any.whl (30 kB) 2026-02-08T20:41:40,158 Collecting packaging>=24.0 2026-02-08T20:41:40,163 Using cached packaging-26.0-py3-none-any.whl (74 kB) 2026-02-08T20:41:43,177 Installing collected packages: setuptools, packaging, wheel 2026-02-08T20:41:46,688 Creating /tmp/pip-build-env-e2ofrwgs/overlay/local/bin 2026-02-08T20:41:46,691 changing mode of /tmp/pip-build-env-e2ofrwgs/overlay/local/bin/wheel to 755 2026-02-08T20:41:46,712 Successfully installed packaging-26.0 setuptools-82.0.0 wheel-0.46.3 2026-02-08T20:41:47,008 Installing build dependencies: finished with status 'done' 2026-02-08T20:41:47,014 Getting requirements to build wheel: started 2026-02-08T20:41:47,016 Running command Getting requirements to build wheel 2026-02-08T20:41:47,732 /tmp/pip-build-env-e2ofrwgs/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-02-08T20:41:47,732 !! 2026-02-08T20:41:47,733 ******************************************************************************** 2026-02-08T20:41:47,734 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-02-08T20:41:47,735 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-02-08T20:41:47,736 or your builds will no longer be supported. 2026-02-08T20:41:47,737 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-08T20:41:47,737 ******************************************************************************** 2026-02-08T20:41:47,738 !! 2026-02-08T20:41:47,739 corresp(dist, value, root_dir) 2026-02-08T20:41:47,816 /tmp/pip-build-env-e2ofrwgs/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-02-08T20:41:47,817 !! 2026-02-08T20:41:47,818 ******************************************************************************** 2026-02-08T20:41:47,819 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-02-08T20:41:47,820 License :: OSI Approved :: Apache Software License 2026-02-08T20:41:47,821 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-08T20:41:47,821 ******************************************************************************** 2026-02-08T20:41:47,822 !! 2026-02-08T20:41:47,823 dist._finalize_license_expression() 2026-02-08T20:41:47,824 /tmp/pip-build-env-e2ofrwgs/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-02-08T20:41:47,825 !! 2026-02-08T20:41:47,826 ******************************************************************************** 2026-02-08T20:41:47,827 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-02-08T20:41:47,828 License :: OSI Approved :: Apache Software License 2026-02-08T20:41:47,830 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-08T20:41:47,830 ******************************************************************************** 2026-02-08T20:41:47,831 !! 2026-02-08T20:41:47,832 self._finalize_license_expression() 2026-02-08T20:41:47,832 running egg_info 2026-02-08T20:41:47,837 writing evalscope.egg-info/PKG-INFO 2026-02-08T20:41:47,857 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-02-08T20:41:47,859 writing entry points to evalscope.egg-info/entry_points.txt 2026-02-08T20:41:47,870 writing requirements to evalscope.egg-info/requires.txt 2026-02-08T20:41:47,871 writing top-level names to evalscope.egg-info/top_level.txt 2026-02-08T20:41:48,132 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-02-08T20:41:48,172 reading manifest template 'MANIFEST.in' 2026-02-08T20:41:48,551 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-02-08T20:41:48,556 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-02-08T20:41:48,561 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-02-08T20:41:48,566 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-02-08T20:41:48,567 adding license file 'LICENSE' 2026-02-08T20:41:48,612 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-02-08T20:41:48,715 Getting requirements to build wheel: finished with status 'done' 2026-02-08T20:41:48,718 Created temporary directory: /tmp/pip-modern-metadata-hsyasxge 2026-02-08T20:41:48,720 Preparing metadata (pyproject.toml): started 2026-02-08T20:41:48,721 Running command Preparing metadata (pyproject.toml) 2026-02-08T20:41:49,373 /tmp/pip-build-env-e2ofrwgs/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-02-08T20:41:49,373 !! 2026-02-08T20:41:49,375 ******************************************************************************** 2026-02-08T20:41:49,375 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-02-08T20:41:49,376 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-02-08T20:41:49,377 or your builds will no longer be supported. 2026-02-08T20:41:49,378 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-08T20:41:49,378 ******************************************************************************** 2026-02-08T20:41:49,380 !! 2026-02-08T20:41:49,380 corresp(dist, value, root_dir) 2026-02-08T20:41:49,454 /tmp/pip-build-env-e2ofrwgs/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-02-08T20:41:49,455 !! 2026-02-08T20:41:49,456 ******************************************************************************** 2026-02-08T20:41:49,457 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-02-08T20:41:49,458 License :: OSI Approved :: Apache Software License 2026-02-08T20:41:49,459 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-08T20:41:49,459 ******************************************************************************** 2026-02-08T20:41:49,461 !! 2026-02-08T20:41:49,461 dist._finalize_license_expression() 2026-02-08T20:41:49,463 /tmp/pip-build-env-e2ofrwgs/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-02-08T20:41:49,463 !! 2026-02-08T20:41:49,464 ******************************************************************************** 2026-02-08T20:41:49,465 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-02-08T20:41:49,466 License :: OSI Approved :: Apache Software License 2026-02-08T20:41:49,467 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-08T20:41:49,467 ******************************************************************************** 2026-02-08T20:41:49,468 !! 2026-02-08T20:41:49,468 self._finalize_license_expression() 2026-02-08T20:41:49,469 running dist_info 2026-02-08T20:41:49,479 creating /tmp/pip-modern-metadata-hsyasxge/evalscope.egg-info 2026-02-08T20:41:49,480 writing /tmp/pip-modern-metadata-hsyasxge/evalscope.egg-info/PKG-INFO 2026-02-08T20:41:49,500 writing dependency_links to /tmp/pip-modern-metadata-hsyasxge/evalscope.egg-info/dependency_links.txt 2026-02-08T20:41:49,501 writing entry points to /tmp/pip-modern-metadata-hsyasxge/evalscope.egg-info/entry_points.txt 2026-02-08T20:41:49,512 writing requirements to /tmp/pip-modern-metadata-hsyasxge/evalscope.egg-info/requires.txt 2026-02-08T20:41:49,513 writing top-level names to /tmp/pip-modern-metadata-hsyasxge/evalscope.egg-info/top_level.txt 2026-02-08T20:41:49,515 writing manifest file '/tmp/pip-modern-metadata-hsyasxge/evalscope.egg-info/SOURCES.txt' 2026-02-08T20:41:49,704 reading manifest file '/tmp/pip-modern-metadata-hsyasxge/evalscope.egg-info/SOURCES.txt' 2026-02-08T20:41:49,706 reading manifest template 'MANIFEST.in' 2026-02-08T20:41:50,025 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-02-08T20:41:50,028 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-02-08T20:41:50,032 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-02-08T20:41:50,035 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-02-08T20:41:50,036 adding license file 'LICENSE' 2026-02-08T20:41:50,070 writing manifest file '/tmp/pip-modern-metadata-hsyasxge/evalscope.egg-info/SOURCES.txt' 2026-02-08T20:41:50,072 creating '/tmp/pip-modern-metadata-hsyasxge/evalscope-1.4.2.dist-info' 2026-02-08T20:41:50,203 Preparing metadata (pyproject.toml): finished with status 'done' 2026-02-08T20:41:50,211 Source in /tmp/pip-wheel-q98ytt1_/evalscope_4525d441a62e45da9e51a53e7253afc7 has version 1.4.2, which satisfies requirement evalscope==1.4.2 from https://files.pythonhosted.org/packages/97/dd/6e1cda8f161363ef806929b4989a0e9a7df733b7b82361ef1897315c12d0/evalscope-1.4.2.tar.gz 2026-02-08T20:41:50,212 Removed evalscope==1.4.2 from https://files.pythonhosted.org/packages/97/dd/6e1cda8f161363ef806929b4989a0e9a7df733b7b82361ef1897315c12d0/evalscope-1.4.2.tar.gz from build tracker '/tmp/pip-build-tracker-g2uz737x' 2026-02-08T20:41:50,223 Created temporary directory: /tmp/pip-unpack-dlerydqs 2026-02-08T20:41:50,224 Building wheels for collected packages: evalscope 2026-02-08T20:41:50,228 Created temporary directory: /tmp/pip-wheel-431hf6ga 2026-02-08T20:41:50,228 Destination directory: /tmp/pip-wheel-431hf6ga 2026-02-08T20:41:50,231 Building wheel for evalscope (pyproject.toml): started 2026-02-08T20:41:50,232 Running command Building wheel for evalscope (pyproject.toml) 2026-02-08T20:41:50,881 /tmp/pip-build-env-e2ofrwgs/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-02-08T20:41:50,881 !! 2026-02-08T20:41:50,882 ******************************************************************************** 2026-02-08T20:41:50,883 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-02-08T20:41:50,884 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-02-08T20:41:50,885 or your builds will no longer be supported. 2026-02-08T20:41:50,885 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-08T20:41:50,886 ******************************************************************************** 2026-02-08T20:41:50,887 !! 2026-02-08T20:41:50,887 corresp(dist, value, root_dir) 2026-02-08T20:41:50,962 /tmp/pip-build-env-e2ofrwgs/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-02-08T20:41:50,963 !! 2026-02-08T20:41:50,964 ******************************************************************************** 2026-02-08T20:41:50,964 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-02-08T20:41:50,965 License :: OSI Approved :: Apache Software License 2026-02-08T20:41:50,966 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-08T20:41:50,966 ******************************************************************************** 2026-02-08T20:41:50,967 !! 2026-02-08T20:41:50,967 dist._finalize_license_expression() 2026-02-08T20:41:50,971 /tmp/pip-build-env-e2ofrwgs/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:765: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-02-08T20:41:50,971 !! 2026-02-08T20:41:50,972 ******************************************************************************** 2026-02-08T20:41:50,973 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-02-08T20:41:50,974 License :: OSI Approved :: Apache Software License 2026-02-08T20:41:50,975 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-08T20:41:50,975 ******************************************************************************** 2026-02-08T20:41:50,977 !! 2026-02-08T20:41:50,977 self._finalize_license_expression() 2026-02-08T20:41:50,978 running bdist_wheel 2026-02-08T20:41:50,990 running build 2026-02-08T20:41:50,991 running build_py 2026-02-08T20:41:50,997 creating build/lib/evalscope 2026-02-08T20:41:50,999 copying evalscope/version.py -> build/lib/evalscope 2026-02-08T20:41:51,000 copying evalscope/__init__.py -> build/lib/evalscope 2026-02-08T20:41:51,002 copying evalscope/config.py -> build/lib/evalscope 2026-02-08T20:41:51,005 copying evalscope/run.py -> build/lib/evalscope 2026-02-08T20:41:51,007 copying evalscope/arguments.py -> build/lib/evalscope 2026-02-08T20:41:51,009 copying evalscope/constants.py -> build/lib/evalscope 2026-02-08T20:41:51,012 creating build/lib/evalscope/third_party 2026-02-08T20:41:51,013 copying evalscope/third_party/__init__.py -> build/lib/evalscope/third_party 2026-02-08T20:41:51,015 creating build/lib/evalscope/metrics 2026-02-08T20:41:51,016 copying evalscope/metrics/__init__.py -> build/lib/evalscope/metrics 2026-02-08T20:41:51,018 copying evalscope/metrics/metrics.py -> build/lib/evalscope/metrics 2026-02-08T20:41:51,020 copying evalscope/metrics/rouge_metric.py -> build/lib/evalscope/metrics 2026-02-08T20:41:51,023 copying evalscope/metrics/metric.py -> build/lib/evalscope/metrics 2026-02-08T20:41:51,025 copying evalscope/metrics/llm_judge.py -> build/lib/evalscope/metrics 2026-02-08T20:41:51,028 copying evalscope/metrics/math_parser.py -> build/lib/evalscope/metrics 2026-02-08T20:41:51,030 creating build/lib/evalscope/perf 2026-02-08T20:41:51,031 copying evalscope/perf/__init__.py -> build/lib/evalscope/perf 2026-02-08T20:41:51,033 copying evalscope/perf/main.py -> build/lib/evalscope/perf 2026-02-08T20:41:51,036 copying evalscope/perf/http_client.py -> build/lib/evalscope/perf 2026-02-08T20:41:51,038 copying evalscope/perf/arguments.py -> build/lib/evalscope/perf 2026-02-08T20:41:51,040 copying evalscope/perf/benchmark.py -> build/lib/evalscope/perf 2026-02-08T20:41:51,043 creating build/lib/evalscope/cli 2026-02-08T20:41:51,044 copying evalscope/cli/start_eval.py -> build/lib/evalscope/cli 2026-02-08T20:41:51,046 copying evalscope/cli/__init__.py -> build/lib/evalscope/cli 2026-02-08T20:41:51,047 copying evalscope/cli/cli.py -> build/lib/evalscope/cli 2026-02-08T20:41:51,049 copying evalscope/cli/start_service.py -> build/lib/evalscope/cli 2026-02-08T20:41:51,050 copying evalscope/cli/base.py -> build/lib/evalscope/cli 2026-02-08T20:41:51,052 copying evalscope/cli/start_app.py -> build/lib/evalscope/cli 2026-02-08T20:41:51,054 copying evalscope/cli/start_perf.py -> build/lib/evalscope/cli 2026-02-08T20:41:51,056 creating build/lib/evalscope/benchmarks 2026-02-08T20:41:51,057 copying evalscope/benchmarks/__init__.py -> build/lib/evalscope/benchmarks 2026-02-08T20:41:51,059 creating build/lib/evalscope/api 2026-02-08T20:41:51,060 copying evalscope/api/__init__.py -> build/lib/evalscope/api 2026-02-08T20:41:51,062 copying evalscope/api/registry.py -> build/lib/evalscope/api 2026-02-08T20:41:51,064 creating build/lib/evalscope/filters 2026-02-08T20:41:51,065 copying evalscope/filters/__init__.py -> build/lib/evalscope/filters 2026-02-08T20:41:51,067 copying evalscope/filters/extraction.py -> build/lib/evalscope/filters 2026-02-08T20:41:51,069 copying evalscope/filters/selection.py -> build/lib/evalscope/filters 2026-02-08T20:41:51,071 creating build/lib/evalscope/summarizer 2026-02-08T20:41:51,072 copying evalscope/summarizer/__init__.py -> build/lib/evalscope/summarizer 2026-02-08T20:41:51,074 copying evalscope/summarizer/summarizer.py -> build/lib/evalscope/summarizer 2026-02-08T20:41:51,076 creating build/lib/evalscope/collections 2026-02-08T20:41:51,077 copying evalscope/collections/__init__.py -> build/lib/evalscope/collections 2026-02-08T20:41:51,079 copying evalscope/collections/schema.py -> build/lib/evalscope/collections 2026-02-08T20:41:51,081 copying evalscope/collections/sampler.py -> build/lib/evalscope/collections 2026-02-08T20:41:51,084 creating build/lib/evalscope/service 2026-02-08T20:41:51,085 copying evalscope/service/__init__.py -> build/lib/evalscope/service 2026-02-08T20:41:51,087 copying evalscope/service/app.py -> build/lib/evalscope/service 2026-02-08T20:41:51,089 copying evalscope/service/utils.py -> build/lib/evalscope/service 2026-02-08T20:41:51,093 creating build/lib/evalscope/evaluator 2026-02-08T20:41:51,094 copying evalscope/evaluator/__init__.py -> build/lib/evalscope/evaluator 2026-02-08T20:41:51,095 copying evalscope/evaluator/evaluator.py -> build/lib/evalscope/evaluator 2026-02-08T20:41:51,098 creating build/lib/evalscope/backend 2026-02-08T20:41:51,099 copying evalscope/backend/__init__.py -> build/lib/evalscope/backend 2026-02-08T20:41:51,101 copying evalscope/backend/base.py -> build/lib/evalscope/backend 2026-02-08T20:41:51,104 creating build/lib/evalscope/utils 2026-02-08T20:41:51,105 copying evalscope/utils/function_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,107 copying evalscope/utils/code_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,110 copying evalscope/utils/__init__.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,112 copying evalscope/utils/resource_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,114 copying evalscope/utils/io_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,116 copying evalscope/utils/multi_choices.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,119 copying evalscope/utils/chat_service.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,121 copying evalscope/utils/url_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,123 copying evalscope/utils/model_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,125 copying evalscope/utils/argument_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,127 copying evalscope/utils/ner.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,130 copying evalscope/utils/deprecation_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,131 copying evalscope/utils/tqdm_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,133 copying evalscope/utils/json_schema.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,135 copying evalscope/utils/import_utils.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,137 copying evalscope/utils/logger.py -> build/lib/evalscope/utils 2026-02-08T20:41:51,140 creating build/lib/evalscope/app 2026-02-08T20:41:51,141 copying evalscope/app/__init__.py -> build/lib/evalscope/app 2026-02-08T20:41:51,143 copying evalscope/app/app.py -> build/lib/evalscope/app 2026-02-08T20:41:51,144 copying evalscope/app/arguments.py -> build/lib/evalscope/app 2026-02-08T20:41:51,146 copying evalscope/app/constants.py -> build/lib/evalscope/app 2026-02-08T20:41:51,148 creating build/lib/evalscope/models 2026-02-08T20:41:51,149 copying evalscope/models/__init__.py -> build/lib/evalscope/models 2026-02-08T20:41:51,151 copying evalscope/models/openai_compatible.py -> build/lib/evalscope/models 2026-02-08T20:41:51,153 copying evalscope/models/text2image_model.py -> build/lib/evalscope/models 2026-02-08T20:41:51,155 copying evalscope/models/model_apis.py -> build/lib/evalscope/models 2026-02-08T20:41:51,157 copying evalscope/models/mockllm.py -> build/lib/evalscope/models 2026-02-08T20:41:51,159 copying evalscope/models/image_edit_model.py -> build/lib/evalscope/models 2026-02-08T20:41:51,161 copying evalscope/models/modelscope.py -> build/lib/evalscope/models 2026-02-08T20:41:51,164 creating build/lib/evalscope/report 2026-02-08T20:41:51,165 copying evalscope/report/__init__.py -> build/lib/evalscope/report 2026-02-08T20:41:51,167 copying evalscope/report/report.py -> build/lib/evalscope/report 2026-02-08T20:41:51,169 copying evalscope/report/combinator.py -> build/lib/evalscope/report 2026-02-08T20:41:51,171 copying evalscope/report/generator.py -> build/lib/evalscope/report 2026-02-08T20:41:51,174 creating build/lib/evalscope/third_party/longbench_write 2026-02-08T20:41:51,175 copying evalscope/third_party/longbench_write/__init__.py -> build/lib/evalscope/third_party/longbench_write 2026-02-08T20:41:51,177 copying evalscope/third_party/longbench_write/infer.py -> build/lib/evalscope/third_party/longbench_write 2026-02-08T20:41:51,179 copying evalscope/third_party/longbench_write/longbench_write.py -> build/lib/evalscope/third_party/longbench_write 2026-02-08T20:41:51,181 copying evalscope/third_party/longbench_write/eval.py -> build/lib/evalscope/third_party/longbench_write 2026-02-08T20:41:51,183 copying evalscope/third_party/longbench_write/utils.py -> build/lib/evalscope/third_party/longbench_write 2026-02-08T20:41:51,186 creating build/lib/evalscope/third_party/toolbench_static 2026-02-08T20:41:51,187 copying evalscope/third_party/toolbench_static/__init__.py -> build/lib/evalscope/third_party/toolbench_static 2026-02-08T20:41:51,188 copying evalscope/third_party/toolbench_static/infer.py -> build/lib/evalscope/third_party/toolbench_static 2026-02-08T20:41:51,191 copying evalscope/third_party/toolbench_static/eval.py -> build/lib/evalscope/third_party/toolbench_static 2026-02-08T20:41:51,193 copying evalscope/third_party/toolbench_static/toolbench_static.py -> build/lib/evalscope/third_party/toolbench_static 2026-02-08T20:41:51,196 creating build/lib/evalscope/third_party/thinkbench 2026-02-08T20:41:51,197 copying evalscope/third_party/thinkbench/__init__.py -> build/lib/evalscope/third_party/thinkbench 2026-02-08T20:41:51,199 copying evalscope/third_party/thinkbench/infer.py -> build/lib/evalscope/third_party/thinkbench 2026-02-08T20:41:51,200 copying evalscope/third_party/thinkbench/eval.py -> build/lib/evalscope/third_party/thinkbench 2026-02-08T20:41:51,203 creating build/lib/evalscope/third_party/longbench_write/resources 2026-02-08T20:41:51,204 copying evalscope/third_party/longbench_write/resources/__init__.py -> build/lib/evalscope/third_party/longbench_write/resources 2026-02-08T20:41:51,207 creating build/lib/evalscope/third_party/longbench_write/tools 2026-02-08T20:41:51,208 copying evalscope/third_party/longbench_write/tools/__init__.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-02-08T20:41:51,210 copying evalscope/third_party/longbench_write/tools/data_etl.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-02-08T20:41:51,212 copying evalscope/third_party/longbench_write/tools/openai_api.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-02-08T20:41:51,214 creating build/lib/evalscope/third_party/toolbench_static/llm 2026-02-08T20:41:51,215 copying evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-02-08T20:41:51,217 copying evalscope/third_party/toolbench_static/llm/__init__.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-02-08T20:41:51,220 creating build/lib/evalscope/third_party/thinkbench/tools 2026-02-08T20:41:51,221 copying evalscope/third_party/thinkbench/tools/__init__.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-02-08T20:41:51,222 copying evalscope/third_party/thinkbench/tools/utils.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-02-08T20:41:51,224 copying evalscope/third_party/thinkbench/tools/llm.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-02-08T20:41:51,226 creating build/lib/evalscope/metrics/bert_score 2026-02-08T20:41:51,227 copying evalscope/metrics/bert_score/__init__.py -> build/lib/evalscope/metrics/bert_score 2026-02-08T20:41:51,229 copying evalscope/metrics/bert_score/utils.py -> build/lib/evalscope/metrics/bert_score 2026-02-08T20:41:51,232 copying evalscope/metrics/bert_score/scorer.py -> build/lib/evalscope/metrics/bert_score 2026-02-08T20:41:51,234 creating build/lib/evalscope/metrics/sem_score 2026-02-08T20:41:51,235 copying evalscope/metrics/sem_score/__init__.py -> build/lib/evalscope/metrics/sem_score 2026-02-08T20:41:51,237 copying evalscope/metrics/sem_score/scorer.py -> build/lib/evalscope/metrics/sem_score 2026-02-08T20:41:51,239 creating build/lib/evalscope/metrics/text_normalizer 2026-02-08T20:41:51,240 copying evalscope/metrics/text_normalizer/__init__.py -> build/lib/evalscope/metrics/text_normalizer 2026-02-08T20:41:51,242 copying evalscope/metrics/text_normalizer/basic.py -> build/lib/evalscope/metrics/text_normalizer 2026-02-08T20:41:51,244 copying evalscope/metrics/text_normalizer/chinese.py -> build/lib/evalscope/metrics/text_normalizer 2026-02-08T20:41:51,247 copying evalscope/metrics/text_normalizer/wer.py -> build/lib/evalscope/metrics/text_normalizer 2026-02-08T20:41:51,249 copying evalscope/metrics/text_normalizer/english.py -> build/lib/evalscope/metrics/text_normalizer 2026-02-08T20:41:51,252 creating build/lib/evalscope/metrics/t2v_metrics 2026-02-08T20:41:51,253 copying evalscope/metrics/t2v_metrics/itmscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-02-08T20:41:51,255 copying evalscope/metrics/t2v_metrics/vqascore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-02-08T20:41:51,256 copying evalscope/metrics/t2v_metrics/__init__.py -> build/lib/evalscope/metrics/t2v_metrics 2026-02-08T20:41:51,258 copying evalscope/metrics/t2v_metrics/score.py -> build/lib/evalscope/metrics/t2v_metrics 2026-02-08T20:41:51,260 copying evalscope/metrics/t2v_metrics/clipscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-02-08T20:41:51,261 copying evalscope/metrics/t2v_metrics/constants.py -> build/lib/evalscope/metrics/t2v_metrics 2026-02-08T20:41:51,264 creating build/lib/evalscope/metrics/bundled_rouge_score 2026-02-08T20:41:51,265 copying evalscope/metrics/bundled_rouge_score/__init__.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-02-08T20:41:51,266 copying evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-02-08T20:41:51,269 creating build/lib/evalscope/metrics/t2v_metrics/models 2026-02-08T20:41:51,270 copying evalscope/metrics/t2v_metrics/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-02-08T20:41:51,272 copying evalscope/metrics/t2v_metrics/models/model.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-02-08T20:41:51,273 copying evalscope/metrics/t2v_metrics/models/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-02-08T20:41:51,276 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:51,276 copying evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:51,278 copying evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:51,280 copying evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:51,282 copying evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:51,284 copying evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:51,286 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:51,287 copying evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:51,289 copying evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:51,291 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:51,293 copying evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:51,295 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:51,296 copying evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:51,298 copying evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:51,300 copying evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:51,302 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:51,304 copying evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:51,307 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:51,308 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:51,309 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:51,312 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:51,313 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:51,316 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-02-08T20:41:51,317 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-02-08T20:41:51,320 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-02-08T20:41:51,321 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-02-08T20:41:51,323 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-02-08T20:41:51,324 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-02-08T20:41:51,326 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-02-08T20:41:51,327 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-02-08T20:41:51,329 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-02-08T20:41:51,330 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-02-08T20:41:51,332 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-02-08T20:41:51,333 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-02-08T20:41:51,335 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-02-08T20:41:51,337 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-02-08T20:41:51,338 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-02-08T20:41:51,341 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-02-08T20:41:51,342 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-02-08T20:41:51,345 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:51,346 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:51,348 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:51,351 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:51,352 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:51,355 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:51,357 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:51,359 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:51,362 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:51,363 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:51,365 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:51,367 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:51,370 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:51,372 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:51,375 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:51,378 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:51,379 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:51,381 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:51,383 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:51,385 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:51,388 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-02-08T20:41:51,389 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-02-08T20:41:51,391 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-02-08T20:41:51,394 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-02-08T20:41:51,397 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,397 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,401 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,403 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,405 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,408 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,410 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,412 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,415 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,418 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,420 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:51,423 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,424 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,426 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,428 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,431 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,433 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,435 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,437 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,440 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,442 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,445 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,447 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:51,450 creating build/lib/evalscope/perf/plugin 2026-02-08T20:41:51,450 copying evalscope/perf/plugin/__init__.py -> build/lib/evalscope/perf/plugin 2026-02-08T20:41:51,452 copying evalscope/perf/plugin/registry.py -> build/lib/evalscope/perf/plugin 2026-02-08T20:41:51,455 creating build/lib/evalscope/perf/utils 2026-02-08T20:41:51,456 copying evalscope/perf/utils/rich_display.py -> build/lib/evalscope/perf/utils 2026-02-08T20:41:51,459 copying evalscope/perf/utils/benchmark_util.py -> build/lib/evalscope/perf/utils 2026-02-08T20:41:51,461 copying evalscope/perf/utils/__init__.py -> build/lib/evalscope/perf/utils 2026-02-08T20:41:51,462 copying evalscope/perf/utils/local_server.py -> build/lib/evalscope/perf/utils 2026-02-08T20:41:51,464 copying evalscope/perf/utils/log_utils.py -> build/lib/evalscope/perf/utils 2026-02-08T20:41:51,466 copying evalscope/perf/utils/db_util.py -> build/lib/evalscope/perf/utils 2026-02-08T20:41:51,468 copying evalscope/perf/utils/handler.py -> build/lib/evalscope/perf/utils 2026-02-08T20:41:51,470 copying evalscope/perf/utils/analysis_result.py -> build/lib/evalscope/perf/utils 2026-02-08T20:41:51,472 creating build/lib/evalscope/perf/sla 2026-02-08T20:41:51,473 copying evalscope/perf/sla/sla_run.py -> build/lib/evalscope/perf/sla 2026-02-08T20:41:51,476 copying evalscope/perf/sla/__init__.py -> build/lib/evalscope/perf/sla 2026-02-08T20:41:51,477 copying evalscope/perf/sla/sla_criterion.py -> build/lib/evalscope/perf/sla 2026-02-08T20:41:51,479 creating build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,480 copying evalscope/perf/plugin/datasets/random_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,483 copying evalscope/perf/plugin/datasets/__init__.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,484 copying evalscope/perf/plugin/datasets/openqa.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,486 copying evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,488 copying evalscope/perf/plugin/datasets/embedding_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,490 copying evalscope/perf/plugin/datasets/line_by_line.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,492 copying evalscope/perf/plugin/datasets/speed_benchmark.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,494 copying evalscope/perf/plugin/datasets/kontext_bench.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,496 copying evalscope/perf/plugin/datasets/base.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,497 copying evalscope/perf/plugin/datasets/flickr8k.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,499 copying evalscope/perf/plugin/datasets/longalpaca.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,501 copying evalscope/perf/plugin/datasets/rerank_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,503 copying evalscope/perf/plugin/datasets/utils.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,505 copying evalscope/perf/plugin/datasets/custom.py -> build/lib/evalscope/perf/plugin/datasets 2026-02-08T20:41:51,507 creating build/lib/evalscope/perf/plugin/api 2026-02-08T20:41:51,508 copying evalscope/perf/plugin/api/openai_embedding_api.py -> build/lib/evalscope/perf/plugin/api 2026-02-08T20:41:51,511 copying evalscope/perf/plugin/api/__init__.py -> build/lib/evalscope/perf/plugin/api 2026-02-08T20:41:51,513 copying evalscope/perf/plugin/api/dashscope_api.py -> build/lib/evalscope/perf/plugin/api 2026-02-08T20:41:51,515 copying evalscope/perf/plugin/api/openai_rerank_api.py -> build/lib/evalscope/perf/plugin/api 2026-02-08T20:41:51,517 copying evalscope/perf/plugin/api/base.py -> build/lib/evalscope/perf/plugin/api 2026-02-08T20:41:51,519 copying evalscope/perf/plugin/api/default_api.py -> build/lib/evalscope/perf/plugin/api 2026-02-08T20:41:51,521 copying evalscope/perf/plugin/api/custom_api.py -> build/lib/evalscope/perf/plugin/api 2026-02-08T20:41:51,523 copying evalscope/perf/plugin/api/openai_api.py -> build/lib/evalscope/perf/plugin/api 2026-02-08T20:41:51,526 creating build/lib/evalscope/benchmarks/cmmmu 2026-02-08T20:41:51,527 copying evalscope/benchmarks/cmmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmmu 2026-02-08T20:41:51,529 copying evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmmu 2026-02-08T20:41:51,531 copying evalscope/benchmarks/cmmmu/utils.py -> build/lib/evalscope/benchmarks/cmmmu 2026-02-08T20:41:51,534 creating build/lib/evalscope/benchmarks/general_qa 2026-02-08T20:41:51,535 copying evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/lib/evalscope/benchmarks/general_qa 2026-02-08T20:41:51,536 copying evalscope/benchmarks/general_qa/__init__.py -> build/lib/evalscope/benchmarks/general_qa 2026-02-08T20:41:51,538 creating build/lib/evalscope/benchmarks/science_qa 2026-02-08T20:41:51,539 copying evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/lib/evalscope/benchmarks/science_qa 2026-02-08T20:41:51,541 copying evalscope/benchmarks/science_qa/__init__.py -> build/lib/evalscope/benchmarks/science_qa 2026-02-08T20:41:51,543 creating build/lib/evalscope/benchmarks/general_fc 2026-02-08T20:41:51,544 copying evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/lib/evalscope/benchmarks/general_fc 2026-02-08T20:41:51,546 copying evalscope/benchmarks/general_fc/__init__.py -> build/lib/evalscope/benchmarks/general_fc 2026-02-08T20:41:51,548 creating build/lib/evalscope/benchmarks/drivelology 2026-02-08T20:41:51,549 copying evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-02-08T20:41:51,551 copying evalscope/benchmarks/drivelology/__init__.py -> build/lib/evalscope/benchmarks/drivelology 2026-02-08T20:41:51,553 copying evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-02-08T20:41:51,555 copying evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-02-08T20:41:51,557 copying evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-02-08T20:41:51,559 creating build/lib/evalscope/benchmarks/commonsense_qa 2026-02-08T20:41:51,560 copying evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-02-08T20:41:51,562 copying evalscope/benchmarks/commonsense_qa/__init__.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-02-08T20:41:51,564 creating build/lib/evalscope/benchmarks/healthbench 2026-02-08T20:41:51,565 copying evalscope/benchmarks/healthbench/__init__.py -> build/lib/evalscope/benchmarks/healthbench 2026-02-08T20:41:51,567 copying evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/lib/evalscope/benchmarks/healthbench 2026-02-08T20:41:51,569 copying evalscope/benchmarks/healthbench/utils.py -> build/lib/evalscope/benchmarks/healthbench 2026-02-08T20:41:51,571 creating build/lib/evalscope/benchmarks/music_trivia 2026-02-08T20:41:51,572 copying evalscope/benchmarks/music_trivia/__init__.py -> build/lib/evalscope/benchmarks/music_trivia 2026-02-08T20:41:51,574 copying evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/lib/evalscope/benchmarks/music_trivia 2026-02-08T20:41:51,576 creating build/lib/evalscope/benchmarks/mmlu_redux 2026-02-08T20:41:51,577 copying evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-02-08T20:41:51,580 copying evalscope/benchmarks/mmlu_redux/__init__.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-02-08T20:41:51,581 creating build/lib/evalscope/benchmarks/super_gpqa 2026-02-08T20:41:51,582 copying evalscope/benchmarks/super_gpqa/__init__.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-02-08T20:41:51,584 copying evalscope/benchmarks/super_gpqa/prompt.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-02-08T20:41:51,586 copying evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-02-08T20:41:51,588 copying evalscope/benchmarks/super_gpqa/utils.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-02-08T20:41:51,591 creating build/lib/evalscope/benchmarks/image_edit 2026-02-08T20:41:51,592 copying evalscope/benchmarks/image_edit/__init__.py -> build/lib/evalscope/benchmarks/image_edit 2026-02-08T20:41:51,593 creating build/lib/evalscope/benchmarks/pope 2026-02-08T20:41:51,594 copying evalscope/benchmarks/pope/__init__.py -> build/lib/evalscope/benchmarks/pope 2026-02-08T20:41:51,596 copying evalscope/benchmarks/pope/pope_adapter.py -> build/lib/evalscope/benchmarks/pope 2026-02-08T20:41:51,598 creating build/lib/evalscope/benchmarks/text2image 2026-02-08T20:41:51,599 copying evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-02-08T20:41:51,601 copying evalscope/benchmarks/text2image/__init__.py -> build/lib/evalscope/benchmarks/text2image 2026-02-08T20:41:51,603 copying evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-02-08T20:41:51,604 copying evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-02-08T20:41:51,606 copying evalscope/benchmarks/text2image/tifa_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-02-08T20:41:51,608 copying evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-02-08T20:41:51,610 creating build/lib/evalscope/benchmarks/math_500 2026-02-08T20:41:51,611 copying evalscope/benchmarks/math_500/__init__.py -> build/lib/evalscope/benchmarks/math_500 2026-02-08T20:41:51,612 copying evalscope/benchmarks/math_500/math_500_adapter.py -> build/lib/evalscope/benchmarks/math_500 2026-02-08T20:41:51,615 creating build/lib/evalscope/benchmarks/truthful_qa 2026-02-08T20:41:51,616 copying evalscope/benchmarks/truthful_qa/__init__.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-02-08T20:41:51,618 copying evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-02-08T20:41:51,620 creating build/lib/evalscope/benchmarks/poly_math 2026-02-08T20:41:51,621 copying evalscope/benchmarks/poly_math/__init__.py -> build/lib/evalscope/benchmarks/poly_math 2026-02-08T20:41:51,622 copying evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/lib/evalscope/benchmarks/poly_math 2026-02-08T20:41:51,625 creating build/lib/evalscope/benchmarks/simple_qa 2026-02-08T20:41:51,626 copying evalscope/benchmarks/simple_qa/__init__.py -> build/lib/evalscope/benchmarks/simple_qa 2026-02-08T20:41:51,628 copying evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/lib/evalscope/benchmarks/simple_qa 2026-02-08T20:41:51,631 creating build/lib/evalscope/benchmarks/math_verse 2026-02-08T20:41:51,632 copying evalscope/benchmarks/math_verse/__init__.py -> build/lib/evalscope/benchmarks/math_verse 2026-02-08T20:41:51,633 copying evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/lib/evalscope/benchmarks/math_verse 2026-02-08T20:41:51,636 creating build/lib/evalscope/benchmarks/competition_math 2026-02-08T20:41:51,637 copying evalscope/benchmarks/competition_math/__init__.py -> build/lib/evalscope/benchmarks/competition_math 2026-02-08T20:41:51,639 copying evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/lib/evalscope/benchmarks/competition_math 2026-02-08T20:41:51,641 creating build/lib/evalscope/benchmarks/med_mcqa 2026-02-08T20:41:51,642 copying evalscope/benchmarks/med_mcqa/__init__.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-02-08T20:41:51,644 copying evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-02-08T20:41:51,646 creating build/lib/evalscope/benchmarks/wmt 2026-02-08T20:41:51,647 copying evalscope/benchmarks/wmt/__init__.py -> build/lib/evalscope/benchmarks/wmt 2026-02-08T20:41:51,648 copying evalscope/benchmarks/wmt/wmt24_adapter.py -> build/lib/evalscope/benchmarks/wmt 2026-02-08T20:41:51,651 creating build/lib/evalscope/benchmarks/zerobench 2026-02-08T20:41:51,652 copying evalscope/benchmarks/zerobench/__init__.py -> build/lib/evalscope/benchmarks/zerobench 2026-02-08T20:41:51,654 copying evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/lib/evalscope/benchmarks/zerobench 2026-02-08T20:41:51,656 creating build/lib/evalscope/benchmarks/eq_bench 2026-02-08T20:41:51,657 copying evalscope/benchmarks/eq_bench/__init__.py -> build/lib/evalscope/benchmarks/eq_bench 2026-02-08T20:41:51,658 copying evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/lib/evalscope/benchmarks/eq_bench 2026-02-08T20:41:51,661 copying evalscope/benchmarks/eq_bench/answer_validation.py -> build/lib/evalscope/benchmarks/eq_bench 2026-02-08T20:41:51,663 creating build/lib/evalscope/benchmarks/mm_bench 2026-02-08T20:41:51,664 copying evalscope/benchmarks/mm_bench/__init__.py -> build/lib/evalscope/benchmarks/mm_bench 2026-02-08T20:41:51,666 copying evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/lib/evalscope/benchmarks/mm_bench 2026-02-08T20:41:51,668 creating build/lib/evalscope/benchmarks/math_vista 2026-02-08T20:41:51,669 copying evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/lib/evalscope/benchmarks/math_vista 2026-02-08T20:41:51,671 copying evalscope/benchmarks/math_vista/__init__.py -> build/lib/evalscope/benchmarks/math_vista 2026-02-08T20:41:51,673 creating build/lib/evalscope/benchmarks/general_mcq 2026-02-08T20:41:51,674 copying evalscope/benchmarks/general_mcq/__init__.py -> build/lib/evalscope/benchmarks/general_mcq 2026-02-08T20:41:51,676 copying evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/lib/evalscope/benchmarks/general_mcq 2026-02-08T20:41:51,678 creating build/lib/evalscope/benchmarks/mm_star 2026-02-08T20:41:51,679 copying evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/lib/evalscope/benchmarks/mm_star 2026-02-08T20:41:51,681 copying evalscope/benchmarks/mm_star/__init__.py -> build/lib/evalscope/benchmarks/mm_star 2026-02-08T20:41:51,683 creating build/lib/evalscope/benchmarks/maritime_bench 2026-02-08T20:41:51,684 copying evalscope/benchmarks/maritime_bench/__init__.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-02-08T20:41:51,686 copying evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-02-08T20:41:51,688 creating build/lib/evalscope/benchmarks/mmmu 2026-02-08T20:41:51,689 copying evalscope/benchmarks/mmmu/__init__.py -> build/lib/evalscope/benchmarks/mmmu 2026-02-08T20:41:51,691 copying evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/lib/evalscope/benchmarks/mmmu 2026-02-08T20:41:51,694 creating build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,695 copying evalscope/benchmarks/live_code_bench/__init__.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,696 copying evalscope/benchmarks/live_code_bench/load_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,698 copying evalscope/benchmarks/live_code_bench/testing_util.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,701 copying evalscope/benchmarks/live_code_bench/extract_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,702 copying evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,705 copying evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,707 copying evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,709 copying evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,711 copying evalscope/benchmarks/live_code_bench/prompts.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:51,714 creating build/lib/evalscope/benchmarks/micro_vqa 2026-02-08T20:41:51,715 copying evalscope/benchmarks/micro_vqa/__init__.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-02-08T20:41:51,717 copying evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-02-08T20:41:51,719 creating build/lib/evalscope/benchmarks/swe_bench 2026-02-08T20:41:51,720 copying evalscope/benchmarks/swe_bench/__init__.py -> build/lib/evalscope/benchmarks/swe_bench 2026-02-08T20:41:51,721 copying evalscope/benchmarks/swe_bench/build_images.py -> build/lib/evalscope/benchmarks/swe_bench 2026-02-08T20:41:51,724 copying evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/lib/evalscope/benchmarks/swe_bench 2026-02-08T20:41:51,726 copying evalscope/benchmarks/swe_bench/utils.py -> build/lib/evalscope/benchmarks/swe_bench 2026-02-08T20:41:51,729 creating build/lib/evalscope/benchmarks/data_collection 2026-02-08T20:41:51,730 copying evalscope/benchmarks/data_collection/__init__.py -> build/lib/evalscope/benchmarks/data_collection 2026-02-08T20:41:51,731 copying evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/lib/evalscope/benchmarks/data_collection 2026-02-08T20:41:51,734 creating build/lib/evalscope/benchmarks/olympiad_bench 2026-02-08T20:41:51,735 copying evalscope/benchmarks/olympiad_bench/__init__.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-02-08T20:41:51,736 copying evalscope/benchmarks/olympiad_bench/utils.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-02-08T20:41:51,739 copying evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-02-08T20:41:51,742 creating build/lib/evalscope/benchmarks/docvqa 2026-02-08T20:41:51,743 copying evalscope/benchmarks/docvqa/__init__.py -> build/lib/evalscope/benchmarks/docvqa 2026-02-08T20:41:51,744 copying evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/lib/evalscope/benchmarks/docvqa 2026-02-08T20:41:51,746 creating build/lib/evalscope/benchmarks/a_okvqa 2026-02-08T20:41:51,748 copying evalscope/benchmarks/a_okvqa/__init__.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-02-08T20:41:51,749 copying evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-02-08T20:41:51,751 creating build/lib/evalscope/benchmarks/humaneval 2026-02-08T20:41:51,752 copying evalscope/benchmarks/humaneval/__init__.py -> build/lib/evalscope/benchmarks/humaneval 2026-02-08T20:41:51,754 copying evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/lib/evalscope/benchmarks/humaneval 2026-02-08T20:41:51,757 copying evalscope/benchmarks/humaneval/utils.py -> build/lib/evalscope/benchmarks/humaneval 2026-02-08T20:41:51,759 creating build/lib/evalscope/benchmarks/mgsm 2026-02-08T20:41:51,760 copying evalscope/benchmarks/mgsm/__init__.py -> build/lib/evalscope/benchmarks/mgsm 2026-02-08T20:41:51,762 copying evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/lib/evalscope/benchmarks/mgsm 2026-02-08T20:41:51,764 creating build/lib/evalscope/benchmarks/ocr_bench 2026-02-08T20:41:51,765 copying evalscope/benchmarks/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench 2026-02-08T20:41:51,767 creating build/lib/evalscope/benchmarks/qasc 2026-02-08T20:41:51,768 copying evalscope/benchmarks/qasc/__init__.py -> build/lib/evalscope/benchmarks/qasc 2026-02-08T20:41:51,770 copying evalscope/benchmarks/qasc/qasc_adapter.py -> build/lib/evalscope/benchmarks/qasc 2026-02-08T20:41:51,772 creating build/lib/evalscope/benchmarks/trivia_qa 2026-02-08T20:41:51,773 copying evalscope/benchmarks/trivia_qa/__init__.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-02-08T20:41:51,775 copying evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-02-08T20:41:51,777 creating build/lib/evalscope/benchmarks/cmmlu 2026-02-08T20:41:51,778 copying evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/lib/evalscope/benchmarks/cmmlu 2026-02-08T20:41:51,780 copying evalscope/benchmarks/cmmlu/__init__.py -> build/lib/evalscope/benchmarks/cmmlu 2026-02-08T20:41:51,782 creating build/lib/evalscope/benchmarks/infovqa 2026-02-08T20:41:51,783 copying evalscope/benchmarks/infovqa/__init__.py -> build/lib/evalscope/benchmarks/infovqa 2026-02-08T20:41:51,784 copying evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/lib/evalscope/benchmarks/infovqa 2026-02-08T20:41:51,787 creating build/lib/evalscope/benchmarks/general_vqa 2026-02-08T20:41:51,788 copying evalscope/benchmarks/general_vqa/__init__.py -> build/lib/evalscope/benchmarks/general_vqa 2026-02-08T20:41:51,789 copying evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/lib/evalscope/benchmarks/general_vqa 2026-02-08T20:41:51,792 creating build/lib/evalscope/benchmarks/gsm8k_v 2026-02-08T20:41:51,792 copying evalscope/benchmarks/gsm8k_v/__init__.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-02-08T20:41:51,794 copying evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-02-08T20:41:51,796 creating build/lib/evalscope/benchmarks/simple_vqa 2026-02-08T20:41:51,797 copying evalscope/benchmarks/simple_vqa/__init__.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-02-08T20:41:51,799 copying evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-02-08T20:41:51,802 creating build/lib/evalscope/benchmarks/pumed_qa 2026-02-08T20:41:51,803 copying evalscope/benchmarks/pumed_qa/__init__.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-02-08T20:41:51,805 copying evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-02-08T20:41:51,807 creating build/lib/evalscope/benchmarks/general_vmcq 2026-02-08T20:41:51,808 copying evalscope/benchmarks/general_vmcq/__init__.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-02-08T20:41:51,810 copying evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-02-08T20:41:51,813 creating build/lib/evalscope/benchmarks/ifbench 2026-02-08T20:41:51,814 copying evalscope/benchmarks/ifbench/instructions_registry.py -> build/lib/evalscope/benchmarks/ifbench 2026-02-08T20:41:51,817 copying evalscope/benchmarks/ifbench/evaluation_lib.py -> build/lib/evalscope/benchmarks/ifbench 2026-02-08T20:41:51,820 copying evalscope/benchmarks/ifbench/__init__.py -> build/lib/evalscope/benchmarks/ifbench 2026-02-08T20:41:51,821 copying evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/lib/evalscope/benchmarks/ifbench 2026-02-08T20:41:51,824 copying evalscope/benchmarks/ifbench/instructions_util.py -> build/lib/evalscope/benchmarks/ifbench 2026-02-08T20:41:51,827 copying evalscope/benchmarks/ifbench/instructions.py -> build/lib/evalscope/benchmarks/ifbench 2026-02-08T20:41:51,831 creating build/lib/evalscope/benchmarks/chartqa 2026-02-08T20:41:51,832 copying evalscope/benchmarks/chartqa/__init__.py -> build/lib/evalscope/benchmarks/chartqa 2026-02-08T20:41:51,834 copying evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/lib/evalscope/benchmarks/chartqa 2026-02-08T20:41:51,836 copying evalscope/benchmarks/chartqa/utils.py -> build/lib/evalscope/benchmarks/chartqa 2026-02-08T20:41:51,839 creating build/lib/evalscope/benchmarks/multi_if 2026-02-08T20:41:51,840 copying evalscope/benchmarks/multi_if/__init__.py -> build/lib/evalscope/benchmarks/multi_if 2026-02-08T20:41:51,842 copying evalscope/benchmarks/multi_if/metrics.py -> build/lib/evalscope/benchmarks/multi_if 2026-02-08T20:41:51,844 copying evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/lib/evalscope/benchmarks/multi_if 2026-02-08T20:41:51,847 copying evalscope/benchmarks/multi_if/ifeval.py -> build/lib/evalscope/benchmarks/multi_if 2026-02-08T20:41:51,851 creating build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-02-08T20:41:51,852 copying evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-02-08T20:41:51,853 copying evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-02-08T20:41:51,856 creating build/lib/evalscope/benchmarks/winogrande 2026-02-08T20:41:51,857 copying evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/lib/evalscope/benchmarks/winogrande 2026-02-08T20:41:51,858 copying evalscope/benchmarks/winogrande/__init__.py -> build/lib/evalscope/benchmarks/winogrande 2026-02-08T20:41:51,860 creating build/lib/evalscope/benchmarks/siqa 2026-02-08T20:41:51,861 copying evalscope/benchmarks/siqa/__init__.py -> build/lib/evalscope/benchmarks/siqa 2026-02-08T20:41:51,863 copying evalscope/benchmarks/siqa/siqa_adapter.py -> build/lib/evalscope/benchmarks/siqa 2026-02-08T20:41:51,865 creating build/lib/evalscope/benchmarks/iquiz 2026-02-08T20:41:51,866 copying evalscope/benchmarks/iquiz/__init__.py -> build/lib/evalscope/benchmarks/iquiz 2026-02-08T20:41:51,867 copying evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/lib/evalscope/benchmarks/iquiz 2026-02-08T20:41:51,870 creating build/lib/evalscope/benchmarks/piqa 2026-02-08T20:41:51,871 copying evalscope/benchmarks/piqa/__init__.py -> build/lib/evalscope/benchmarks/piqa 2026-02-08T20:41:51,872 copying evalscope/benchmarks/piqa/piqa_adapter.py -> build/lib/evalscope/benchmarks/piqa 2026-02-08T20:41:51,874 creating build/lib/evalscope/benchmarks/mbpp 2026-02-08T20:41:51,875 copying evalscope/benchmarks/mbpp/__init__.py -> build/lib/evalscope/benchmarks/mbpp 2026-02-08T20:41:51,877 copying evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/lib/evalscope/benchmarks/mbpp 2026-02-08T20:41:51,879 creating build/lib/evalscope/benchmarks/real_world_qa 2026-02-08T20:41:51,880 copying evalscope/benchmarks/real_world_qa/__init__.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-02-08T20:41:51,882 copying evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-02-08T20:41:51,884 creating build/lib/evalscope/benchmarks/general_arena 2026-02-08T20:41:51,885 copying evalscope/benchmarks/general_arena/__init__.py -> build/lib/evalscope/benchmarks/general_arena 2026-02-08T20:41:51,887 copying evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/lib/evalscope/benchmarks/general_arena 2026-02-08T20:41:51,890 copying evalscope/benchmarks/general_arena/utils.py -> build/lib/evalscope/benchmarks/general_arena 2026-02-08T20:41:51,893 creating build/lib/evalscope/benchmarks/alpaca_eval 2026-02-08T20:41:51,894 copying evalscope/benchmarks/alpaca_eval/__init__.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-02-08T20:41:51,895 copying evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-02-08T20:41:51,898 creating build/lib/evalscope/benchmarks/blink 2026-02-08T20:41:51,899 copying evalscope/benchmarks/blink/__init__.py -> build/lib/evalscope/benchmarks/blink 2026-02-08T20:41:51,901 copying evalscope/benchmarks/blink/blink_adapter.py -> build/lib/evalscope/benchmarks/blink 2026-02-08T20:41:51,903 creating build/lib/evalscope/benchmarks/fleurs 2026-02-08T20:41:51,904 copying evalscope/benchmarks/fleurs/__init__.py -> build/lib/evalscope/benchmarks/fleurs 2026-02-08T20:41:51,905 copying evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/lib/evalscope/benchmarks/fleurs 2026-02-08T20:41:51,908 creating build/lib/evalscope/benchmarks/openai_mrcr 2026-02-08T20:41:51,909 copying evalscope/benchmarks/openai_mrcr/__init__.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-02-08T20:41:51,911 copying evalscope/benchmarks/openai_mrcr/utils.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-02-08T20:41:51,913 copying evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-02-08T20:41:51,915 creating build/lib/evalscope/benchmarks/frames 2026-02-08T20:41:51,916 copying evalscope/benchmarks/frames/__init__.py -> build/lib/evalscope/benchmarks/frames 2026-02-08T20:41:51,918 copying evalscope/benchmarks/frames/frames_adapter.py -> build/lib/evalscope/benchmarks/frames 2026-02-08T20:41:51,920 copying evalscope/benchmarks/frames/utils.py -> build/lib/evalscope/benchmarks/frames 2026-02-08T20:41:51,922 creating build/lib/evalscope/benchmarks/mmmu_pro 2026-02-08T20:41:51,923 copying evalscope/benchmarks/mmmu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-02-08T20:41:51,925 copying evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-02-08T20:41:51,928 creating build/lib/evalscope/benchmarks/docmath 2026-02-08T20:41:51,928 copying evalscope/benchmarks/docmath/__init__.py -> build/lib/evalscope/benchmarks/docmath 2026-02-08T20:41:51,930 copying evalscope/benchmarks/docmath/docmath_adapter.py -> build/lib/evalscope/benchmarks/docmath 2026-02-08T20:41:51,932 copying evalscope/benchmarks/docmath/utils.py -> build/lib/evalscope/benchmarks/docmath 2026-02-08T20:41:51,935 creating build/lib/evalscope/benchmarks/ai2d 2026-02-08T20:41:51,936 copying evalscope/benchmarks/ai2d/__init__.py -> build/lib/evalscope/benchmarks/ai2d 2026-02-08T20:41:51,937 copying evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/lib/evalscope/benchmarks/ai2d 2026-02-08T20:41:51,939 creating build/lib/evalscope/benchmarks/needle_haystack 2026-02-08T20:41:51,940 copying evalscope/benchmarks/needle_haystack/__init__.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-02-08T20:41:51,942 copying evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-02-08T20:41:51,945 copying evalscope/benchmarks/needle_haystack/utils.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-02-08T20:41:51,947 creating build/lib/evalscope/benchmarks/torgo 2026-02-08T20:41:51,948 copying evalscope/benchmarks/torgo/__init__.py -> build/lib/evalscope/benchmarks/torgo 2026-02-08T20:41:51,950 copying evalscope/benchmarks/torgo/torgo_adapter.py -> build/lib/evalscope/benchmarks/torgo 2026-02-08T20:41:51,953 creating build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,954 copying evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,956 copying evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,958 copying evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,959 copying evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,961 copying evalscope/benchmarks/ner/conllpp_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,963 copying evalscope/benchmarks/ner/copious_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,965 copying evalscope/benchmarks/ner/ncbi_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,967 copying evalscope/benchmarks/ner/__init__.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,968 copying evalscope/benchmarks/ner/wnut2017_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,970 copying evalscope/benchmarks/ner/jnlpba_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,972 copying evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,974 copying evalscope/benchmarks/ner/conll2003_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,976 copying evalscope/benchmarks/ner/anat_em_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,977 copying evalscope/benchmarks/ner/genia_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,979 copying evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,981 copying evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,983 copying evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,985 copying evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,986 copying evalscope/benchmarks/ner/fin_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,988 copying evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,990 copying evalscope/benchmarks/ner/bc2gm_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,992 copying evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,993 copying evalscope/benchmarks/ner/cross_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-02-08T20:41:51,996 creating build/lib/evalscope/benchmarks/math_vision 2026-02-08T20:41:51,997 copying evalscope/benchmarks/math_vision/__init__.py -> build/lib/evalscope/benchmarks/math_vision 2026-02-08T20:41:51,998 copying evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/lib/evalscope/benchmarks/math_vision 2026-02-08T20:41:52,001 creating build/lib/evalscope/benchmarks/tau_bench 2026-02-08T20:41:52,002 copying evalscope/benchmarks/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench 2026-02-08T20:41:52,004 creating build/lib/evalscope/benchmarks/musr 2026-02-08T20:41:52,005 copying evalscope/benchmarks/musr/__init__.py -> build/lib/evalscope/benchmarks/musr 2026-02-08T20:41:52,007 copying evalscope/benchmarks/musr/musr_adapter.py -> build/lib/evalscope/benchmarks/musr 2026-02-08T20:41:52,009 creating build/lib/evalscope/benchmarks/chinese_simple_qa 2026-02-08T20:41:52,010 copying evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-02-08T20:41:52,011 copying evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-02-08T20:41:52,014 creating build/lib/evalscope/benchmarks/coin_flip 2026-02-08T20:41:52,015 copying evalscope/benchmarks/coin_flip/__init__.py -> build/lib/evalscope/benchmarks/coin_flip 2026-02-08T20:41:52,017 copying evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/lib/evalscope/benchmarks/coin_flip 2026-02-08T20:41:52,019 creating build/lib/evalscope/benchmarks/zebralogicbench 2026-02-08T20:41:52,020 copying evalscope/benchmarks/zebralogicbench/__init__.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-02-08T20:41:52,022 copying evalscope/benchmarks/zebralogicbench/utils.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-02-08T20:41:52,025 copying evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-02-08T20:41:52,027 creating build/lib/evalscope/benchmarks/logi_qa 2026-02-08T20:41:52,028 copying evalscope/benchmarks/logi_qa/__int__.py -> build/lib/evalscope/benchmarks/logi_qa 2026-02-08T20:41:52,030 copying evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/lib/evalscope/benchmarks/logi_qa 2026-02-08T20:41:52,032 creating build/lib/evalscope/benchmarks/humanevalplus 2026-02-08T20:41:52,033 copying evalscope/benchmarks/humanevalplus/__init__.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-02-08T20:41:52,035 copying evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/lib/evalscope/benchmarks/humanevalplus 2026-02-08T20:41:52,037 creating build/lib/evalscope/benchmarks/mmlu 2026-02-08T20:41:52,038 copying evalscope/benchmarks/mmlu/__init__.py -> build/lib/evalscope/benchmarks/mmlu 2026-02-08T20:41:52,040 copying evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/lib/evalscope/benchmarks/mmlu 2026-02-08T20:41:52,043 creating build/lib/evalscope/benchmarks/vstar_bench 2026-02-08T20:41:52,044 copying evalscope/benchmarks/vstar_bench/__init__.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-02-08T20:41:52,046 copying evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-02-08T20:41:52,048 creating build/lib/evalscope/benchmarks/multipl_e 2026-02-08T20:41:52,049 copying evalscope/benchmarks/multipl_e/__init__.py -> build/lib/evalscope/benchmarks/multipl_e 2026-02-08T20:41:52,051 copying evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-02-08T20:41:52,053 copying evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-02-08T20:41:52,056 copying evalscope/benchmarks/multipl_e/utils.py -> build/lib/evalscope/benchmarks/multipl_e 2026-02-08T20:41:52,058 creating build/lib/evalscope/benchmarks/hellaswag 2026-02-08T20:41:52,059 copying evalscope/benchmarks/hellaswag/__init__.py -> build/lib/evalscope/benchmarks/hellaswag 2026-02-08T20:41:52,061 copying evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/lib/evalscope/benchmarks/hellaswag 2026-02-08T20:41:52,064 creating build/lib/evalscope/benchmarks/ifeval 2026-02-08T20:41:52,065 copying evalscope/benchmarks/ifeval/instructions_registry.py -> build/lib/evalscope/benchmarks/ifeval 2026-02-08T20:41:52,067 copying evalscope/benchmarks/ifeval/__init__.py -> build/lib/evalscope/benchmarks/ifeval 2026-02-08T20:41:52,069 copying evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/lib/evalscope/benchmarks/ifeval 2026-02-08T20:41:52,071 copying evalscope/benchmarks/ifeval/instructions_util.py -> build/lib/evalscope/benchmarks/ifeval 2026-02-08T20:41:52,074 copying evalscope/benchmarks/ifeval/utils.py -> build/lib/evalscope/benchmarks/ifeval 2026-02-08T20:41:52,077 copying evalscope/benchmarks/ifeval/instructions.py -> build/lib/evalscope/benchmarks/ifeval 2026-02-08T20:41:52,081 creating build/lib/evalscope/benchmarks/mmlu_pro 2026-02-08T20:41:52,082 copying evalscope/benchmarks/mmlu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-02-08T20:41:52,085 copying evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-02-08T20:41:52,087 creating build/lib/evalscope/benchmarks/hle 2026-02-08T20:41:52,089 copying evalscope/benchmarks/hle/__init__.py -> build/lib/evalscope/benchmarks/hle 2026-02-08T20:41:52,091 copying evalscope/benchmarks/hle/hle_adapter.py -> build/lib/evalscope/benchmarks/hle 2026-02-08T20:41:52,094 creating build/lib/evalscope/benchmarks/halu_eval 2026-02-08T20:41:52,096 copying evalscope/benchmarks/halu_eval/__init__.py -> build/lib/evalscope/benchmarks/halu_eval 2026-02-08T20:41:52,098 copying evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/lib/evalscope/benchmarks/halu_eval 2026-02-08T20:41:52,100 copying evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/lib/evalscope/benchmarks/halu_eval 2026-02-08T20:41:52,104 creating build/lib/evalscope/benchmarks/arena_hard 2026-02-08T20:41:52,105 copying evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/lib/evalscope/benchmarks/arena_hard 2026-02-08T20:41:52,108 copying evalscope/benchmarks/arena_hard/__init__.py -> build/lib/evalscope/benchmarks/arena_hard 2026-02-08T20:41:52,110 copying evalscope/benchmarks/arena_hard/utils.py -> build/lib/evalscope/benchmarks/arena_hard 2026-02-08T20:41:52,113 creating build/lib/evalscope/benchmarks/aime 2026-02-08T20:41:52,115 copying evalscope/benchmarks/aime/grader.py -> build/lib/evalscope/benchmarks/aime 2026-02-08T20:41:52,118 copying evalscope/benchmarks/aime/__init__.py -> build/lib/evalscope/benchmarks/aime 2026-02-08T20:41:52,120 copying evalscope/benchmarks/aime/math_normalize.py -> build/lib/evalscope/benchmarks/aime 2026-02-08T20:41:52,123 copying evalscope/benchmarks/aime/aime24_adapter.py -> build/lib/evalscope/benchmarks/aime 2026-02-08T20:41:52,125 copying evalscope/benchmarks/aime/aime25_adapter.py -> build/lib/evalscope/benchmarks/aime 2026-02-08T20:41:52,129 creating build/lib/evalscope/benchmarks/gsm8k 2026-02-08T20:41:52,130 copying evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/lib/evalscope/benchmarks/gsm8k 2026-02-08T20:41:52,133 copying evalscope/benchmarks/gsm8k/__init__.py -> build/lib/evalscope/benchmarks/gsm8k 2026-02-08T20:41:52,135 creating build/lib/evalscope/benchmarks/hallusion_bench 2026-02-08T20:41:52,137 copying evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-02-08T20:41:52,139 copying evalscope/benchmarks/hallusion_bench/__init__.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-02-08T20:41:52,142 creating build/lib/evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:52,143 copying evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:52,146 copying evalscope/benchmarks/omnidoc_bench/__init__.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:52,148 copying evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:52,151 copying evalscope/benchmarks/omnidoc_bench/metrics.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:52,155 copying evalscope/benchmarks/omnidoc_bench/utils.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:52,159 creating build/lib/evalscope/benchmarks/race 2026-02-08T20:41:52,161 copying evalscope/benchmarks/race/__init__.py -> build/lib/evalscope/benchmarks/race 2026-02-08T20:41:52,163 copying evalscope/benchmarks/race/race_adapter.py -> build/lib/evalscope/benchmarks/race 2026-02-08T20:41:52,166 creating build/lib/evalscope/benchmarks/bbh 2026-02-08T20:41:52,167 copying evalscope/benchmarks/bbh/__init__.py -> build/lib/evalscope/benchmarks/bbh 2026-02-08T20:41:52,169 copying evalscope/benchmarks/bbh/bbh_adapter.py -> build/lib/evalscope/benchmarks/bbh 2026-02-08T20:41:52,172 creating build/lib/evalscope/benchmarks/arc 2026-02-08T20:41:52,173 copying evalscope/benchmarks/arc/__init__.py -> build/lib/evalscope/benchmarks/arc 2026-02-08T20:41:52,175 copying evalscope/benchmarks/arc/arc_adapter.py -> build/lib/evalscope/benchmarks/arc 2026-02-08T20:41:52,178 creating build/lib/evalscope/benchmarks/terminal_bench 2026-02-08T20:41:52,179 copying evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-02-08T20:41:52,181 copying evalscope/benchmarks/terminal_bench/__init__.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-02-08T20:41:52,182 copying evalscope/benchmarks/terminal_bench/utils.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-02-08T20:41:52,185 creating build/lib/evalscope/benchmarks/process_bench 2026-02-08T20:41:52,186 copying evalscope/benchmarks/process_bench/__init__.py -> build/lib/evalscope/benchmarks/process_bench 2026-02-08T20:41:52,187 copying evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/lib/evalscope/benchmarks/process_bench 2026-02-08T20:41:52,190 creating build/lib/evalscope/benchmarks/sciq 2026-02-08T20:41:52,191 copying evalscope/benchmarks/sciq/__init__.py -> build/lib/evalscope/benchmarks/sciq 2026-02-08T20:41:52,193 copying evalscope/benchmarks/sciq/sciq_adapter.py -> build/lib/evalscope/benchmarks/sciq 2026-02-08T20:41:52,195 creating build/lib/evalscope/benchmarks/scicode 2026-02-08T20:41:52,196 copying evalscope/benchmarks/scicode/__init__.py -> build/lib/evalscope/benchmarks/scicode 2026-02-08T20:41:52,197 copying evalscope/benchmarks/scicode/util.py -> build/lib/evalscope/benchmarks/scicode 2026-02-08T20:41:52,199 copying evalscope/benchmarks/scicode/scicode_adapter.py -> build/lib/evalscope/benchmarks/scicode 2026-02-08T20:41:52,202 copying evalscope/benchmarks/scicode/prompt_templates.py -> build/lib/evalscope/benchmarks/scicode 2026-02-08T20:41:52,204 creating build/lib/evalscope/benchmarks/cmmu 2026-02-08T20:41:52,205 copying evalscope/benchmarks/cmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmu 2026-02-08T20:41:52,207 copying evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmu 2026-02-08T20:41:52,209 copying evalscope/benchmarks/cmmu/prompt.py -> build/lib/evalscope/benchmarks/cmmu 2026-02-08T20:41:52,211 creating build/lib/evalscope/benchmarks/minerva_math 2026-02-08T20:41:52,212 copying evalscope/benchmarks/minerva_math/__init__.py -> build/lib/evalscope/benchmarks/minerva_math 2026-02-08T20:41:52,214 copying evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/lib/evalscope/benchmarks/minerva_math 2026-02-08T20:41:52,216 creating build/lib/evalscope/benchmarks/visu_logic 2026-02-08T20:41:52,217 copying evalscope/benchmarks/visu_logic/__init__.py -> build/lib/evalscope/benchmarks/visu_logic 2026-02-08T20:41:52,219 copying evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/lib/evalscope/benchmarks/visu_logic 2026-02-08T20:41:52,221 creating build/lib/evalscope/benchmarks/math_qa 2026-02-08T20:41:52,222 copying evalscope/benchmarks/math_qa/__init__.py -> build/lib/evalscope/benchmarks/math_qa 2026-02-08T20:41:52,223 copying evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/lib/evalscope/benchmarks/math_qa 2026-02-08T20:41:52,226 creating build/lib/evalscope/benchmarks/aa_lcr 2026-02-08T20:41:52,226 copying evalscope/benchmarks/aa_lcr/__init__.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-02-08T20:41:52,228 copying evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-02-08T20:41:52,231 creating build/lib/evalscope/benchmarks/bfcl 2026-02-08T20:41:52,232 copying evalscope/benchmarks/bfcl/__init__.py -> build/lib/evalscope/benchmarks/bfcl 2026-02-08T20:41:52,234 creating build/lib/evalscope/benchmarks/tool_bench 2026-02-08T20:41:52,235 copying evalscope/benchmarks/tool_bench/__init__.py -> build/lib/evalscope/benchmarks/tool_bench 2026-02-08T20:41:52,236 copying evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/lib/evalscope/benchmarks/tool_bench 2026-02-08T20:41:52,238 copying evalscope/benchmarks/tool_bench/utils.py -> build/lib/evalscope/benchmarks/tool_bench 2026-02-08T20:41:52,241 creating build/lib/evalscope/benchmarks/omni_bench 2026-02-08T20:41:52,242 copying evalscope/benchmarks/omni_bench/__init__.py -> build/lib/evalscope/benchmarks/omni_bench 2026-02-08T20:41:52,243 copying evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/lib/evalscope/benchmarks/omni_bench 2026-02-08T20:41:52,246 creating build/lib/evalscope/benchmarks/amc 2026-02-08T20:41:52,247 copying evalscope/benchmarks/amc/__init__.py -> build/lib/evalscope/benchmarks/amc 2026-02-08T20:41:52,248 copying evalscope/benchmarks/amc/amc_adapter.py -> build/lib/evalscope/benchmarks/amc 2026-02-08T20:41:52,250 creating build/lib/evalscope/benchmarks/ceval 2026-02-08T20:41:52,252 copying evalscope/benchmarks/ceval/__init__.py -> build/lib/evalscope/benchmarks/ceval 2026-02-08T20:41:52,253 copying evalscope/benchmarks/ceval/ceval_adapter.py -> build/lib/evalscope/benchmarks/ceval 2026-02-08T20:41:52,256 creating build/lib/evalscope/benchmarks/biomix_qa 2026-02-08T20:41:52,257 copying evalscope/benchmarks/biomix_qa/__init__.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-02-08T20:41:52,259 copying evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-02-08T20:41:52,261 creating build/lib/evalscope/benchmarks/gpqa 2026-02-08T20:41:52,262 copying evalscope/benchmarks/gpqa/__init__.py -> build/lib/evalscope/benchmarks/gpqa 2026-02-08T20:41:52,264 copying evalscope/benchmarks/gpqa/prompt.py -> build/lib/evalscope/benchmarks/gpqa 2026-02-08T20:41:52,267 copying evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/lib/evalscope/benchmarks/gpqa 2026-02-08T20:41:52,269 creating build/lib/evalscope/benchmarks/librispeech 2026-02-08T20:41:52,270 copying evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/lib/evalscope/benchmarks/librispeech 2026-02-08T20:41:52,272 copying evalscope/benchmarks/librispeech/__init__.py -> build/lib/evalscope/benchmarks/librispeech 2026-02-08T20:41:52,274 creating build/lib/evalscope/benchmarks/mri_mcqa 2026-02-08T20:41:52,275 copying evalscope/benchmarks/mri_mcqa/__init__.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-02-08T20:41:52,277 copying evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-02-08T20:41:52,279 creating build/lib/evalscope/benchmarks/drop 2026-02-08T20:41:52,280 copying evalscope/benchmarks/drop/drop_adapter.py -> build/lib/evalscope/benchmarks/drop 2026-02-08T20:41:52,282 copying evalscope/benchmarks/drop/__init__.py -> build/lib/evalscope/benchmarks/drop 2026-02-08T20:41:52,284 copying evalscope/benchmarks/drop/utils.py -> build/lib/evalscope/benchmarks/drop 2026-02-08T20:41:52,287 creating build/lib/evalscope/benchmarks/mbppplus 2026-02-08T20:41:52,288 copying evalscope/benchmarks/mbppplus/__init__.py -> build/lib/evalscope/benchmarks/mbppplus 2026-02-08T20:41:52,289 copying evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/lib/evalscope/benchmarks/mbppplus 2026-02-08T20:41:52,292 creating build/lib/evalscope/benchmarks/refcoco 2026-02-08T20:41:52,293 copying evalscope/benchmarks/refcoco/evaluation_lib.py -> build/lib/evalscope/benchmarks/refcoco 2026-02-08T20:41:52,295 copying evalscope/benchmarks/refcoco/__init__.py -> build/lib/evalscope/benchmarks/refcoco 2026-02-08T20:41:52,297 copying evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/lib/evalscope/benchmarks/refcoco 2026-02-08T20:41:52,299 copying evalscope/benchmarks/refcoco/utils.py -> build/lib/evalscope/benchmarks/refcoco 2026-02-08T20:41:52,301 creating build/lib/evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:52,302 copying evalscope/benchmarks/image_edit/gedit/__init__.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:52,304 copying evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:52,307 copying evalscope/benchmarks/image_edit/gedit/utils.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:52,309 copying evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:52,313 creating build/lib/evalscope/benchmarks/poly_math/utils 2026-02-08T20:41:52,314 copying evalscope/benchmarks/poly_math/utils/instruction.py -> build/lib/evalscope/benchmarks/poly_math/utils 2026-02-08T20:41:52,317 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,318 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,320 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,322 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,324 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,326 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,328 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,331 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,333 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,335 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:52,339 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-02-08T20:41:52,340 copying evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-02-08T20:41:52,343 copying evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-02-08T20:41:52,345 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:52,346 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:52,349 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:52,350 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:52,353 creating build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:52,354 copying evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:52,356 copying evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:52,358 copying evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:52,360 copying evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:52,362 copying evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:52,363 copying evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:52,366 creating build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-02-08T20:41:52,367 copying evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-02-08T20:41:52,369 copying evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-02-08T20:41:52,371 copying evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-02-08T20:41:52,374 creating build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-02-08T20:41:52,375 copying evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-02-08T20:41:52,377 copying evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-02-08T20:41:52,379 copying evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-02-08T20:41:52,383 creating build/lib/evalscope/benchmarks/scicode/docker 2026-02-08T20:41:52,384 copying evalscope/benchmarks/scicode/docker/test_util.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-02-08T20:41:52,386 copying evalscope/benchmarks/scicode/docker/process_data.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-02-08T20:41:52,389 creating build/lib/evalscope/benchmarks/bfcl/v4 2026-02-08T20:41:52,390 copying evalscope/benchmarks/bfcl/v4/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-02-08T20:41:52,392 copying evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-02-08T20:41:52,394 copying evalscope/benchmarks/bfcl/v4/utils.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-02-08T20:41:52,397 creating build/lib/evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:52,399 copying evalscope/benchmarks/bfcl/v3/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:52,400 copying evalscope/benchmarks/bfcl/v3/generation.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:52,403 copying evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:52,406 copying evalscope/benchmarks/bfcl/v3/utils.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:52,408 creating build/lib/evalscope/api/model 2026-02-08T20:41:52,410 copying evalscope/api/model/__init__.py -> build/lib/evalscope/api/model 2026-02-08T20:41:52,411 copying evalscope/api/model/lazy_model.py -> build/lib/evalscope/api/model 2026-02-08T20:41:52,414 copying evalscope/api/model/model_output.py -> build/lib/evalscope/api/model 2026-02-08T20:41:52,416 copying evalscope/api/model/generate_config.py -> build/lib/evalscope/api/model 2026-02-08T20:41:52,419 copying evalscope/api/model/model.py -> build/lib/evalscope/api/model 2026-02-08T20:41:52,421 creating build/lib/evalscope/api/messages 2026-02-08T20:41:52,422 copying evalscope/api/messages/__init__.py -> build/lib/evalscope/api/messages 2026-02-08T20:41:52,425 copying evalscope/api/messages/chat_message.py -> build/lib/evalscope/api/messages 2026-02-08T20:41:52,427 copying evalscope/api/messages/content.py -> build/lib/evalscope/api/messages 2026-02-08T20:41:52,429 copying evalscope/api/messages/utils.py -> build/lib/evalscope/api/messages 2026-02-08T20:41:52,431 creating build/lib/evalscope/api/metric 2026-02-08T20:41:52,432 copying evalscope/api/metric/__init__.py -> build/lib/evalscope/api/metric 2026-02-08T20:41:52,434 copying evalscope/api/metric/metric.py -> build/lib/evalscope/api/metric 2026-02-08T20:41:52,436 copying evalscope/api/metric/scorer.py -> build/lib/evalscope/api/metric 2026-02-08T20:41:52,438 creating build/lib/evalscope/api/filter 2026-02-08T20:41:52,439 copying evalscope/api/filter/__init__.py -> build/lib/evalscope/api/filter 2026-02-08T20:41:52,441 copying evalscope/api/filter/filter.py -> build/lib/evalscope/api/filter 2026-02-08T20:41:52,443 creating build/lib/evalscope/api/dataset 2026-02-08T20:41:52,444 copying evalscope/api/dataset/__init__.py -> build/lib/evalscope/api/dataset 2026-02-08T20:41:52,446 copying evalscope/api/dataset/loader.py -> build/lib/evalscope/api/dataset 2026-02-08T20:41:52,449 copying evalscope/api/dataset/dataset.py -> build/lib/evalscope/api/dataset 2026-02-08T20:41:52,451 copying evalscope/api/dataset/utils.py -> build/lib/evalscope/api/dataset 2026-02-08T20:41:52,454 creating build/lib/evalscope/api/evaluator 2026-02-08T20:41:52,455 copying evalscope/api/evaluator/__init__.py -> build/lib/evalscope/api/evaluator 2026-02-08T20:41:52,457 copying evalscope/api/evaluator/evaluator.py -> build/lib/evalscope/api/evaluator 2026-02-08T20:41:52,459 copying evalscope/api/evaluator/cache.py -> build/lib/evalscope/api/evaluator 2026-02-08T20:41:52,461 copying evalscope/api/evaluator/state.py -> build/lib/evalscope/api/evaluator 2026-02-08T20:41:52,464 creating build/lib/evalscope/api/tool 2026-02-08T20:41:52,465 copying evalscope/api/tool/__init__.py -> build/lib/evalscope/api/tool 2026-02-08T20:41:52,467 copying evalscope/api/tool/tool_info.py -> build/lib/evalscope/api/tool 2026-02-08T20:41:52,470 copying evalscope/api/tool/tool_call.py -> build/lib/evalscope/api/tool 2026-02-08T20:41:52,472 copying evalscope/api/tool/utils.py -> build/lib/evalscope/api/tool 2026-02-08T20:41:52,474 creating build/lib/evalscope/api/benchmark 2026-02-08T20:41:52,475 copying evalscope/api/benchmark/meta.py -> build/lib/evalscope/api/benchmark 2026-02-08T20:41:52,478 copying evalscope/api/benchmark/__init__.py -> build/lib/evalscope/api/benchmark 2026-02-08T20:41:52,479 copying evalscope/api/benchmark/benchmark.py -> build/lib/evalscope/api/benchmark 2026-02-08T20:41:52,482 creating build/lib/evalscope/api/mixin 2026-02-08T20:41:52,483 copying evalscope/api/mixin/__init__.py -> build/lib/evalscope/api/mixin 2026-02-08T20:41:52,485 copying evalscope/api/mixin/llm_judge_mixin.py -> build/lib/evalscope/api/mixin 2026-02-08T20:41:52,488 copying evalscope/api/mixin/sandbox_mixin.py -> build/lib/evalscope/api/mixin 2026-02-08T20:41:52,491 creating build/lib/evalscope/api/benchmark/adapters 2026-02-08T20:41:52,492 copying evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-02-08T20:41:52,494 copying evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-02-08T20:41:52,496 copying evalscope/api/benchmark/adapters/default_data_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-02-08T20:41:52,499 copying evalscope/api/benchmark/adapters/__init__.py -> build/lib/evalscope/api/benchmark/adapters 2026-02-08T20:41:52,500 copying evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-02-08T20:41:52,502 copying evalscope/api/benchmark/adapters/text2image_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-02-08T20:41:52,505 copying evalscope/api/benchmark/adapters/ner_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-02-08T20:41:52,507 copying evalscope/api/benchmark/adapters/agent_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-02-08T20:41:52,509 creating build/lib/evalscope/service/frontend 2026-02-08T20:41:52,510 copying evalscope/service/frontend/__init__.py -> build/lib/evalscope/service/frontend 2026-02-08T20:41:52,512 copying evalscope/service/frontend/async_client.py -> build/lib/evalscope/service/frontend 2026-02-08T20:41:52,514 copying evalscope/service/frontend/main.py -> build/lib/evalscope/service/frontend 2026-02-08T20:41:52,516 copying evalscope/service/frontend/utils.py -> build/lib/evalscope/service/frontend 2026-02-08T20:41:52,519 creating build/lib/evalscope/backend/rag_eval 2026-02-08T20:41:52,520 copying evalscope/backend/rag_eval/__init__.py -> build/lib/evalscope/backend/rag_eval 2026-02-08T20:41:52,522 copying evalscope/backend/rag_eval/backend_manager.py -> build/lib/evalscope/backend/rag_eval 2026-02-08T20:41:52,525 creating build/lib/evalscope/backend/opencompass 2026-02-08T20:41:52,526 copying evalscope/backend/opencompass/__init__.py -> build/lib/evalscope/backend/opencompass 2026-02-08T20:41:52,528 copying evalscope/backend/opencompass/backend_manager.py -> build/lib/evalscope/backend/opencompass 2026-02-08T20:41:52,530 copying evalscope/backend/opencompass/api_meta_template.py -> build/lib/evalscope/backend/opencompass 2026-02-08T20:41:52,532 creating build/lib/evalscope/backend/vlm_eval_kit 2026-02-08T20:41:52,533 copying evalscope/backend/vlm_eval_kit/__init__.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-02-08T20:41:52,535 copying evalscope/backend/vlm_eval_kit/backend_manager.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-02-08T20:41:52,538 creating build/lib/evalscope/backend/rag_eval/ragas 2026-02-08T20:41:52,539 copying evalscope/backend/rag_eval/ragas/task_template.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-02-08T20:41:52,541 copying evalscope/backend/rag_eval/ragas/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-02-08T20:41:52,543 copying evalscope/backend/rag_eval/ragas/arguments.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-02-08T20:41:52,545 creating build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:52,546 copying evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:52,548 copying evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:52,550 copying evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:52,553 copying evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:52,555 creating build/lib/evalscope/backend/rag_eval/utils 2026-02-08T20:41:52,556 copying evalscope/backend/rag_eval/utils/tools.py -> build/lib/evalscope/backend/rag_eval/utils 2026-02-08T20:41:52,558 copying evalscope/backend/rag_eval/utils/__init__.py -> build/lib/evalscope/backend/rag_eval/utils 2026-02-08T20:41:52,560 copying evalscope/backend/rag_eval/utils/embedding.py -> build/lib/evalscope/backend/rag_eval/utils 2026-02-08T20:41:52,562 copying evalscope/backend/rag_eval/utils/clip.py -> build/lib/evalscope/backend/rag_eval/utils 2026-02-08T20:41:52,564 copying evalscope/backend/rag_eval/utils/llm.py -> build/lib/evalscope/backend/rag_eval/utils 2026-02-08T20:41:52,566 creating build/lib/evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:52,567 copying evalscope/backend/rag_eval/cmteb/task_template.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:52,569 copying evalscope/backend/rag_eval/cmteb/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:52,571 copying evalscope/backend/rag_eval/cmteb/base.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:52,573 copying evalscope/backend/rag_eval/cmteb/arguments.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:52,575 creating build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:52,576 copying evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:52,578 copying evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:52,580 copying evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:52,582 copying evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:52,584 copying evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:52,587 creating build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-02-08T20:41:52,588 copying evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-02-08T20:41:52,590 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:52,591 copying evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:52,593 copying evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:52,595 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:52,597 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:52,600 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-02-08T20:41:52,601 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-02-08T20:41:52,604 creating build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:52,605 copying evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:52,608 copying evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:52,609 copying evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:52,612 copying evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:52,614 copying evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:52,616 copying evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:52,618 copying evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:52,620 copying evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:52,623 creating build/lib/evalscope/backend/opencompass/tasks 2026-02-08T20:41:52,624 copying evalscope/backend/opencompass/tasks/eval_api.py -> build/lib/evalscope/backend/opencompass/tasks 2026-02-08T20:41:52,626 copying evalscope/backend/opencompass/tasks/__init__.py -> build/lib/evalscope/backend/opencompass/tasks 2026-02-08T20:41:52,628 copying evalscope/backend/opencompass/tasks/eval_datasets.py -> build/lib/evalscope/backend/opencompass/tasks 2026-02-08T20:41:52,631 creating build/lib/evalscope/app/ui 2026-02-08T20:41:52,631 copying evalscope/app/ui/visualization.py -> build/lib/evalscope/app/ui 2026-02-08T20:41:52,633 copying evalscope/app/ui/sidebar.py -> build/lib/evalscope/app/ui 2026-02-08T20:41:52,635 copying evalscope/app/ui/__init__.py -> build/lib/evalscope/app/ui 2026-02-08T20:41:52,637 copying evalscope/app/ui/multi_model.py -> build/lib/evalscope/app/ui 2026-02-08T20:41:52,639 copying evalscope/app/ui/single_model.py -> build/lib/evalscope/app/ui 2026-02-08T20:41:52,641 copying evalscope/app/ui/app_ui.py -> build/lib/evalscope/app/ui 2026-02-08T20:41:52,644 creating build/lib/evalscope/app/utils 2026-02-08T20:41:52,645 copying evalscope/app/utils/visualization.py -> build/lib/evalscope/app/utils 2026-02-08T20:41:52,647 copying evalscope/app/utils/localization.py -> build/lib/evalscope/app/utils 2026-02-08T20:41:52,649 copying evalscope/app/utils/data_utils.py -> build/lib/evalscope/app/utils 2026-02-08T20:41:52,651 copying evalscope/app/utils/env_utils.py -> build/lib/evalscope/app/utils 2026-02-08T20:41:52,653 copying evalscope/app/utils/text_utils.py -> build/lib/evalscope/app/utils 2026-02-08T20:41:52,656 creating build/lib/evalscope/models/utils 2026-02-08T20:41:52,657 copying evalscope/models/utils/openai.py -> build/lib/evalscope/models/utils 2026-02-08T20:41:52,662 running egg_info 2026-02-08T20:41:52,672 writing evalscope.egg-info/PKG-INFO 2026-02-08T20:41:52,692 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-02-08T20:41:52,694 writing entry points to evalscope.egg-info/entry_points.txt 2026-02-08T20:41:52,704 writing requirements to evalscope.egg-info/requires.txt 2026-02-08T20:41:52,705 writing top-level names to evalscope.egg-info/top_level.txt 2026-02-08T20:41:52,881 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-02-08T20:41:52,921 reading manifest template 'MANIFEST.in' 2026-02-08T20:41:53,259 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-02-08T20:41:53,264 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-02-08T20:41:53,269 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-02-08T20:41:53,275 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-02-08T20:41:53,276 adding license file 'LICENSE' 2026-02-08T20:41:53,321 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-02-08T20:41:53,420 copying evalscope/third_party/longbench_write/README.md -> build/lib/evalscope/third_party/longbench_write 2026-02-08T20:41:53,423 copying evalscope/third_party/longbench_write/default_task.json -> build/lib/evalscope/third_party/longbench_write 2026-02-08T20:41:53,425 copying evalscope/third_party/longbench_write/default_task.yaml -> build/lib/evalscope/third_party/longbench_write 2026-02-08T20:41:53,427 copying evalscope/third_party/toolbench_static/README.md -> build/lib/evalscope/third_party/toolbench_static 2026-02-08T20:41:53,429 copying evalscope/third_party/toolbench_static/config_default.json -> build/lib/evalscope/third_party/toolbench_static 2026-02-08T20:41:53,431 copying evalscope/third_party/toolbench_static/config_default.yaml -> build/lib/evalscope/third_party/toolbench_static 2026-02-08T20:41:53,434 copying evalscope/third_party/toolbench_static/requirements.txt -> build/lib/evalscope/third_party/toolbench_static 2026-02-08T20:41:53,436 copying evalscope/third_party/longbench_write/resources/judge.txt -> build/lib/evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,438 copying evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,442 copying evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,445 copying evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,447 creating build/lib/evalscope/third_party/thinkbench/resources 2026-02-08T20:41:53,448 copying evalscope/third_party/thinkbench/resources/critique_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-02-08T20:41:53,450 copying evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-02-08T20:41:53,452 copying evalscope/metrics/text_normalizer/english.json -> build/lib/evalscope/metrics/text_normalizer 2026-02-08T20:41:53,456 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-02-08T20:41:53,457 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-02-08T20:41:53,459 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-02-08T20:41:53,460 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-02-08T20:41:53,462 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-02-08T20:41:53,464 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-02-08T20:41:53,466 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,467 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,469 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,471 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,473 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,476 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,478 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,480 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,482 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,484 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,487 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,489 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,491 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,493 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,495 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,497 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,499 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,502 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,504 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,506 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,508 copying evalscope/benchmarks/trivia_qa/samples.jsonl -> build/lib/evalscope/benchmarks/trivia_qa 2026-02-08T20:41:53,510 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:53,512 creating build/lib/evalscope/benchmarks/humanevalplus/docker 2026-02-08T20:41:53,513 copying evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/lib/evalscope/benchmarks/humanevalplus/docker 2026-02-08T20:41:53,515 creating build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,516 copying evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,519 copying evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,521 copying evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,523 copying evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,525 copying evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,528 copying evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,530 copying evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,533 copying evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,535 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,537 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,539 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,542 copying evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,544 copying evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,546 copying evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,548 copying evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,550 copying evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,552 copying evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,554 copying evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,557 copying evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,559 copying evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,562 copying evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,564 copying evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,566 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,568 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,570 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,573 copying evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,575 copying evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:53,578 copying evalscope/benchmarks/scicode/docker/Dockerfile -> build/lib/evalscope/benchmarks/scicode/docker 2026-02-08T20:41:53,581 copying evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/lib/evalscope/benchmarks/scicode/docker 2026-02-08T20:41:53,583 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-02-08T20:41:53,676 installing to build/bdist.linux-armv7l/wheel 2026-02-08T20:41:53,677 running install 2026-02-08T20:41:53,701 running install_lib 2026-02-08T20:41:53,706 creating build/bdist.linux-armv7l/wheel 2026-02-08T20:41:53,708 creating build/bdist.linux-armv7l/wheel/evalscope 2026-02-08T20:41:53,709 copying build/lib/evalscope/version.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-02-08T20:41:53,711 copying build/lib/evalscope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-02-08T20:41:53,714 creating build/bdist.linux-armv7l/wheel/evalscope/third_party 2026-02-08T20:41:53,715 copying build/lib/evalscope/third_party/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party 2026-02-08T20:41:53,717 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write 2026-02-08T20:41:53,718 copying build/lib/evalscope/third_party/longbench_write/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-02-08T20:41:53,720 copying build/lib/evalscope/third_party/longbench_write/default_task.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-02-08T20:41:53,722 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,723 copying build/lib/evalscope/third_party/longbench_write/resources/judge.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,725 copying build/lib/evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,728 copying build/lib/evalscope/third_party/longbench_write/resources/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,729 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,732 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-02-08T20:41:53,735 copying build/lib/evalscope/third_party/longbench_write/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-02-08T20:41:53,737 copying build/lib/evalscope/third_party/longbench_write/longbench_write.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-02-08T20:41:53,739 copying build/lib/evalscope/third_party/longbench_write/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-02-08T20:41:53,742 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/tools 2026-02-08T20:41:53,743 copying build/lib/evalscope/third_party/longbench_write/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-02-08T20:41:53,746 copying build/lib/evalscope/third_party/longbench_write/tools/data_etl.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-02-08T20:41:53,749 copying build/lib/evalscope/third_party/longbench_write/tools/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-02-08T20:41:53,751 copying build/lib/evalscope/third_party/longbench_write/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-02-08T20:41:53,754 copying build/lib/evalscope/third_party/longbench_write/default_task.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-02-08T20:41:53,756 copying build/lib/evalscope/third_party/longbench_write/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-02-08T20:41:53,758 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static 2026-02-08T20:41:53,759 copying build/lib/evalscope/third_party/toolbench_static/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-02-08T20:41:53,762 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static/llm 2026-02-08T20:41:53,763 copying build/lib/evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-02-08T20:41:53,765 copying build/lib/evalscope/third_party/toolbench_static/llm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-02-08T20:41:53,767 copying build/lib/evalscope/third_party/toolbench_static/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-02-08T20:41:53,769 copying build/lib/evalscope/third_party/toolbench_static/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-02-08T20:41:53,771 copying build/lib/evalscope/third_party/toolbench_static/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-02-08T20:41:53,773 copying build/lib/evalscope/third_party/toolbench_static/config_default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-02-08T20:41:53,774 copying build/lib/evalscope/third_party/toolbench_static/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-02-08T20:41:53,776 copying build/lib/evalscope/third_party/toolbench_static/config_default.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-02-08T20:41:53,778 copying build/lib/evalscope/third_party/toolbench_static/toolbench_static.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-02-08T20:41:53,780 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench 2026-02-08T20:41:53,782 copying build/lib/evalscope/third_party/thinkbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-02-08T20:41:53,785 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/resources 2026-02-08T20:41:53,786 copying build/lib/evalscope/third_party/thinkbench/resources/critique_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-02-08T20:41:53,788 copying build/lib/evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-02-08T20:41:53,789 copying build/lib/evalscope/third_party/thinkbench/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-02-08T20:41:53,791 copying build/lib/evalscope/third_party/thinkbench/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-02-08T20:41:53,795 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/tools 2026-02-08T20:41:53,796 copying build/lib/evalscope/third_party/thinkbench/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-02-08T20:41:53,797 copying build/lib/evalscope/third_party/thinkbench/tools/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-02-08T20:41:53,799 copying build/lib/evalscope/third_party/thinkbench/tools/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-02-08T20:41:53,801 creating build/bdist.linux-armv7l/wheel/evalscope/metrics 2026-02-08T20:41:53,803 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bert_score 2026-02-08T20:41:53,804 copying build/lib/evalscope/metrics/bert_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-02-08T20:41:53,806 copying build/lib/evalscope/metrics/bert_score/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-02-08T20:41:53,809 copying build/lib/evalscope/metrics/bert_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-02-08T20:41:53,811 copying build/lib/evalscope/metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-02-08T20:41:53,813 copying build/lib/evalscope/metrics/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-02-08T20:41:53,816 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/sem_score 2026-02-08T20:41:53,817 copying build/lib/evalscope/metrics/sem_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-02-08T20:41:53,819 copying build/lib/evalscope/metrics/sem_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-02-08T20:41:53,821 copying build/lib/evalscope/metrics/rouge_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-02-08T20:41:53,823 copying build/lib/evalscope/metrics/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-02-08T20:41:53,826 copying build/lib/evalscope/metrics/llm_judge.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-02-08T20:41:53,829 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/text_normalizer 2026-02-08T20:41:53,830 copying build/lib/evalscope/metrics/text_normalizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-02-08T20:41:53,831 copying build/lib/evalscope/metrics/text_normalizer/basic.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-02-08T20:41:53,833 copying build/lib/evalscope/metrics/text_normalizer/chinese.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-02-08T20:41:53,836 copying build/lib/evalscope/metrics/text_normalizer/wer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-02-08T20:41:53,839 copying build/lib/evalscope/metrics/text_normalizer/english.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-02-08T20:41:53,841 copying build/lib/evalscope/metrics/text_normalizer/english.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-02-08T20:41:53,845 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics 2026-02-08T20:41:53,846 copying build/lib/evalscope/metrics/t2v_metrics/itmscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-02-08T20:41:53,848 copying build/lib/evalscope/metrics/t2v_metrics/vqascore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-02-08T20:41:53,849 copying build/lib/evalscope/metrics/t2v_metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-02-08T20:41:53,851 copying build/lib/evalscope/metrics/t2v_metrics/score.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-02-08T20:41:53,853 copying build/lib/evalscope/metrics/t2v_metrics/clipscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-02-08T20:41:53,854 copying build/lib/evalscope/metrics/t2v_metrics/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-02-08T20:41:53,857 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models 2026-02-08T20:41:53,858 copying build/lib/evalscope/metrics/t2v_metrics/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-02-08T20:41:53,860 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:53,861 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:53,863 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:53,865 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:53,867 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:53,868 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:53,871 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:53,872 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-02-08T20:41:53,875 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:53,877 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:53,878 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-02-08T20:41:53,881 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:53,882 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:53,884 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:53,886 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:53,888 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-02-08T20:41:53,890 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-02-08T20:41:53,891 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-02-08T20:41:53,894 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-02-08T20:41:53,895 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-02-08T20:41:53,897 copying build/lib/evalscope/metrics/t2v_metrics/models/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-02-08T20:41:53,899 copying build/lib/evalscope/metrics/t2v_metrics/models/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-02-08T20:41:53,901 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:53,902 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:53,905 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:53,906 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:53,908 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-02-08T20:41:53,909 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-02-08T20:41:53,912 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-02-08T20:41:53,913 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-02-08T20:41:53,914 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-02-08T20:41:53,916 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-02-08T20:41:53,918 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-02-08T20:41:53,920 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-02-08T20:41:53,921 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-02-08T20:41:53,924 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-02-08T20:41:53,925 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-02-08T20:41:53,927 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:53,930 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-02-08T20:41:53,931 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-02-08T20:41:53,933 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-02-08T20:41:53,934 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-02-08T20:41:53,937 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-02-08T20:41:53,938 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-02-08T20:41:53,940 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-02-08T20:41:53,942 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,943 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,945 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,947 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,949 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,950 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,952 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,954 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,955 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,957 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,959 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,961 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,962 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,964 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,966 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,968 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,969 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,971 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,973 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,975 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-02-08T20:41:53,977 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-02-08T20:41:53,979 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:53,980 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:53,982 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:53,984 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:53,986 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:53,989 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-02-08T20:41:53,990 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-02-08T20:41:53,992 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-02-08T20:41:53,994 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-02-08T20:41:53,996 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:53,999 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:54,001 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-02-08T20:41:54,004 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:54,005 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:54,008 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,009 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,013 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,015 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,017 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,022 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,027 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,032 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,035 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,037 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,040 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-02-08T20:41:54,042 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:54,044 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:54,046 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:54,050 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,051 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,053 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,055 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,057 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,059 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,062 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,064 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,066 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,068 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,070 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,073 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-02-08T20:41:54,075 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:54,077 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-02-08T20:41:54,081 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:54,082 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:54,084 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:54,086 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:54,088 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-02-08T20:41:54,090 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-02-08T20:41:54,093 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bundled_rouge_score 2026-02-08T20:41:54,094 copying build/lib/evalscope/metrics/bundled_rouge_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-02-08T20:41:54,096 copying build/lib/evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-02-08T20:41:54,098 copying build/lib/evalscope/metrics/math_parser.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-02-08T20:41:54,101 creating build/bdist.linux-armv7l/wheel/evalscope/perf 2026-02-08T20:41:54,102 copying build/lib/evalscope/perf/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-02-08T20:41:54,105 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin 2026-02-08T20:41:54,106 copying build/lib/evalscope/perf/plugin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-02-08T20:41:54,108 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/datasets 2026-02-08T20:41:54,109 copying build/lib/evalscope/perf/plugin/datasets/random_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,111 copying build/lib/evalscope/perf/plugin/datasets/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,113 copying build/lib/evalscope/perf/plugin/datasets/openqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,115 copying build/lib/evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,117 copying build/lib/evalscope/perf/plugin/datasets/embedding_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,119 copying build/lib/evalscope/perf/plugin/datasets/line_by_line.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,121 copying build/lib/evalscope/perf/plugin/datasets/speed_benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,123 copying build/lib/evalscope/perf/plugin/datasets/kontext_bench.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,125 copying build/lib/evalscope/perf/plugin/datasets/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,127 copying build/lib/evalscope/perf/plugin/datasets/flickr8k.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,128 copying build/lib/evalscope/perf/plugin/datasets/longalpaca.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,130 copying build/lib/evalscope/perf/plugin/datasets/rerank_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,132 copying build/lib/evalscope/perf/plugin/datasets/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,134 copying build/lib/evalscope/perf/plugin/datasets/custom.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-02-08T20:41:54,137 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/api 2026-02-08T20:41:54,138 copying build/lib/evalscope/perf/plugin/api/openai_embedding_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-02-08T20:41:54,140 copying build/lib/evalscope/perf/plugin/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-02-08T20:41:54,142 copying build/lib/evalscope/perf/plugin/api/dashscope_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-02-08T20:41:54,144 copying build/lib/evalscope/perf/plugin/api/openai_rerank_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-02-08T20:41:54,146 copying build/lib/evalscope/perf/plugin/api/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-02-08T20:41:54,148 copying build/lib/evalscope/perf/plugin/api/default_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-02-08T20:41:54,150 copying build/lib/evalscope/perf/plugin/api/custom_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-02-08T20:41:54,153 copying build/lib/evalscope/perf/plugin/api/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-02-08T20:41:54,155 copying build/lib/evalscope/perf/plugin/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-02-08T20:41:54,157 copying build/lib/evalscope/perf/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-02-08T20:41:54,159 copying build/lib/evalscope/perf/http_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-02-08T20:41:54,161 copying build/lib/evalscope/perf/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-02-08T20:41:54,164 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils 2026-02-08T20:41:54,165 copying build/lib/evalscope/perf/utils/rich_display.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-02-08T20:41:54,167 copying build/lib/evalscope/perf/utils/benchmark_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-02-08T20:41:54,170 copying build/lib/evalscope/perf/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-02-08T20:41:54,171 copying build/lib/evalscope/perf/utils/local_server.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-02-08T20:41:54,173 copying build/lib/evalscope/perf/utils/log_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-02-08T20:41:54,175 copying build/lib/evalscope/perf/utils/db_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-02-08T20:41:54,177 copying build/lib/evalscope/perf/utils/handler.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-02-08T20:41:54,179 copying build/lib/evalscope/perf/utils/analysis_result.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-02-08T20:41:54,181 copying build/lib/evalscope/perf/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-02-08T20:41:54,183 creating build/bdist.linux-armv7l/wheel/evalscope/perf/sla 2026-02-08T20:41:54,184 copying build/lib/evalscope/perf/sla/sla_run.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-02-08T20:41:54,187 copying build/lib/evalscope/perf/sla/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-02-08T20:41:54,189 copying build/lib/evalscope/perf/sla/sla_criterion.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-02-08T20:41:54,191 creating build/bdist.linux-armv7l/wheel/evalscope/cli 2026-02-08T20:41:54,192 copying build/lib/evalscope/cli/start_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-02-08T20:41:54,194 copying build/lib/evalscope/cli/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-02-08T20:41:54,196 copying build/lib/evalscope/cli/cli.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-02-08T20:41:54,198 copying build/lib/evalscope/cli/start_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-02-08T20:41:54,200 copying build/lib/evalscope/cli/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-02-08T20:41:54,202 copying build/lib/evalscope/cli/start_app.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-02-08T20:41:54,204 copying build/lib/evalscope/cli/start_perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-02-08T20:41:54,207 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks 2026-02-08T20:41:54,209 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmmu 2026-02-08T20:41:54,210 copying build/lib/evalscope/benchmarks/cmmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-02-08T20:41:54,212 copying build/lib/evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-02-08T20:41:54,215 copying build/lib/evalscope/benchmarks/cmmmu/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-02-08T20:41:54,218 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_qa 2026-02-08T20:41:54,220 copying build/lib/evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-02-08T20:41:54,222 copying build/lib/evalscope/benchmarks/general_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-02-08T20:41:54,225 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/science_qa 2026-02-08T20:41:54,226 copying build/lib/evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-02-08T20:41:54,228 copying build/lib/evalscope/benchmarks/science_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-02-08T20:41:54,230 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_fc 2026-02-08T20:41:54,231 copying build/lib/evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-02-08T20:41:54,233 copying build/lib/evalscope/benchmarks/general_fc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-02-08T20:41:54,235 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drivelology 2026-02-08T20:41:54,236 copying build/lib/evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-02-08T20:41:54,238 copying build/lib/evalscope/benchmarks/drivelology/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-02-08T20:41:54,240 copying build/lib/evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-02-08T20:41:54,242 copying build/lib/evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-02-08T20:41:54,244 copying build/lib/evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-02-08T20:41:54,247 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/commonsense_qa 2026-02-08T20:41:54,248 copying build/lib/evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-02-08T20:41:54,249 copying build/lib/evalscope/benchmarks/commonsense_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-02-08T20:41:54,251 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/healthbench 2026-02-08T20:41:54,252 copying build/lib/evalscope/benchmarks/healthbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-02-08T20:41:54,254 copying build/lib/evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-02-08T20:41:54,256 copying build/lib/evalscope/benchmarks/healthbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-02-08T20:41:54,258 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/music_trivia 2026-02-08T20:41:54,259 copying build/lib/evalscope/benchmarks/music_trivia/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-02-08T20:41:54,261 copying build/lib/evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-02-08T20:41:54,263 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_redux 2026-02-08T20:41:54,264 copying build/lib/evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-02-08T20:41:54,266 copying build/lib/evalscope/benchmarks/mmlu_redux/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-02-08T20:41:54,269 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/super_gpqa 2026-02-08T20:41:54,270 copying build/lib/evalscope/benchmarks/super_gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-02-08T20:41:54,272 copying build/lib/evalscope/benchmarks/super_gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-02-08T20:41:54,275 copying build/lib/evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-02-08T20:41:54,278 copying build/lib/evalscope/benchmarks/super_gpqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-02-08T20:41:54,282 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit 2026-02-08T20:41:54,283 copying build/lib/evalscope/benchmarks/image_edit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit 2026-02-08T20:41:54,285 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:54,286 copying build/lib/evalscope/benchmarks/image_edit/gedit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:54,288 copying build/lib/evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:54,291 copying build/lib/evalscope/benchmarks/image_edit/gedit/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:54,294 copying build/lib/evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-02-08T20:41:54,299 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pope 2026-02-08T20:41:54,300 copying build/lib/evalscope/benchmarks/pope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-02-08T20:41:54,303 copying build/lib/evalscope/benchmarks/pope/pope_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-02-08T20:41:54,307 copying build/lib/evalscope/benchmarks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks 2026-02-08T20:41:54,310 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/text2image 2026-02-08T20:41:54,312 copying build/lib/evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-02-08T20:41:54,315 copying build/lib/evalscope/benchmarks/text2image/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-02-08T20:41:54,317 copying build/lib/evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-02-08T20:41:54,321 copying build/lib/evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-02-08T20:41:54,324 copying build/lib/evalscope/benchmarks/text2image/tifa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-02-08T20:41:54,357 copying build/lib/evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-02-08T20:41:54,362 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_500 2026-02-08T20:41:54,363 copying build/lib/evalscope/benchmarks/math_500/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-02-08T20:41:54,367 copying build/lib/evalscope/benchmarks/math_500/math_500_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-02-08T20:41:54,371 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/truthful_qa 2026-02-08T20:41:54,373 copying build/lib/evalscope/benchmarks/truthful_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-02-08T20:41:54,377 copying build/lib/evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-02-08T20:41:54,383 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math 2026-02-08T20:41:54,385 copying build/lib/evalscope/benchmarks/poly_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-02-08T20:41:54,390 copying build/lib/evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-02-08T20:41:54,398 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math/utils 2026-02-08T20:41:54,401 copying build/lib/evalscope/benchmarks/poly_math/utils/instruction.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math/utils 2026-02-08T20:41:54,411 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_qa 2026-02-08T20:41:54,414 copying build/lib/evalscope/benchmarks/simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-02-08T20:41:54,418 copying build/lib/evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-02-08T20:41:54,423 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_verse 2026-02-08T20:41:54,425 copying build/lib/evalscope/benchmarks/math_verse/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-02-08T20:41:54,428 copying build/lib/evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-02-08T20:41:54,432 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/competition_math 2026-02-08T20:41:54,434 copying build/lib/evalscope/benchmarks/competition_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-02-08T20:41:54,438 copying build/lib/evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-02-08T20:41:54,443 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/med_mcqa 2026-02-08T20:41:54,445 copying build/lib/evalscope/benchmarks/med_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-02-08T20:41:54,449 copying build/lib/evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-02-08T20:41:54,454 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/wmt 2026-02-08T20:41:54,456 copying build/lib/evalscope/benchmarks/wmt/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-02-08T20:41:54,459 copying build/lib/evalscope/benchmarks/wmt/wmt24_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-02-08T20:41:54,465 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zerobench 2026-02-08T20:41:54,467 copying build/lib/evalscope/benchmarks/zerobench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-02-08T20:41:54,471 copying build/lib/evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-02-08T20:41:54,476 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/eq_bench 2026-02-08T20:41:54,478 copying build/lib/evalscope/benchmarks/eq_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-02-08T20:41:54,481 copying build/lib/evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-02-08T20:41:54,485 copying build/lib/evalscope/benchmarks/eq_bench/answer_validation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-02-08T20:41:54,489 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_bench 2026-02-08T20:41:54,491 copying build/lib/evalscope/benchmarks/mm_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-02-08T20:41:54,495 copying build/lib/evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-02-08T20:41:54,500 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vista 2026-02-08T20:41:54,502 copying build/lib/evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-02-08T20:41:54,507 copying build/lib/evalscope/benchmarks/math_vista/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-02-08T20:41:54,512 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_mcq 2026-02-08T20:41:54,513 copying build/lib/evalscope/benchmarks/general_mcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-02-08T20:41:54,517 copying build/lib/evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-02-08T20:41:54,521 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_star 2026-02-08T20:41:54,523 copying build/lib/evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-02-08T20:41:54,526 copying build/lib/evalscope/benchmarks/mm_star/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-02-08T20:41:54,528 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/maritime_bench 2026-02-08T20:41:54,529 copying build/lib/evalscope/benchmarks/maritime_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-02-08T20:41:54,532 copying build/lib/evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-02-08T20:41:54,535 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu 2026-02-08T20:41:54,536 copying build/lib/evalscope/benchmarks/mmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-02-08T20:41:54,538 copying build/lib/evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-02-08T20:41:54,542 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,543 copying build/lib/evalscope/benchmarks/live_code_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,546 copying build/lib/evalscope/benchmarks/live_code_bench/load_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,548 copying build/lib/evalscope/benchmarks/live_code_bench/testing_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,552 copying build/lib/evalscope/benchmarks/live_code_bench/extract_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,554 copying build/lib/evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,557 copying build/lib/evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,560 copying build/lib/evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,563 copying build/lib/evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,566 copying build/lib/evalscope/benchmarks/live_code_bench/prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-02-08T20:41:54,570 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/micro_vqa 2026-02-08T20:41:54,571 copying build/lib/evalscope/benchmarks/micro_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-02-08T20:41:54,574 copying build/lib/evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-02-08T20:41:54,577 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/swe_bench 2026-02-08T20:41:54,578 copying build/lib/evalscope/benchmarks/swe_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-02-08T20:41:54,581 copying build/lib/evalscope/benchmarks/swe_bench/build_images.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-02-08T20:41:54,584 copying build/lib/evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-02-08T20:41:54,587 copying build/lib/evalscope/benchmarks/swe_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-02-08T20:41:54,590 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/data_collection 2026-02-08T20:41:54,591 copying build/lib/evalscope/benchmarks/data_collection/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-02-08T20:41:54,592 copying build/lib/evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-02-08T20:41:54,595 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/olympiad_bench 2026-02-08T20:41:54,596 copying build/lib/evalscope/benchmarks/olympiad_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-02-08T20:41:54,597 copying build/lib/evalscope/benchmarks/olympiad_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-02-08T20:41:54,600 copying build/lib/evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-02-08T20:41:54,603 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docvqa 2026-02-08T20:41:54,604 copying build/lib/evalscope/benchmarks/docvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-02-08T20:41:54,606 copying build/lib/evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-02-08T20:41:54,608 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/a_okvqa 2026-02-08T20:41:54,610 copying build/lib/evalscope/benchmarks/a_okvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-02-08T20:41:54,612 copying build/lib/evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-02-08T20:41:54,614 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humaneval 2026-02-08T20:41:54,615 copying build/lib/evalscope/benchmarks/humaneval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-02-08T20:41:54,618 copying build/lib/evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-02-08T20:41:54,620 copying build/lib/evalscope/benchmarks/humaneval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-02-08T20:41:54,623 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mgsm 2026-02-08T20:41:54,624 copying build/lib/evalscope/benchmarks/mgsm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-02-08T20:41:54,625 copying build/lib/evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-02-08T20:41:54,628 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench 2026-02-08T20:41:54,629 copying build/lib/evalscope/benchmarks/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench 2026-02-08T20:41:54,631 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,632 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,633 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,636 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,638 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,640 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,642 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,644 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,646 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,648 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-02-08T20:41:54,651 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:54,652 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:54,655 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:54,657 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:54,660 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-02-08T20:41:54,662 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench 2026-02-08T20:41:54,663 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-02-08T20:41:54,665 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-02-08T20:41:54,668 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/qasc 2026-02-08T20:41:54,669 copying build/lib/evalscope/benchmarks/qasc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-02-08T20:41:54,670 copying build/lib/evalscope/benchmarks/qasc/qasc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-02-08T20:41:54,673 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/trivia_qa 2026-02-08T20:41:54,895 copying build/lib/evalscope/benchmarks/trivia_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-02-08T20:41:54,897 copying build/lib/evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-02-08T20:41:54,898 copying build/lib/evalscope/benchmarks/trivia_qa/samples.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-02-08T20:41:54,901 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmlu 2026-02-08T20:41:54,902 copying build/lib/evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-02-08T20:41:54,905 copying build/lib/evalscope/benchmarks/cmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-02-08T20:41:54,907 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/infovqa 2026-02-08T20:41:54,908 copying build/lib/evalscope/benchmarks/infovqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-02-08T20:41:54,910 copying build/lib/evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-02-08T20:41:54,912 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vqa 2026-02-08T20:41:54,913 copying build/lib/evalscope/benchmarks/general_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-02-08T20:41:54,915 copying build/lib/evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-02-08T20:41:54,917 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k_v 2026-02-08T20:41:54,918 copying build/lib/evalscope/benchmarks/gsm8k_v/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-02-08T20:41:54,920 copying build/lib/evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-02-08T20:41:54,923 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_vqa 2026-02-08T20:41:54,924 copying build/lib/evalscope/benchmarks/simple_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-02-08T20:41:54,927 copying build/lib/evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-02-08T20:41:54,929 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pumed_qa 2026-02-08T20:41:54,930 copying build/lib/evalscope/benchmarks/pumed_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-02-08T20:41:54,933 copying build/lib/evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-02-08T20:41:54,936 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vmcq 2026-02-08T20:41:54,938 copying build/lib/evalscope/benchmarks/general_vmcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-02-08T20:41:54,940 copying build/lib/evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-02-08T20:41:54,943 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifbench 2026-02-08T20:41:54,944 copying build/lib/evalscope/benchmarks/ifbench/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-02-08T20:41:54,947 copying build/lib/evalscope/benchmarks/ifbench/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-02-08T20:41:54,951 copying build/lib/evalscope/benchmarks/ifbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-02-08T20:41:54,953 copying build/lib/evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-02-08T20:41:54,956 copying build/lib/evalscope/benchmarks/ifbench/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-02-08T20:41:54,961 copying build/lib/evalscope/benchmarks/ifbench/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-02-08T20:41:54,969 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chartqa 2026-02-08T20:41:54,970 copying build/lib/evalscope/benchmarks/chartqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-02-08T20:41:54,972 copying build/lib/evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-02-08T20:41:54,975 copying build/lib/evalscope/benchmarks/chartqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-02-08T20:41:54,980 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multi_if 2026-02-08T20:41:54,981 copying build/lib/evalscope/benchmarks/multi_if/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-02-08T20:41:54,984 copying build/lib/evalscope/benchmarks/multi_if/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-02-08T20:41:54,987 copying build/lib/evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-02-08T20:41:54,992 copying build/lib/evalscope/benchmarks/multi_if/ifeval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-02-08T20:41:55,005 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/seed_bench_2_plus 2026-02-08T20:41:55,008 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-02-08T20:41:55,014 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-02-08T20:41:55,022 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/winogrande 2026-02-08T20:41:55,245 copying build/lib/evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-02-08T20:41:55,247 copying build/lib/evalscope/benchmarks/winogrande/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-02-08T20:41:55,249 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/siqa 2026-02-08T20:41:55,250 copying build/lib/evalscope/benchmarks/siqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-02-08T20:41:55,251 copying build/lib/evalscope/benchmarks/siqa/siqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-02-08T20:41:55,254 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/iquiz 2026-02-08T20:41:55,255 copying build/lib/evalscope/benchmarks/iquiz/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-02-08T20:41:55,256 copying build/lib/evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-02-08T20:41:55,259 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/piqa 2026-02-08T20:41:55,260 copying build/lib/evalscope/benchmarks/piqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-02-08T20:41:55,262 copying build/lib/evalscope/benchmarks/piqa/piqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-02-08T20:41:55,264 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbpp 2026-02-08T20:41:55,265 copying build/lib/evalscope/benchmarks/mbpp/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-02-08T20:41:55,267 copying build/lib/evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-02-08T20:41:55,269 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/real_world_qa 2026-02-08T20:41:55,270 copying build/lib/evalscope/benchmarks/real_world_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-02-08T20:41:55,272 copying build/lib/evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-02-08T20:41:55,274 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_arena 2026-02-08T20:41:55,276 copying build/lib/evalscope/benchmarks/general_arena/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-02-08T20:41:55,277 copying build/lib/evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-02-08T20:41:55,280 copying build/lib/evalscope/benchmarks/general_arena/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-02-08T20:41:55,283 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/alpaca_eval 2026-02-08T20:41:55,284 copying build/lib/evalscope/benchmarks/alpaca_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-02-08T20:41:55,286 copying build/lib/evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-02-08T20:41:55,288 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/blink 2026-02-08T20:41:55,289 copying build/lib/evalscope/benchmarks/blink/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-02-08T20:41:55,291 copying build/lib/evalscope/benchmarks/blink/blink_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-02-08T20:41:55,294 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/fleurs 2026-02-08T20:41:55,296 copying build/lib/evalscope/benchmarks/fleurs/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-02-08T20:41:55,299 copying build/lib/evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-02-08T20:41:55,303 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/openai_mrcr 2026-02-08T20:41:55,305 copying build/lib/evalscope/benchmarks/openai_mrcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-02-08T20:41:55,308 copying build/lib/evalscope/benchmarks/openai_mrcr/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-02-08T20:41:55,312 copying build/lib/evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-02-08T20:41:55,317 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/frames 2026-02-08T20:41:55,319 copying build/lib/evalscope/benchmarks/frames/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-02-08T20:41:55,322 copying build/lib/evalscope/benchmarks/frames/frames_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-02-08T20:41:55,326 copying build/lib/evalscope/benchmarks/frames/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-02-08T20:41:55,330 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu_pro 2026-02-08T20:41:55,332 copying build/lib/evalscope/benchmarks/mmmu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-02-08T20:41:55,335 copying build/lib/evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-02-08T20:41:55,340 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docmath 2026-02-08T20:41:55,342 copying build/lib/evalscope/benchmarks/docmath/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-02-08T20:41:55,345 copying build/lib/evalscope/benchmarks/docmath/docmath_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-02-08T20:41:55,349 copying build/lib/evalscope/benchmarks/docmath/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-02-08T20:41:55,353 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ai2d 2026-02-08T20:41:55,356 copying build/lib/evalscope/benchmarks/ai2d/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-02-08T20:41:55,359 copying build/lib/evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-02-08T20:41:55,363 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/needle_haystack 2026-02-08T20:41:55,365 copying build/lib/evalscope/benchmarks/needle_haystack/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-02-08T20:41:55,368 copying build/lib/evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-02-08T20:41:55,372 copying build/lib/evalscope/benchmarks/needle_haystack/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-02-08T20:41:55,377 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/torgo 2026-02-08T20:41:55,379 copying build/lib/evalscope/benchmarks/torgo/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-02-08T20:41:55,383 copying build/lib/evalscope/benchmarks/torgo/torgo_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-02-08T20:41:55,388 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner 2026-02-08T20:41:55,390 copying build/lib/evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,394 copying build/lib/evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,396 copying build/lib/evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,398 copying build/lib/evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,400 copying build/lib/evalscope/benchmarks/ner/conllpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,403 copying build/lib/evalscope/benchmarks/ner/copious_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,405 copying build/lib/evalscope/benchmarks/ner/ncbi_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,407 copying build/lib/evalscope/benchmarks/ner/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,409 copying build/lib/evalscope/benchmarks/ner/wnut2017_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,411 copying build/lib/evalscope/benchmarks/ner/jnlpba_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,414 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:55,415 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:55,417 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:55,419 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:55,421 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:55,424 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:55,426 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-02-08T20:41:55,428 copying build/lib/evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,431 copying build/lib/evalscope/benchmarks/ner/conll2003_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,433 copying build/lib/evalscope/benchmarks/ner/anat_em_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,435 copying build/lib/evalscope/benchmarks/ner/genia_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,438 copying build/lib/evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,440 copying build/lib/evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,442 copying build/lib/evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,445 copying build/lib/evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,447 copying build/lib/evalscope/benchmarks/ner/fin_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,449 copying build/lib/evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,451 copying build/lib/evalscope/benchmarks/ner/bc2gm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,454 copying build/lib/evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,456 copying build/lib/evalscope/benchmarks/ner/cross_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-02-08T20:41:55,459 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vision 2026-02-08T20:41:55,460 copying build/lib/evalscope/benchmarks/math_vision/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-02-08T20:41:55,462 copying build/lib/evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-02-08T20:41:55,465 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench 2026-02-08T20:41:55,466 copying build/lib/evalscope/benchmarks/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench 2026-02-08T20:41:55,469 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau_bench 2026-02-08T20:41:55,470 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-02-08T20:41:55,472 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-02-08T20:41:55,475 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-02-08T20:41:55,478 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau2_bench 2026-02-08T20:41:55,479 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-02-08T20:41:55,481 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-02-08T20:41:55,484 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-02-08T20:41:55,487 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/musr 2026-02-08T20:41:55,488 copying build/lib/evalscope/benchmarks/musr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-02-08T20:41:55,490 copying build/lib/evalscope/benchmarks/musr/musr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-02-08T20:41:55,493 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chinese_simple_qa 2026-02-08T20:41:55,494 copying build/lib/evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-02-08T20:41:55,496 copying build/lib/evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-02-08T20:41:55,499 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/coin_flip 2026-02-08T20:41:55,501 copying build/lib/evalscope/benchmarks/coin_flip/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-02-08T20:41:55,503 copying build/lib/evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-02-08T20:41:55,506 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zebralogicbench 2026-02-08T20:41:55,507 copying build/lib/evalscope/benchmarks/zebralogicbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-02-08T20:41:55,509 copying build/lib/evalscope/benchmarks/zebralogicbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-02-08T20:41:55,512 copying build/lib/evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-02-08T20:41:55,515 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/logi_qa 2026-02-08T20:41:55,516 copying build/lib/evalscope/benchmarks/logi_qa/__int__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-02-08T20:41:55,518 copying build/lib/evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-02-08T20:41:55,521 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus 2026-02-08T20:41:55,522 copying build/lib/evalscope/benchmarks/humanevalplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-02-08T20:41:55,524 copying build/lib/evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus 2026-02-08T20:41:55,527 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humanevalplus/docker 2026-02-08T20:41:55,528 copying build/lib/evalscope/benchmarks/humanevalplus/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humanevalplus/docker 2026-02-08T20:41:55,530 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu 2026-02-08T20:41:55,531 copying build/lib/evalscope/benchmarks/mmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-02-08T20:41:55,533 copying build/lib/evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-02-08T20:41:55,536 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/vstar_bench 2026-02-08T20:41:55,537 copying build/lib/evalscope/benchmarks/vstar_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-02-08T20:41:55,539 copying build/lib/evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-02-08T20:41:55,541 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multipl_e 2026-02-08T20:41:55,542 copying build/lib/evalscope/benchmarks/multipl_e/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-02-08T20:41:55,544 copying build/lib/evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-02-08T20:41:55,546 copying build/lib/evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-02-08T20:41:55,548 copying build/lib/evalscope/benchmarks/multipl_e/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-02-08T20:41:55,551 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hellaswag 2026-02-08T20:41:55,552 copying build/lib/evalscope/benchmarks/hellaswag/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-02-08T20:41:55,554 copying build/lib/evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-02-08T20:41:55,557 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifeval 2026-02-08T20:41:55,558 copying build/lib/evalscope/benchmarks/ifeval/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-02-08T20:41:55,560 copying build/lib/evalscope/benchmarks/ifeval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-02-08T20:41:55,562 copying build/lib/evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-02-08T20:41:55,564 copying build/lib/evalscope/benchmarks/ifeval/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-02-08T20:41:55,566 copying build/lib/evalscope/benchmarks/ifeval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-02-08T20:41:55,568 copying build/lib/evalscope/benchmarks/ifeval/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-02-08T20:41:55,572 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_pro 2026-02-08T20:41:55,573 copying build/lib/evalscope/benchmarks/mmlu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-02-08T20:41:55,575 copying build/lib/evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-02-08T20:41:55,577 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hle 2026-02-08T20:41:55,578 copying build/lib/evalscope/benchmarks/hle/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-02-08T20:41:55,580 copying build/lib/evalscope/benchmarks/hle/hle_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-02-08T20:41:55,582 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/halu_eval 2026-02-08T20:41:55,583 copying build/lib/evalscope/benchmarks/halu_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-02-08T20:41:55,585 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-02-08T20:41:55,588 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-02-08T20:41:55,590 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arena_hard 2026-02-08T20:41:55,591 copying build/lib/evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-02-08T20:41:55,593 copying build/lib/evalscope/benchmarks/arena_hard/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-02-08T20:41:55,595 copying build/lib/evalscope/benchmarks/arena_hard/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-02-08T20:41:55,598 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aime 2026-02-08T20:41:55,599 copying build/lib/evalscope/benchmarks/aime/grader.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-02-08T20:41:55,601 copying build/lib/evalscope/benchmarks/aime/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-02-08T20:41:55,602 copying build/lib/evalscope/benchmarks/aime/math_normalize.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-02-08T20:41:55,605 copying build/lib/evalscope/benchmarks/aime/aime24_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-02-08T20:41:55,606 copying build/lib/evalscope/benchmarks/aime/aime25_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-02-08T20:41:55,609 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k 2026-02-08T20:41:55,610 copying build/lib/evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-02-08T20:41:55,612 copying build/lib/evalscope/benchmarks/gsm8k/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-02-08T20:41:55,615 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hallusion_bench 2026-02-08T20:41:55,616 copying build/lib/evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-02-08T20:41:55,619 copying build/lib/evalscope/benchmarks/hallusion_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-02-08T20:41:55,621 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:55,622 copying build/lib/evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:55,624 copying build/lib/evalscope/benchmarks/omnidoc_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:55,625 copying build/lib/evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:55,628 copying build/lib/evalscope/benchmarks/omnidoc_bench/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:55,630 copying build/lib/evalscope/benchmarks/omnidoc_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-02-08T20:41:55,634 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/race 2026-02-08T20:41:55,635 copying build/lib/evalscope/benchmarks/race/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-02-08T20:41:55,637 copying build/lib/evalscope/benchmarks/race/race_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-02-08T20:41:55,639 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh 2026-02-08T20:41:55,641 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,642 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,644 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,646 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,649 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,651 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,653 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,655 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,657 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,659 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,661 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,663 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,667 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,669 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,671 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,673 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,675 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,677 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,680 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,682 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,684 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,687 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,689 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,691 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,693 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,695 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,697 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,699 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-02-08T20:41:55,701 copying build/lib/evalscope/benchmarks/bbh/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-02-08T20:41:55,702 copying build/lib/evalscope/benchmarks/bbh/bbh_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-02-08T20:41:55,705 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arc 2026-02-08T20:41:55,706 copying build/lib/evalscope/benchmarks/arc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-02-08T20:41:55,709 copying build/lib/evalscope/benchmarks/arc/arc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-02-08T20:41:55,711 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/terminal_bench 2026-02-08T20:41:55,712 copying build/lib/evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-02-08T20:41:55,714 copying build/lib/evalscope/benchmarks/terminal_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-02-08T20:41:55,716 copying build/lib/evalscope/benchmarks/terminal_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-02-08T20:41:55,718 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/process_bench 2026-02-08T20:41:55,720 copying build/lib/evalscope/benchmarks/process_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-02-08T20:41:55,721 copying build/lib/evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-02-08T20:41:55,724 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/sciq 2026-02-08T20:41:55,725 copying build/lib/evalscope/benchmarks/sciq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-02-08T20:41:55,727 copying build/lib/evalscope/benchmarks/sciq/sciq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-02-08T20:41:55,729 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode 2026-02-08T20:41:55,730 copying build/lib/evalscope/benchmarks/scicode/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-02-08T20:41:55,731 copying build/lib/evalscope/benchmarks/scicode/util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-02-08T20:41:55,733 copying build/lib/evalscope/benchmarks/scicode/scicode_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-02-08T20:41:55,735 copying build/lib/evalscope/benchmarks/scicode/prompt_templates.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-02-08T20:41:55,737 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode/docker 2026-02-08T20:41:55,738 copying build/lib/evalscope/benchmarks/scicode/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-02-08T20:41:55,742 copying build/lib/evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-02-08T20:41:55,743 copying build/lib/evalscope/benchmarks/scicode/docker/test_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-02-08T20:41:55,745 copying build/lib/evalscope/benchmarks/scicode/docker/process_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-02-08T20:41:55,748 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmu 2026-02-08T20:41:55,749 copying build/lib/evalscope/benchmarks/cmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-02-08T20:41:55,751 copying build/lib/evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-02-08T20:41:55,753 copying build/lib/evalscope/benchmarks/cmmu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-02-08T20:41:55,756 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/minerva_math 2026-02-08T20:41:55,757 copying build/lib/evalscope/benchmarks/minerva_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-02-08T20:41:55,759 copying build/lib/evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-02-08T20:41:55,762 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/visu_logic 2026-02-08T20:41:55,763 copying build/lib/evalscope/benchmarks/visu_logic/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-02-08T20:41:55,765 copying build/lib/evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-02-08T20:41:55,769 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_qa 2026-02-08T20:41:55,770 copying build/lib/evalscope/benchmarks/math_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-02-08T20:41:55,772 copying build/lib/evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-02-08T20:41:55,775 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aa_lcr 2026-02-08T20:41:55,776 copying build/lib/evalscope/benchmarks/aa_lcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-02-08T20:41:55,778 copying build/lib/evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-02-08T20:41:55,781 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl 2026-02-08T20:41:55,782 copying build/lib/evalscope/benchmarks/bfcl/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl 2026-02-08T20:41:55,784 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v4 2026-02-08T20:41:55,785 copying build/lib/evalscope/benchmarks/bfcl/v4/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-02-08T20:41:55,787 copying build/lib/evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-02-08T20:41:55,789 copying build/lib/evalscope/benchmarks/bfcl/v4/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-02-08T20:41:55,792 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:55,793 copying build/lib/evalscope/benchmarks/bfcl/v3/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:55,795 copying build/lib/evalscope/benchmarks/bfcl/v3/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:55,797 copying build/lib/evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:55,799 copying build/lib/evalscope/benchmarks/bfcl/v3/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-02-08T20:41:55,802 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tool_bench 2026-02-08T20:41:55,803 copying build/lib/evalscope/benchmarks/tool_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-02-08T20:41:55,805 copying build/lib/evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-02-08T20:41:55,806 copying build/lib/evalscope/benchmarks/tool_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-02-08T20:41:55,809 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omni_bench 2026-02-08T20:41:55,810 copying build/lib/evalscope/benchmarks/omni_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-02-08T20:41:55,812 copying build/lib/evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-02-08T20:41:55,814 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/amc 2026-02-08T20:41:55,815 copying build/lib/evalscope/benchmarks/amc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-02-08T20:41:55,817 copying build/lib/evalscope/benchmarks/amc/amc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-02-08T20:41:55,819 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ceval 2026-02-08T20:41:55,820 copying build/lib/evalscope/benchmarks/ceval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-02-08T20:41:55,822 copying build/lib/evalscope/benchmarks/ceval/ceval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-02-08T20:41:55,825 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/biomix_qa 2026-02-08T20:41:55,826 copying build/lib/evalscope/benchmarks/biomix_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-02-08T20:41:55,828 copying build/lib/evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-02-08T20:41:55,830 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gpqa 2026-02-08T20:41:55,831 copying build/lib/evalscope/benchmarks/gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-02-08T20:41:55,833 copying build/lib/evalscope/benchmarks/gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-02-08T20:41:55,839 copying build/lib/evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-02-08T20:41:55,841 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/librispeech 2026-02-08T20:41:55,842 copying build/lib/evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-02-08T20:41:55,844 copying build/lib/evalscope/benchmarks/librispeech/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-02-08T20:41:55,846 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mri_mcqa 2026-02-08T20:41:55,847 copying build/lib/evalscope/benchmarks/mri_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-02-08T20:41:55,849 copying build/lib/evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-02-08T20:41:55,851 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drop 2026-02-08T20:41:55,852 copying build/lib/evalscope/benchmarks/drop/drop_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-02-08T20:41:55,854 copying build/lib/evalscope/benchmarks/drop/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-02-08T20:41:55,856 copying build/lib/evalscope/benchmarks/drop/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-02-08T20:41:55,858 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbppplus 2026-02-08T20:41:55,860 copying build/lib/evalscope/benchmarks/mbppplus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-02-08T20:41:55,861 copying build/lib/evalscope/benchmarks/mbppplus/mbppplus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbppplus 2026-02-08T20:41:55,864 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/refcoco 2026-02-08T20:41:55,865 copying build/lib/evalscope/benchmarks/refcoco/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-02-08T20:41:55,867 copying build/lib/evalscope/benchmarks/refcoco/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-02-08T20:41:55,869 copying build/lib/evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-02-08T20:41:55,871 copying build/lib/evalscope/benchmarks/refcoco/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-02-08T20:41:55,873 creating build/bdist.linux-armv7l/wheel/evalscope/api 2026-02-08T20:41:55,875 copying build/lib/evalscope/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-02-08T20:41:55,877 creating build/bdist.linux-armv7l/wheel/evalscope/api/model 2026-02-08T20:41:55,878 copying build/lib/evalscope/api/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-02-08T20:41:55,880 copying build/lib/evalscope/api/model/lazy_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-02-08T20:41:55,882 copying build/lib/evalscope/api/model/model_output.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-02-08T20:41:55,884 copying build/lib/evalscope/api/model/generate_config.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-02-08T20:41:55,886 copying build/lib/evalscope/api/model/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-02-08T20:41:55,889 creating build/bdist.linux-armv7l/wheel/evalscope/api/messages 2026-02-08T20:41:55,891 copying build/lib/evalscope/api/messages/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-02-08T20:41:55,892 copying build/lib/evalscope/api/messages/chat_message.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-02-08T20:41:55,895 copying build/lib/evalscope/api/messages/content.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-02-08T20:41:55,897 copying build/lib/evalscope/api/messages/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-02-08T20:41:55,899 creating build/bdist.linux-armv7l/wheel/evalscope/api/metric 2026-02-08T20:41:55,900 copying build/lib/evalscope/api/metric/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-02-08T20:41:55,902 copying build/lib/evalscope/api/metric/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-02-08T20:41:55,904 copying build/lib/evalscope/api/metric/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-02-08T20:41:55,906 copying build/lib/evalscope/api/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-02-08T20:41:55,908 creating build/bdist.linux-armv7l/wheel/evalscope/api/filter 2026-02-08T20:41:55,909 copying build/lib/evalscope/api/filter/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-02-08T20:41:55,911 copying build/lib/evalscope/api/filter/filter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-02-08T20:41:55,914 creating build/bdist.linux-armv7l/wheel/evalscope/api/dataset 2026-02-08T20:41:55,915 copying build/lib/evalscope/api/dataset/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-02-08T20:41:55,916 copying build/lib/evalscope/api/dataset/loader.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-02-08T20:41:55,919 copying build/lib/evalscope/api/dataset/dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-02-08T20:41:55,921 copying build/lib/evalscope/api/dataset/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-02-08T20:41:55,924 creating build/bdist.linux-armv7l/wheel/evalscope/api/evaluator 2026-02-08T20:41:55,925 copying build/lib/evalscope/api/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-02-08T20:41:55,927 copying build/lib/evalscope/api/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-02-08T20:41:55,929 copying build/lib/evalscope/api/evaluator/cache.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-02-08T20:41:55,931 copying build/lib/evalscope/api/evaluator/state.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-02-08T20:41:55,934 creating build/bdist.linux-armv7l/wheel/evalscope/api/tool 2026-02-08T20:41:55,935 copying build/lib/evalscope/api/tool/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-02-08T20:41:55,937 copying build/lib/evalscope/api/tool/tool_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-02-08T20:41:55,940 copying build/lib/evalscope/api/tool/tool_call.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-02-08T20:41:55,942 copying build/lib/evalscope/api/tool/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-02-08T20:41:55,944 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark 2026-02-08T20:41:55,945 copying build/lib/evalscope/api/benchmark/meta.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-02-08T20:41:55,947 copying build/lib/evalscope/api/benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-02-08T20:41:55,949 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark/adapters 2026-02-08T20:41:55,950 copying build/lib/evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-02-08T20:41:55,952 copying build/lib/evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-02-08T20:41:55,954 copying build/lib/evalscope/api/benchmark/adapters/default_data_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-02-08T20:41:55,957 copying build/lib/evalscope/api/benchmark/adapters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-02-08T20:41:55,959 copying build/lib/evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-02-08T20:41:55,961 copying build/lib/evalscope/api/benchmark/adapters/text2image_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-02-08T20:41:55,963 copying build/lib/evalscope/api/benchmark/adapters/ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-02-08T20:41:55,965 copying build/lib/evalscope/api/benchmark/adapters/agent_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-02-08T20:41:55,967 copying build/lib/evalscope/api/benchmark/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-02-08T20:41:55,970 creating build/bdist.linux-armv7l/wheel/evalscope/api/mixin 2026-02-08T20:41:55,971 copying build/lib/evalscope/api/mixin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-02-08T20:41:55,973 copying build/lib/evalscope/api/mixin/llm_judge_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-02-08T20:41:55,975 copying build/lib/evalscope/api/mixin/sandbox_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-02-08T20:41:55,977 copying build/lib/evalscope/config.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-02-08T20:41:55,980 creating build/bdist.linux-armv7l/wheel/evalscope/filters 2026-02-08T20:41:55,981 copying build/lib/evalscope/filters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-02-08T20:41:55,983 copying build/lib/evalscope/filters/extraction.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-02-08T20:41:55,985 copying build/lib/evalscope/filters/selection.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-02-08T20:41:55,987 creating build/bdist.linux-armv7l/wheel/evalscope/summarizer 2026-02-08T20:41:55,988 copying build/lib/evalscope/summarizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-02-08T20:41:55,990 copying build/lib/evalscope/summarizer/summarizer.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-02-08T20:41:55,993 copying build/lib/evalscope/run.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-02-08T20:41:55,996 creating build/bdist.linux-armv7l/wheel/evalscope/collections 2026-02-08T20:41:55,997 copying build/lib/evalscope/collections/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-02-08T20:41:55,998 copying build/lib/evalscope/collections/schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-02-08T20:41:56,001 copying build/lib/evalscope/collections/sampler.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-02-08T20:41:56,003 creating build/bdist.linux-armv7l/wheel/evalscope/service 2026-02-08T20:41:56,004 copying build/lib/evalscope/service/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-02-08T20:41:56,006 copying build/lib/evalscope/service/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-02-08T20:41:56,009 creating build/bdist.linux-armv7l/wheel/evalscope/service/frontend 2026-02-08T20:41:56,010 copying build/lib/evalscope/service/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-02-08T20:41:56,012 copying build/lib/evalscope/service/frontend/async_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-02-08T20:41:56,014 copying build/lib/evalscope/service/frontend/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-02-08T20:41:56,017 copying build/lib/evalscope/service/frontend/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-02-08T20:41:56,019 copying build/lib/evalscope/service/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-02-08T20:41:56,021 creating build/bdist.linux-armv7l/wheel/evalscope/evaluator 2026-02-08T20:41:56,022 copying build/lib/evalscope/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-02-08T20:41:56,024 copying build/lib/evalscope/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-02-08T20:41:56,027 copying build/lib/evalscope/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-02-08T20:41:56,029 creating build/bdist.linux-armv7l/wheel/evalscope/backend 2026-02-08T20:41:56,031 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval 2026-02-08T20:41:56,033 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas 2026-02-08T20:41:56,034 copying build/lib/evalscope/backend/rag_eval/ragas/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-02-08T20:41:56,037 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:56,038 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:56,040 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:56,042 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:56,043 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:56,046 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-02-08T20:41:56,048 copying build/lib/evalscope/backend/rag_eval/ragas/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-02-08T20:41:56,050 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/prompts 2026-02-08T20:41:56,051 copying build/lib/evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/prompts 2026-02-08T20:41:56,053 copying build/lib/evalscope/backend/rag_eval/ragas/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-02-08T20:41:56,055 copying build/lib/evalscope/backend/rag_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-02-08T20:41:56,057 copying build/lib/evalscope/backend/rag_eval/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-02-08T20:41:56,059 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:56,060 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:56,063 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:56,064 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:56,065 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:56,067 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:56,070 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-02-08T20:41:56,072 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:56,074 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:56,076 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-02-08T20:41:56,079 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/utils 2026-02-08T20:41:56,080 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-02-08T20:41:56,082 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-02-08T20:41:56,084 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/utils 2026-02-08T20:41:56,085 copying build/lib/evalscope/backend/rag_eval/utils/tools.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-02-08T20:41:56,087 copying build/lib/evalscope/backend/rag_eval/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-02-08T20:41:56,089 copying build/lib/evalscope/backend/rag_eval/utils/embedding.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-02-08T20:41:56,091 copying build/lib/evalscope/backend/rag_eval/utils/clip.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-02-08T20:41:56,093 copying build/lib/evalscope/backend/rag_eval/utils/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-02-08T20:41:56,096 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:56,097 copying build/lib/evalscope/backend/rag_eval/cmteb/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:56,099 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:56,100 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:56,103 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:56,105 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:56,107 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:56,109 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:56,111 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:56,113 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:56,116 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-02-08T20:41:56,118 copying build/lib/evalscope/backend/rag_eval/cmteb/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:56,120 copying build/lib/evalscope/backend/rag_eval/cmteb/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:56,122 copying build/lib/evalscope/backend/rag_eval/cmteb/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-02-08T20:41:56,123 copying build/lib/evalscope/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-02-08T20:41:56,126 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass 2026-02-08T20:41:56,127 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass/tasks 2026-02-08T20:41:56,128 copying build/lib/evalscope/backend/opencompass/tasks/eval_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-02-08T20:41:56,130 copying build/lib/evalscope/backend/opencompass/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-02-08T20:41:56,132 copying build/lib/evalscope/backend/opencompass/tasks/eval_datasets.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-02-08T20:41:56,134 copying build/lib/evalscope/backend/opencompass/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-02-08T20:41:56,136 copying build/lib/evalscope/backend/opencompass/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-02-08T20:41:56,138 copying build/lib/evalscope/backend/opencompass/api_meta_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-02-08T20:41:56,140 copying build/lib/evalscope/backend/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-02-08T20:41:56,142 creating build/bdist.linux-armv7l/wheel/evalscope/backend/vlm_eval_kit 2026-02-08T20:41:56,143 copying build/lib/evalscope/backend/vlm_eval_kit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-02-08T20:41:56,145 copying build/lib/evalscope/backend/vlm_eval_kit/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-02-08T20:41:56,148 creating build/bdist.linux-armv7l/wheel/evalscope/utils 2026-02-08T20:41:56,149 copying build/lib/evalscope/utils/function_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,152 copying build/lib/evalscope/utils/code_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,155 copying build/lib/evalscope/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,157 copying build/lib/evalscope/utils/resource_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,159 copying build/lib/evalscope/utils/io_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,161 copying build/lib/evalscope/utils/multi_choices.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,164 copying build/lib/evalscope/utils/chat_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,166 copying build/lib/evalscope/utils/url_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,168 copying build/lib/evalscope/utils/model_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,170 copying build/lib/evalscope/utils/argument_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,172 copying build/lib/evalscope/utils/ner.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,175 copying build/lib/evalscope/utils/deprecation_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,177 copying build/lib/evalscope/utils/tqdm_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,178 copying build/lib/evalscope/utils/json_schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,181 copying build/lib/evalscope/utils/import_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,183 copying build/lib/evalscope/utils/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-02-08T20:41:56,186 copying build/lib/evalscope/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-02-08T20:41:56,189 creating build/bdist.linux-armv7l/wheel/evalscope/app 2026-02-08T20:41:56,190 copying build/lib/evalscope/app/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-02-08T20:41:56,192 copying build/lib/evalscope/app/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-02-08T20:41:56,195 creating build/bdist.linux-armv7l/wheel/evalscope/app/ui 2026-02-08T20:41:56,196 copying build/lib/evalscope/app/ui/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-02-08T20:41:56,198 copying build/lib/evalscope/app/ui/sidebar.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-02-08T20:41:56,200 copying build/lib/evalscope/app/ui/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-02-08T20:41:56,201 copying build/lib/evalscope/app/ui/multi_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-02-08T20:41:56,204 copying build/lib/evalscope/app/ui/single_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-02-08T20:41:56,206 copying build/lib/evalscope/app/ui/app_ui.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-02-08T20:41:56,209 copying build/lib/evalscope/app/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-02-08T20:41:56,211 creating build/bdist.linux-armv7l/wheel/evalscope/app/utils 2026-02-08T20:41:56,212 copying build/lib/evalscope/app/utils/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-02-08T20:41:56,215 copying build/lib/evalscope/app/utils/localization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-02-08T20:41:56,217 copying build/lib/evalscope/app/utils/data_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-02-08T20:41:56,219 copying build/lib/evalscope/app/utils/env_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-02-08T20:41:56,221 copying build/lib/evalscope/app/utils/text_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-02-08T20:41:56,223 copying build/lib/evalscope/app/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-02-08T20:41:56,226 creating build/bdist.linux-armv7l/wheel/evalscope/models 2026-02-08T20:41:56,227 copying build/lib/evalscope/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-02-08T20:41:56,229 copying build/lib/evalscope/models/openai_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-02-08T20:41:56,231 copying build/lib/evalscope/models/text2image_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-02-08T20:41:56,233 copying build/lib/evalscope/models/model_apis.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-02-08T20:41:56,235 copying build/lib/evalscope/models/mockllm.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-02-08T20:41:56,238 creating build/bdist.linux-armv7l/wheel/evalscope/models/utils 2026-02-08T20:41:56,239 copying build/lib/evalscope/models/utils/openai.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-02-08T20:41:56,242 copying build/lib/evalscope/models/image_edit_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-02-08T20:41:56,244 copying build/lib/evalscope/models/modelscope.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-02-08T20:41:56,247 creating build/bdist.linux-armv7l/wheel/evalscope/report 2026-02-08T20:41:56,248 copying build/lib/evalscope/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-02-08T20:41:56,250 copying build/lib/evalscope/report/report.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-02-08T20:41:56,253 copying build/lib/evalscope/report/combinator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-02-08T20:41:56,255 copying build/lib/evalscope/report/generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-02-08T20:41:56,257 running install_egg_info 2026-02-08T20:41:56,262 Copying evalscope.egg-info to build/bdist.linux-armv7l/wheel/./evalscope-1.4.2-py3.11.egg-info 2026-02-08T20:41:56,275 running install_scripts 2026-02-08T20:41:56,289 creating build/bdist.linux-armv7l/wheel/evalscope-1.4.2.dist-info/WHEEL 2026-02-08T20:41:56,292 creating '/tmp/pip-wheel-431hf6ga/.tmp-iefn8r6l/evalscope-1.4.2-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-02-08T20:41:56,294 adding 'evalscope/__init__.py' 2026-02-08T20:41:56,296 adding 'evalscope/arguments.py' 2026-02-08T20:41:56,298 adding 'evalscope/config.py' 2026-02-08T20:41:56,300 adding 'evalscope/constants.py' 2026-02-08T20:41:56,302 adding 'evalscope/run.py' 2026-02-08T20:41:56,303 adding 'evalscope/version.py' 2026-02-08T20:41:56,305 adding 'evalscope/api/__init__.py' 2026-02-08T20:41:56,306 adding 'evalscope/api/registry.py' 2026-02-08T20:41:56,308 adding 'evalscope/api/benchmark/__init__.py' 2026-02-08T20:41:56,310 adding 'evalscope/api/benchmark/benchmark.py' 2026-02-08T20:41:56,312 adding 'evalscope/api/benchmark/meta.py' 2026-02-08T20:41:56,313 adding 'evalscope/api/benchmark/adapters/__init__.py' 2026-02-08T20:41:56,315 adding 'evalscope/api/benchmark/adapters/agent_adapter.py' 2026-02-08T20:41:56,318 adding 'evalscope/api/benchmark/adapters/default_data_adapter.py' 2026-02-08T20:41:56,320 adding 'evalscope/api/benchmark/adapters/image_edit_adapter.py' 2026-02-08T20:41:56,321 adding 'evalscope/api/benchmark/adapters/multi_choice_adapter.py' 2026-02-08T20:41:56,323 adding 'evalscope/api/benchmark/adapters/ner_adapter.py' 2026-02-08T20:41:56,325 adding 'evalscope/api/benchmark/adapters/text2image_adapter.py' 2026-02-08T20:41:56,326 adding 'evalscope/api/benchmark/adapters/vision_language_adapter.py' 2026-02-08T20:41:56,328 adding 'evalscope/api/dataset/__init__.py' 2026-02-08T20:41:56,330 adding 'evalscope/api/dataset/dataset.py' 2026-02-08T20:41:56,332 adding 'evalscope/api/dataset/loader.py' 2026-02-08T20:41:56,333 adding 'evalscope/api/dataset/utils.py' 2026-02-08T20:41:56,335 adding 'evalscope/api/evaluator/__init__.py' 2026-02-08T20:41:56,337 adding 'evalscope/api/evaluator/cache.py' 2026-02-08T20:41:56,339 adding 'evalscope/api/evaluator/evaluator.py' 2026-02-08T20:41:56,340 adding 'evalscope/api/evaluator/state.py' 2026-02-08T20:41:56,342 adding 'evalscope/api/filter/__init__.py' 2026-02-08T20:41:56,343 adding 'evalscope/api/filter/filter.py' 2026-02-08T20:41:56,345 adding 'evalscope/api/messages/__init__.py' 2026-02-08T20:41:56,347 adding 'evalscope/api/messages/chat_message.py' 2026-02-08T20:41:56,348 adding 'evalscope/api/messages/content.py' 2026-02-08T20:41:56,350 adding 'evalscope/api/messages/utils.py' 2026-02-08T20:41:56,351 adding 'evalscope/api/metric/__init__.py' 2026-02-08T20:41:56,353 adding 'evalscope/api/metric/metric.py' 2026-02-08T20:41:56,354 adding 'evalscope/api/metric/scorer.py' 2026-02-08T20:41:56,356 adding 'evalscope/api/mixin/__init__.py' 2026-02-08T20:41:56,357 adding 'evalscope/api/mixin/llm_judge_mixin.py' 2026-02-08T20:41:56,359 adding 'evalscope/api/mixin/sandbox_mixin.py' 2026-02-08T20:41:56,361 adding 'evalscope/api/model/__init__.py' 2026-02-08T20:41:56,363 adding 'evalscope/api/model/generate_config.py' 2026-02-08T20:41:56,364 adding 'evalscope/api/model/lazy_model.py' 2026-02-08T20:41:56,366 adding 'evalscope/api/model/model.py' 2026-02-08T20:41:56,368 adding 'evalscope/api/model/model_output.py' 2026-02-08T20:41:56,370 adding 'evalscope/api/tool/__init__.py' 2026-02-08T20:41:56,371 adding 'evalscope/api/tool/tool_call.py' 2026-02-08T20:41:56,373 adding 'evalscope/api/tool/tool_info.py' 2026-02-08T20:41:56,374 adding 'evalscope/api/tool/utils.py' 2026-02-08T20:41:56,376 adding 'evalscope/app/__init__.py' 2026-02-08T20:41:56,377 adding 'evalscope/app/app.py' 2026-02-08T20:41:56,379 adding 'evalscope/app/arguments.py' 2026-02-08T20:41:56,380 adding 'evalscope/app/constants.py' 2026-02-08T20:41:56,382 adding 'evalscope/app/ui/__init__.py' 2026-02-08T20:41:56,383 adding 'evalscope/app/ui/app_ui.py' 2026-02-08T20:41:56,386 adding 'evalscope/app/ui/multi_model.py' 2026-02-08T20:41:56,387 adding 'evalscope/app/ui/sidebar.py' 2026-02-08T20:41:56,389 adding 'evalscope/app/ui/single_model.py' 2026-02-08T20:41:56,390 adding 'evalscope/app/ui/visualization.py' 2026-02-08T20:41:56,393 adding 'evalscope/app/utils/data_utils.py' 2026-02-08T20:41:56,394 adding 'evalscope/app/utils/env_utils.py' 2026-02-08T20:41:56,395 adding 'evalscope/app/utils/localization.py' 2026-02-08T20:41:56,397 adding 'evalscope/app/utils/text_utils.py' 2026-02-08T20:41:56,398 adding 'evalscope/app/utils/visualization.py' 2026-02-08T20:41:56,400 adding 'evalscope/backend/__init__.py' 2026-02-08T20:41:56,401 adding 'evalscope/backend/base.py' 2026-02-08T20:41:56,403 adding 'evalscope/backend/opencompass/__init__.py' 2026-02-08T20:41:56,404 adding 'evalscope/backend/opencompass/api_meta_template.py' 2026-02-08T20:41:56,406 adding 'evalscope/backend/opencompass/backend_manager.py' 2026-02-08T20:41:56,407 adding 'evalscope/backend/opencompass/tasks/__init__.py' 2026-02-08T20:41:56,409 adding 'evalscope/backend/opencompass/tasks/eval_api.py' 2026-02-08T20:41:56,410 adding 'evalscope/backend/opencompass/tasks/eval_datasets.py' 2026-02-08T20:41:56,412 adding 'evalscope/backend/rag_eval/__init__.py' 2026-02-08T20:41:56,413 adding 'evalscope/backend/rag_eval/backend_manager.py' 2026-02-08T20:41:56,415 adding 'evalscope/backend/rag_eval/clip_benchmark/__init__.py' 2026-02-08T20:41:56,416 adding 'evalscope/backend/rag_eval/clip_benchmark/arguments.py' 2026-02-08T20:41:56,418 adding 'evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py' 2026-02-08T20:41:56,420 adding 'evalscope/backend/rag_eval/clip_benchmark/task_template.py' 2026-02-08T20:41:56,421 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py' 2026-02-08T20:41:56,423 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py' 2026-02-08T20:41:56,424 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py' 2026-02-08T20:41:56,426 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py' 2026-02-08T20:41:56,428 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py' 2026-02-08T20:41:56,430 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt' 2026-02-08T20:41:56,431 adding 'evalscope/backend/rag_eval/cmteb/__init__.py' 2026-02-08T20:41:56,433 adding 'evalscope/backend/rag_eval/cmteb/arguments.py' 2026-02-08T20:41:56,434 adding 'evalscope/backend/rag_eval/cmteb/base.py' 2026-02-08T20:41:56,436 adding 'evalscope/backend/rag_eval/cmteb/task_template.py' 2026-02-08T20:41:56,438 adding 'evalscope/backend/rag_eval/cmteb/tasks/Classification.py' 2026-02-08T20:41:56,440 adding 'evalscope/backend/rag_eval/cmteb/tasks/Clustering.py' 2026-02-08T20:41:56,441 adding 'evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py' 2026-02-08T20:41:56,442 adding 'evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py' 2026-02-08T20:41:56,444 adding 'evalscope/backend/rag_eval/cmteb/tasks/Reranking.py' 2026-02-08T20:41:56,446 adding 'evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py' 2026-02-08T20:41:56,447 adding 'evalscope/backend/rag_eval/cmteb/tasks/STS.py' 2026-02-08T20:41:56,449 adding 'evalscope/backend/rag_eval/cmteb/tasks/__init__.py' 2026-02-08T20:41:56,451 adding 'evalscope/backend/rag_eval/ragas/__init__.py' 2026-02-08T20:41:56,452 adding 'evalscope/backend/rag_eval/ragas/arguments.py' 2026-02-08T20:41:56,453 adding 'evalscope/backend/rag_eval/ragas/task_template.py' 2026-02-08T20:41:56,455 adding 'evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py' 2026-02-08T20:41:56,457 adding 'evalscope/backend/rag_eval/ragas/tasks/__init__.py' 2026-02-08T20:41:56,458 adding 'evalscope/backend/rag_eval/ragas/tasks/build_distribution.py' 2026-02-08T20:41:56,460 adding 'evalscope/backend/rag_eval/ragas/tasks/build_transform.py' 2026-02-08T20:41:56,462 adding 'evalscope/backend/rag_eval/ragas/tasks/testset_generation.py' 2026-02-08T20:41:56,463 adding 'evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py' 2026-02-08T20:41:56,465 adding 'evalscope/backend/rag_eval/utils/__init__.py' 2026-02-08T20:41:56,466 adding 'evalscope/backend/rag_eval/utils/clip.py' 2026-02-08T20:41:56,468 adding 'evalscope/backend/rag_eval/utils/embedding.py' 2026-02-08T20:41:56,469 adding 'evalscope/backend/rag_eval/utils/llm.py' 2026-02-08T20:41:56,471 adding 'evalscope/backend/rag_eval/utils/tools.py' 2026-02-08T20:41:56,472 adding 'evalscope/backend/vlm_eval_kit/__init__.py' 2026-02-08T20:41:56,474 adding 'evalscope/backend/vlm_eval_kit/backend_manager.py' 2026-02-08T20:41:56,478 adding 'evalscope/benchmarks/__init__.py' 2026-02-08T20:41:56,479 adding 'evalscope/benchmarks/a_okvqa/__init__.py' 2026-02-08T20:41:56,481 adding 'evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py' 2026-02-08T20:41:56,482 adding 'evalscope/benchmarks/aa_lcr/__init__.py' 2026-02-08T20:41:56,484 adding 'evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py' 2026-02-08T20:41:56,486 adding 'evalscope/benchmarks/ai2d/__init__.py' 2026-02-08T20:41:56,487 adding 'evalscope/benchmarks/ai2d/ai2d_adapter.py' 2026-02-08T20:41:56,489 adding 'evalscope/benchmarks/aime/__init__.py' 2026-02-08T20:41:56,490 adding 'evalscope/benchmarks/aime/aime24_adapter.py' 2026-02-08T20:41:56,492 adding 'evalscope/benchmarks/aime/aime25_adapter.py' 2026-02-08T20:41:56,494 adding 'evalscope/benchmarks/aime/grader.py' 2026-02-08T20:41:56,495 adding 'evalscope/benchmarks/aime/math_normalize.py' 2026-02-08T20:41:56,497 adding 'evalscope/benchmarks/alpaca_eval/__init__.py' 2026-02-08T20:41:56,498 adding 'evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py' 2026-02-08T20:41:56,500 adding 'evalscope/benchmarks/amc/__init__.py' 2026-02-08T20:41:56,501 adding 'evalscope/benchmarks/amc/amc_adapter.py' 2026-02-08T20:41:56,503 adding 'evalscope/benchmarks/arc/__init__.py' 2026-02-08T20:41:56,504 adding 'evalscope/benchmarks/arc/arc_adapter.py' 2026-02-08T20:41:56,506 adding 'evalscope/benchmarks/arena_hard/__init__.py' 2026-02-08T20:41:56,508 adding 'evalscope/benchmarks/arena_hard/arena_hard_adapter.py' 2026-02-08T20:41:56,509 adding 'evalscope/benchmarks/arena_hard/utils.py' 2026-02-08T20:41:56,511 adding 'evalscope/benchmarks/bbh/__init__.py' 2026-02-08T20:41:56,513 adding 'evalscope/benchmarks/bbh/bbh_adapter.py' 2026-02-08T20:41:56,516 adding 'evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt' 2026-02-08T20:41:56,517 adding 'evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt' 2026-02-08T20:41:56,518 adding 'evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt' 2026-02-08T20:41:56,520 adding 'evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt' 2026-02-08T20:41:56,521 adding 'evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt' 2026-02-08T20:41:56,523 adding 'evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt' 2026-02-08T20:41:56,524 adding 'evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt' 2026-02-08T20:41:56,525 adding 'evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt' 2026-02-08T20:41:56,527 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt' 2026-02-08T20:41:56,528 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt' 2026-02-08T20:41:56,530 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt' 2026-02-08T20:41:56,531 adding 'evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt' 2026-02-08T20:41:56,533 adding 'evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt' 2026-02-08T20:41:56,535 adding 'evalscope/benchmarks/bbh/cot_prompts/navigate.txt' 2026-02-08T20:41:56,536 adding 'evalscope/benchmarks/bbh/cot_prompts/object_counting.txt' 2026-02-08T20:41:56,538 adding 'evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt' 2026-02-08T20:41:56,539 adding 'evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt' 2026-02-08T20:41:56,541 adding 'evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt' 2026-02-08T20:41:56,543 adding 'evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt' 2026-02-08T20:41:56,544 adding 'evalscope/benchmarks/bbh/cot_prompts/snarks.txt' 2026-02-08T20:41:56,546 adding 'evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt' 2026-02-08T20:41:56,547 adding 'evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt' 2026-02-08T20:41:56,548 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt' 2026-02-08T20:41:56,549 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt' 2026-02-08T20:41:56,551 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt' 2026-02-08T20:41:56,552 adding 'evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt' 2026-02-08T20:41:56,554 adding 'evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt' 2026-02-08T20:41:56,555 adding 'evalscope/benchmarks/bfcl/__init__.py' 2026-02-08T20:41:56,557 adding 'evalscope/benchmarks/bfcl/v3/__init__.py' 2026-02-08T20:41:56,559 adding 'evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py' 2026-02-08T20:41:56,561 adding 'evalscope/benchmarks/bfcl/v3/generation.py' 2026-02-08T20:41:56,562 adding 'evalscope/benchmarks/bfcl/v3/utils.py' 2026-02-08T20:41:56,564 adding 'evalscope/benchmarks/bfcl/v4/__init__.py' 2026-02-08T20:41:56,566 adding 'evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py' 2026-02-08T20:41:56,568 adding 'evalscope/benchmarks/bfcl/v4/utils.py' 2026-02-08T20:41:56,570 adding 'evalscope/benchmarks/biomix_qa/__init__.py' 2026-02-08T20:41:56,571 adding 'evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py' 2026-02-08T20:41:56,573 adding 'evalscope/benchmarks/blink/__init__.py' 2026-02-08T20:41:56,574 adding 'evalscope/benchmarks/blink/blink_adapter.py' 2026-02-08T20:41:56,576 adding 'evalscope/benchmarks/ceval/__init__.py' 2026-02-08T20:41:56,577 adding 'evalscope/benchmarks/ceval/ceval_adapter.py' 2026-02-08T20:41:56,579 adding 'evalscope/benchmarks/chartqa/__init__.py' 2026-02-08T20:41:56,581 adding 'evalscope/benchmarks/chartqa/chartqa_adapter.py' 2026-02-08T20:41:56,582 adding 'evalscope/benchmarks/chartqa/utils.py' 2026-02-08T20:41:56,583 adding 'evalscope/benchmarks/chinese_simple_qa/__init__.py' 2026-02-08T20:41:56,585 adding 'evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py' 2026-02-08T20:41:56,587 adding 'evalscope/benchmarks/cmmlu/__init__.py' 2026-02-08T20:41:56,589 adding 'evalscope/benchmarks/cmmlu/cmmlu_adapter.py' 2026-02-08T20:41:56,590 adding 'evalscope/benchmarks/cmmmu/__init__.py' 2026-02-08T20:41:56,592 adding 'evalscope/benchmarks/cmmmu/cmmmu_adapter.py' 2026-02-08T20:41:56,594 adding 'evalscope/benchmarks/cmmmu/utils.py' 2026-02-08T20:41:56,596 adding 'evalscope/benchmarks/cmmu/__init__.py' 2026-02-08T20:41:56,598 adding 'evalscope/benchmarks/cmmu/cmmu_adapter.py' 2026-02-08T20:41:56,600 adding 'evalscope/benchmarks/cmmu/prompt.py' 2026-02-08T20:41:56,602 adding 'evalscope/benchmarks/coin_flip/__init__.py' 2026-02-08T20:41:56,603 adding 'evalscope/benchmarks/coin_flip/coin_flip_adapter.py' 2026-02-08T20:41:56,605 adding 'evalscope/benchmarks/commonsense_qa/__init__.py' 2026-02-08T20:41:56,606 adding 'evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py' 2026-02-08T20:41:56,608 adding 'evalscope/benchmarks/competition_math/__init__.py' 2026-02-08T20:41:56,610 adding 'evalscope/benchmarks/competition_math/competition_math_adapter.py' 2026-02-08T20:41:56,611 adding 'evalscope/benchmarks/data_collection/__init__.py' 2026-02-08T20:41:56,613 adding 'evalscope/benchmarks/data_collection/data_collection_adapter.py' 2026-02-08T20:41:56,615 adding 'evalscope/benchmarks/docmath/__init__.py' 2026-02-08T20:41:56,617 adding 'evalscope/benchmarks/docmath/docmath_adapter.py' 2026-02-08T20:41:56,619 adding 'evalscope/benchmarks/docmath/utils.py' 2026-02-08T20:41:56,621 adding 'evalscope/benchmarks/docvqa/__init__.py' 2026-02-08T20:41:56,622 adding 'evalscope/benchmarks/docvqa/docvqa_adapter.py' 2026-02-08T20:41:56,624 adding 'evalscope/benchmarks/drivelology/__init__.py' 2026-02-08T20:41:56,626 adding 'evalscope/benchmarks/drivelology/drivelology_binary_adapter.py' 2026-02-08T20:41:56,628 adding 'evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py' 2026-02-08T20:41:56,629 adding 'evalscope/benchmarks/drivelology/drivelology_selection_adapter.py' 2026-02-08T20:41:56,631 adding 'evalscope/benchmarks/drivelology/drivelology_writing_adapter.py' 2026-02-08T20:41:56,632 adding 'evalscope/benchmarks/drop/__init__.py' 2026-02-08T20:41:56,634 adding 'evalscope/benchmarks/drop/drop_adapter.py' 2026-02-08T20:41:56,635 adding 'evalscope/benchmarks/drop/utils.py' 2026-02-08T20:41:56,637 adding 'evalscope/benchmarks/eq_bench/__init__.py' 2026-02-08T20:41:56,639 adding 'evalscope/benchmarks/eq_bench/answer_validation.py' 2026-02-08T20:41:56,640 adding 'evalscope/benchmarks/eq_bench/eq_bench_adapter.py' 2026-02-08T20:41:56,642 adding 'evalscope/benchmarks/fleurs/__init__.py' 2026-02-08T20:41:56,644 adding 'evalscope/benchmarks/fleurs/fleurs_adapter.py' 2026-02-08T20:41:56,645 adding 'evalscope/benchmarks/frames/__init__.py' 2026-02-08T20:41:56,647 adding 'evalscope/benchmarks/frames/frames_adapter.py' 2026-02-08T20:41:56,648 adding 'evalscope/benchmarks/frames/utils.py' 2026-02-08T20:41:56,649 adding 'evalscope/benchmarks/general_arena/__init__.py' 2026-02-08T20:41:56,652 adding 'evalscope/benchmarks/general_arena/general_arena_adapter.py' 2026-02-08T20:41:56,654 adding 'evalscope/benchmarks/general_arena/utils.py' 2026-02-08T20:41:56,656 adding 'evalscope/benchmarks/general_fc/__init__.py' 2026-02-08T20:41:56,657 adding 'evalscope/benchmarks/general_fc/general_fc_adapter.py' 2026-02-08T20:41:56,659 adding 'evalscope/benchmarks/general_mcq/__init__.py' 2026-02-08T20:41:56,660 adding 'evalscope/benchmarks/general_mcq/general_mcq_adapter.py' 2026-02-08T20:41:56,662 adding 'evalscope/benchmarks/general_qa/__init__.py' 2026-02-08T20:41:56,663 adding 'evalscope/benchmarks/general_qa/general_qa_adapter.py' 2026-02-08T20:41:56,665 adding 'evalscope/benchmarks/general_vmcq/__init__.py' 2026-02-08T20:41:56,667 adding 'evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py' 2026-02-08T20:41:56,668 adding 'evalscope/benchmarks/general_vqa/__init__.py' 2026-02-08T20:41:56,670 adding 'evalscope/benchmarks/general_vqa/general_vqa_adapter.py' 2026-02-08T20:41:56,672 adding 'evalscope/benchmarks/gpqa/__init__.py' 2026-02-08T20:41:56,673 adding 'evalscope/benchmarks/gpqa/gpqa_adapter.py' 2026-02-08T20:41:56,675 adding 'evalscope/benchmarks/gpqa/prompt.py' 2026-02-08T20:41:56,677 adding 'evalscope/benchmarks/gsm8k/__init__.py' 2026-02-08T20:41:56,678 adding 'evalscope/benchmarks/gsm8k/gsm8k_adapter.py' 2026-02-08T20:41:56,680 adding 'evalscope/benchmarks/gsm8k_v/__init__.py' 2026-02-08T20:41:56,681 adding 'evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py' 2026-02-08T20:41:56,683 adding 'evalscope/benchmarks/hallusion_bench/__init__.py' 2026-02-08T20:41:56,684 adding 'evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py' 2026-02-08T20:41:56,686 adding 'evalscope/benchmarks/halu_eval/__init__.py' 2026-02-08T20:41:56,688 adding 'evalscope/benchmarks/halu_eval/halu_eval_adapter.py' 2026-02-08T20:41:56,689 adding 'evalscope/benchmarks/halu_eval/halu_eval_instructions.py' 2026-02-08T20:41:56,691 adding 'evalscope/benchmarks/healthbench/__init__.py' 2026-02-08T20:41:56,693 adding 'evalscope/benchmarks/healthbench/healthbench_adapter.py' 2026-02-08T20:41:56,695 adding 'evalscope/benchmarks/healthbench/utils.py' 2026-02-08T20:41:56,697 adding 'evalscope/benchmarks/hellaswag/__init__.py' 2026-02-08T20:41:56,698 adding 'evalscope/benchmarks/hellaswag/hellaswag_adapter.py' 2026-02-08T20:41:56,700 adding 'evalscope/benchmarks/hle/__init__.py' 2026-02-08T20:41:56,701 adding 'evalscope/benchmarks/hle/hle_adapter.py' 2026-02-08T20:41:56,703 adding 'evalscope/benchmarks/humaneval/__init__.py' 2026-02-08T20:41:56,705 adding 'evalscope/benchmarks/humaneval/humaneval_adapter.py' 2026-02-08T20:41:56,706 adding 'evalscope/benchmarks/humaneval/utils.py' 2026-02-08T20:41:56,708 adding 'evalscope/benchmarks/humanevalplus/__init__.py' 2026-02-08T20:41:56,709 adding 'evalscope/benchmarks/humanevalplus/humanevalplus_adapter.py' 2026-02-08T20:41:56,711 adding 'evalscope/benchmarks/humanevalplus/docker/Dockerfile' 2026-02-08T20:41:56,712 adding 'evalscope/benchmarks/ifbench/__init__.py' 2026-02-08T20:41:56,714 adding 'evalscope/benchmarks/ifbench/evaluation_lib.py' 2026-02-08T20:41:56,715 adding 'evalscope/benchmarks/ifbench/ifbench_adapter.py' 2026-02-08T20:41:56,723 adding 'evalscope/benchmarks/ifbench/instructions.py' 2026-02-08T20:41:56,725 adding 'evalscope/benchmarks/ifbench/instructions_registry.py' 2026-02-08T20:41:56,728 adding 'evalscope/benchmarks/ifbench/instructions_util.py' 2026-02-08T20:41:56,730 adding 'evalscope/benchmarks/ifeval/__init__.py' 2026-02-08T20:41:56,731 adding 'evalscope/benchmarks/ifeval/ifeval_adapter.py' 2026-02-08T20:41:56,737 adding 'evalscope/benchmarks/ifeval/instructions.py' 2026-02-08T20:41:56,738 adding 'evalscope/benchmarks/ifeval/instructions_registry.py' 2026-02-08T20:41:56,742 adding 'evalscope/benchmarks/ifeval/instructions_util.py' 2026-02-08T20:41:56,743 adding 'evalscope/benchmarks/ifeval/utils.py' 2026-02-08T20:41:56,745 adding 'evalscope/benchmarks/image_edit/__init__.py' 2026-02-08T20:41:56,747 adding 'evalscope/benchmarks/image_edit/gedit/__init__.py' 2026-02-08T20:41:56,748 adding 'evalscope/benchmarks/image_edit/gedit/gedit_adapter.py' 2026-02-08T20:41:56,750 adding 'evalscope/benchmarks/image_edit/gedit/utils.py' 2026-02-08T20:41:56,753 adding 'evalscope/benchmarks/image_edit/gedit/vie_prompts.py' 2026-02-08T20:41:56,754 adding 'evalscope/benchmarks/infovqa/__init__.py' 2026-02-08T20:41:56,756 adding 'evalscope/benchmarks/infovqa/infovqa_adapter.py' 2026-02-08T20:41:56,758 adding 'evalscope/benchmarks/iquiz/__init__.py' 2026-02-08T20:41:56,759 adding 'evalscope/benchmarks/iquiz/iquiz_adapter.py' 2026-02-08T20:41:56,761 adding 'evalscope/benchmarks/librispeech/__init__.py' 2026-02-08T20:41:56,762 adding 'evalscope/benchmarks/librispeech/librispeech_adapter.py' 2026-02-08T20:41:56,764 adding 'evalscope/benchmarks/live_code_bench/__init__.py' 2026-02-08T20:41:56,765 adding 'evalscope/benchmarks/live_code_bench/evaluate_utils.py' 2026-02-08T20:41:56,767 adding 'evalscope/benchmarks/live_code_bench/extract_utils.py' 2026-02-08T20:41:56,768 adding 'evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py' 2026-02-08T20:41:56,770 adding 'evalscope/benchmarks/live_code_bench/load_utils.py' 2026-02-08T20:41:56,771 adding 'evalscope/benchmarks/live_code_bench/pass_k_utils.py' 2026-02-08T20:41:56,773 adding 'evalscope/benchmarks/live_code_bench/prompts.py' 2026-02-08T20:41:56,775 adding 'evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py' 2026-02-08T20:41:56,777 adding 'evalscope/benchmarks/live_code_bench/testing_util.py' 2026-02-08T20:41:56,779 adding 'evalscope/benchmarks/logi_qa/__int__.py' 2026-02-08T20:41:56,781 adding 'evalscope/benchmarks/logi_qa/logi_qa_adapter.py' 2026-02-08T20:41:56,782 adding 'evalscope/benchmarks/maritime_bench/__init__.py' 2026-02-08T20:41:56,784 adding 'evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py' 2026-02-08T20:41:56,785 adding 'evalscope/benchmarks/math_500/__init__.py' 2026-02-08T20:41:56,787 adding 'evalscope/benchmarks/math_500/math_500_adapter.py' 2026-02-08T20:41:56,789 adding 'evalscope/benchmarks/math_qa/__init__.py' 2026-02-08T20:41:56,790 adding 'evalscope/benchmarks/math_qa/math_qa_adapter.py' 2026-02-08T20:41:56,792 adding 'evalscope/benchmarks/math_verse/__init__.py' 2026-02-08T20:41:56,793 adding 'evalscope/benchmarks/math_verse/math_verse_adapter.py' 2026-02-08T20:41:56,795 adding 'evalscope/benchmarks/math_vision/__init__.py' 2026-02-08T20:41:56,797 adding 'evalscope/benchmarks/math_vision/math_vision_adapter.py' 2026-02-08T20:41:56,799 adding 'evalscope/benchmarks/math_vista/__init__.py' 2026-02-08T20:41:56,800 adding 'evalscope/benchmarks/math_vista/math_vista_adapter.py' 2026-02-08T20:41:56,802 adding 'evalscope/benchmarks/mbpp/__init__.py' 2026-02-08T20:41:56,804 adding 'evalscope/benchmarks/mbpp/mbpp_adapter.py' 2026-02-08T20:41:56,806 adding 'evalscope/benchmarks/mbppplus/__init__.py' 2026-02-08T20:41:56,807 adding 'evalscope/benchmarks/mbppplus/mbppplus_adapter.py' 2026-02-08T20:41:56,809 adding 'evalscope/benchmarks/med_mcqa/__init__.py' 2026-02-08T20:41:56,810 adding 'evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py' 2026-02-08T20:41:56,812 adding 'evalscope/benchmarks/mgsm/__init__.py' 2026-02-08T20:41:56,813 adding 'evalscope/benchmarks/mgsm/mgsm_adapter.py' 2026-02-08T20:41:56,815 adding 'evalscope/benchmarks/micro_vqa/__init__.py' 2026-02-08T20:41:56,816 adding 'evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py' 2026-02-08T20:41:56,818 adding 'evalscope/benchmarks/minerva_math/__init__.py' 2026-02-08T20:41:56,819 adding 'evalscope/benchmarks/minerva_math/minerva_math_adapter.py' 2026-02-08T20:41:56,821 adding 'evalscope/benchmarks/mm_bench/__init__.py' 2026-02-08T20:41:56,822 adding 'evalscope/benchmarks/mm_bench/mm_bench_adapter.py' 2026-02-08T20:41:56,824 adding 'evalscope/benchmarks/mm_star/__init__.py' 2026-02-08T20:41:56,825 adding 'evalscope/benchmarks/mm_star/mm_star_adapter.py' 2026-02-08T20:41:56,827 adding 'evalscope/benchmarks/mmlu/__init__.py' 2026-02-08T20:41:56,828 adding 'evalscope/benchmarks/mmlu/mmlu_adapter.py' 2026-02-08T20:41:56,830 adding 'evalscope/benchmarks/mmlu_pro/__init__.py' 2026-02-08T20:41:56,831 adding 'evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py' 2026-02-08T20:41:56,833 adding 'evalscope/benchmarks/mmlu_redux/__init__.py' 2026-02-08T20:41:56,835 adding 'evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py' 2026-02-08T20:41:56,837 adding 'evalscope/benchmarks/mmmu/__init__.py' 2026-02-08T20:41:56,838 adding 'evalscope/benchmarks/mmmu/mmmu_adapter.py' 2026-02-08T20:41:56,840 adding 'evalscope/benchmarks/mmmu_pro/__init__.py' 2026-02-08T20:41:56,841 adding 'evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py' 2026-02-08T20:41:56,843 adding 'evalscope/benchmarks/mri_mcqa/__init__.py' 2026-02-08T20:41:56,845 adding 'evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py' 2026-02-08T20:41:56,846 adding 'evalscope/benchmarks/multi_if/__init__.py' 2026-02-08T20:41:56,855 adding 'evalscope/benchmarks/multi_if/ifeval.py' 2026-02-08T20:41:56,857 adding 'evalscope/benchmarks/multi_if/metrics.py' 2026-02-08T20:41:56,858 adding 'evalscope/benchmarks/multi_if/multi_if_adapter.py' 2026-02-08T20:41:56,860 adding 'evalscope/benchmarks/multipl_e/__init__.py' 2026-02-08T20:41:56,862 adding 'evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py' 2026-02-08T20:41:56,863 adding 'evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py' 2026-02-08T20:41:56,865 adding 'evalscope/benchmarks/multipl_e/utils.py' 2026-02-08T20:41:56,867 adding 'evalscope/benchmarks/music_trivia/__init__.py' 2026-02-08T20:41:56,868 adding 'evalscope/benchmarks/music_trivia/music_trivia_adapter.py' 2026-02-08T20:41:56,870 adding 'evalscope/benchmarks/musr/__init__.py' 2026-02-08T20:41:56,871 adding 'evalscope/benchmarks/musr/musr_adapter.py' 2026-02-08T20:41:56,873 adding 'evalscope/benchmarks/needle_haystack/__init__.py' 2026-02-08T20:41:56,876 adding 'evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py' 2026-02-08T20:41:56,877 adding 'evalscope/benchmarks/needle_haystack/utils.py' 2026-02-08T20:41:56,879 adding 'evalscope/benchmarks/ner/__init__.py' 2026-02-08T20:41:56,881 adding 'evalscope/benchmarks/ner/anat_em_adapter.py' 2026-02-08T20:41:56,882 adding 'evalscope/benchmarks/ner/bc2gm_adapter.py' 2026-02-08T20:41:56,884 adding 'evalscope/benchmarks/ner/bc4chemd_adapter.py' 2026-02-08T20:41:56,885 adding 'evalscope/benchmarks/ner/bc5cdr_adapter.py' 2026-02-08T20:41:56,887 adding 'evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py' 2026-02-08T20:41:56,888 adding 'evalscope/benchmarks/ner/conll2003_adapter.py' 2026-02-08T20:41:56,890 adding 'evalscope/benchmarks/ner/conllpp_adapter.py' 2026-02-08T20:41:56,891 adding 'evalscope/benchmarks/ner/copious_adapter.py' 2026-02-08T20:41:56,893 adding 'evalscope/benchmarks/ner/cross_ner_adapter.py' 2026-02-08T20:41:56,894 adding 'evalscope/benchmarks/ner/fin_ner_adapter.py' 2026-02-08T20:41:56,896 adding 'evalscope/benchmarks/ner/genia_ner_adapter.py' 2026-02-08T20:41:56,897 adding 'evalscope/benchmarks/ner/harvey_ner_adapter.py' 2026-02-08T20:41:56,898 adding 'evalscope/benchmarks/ner/jnlpba_adapter.py' 2026-02-08T20:41:56,900 adding 'evalscope/benchmarks/ner/jnlpba_rare_adapter.py' 2026-02-08T20:41:56,901 adding 'evalscope/benchmarks/ner/mit_movie_trivia_adapter.py' 2026-02-08T20:41:56,902 adding 'evalscope/benchmarks/ner/mit_restaurant_adapter.py' 2026-02-08T20:41:56,904 adding 'evalscope/benchmarks/ner/multi_nerd_adapter.py' 2026-02-08T20:41:56,905 adding 'evalscope/benchmarks/ner/ncbi_adapter.py' 2026-02-08T20:41:56,907 adding 'evalscope/benchmarks/ner/ontonotes5_adapter.py' 2026-02-08T20:41:56,908 adding 'evalscope/benchmarks/ner/tweebank_ner_adapter.py' 2026-02-08T20:41:56,910 adding 'evalscope/benchmarks/ner/tweet_ner_7_adapter.py' 2026-02-08T20:41:56,911 adding 'evalscope/benchmarks/ner/wnut2017_adapter.py' 2026-02-08T20:41:56,913 adding 'evalscope/benchmarks/ner/cross_ner_entities/__init__.py' 2026-02-08T20:41:56,914 adding 'evalscope/benchmarks/ner/cross_ner_entities/ai.py' 2026-02-08T20:41:56,916 adding 'evalscope/benchmarks/ner/cross_ner_entities/literature.py' 2026-02-08T20:41:56,917 adding 'evalscope/benchmarks/ner/cross_ner_entities/music.py' 2026-02-08T20:41:56,919 adding 'evalscope/benchmarks/ner/cross_ner_entities/politics.py' 2026-02-08T20:41:56,920 adding 'evalscope/benchmarks/ner/cross_ner_entities/science.py' 2026-02-08T20:41:56,922 adding 'evalscope/benchmarks/ocr_bench/__init__.py' 2026-02-08T20:41:56,924 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py' 2026-02-08T20:41:56,925 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py' 2026-02-08T20:41:56,927 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py' 2026-02-08T20:41:56,931 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py' 2026-02-08T20:41:56,933 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py' 2026-02-08T20:41:56,935 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py' 2026-02-08T20:41:56,936 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py' 2026-02-08T20:41:56,938 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py' 2026-02-08T20:41:56,939 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py' 2026-02-08T20:41:56,941 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py' 2026-02-08T20:41:56,943 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py' 2026-02-08T20:41:56,945 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py' 2026-02-08T20:41:56,946 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt' 2026-02-08T20:41:56,949 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py' 2026-02-08T20:41:56,952 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py' 2026-02-08T20:41:56,954 adding 'evalscope/benchmarks/olympiad_bench/__init__.py' 2026-02-08T20:41:56,955 adding 'evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py' 2026-02-08T20:41:56,958 adding 'evalscope/benchmarks/olympiad_bench/utils.py' 2026-02-08T20:41:56,960 adding 'evalscope/benchmarks/omni_bench/__init__.py' 2026-02-08T20:41:56,962 adding 'evalscope/benchmarks/omni_bench/omni_bench_adapter.py' 2026-02-08T20:41:56,964 adding 'evalscope/benchmarks/omnidoc_bench/__init__.py' 2026-02-08T20:41:56,966 adding 'evalscope/benchmarks/omnidoc_bench/end2end_eval.py' 2026-02-08T20:41:56,969 adding 'evalscope/benchmarks/omnidoc_bench/metrics.py' 2026-02-08T20:41:56,971 adding 'evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py' 2026-02-08T20:41:56,979 adding 'evalscope/benchmarks/omnidoc_bench/utils.py' 2026-02-08T20:41:56,982 adding 'evalscope/benchmarks/openai_mrcr/__init__.py' 2026-02-08T20:41:56,984 adding 'evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py' 2026-02-08T20:41:56,985 adding 'evalscope/benchmarks/openai_mrcr/utils.py' 2026-02-08T20:41:56,987 adding 'evalscope/benchmarks/piqa/__init__.py' 2026-02-08T20:41:56,988 adding 'evalscope/benchmarks/piqa/piqa_adapter.py' 2026-02-08T20:41:56,990 adding 'evalscope/benchmarks/poly_math/__init__.py' 2026-02-08T20:41:56,991 adding 'evalscope/benchmarks/poly_math/poly_math_adapter.py' 2026-02-08T20:41:56,993 adding 'evalscope/benchmarks/poly_math/utils/instruction.py' 2026-02-08T20:41:56,995 adding 'evalscope/benchmarks/pope/__init__.py' 2026-02-08T20:41:56,996 adding 'evalscope/benchmarks/pope/pope_adapter.py' 2026-02-08T20:41:56,998 adding 'evalscope/benchmarks/process_bench/__init__.py' 2026-02-08T20:41:57,000 adding 'evalscope/benchmarks/process_bench/process_bench_adapter.py' 2026-02-08T20:41:57,001 adding 'evalscope/benchmarks/pumed_qa/__init__.py' 2026-02-08T20:41:57,003 adding 'evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py' 2026-02-08T20:41:57,004 adding 'evalscope/benchmarks/qasc/__init__.py' 2026-02-08T20:41:57,005 adding 'evalscope/benchmarks/qasc/qasc_adapter.py' 2026-02-08T20:41:57,007 adding 'evalscope/benchmarks/race/__init__.py' 2026-02-08T20:41:57,008 adding 'evalscope/benchmarks/race/race_adapter.py' 2026-02-08T20:41:57,010 adding 'evalscope/benchmarks/real_world_qa/__init__.py' 2026-02-08T20:41:57,011 adding 'evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py' 2026-02-08T20:41:57,013 adding 'evalscope/benchmarks/refcoco/__init__.py' 2026-02-08T20:41:57,014 adding 'evalscope/benchmarks/refcoco/evaluation_lib.py' 2026-02-08T20:41:57,016 adding 'evalscope/benchmarks/refcoco/refcoco_adapter.py' 2026-02-08T20:41:57,018 adding 'evalscope/benchmarks/refcoco/utils.py' 2026-02-08T20:41:57,019 adding 'evalscope/benchmarks/scicode/__init__.py' 2026-02-08T20:41:57,020 adding 'evalscope/benchmarks/scicode/prompt_templates.py' 2026-02-08T20:41:57,022 adding 'evalscope/benchmarks/scicode/scicode_adapter.py' 2026-02-08T20:41:57,024 adding 'evalscope/benchmarks/scicode/util.py' 2026-02-08T20:41:57,026 adding 'evalscope/benchmarks/scicode/docker/Dockerfile' 2026-02-08T20:41:57,027 adding 'evalscope/benchmarks/scicode/docker/docker_requirements.txt' 2026-02-08T20:41:57,028 adding 'evalscope/benchmarks/scicode/docker/process_data.py' 2026-02-08T20:41:57,030 adding 'evalscope/benchmarks/scicode/docker/test_util.py' 2026-02-08T20:41:57,031 adding 'evalscope/benchmarks/science_qa/__init__.py' 2026-02-08T20:41:57,033 adding 'evalscope/benchmarks/science_qa/science_qa_adapter.py' 2026-02-08T20:41:57,035 adding 'evalscope/benchmarks/sciq/__init__.py' 2026-02-08T20:41:57,036 adding 'evalscope/benchmarks/sciq/sciq_adapter.py' 2026-02-08T20:41:57,038 adding 'evalscope/benchmarks/seed_bench_2_plus/__init__.py' 2026-02-08T20:41:57,039 adding 'evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py' 2026-02-08T20:41:57,041 adding 'evalscope/benchmarks/simple_qa/__init__.py' 2026-02-08T20:41:57,043 adding 'evalscope/benchmarks/simple_qa/simple_qa_adapter.py' 2026-02-08T20:41:57,044 adding 'evalscope/benchmarks/simple_vqa/__init__.py' 2026-02-08T20:41:57,046 adding 'evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py' 2026-02-08T20:41:57,048 adding 'evalscope/benchmarks/siqa/__init__.py' 2026-02-08T20:41:57,049 adding 'evalscope/benchmarks/siqa/siqa_adapter.py' 2026-02-08T20:41:57,051 adding 'evalscope/benchmarks/super_gpqa/__init__.py' 2026-02-08T20:41:57,052 adding 'evalscope/benchmarks/super_gpqa/prompt.py' 2026-02-08T20:41:57,054 adding 'evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py' 2026-02-08T20:41:57,055 adding 'evalscope/benchmarks/super_gpqa/utils.py' 2026-02-08T20:41:57,057 adding 'evalscope/benchmarks/swe_bench/__init__.py' 2026-02-08T20:41:57,059 adding 'evalscope/benchmarks/swe_bench/build_images.py' 2026-02-08T20:41:57,060 adding 'evalscope/benchmarks/swe_bench/swe_bench_adapter.py' 2026-02-08T20:41:57,062 adding 'evalscope/benchmarks/swe_bench/utils.py' 2026-02-08T20:41:57,064 adding 'evalscope/benchmarks/tau_bench/__init__.py' 2026-02-08T20:41:57,065 adding 'evalscope/benchmarks/tau_bench/tau2_bench/__init__.py' 2026-02-08T20:41:57,067 adding 'evalscope/benchmarks/tau_bench/tau2_bench/generation.py' 2026-02-08T20:41:57,068 adding 'evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py' 2026-02-08T20:41:57,070 adding 'evalscope/benchmarks/tau_bench/tau_bench/__init__.py' 2026-02-08T20:41:57,071 adding 'evalscope/benchmarks/tau_bench/tau_bench/generation.py' 2026-02-08T20:41:57,073 adding 'evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py' 2026-02-08T20:41:57,075 adding 'evalscope/benchmarks/terminal_bench/__init__.py' 2026-02-08T20:41:57,076 adding 'evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py' 2026-02-08T20:41:57,078 adding 'evalscope/benchmarks/terminal_bench/utils.py' 2026-02-08T20:41:57,080 adding 'evalscope/benchmarks/text2image/__init__.py' 2026-02-08T20:41:57,081 adding 'evalscope/benchmarks/text2image/evalmuse_adapter.py' 2026-02-08T20:41:57,082 adding 'evalscope/benchmarks/text2image/genai_bench_adapter.py' 2026-02-08T20:41:57,084 adding 'evalscope/benchmarks/text2image/general_t2i_adapter.py' 2026-02-08T20:41:57,085 adding 'evalscope/benchmarks/text2image/hpdv2_adapter.py' 2026-02-08T20:41:57,086 adding 'evalscope/benchmarks/text2image/tifa_adapter.py' 2026-02-08T20:41:57,088 adding 'evalscope/benchmarks/tool_bench/__init__.py' 2026-02-08T20:41:57,090 adding 'evalscope/benchmarks/tool_bench/tool_bench_adapter.py' 2026-02-08T20:41:57,091 adding 'evalscope/benchmarks/tool_bench/utils.py' 2026-02-08T20:41:57,093 adding 'evalscope/benchmarks/torgo/__init__.py' 2026-02-08T20:41:57,095 adding 'evalscope/benchmarks/torgo/torgo_adapter.py' 2026-02-08T20:41:57,096 adding 'evalscope/benchmarks/trivia_qa/__init__.py' 2026-02-08T20:41:57,098 adding 'evalscope/benchmarks/trivia_qa/samples.jsonl' 2026-02-08T20:41:57,099 adding 'evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py' 2026-02-08T20:41:57,101 adding 'evalscope/benchmarks/truthful_qa/__init__.py' 2026-02-08T20:41:57,102 adding 'evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py' 2026-02-08T20:41:57,104 adding 'evalscope/benchmarks/visu_logic/__init__.py' 2026-02-08T20:41:57,106 adding 'evalscope/benchmarks/visu_logic/visu_logic_adapter.py' 2026-02-08T20:41:57,107 adding 'evalscope/benchmarks/vstar_bench/__init__.py' 2026-02-08T20:41:57,109 adding 'evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py' 2026-02-08T20:41:57,110 adding 'evalscope/benchmarks/winogrande/__init__.py' 2026-02-08T20:41:57,112 adding 'evalscope/benchmarks/winogrande/winogrande_adapter.py' 2026-02-08T20:41:57,113 adding 'evalscope/benchmarks/wmt/__init__.py' 2026-02-08T20:41:57,115 adding 'evalscope/benchmarks/wmt/wmt24_adapter.py' 2026-02-08T20:41:57,117 adding 'evalscope/benchmarks/zebralogicbench/__init__.py' 2026-02-08T20:41:57,118 adding 'evalscope/benchmarks/zebralogicbench/utils.py' 2026-02-08T20:41:57,120 adding 'evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py' 2026-02-08T20:41:57,122 adding 'evalscope/benchmarks/zerobench/__init__.py' 2026-02-08T20:41:57,124 adding 'evalscope/benchmarks/zerobench/zerobench_adapter.py' 2026-02-08T20:41:57,125 adding 'evalscope/cli/__init__.py' 2026-02-08T20:41:57,127 adding 'evalscope/cli/base.py' 2026-02-08T20:41:57,128 adding 'evalscope/cli/cli.py' 2026-02-08T20:41:57,129 adding 'evalscope/cli/start_app.py' 2026-02-08T20:41:57,130 adding 'evalscope/cli/start_eval.py' 2026-02-08T20:41:57,132 adding 'evalscope/cli/start_perf.py' 2026-02-08T20:41:57,133 adding 'evalscope/cli/start_service.py' 2026-02-08T20:41:57,135 adding 'evalscope/collections/__init__.py' 2026-02-08T20:41:57,136 adding 'evalscope/collections/sampler.py' 2026-02-08T20:41:57,138 adding 'evalscope/collections/schema.py' 2026-02-08T20:41:57,139 adding 'evalscope/evaluator/__init__.py' 2026-02-08T20:41:57,142 adding 'evalscope/evaluator/evaluator.py' 2026-02-08T20:41:57,143 adding 'evalscope/filters/__init__.py' 2026-02-08T20:41:57,145 adding 'evalscope/filters/extraction.py' 2026-02-08T20:41:57,146 adding 'evalscope/filters/selection.py' 2026-02-08T20:41:57,148 adding 'evalscope/metrics/__init__.py' 2026-02-08T20:41:57,150 adding 'evalscope/metrics/llm_judge.py' 2026-02-08T20:41:57,153 adding 'evalscope/metrics/math_parser.py' 2026-02-08T20:41:57,155 adding 'evalscope/metrics/metric.py' 2026-02-08T20:41:57,158 adding 'evalscope/metrics/metrics.py' 2026-02-08T20:41:57,160 adding 'evalscope/metrics/rouge_metric.py' 2026-02-08T20:41:57,161 adding 'evalscope/metrics/bert_score/__init__.py' 2026-02-08T20:41:57,163 adding 'evalscope/metrics/bert_score/scorer.py' 2026-02-08T20:41:57,167 adding 'evalscope/metrics/bert_score/utils.py' 2026-02-08T20:41:57,169 adding 'evalscope/metrics/bundled_rouge_score/__init__.py' 2026-02-08T20:41:57,171 adding 'evalscope/metrics/bundled_rouge_score/rouge_scorer.py' 2026-02-08T20:41:57,172 adding 'evalscope/metrics/sem_score/__init__.py' 2026-02-08T20:41:57,174 adding 'evalscope/metrics/sem_score/scorer.py' 2026-02-08T20:41:57,176 adding 'evalscope/metrics/t2v_metrics/__init__.py' 2026-02-08T20:41:57,178 adding 'evalscope/metrics/t2v_metrics/clipscore.py' 2026-02-08T20:41:57,179 adding 'evalscope/metrics/t2v_metrics/constants.py' 2026-02-08T20:41:57,180 adding 'evalscope/metrics/t2v_metrics/itmscore.py' 2026-02-08T20:41:57,182 adding 'evalscope/metrics/t2v_metrics/score.py' 2026-02-08T20:41:57,183 adding 'evalscope/metrics/t2v_metrics/vqascore.py' 2026-02-08T20:41:57,185 adding 'evalscope/metrics/t2v_metrics/models/__init__.py' 2026-02-08T20:41:57,186 adding 'evalscope/metrics/t2v_metrics/models/model.py' 2026-02-08T20:41:57,187 adding 'evalscope/metrics/t2v_metrics/models/utils.py' 2026-02-08T20:41:57,189 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py' 2026-02-08T20:41:57,191 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py' 2026-02-08T20:41:57,193 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py' 2026-02-08T20:41:57,194 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py' 2026-02-08T20:41:57,196 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py' 2026-02-08T20:41:57,198 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py' 2026-02-08T20:41:57,199 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py' 2026-02-08T20:41:57,200 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py' 2026-02-08T20:41:57,202 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py' 2026-02-08T20:41:57,204 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py' 2026-02-08T20:41:57,206 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py' 2026-02-08T20:41:57,207 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py' 2026-02-08T20:41:57,209 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py' 2026-02-08T20:41:57,211 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py' 2026-02-08T20:41:57,212 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py' 2026-02-08T20:41:57,214 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py' 2026-02-08T20:41:57,216 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py' 2026-02-08T20:41:57,218 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py' 2026-02-08T20:41:57,219 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py' 2026-02-08T20:41:57,221 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py' 2026-02-08T20:41:57,222 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py' 2026-02-08T20:41:57,223 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py' 2026-02-08T20:41:57,225 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py' 2026-02-08T20:41:57,228 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py' 2026-02-08T20:41:57,229 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py' 2026-02-08T20:41:57,231 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py' 2026-02-08T20:41:57,233 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py' 2026-02-08T20:41:57,234 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py' 2026-02-08T20:41:57,237 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py' 2026-02-08T20:41:57,239 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py' 2026-02-08T20:41:57,240 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py' 2026-02-08T20:41:57,241 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py' 2026-02-08T20:41:57,243 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py' 2026-02-08T20:41:57,244 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py' 2026-02-08T20:41:57,247 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py' 2026-02-08T20:41:57,248 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py' 2026-02-08T20:41:57,250 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py' 2026-02-08T20:41:57,252 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py' 2026-02-08T20:41:57,254 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml' 2026-02-08T20:41:57,255 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json' 2026-02-08T20:41:57,257 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json' 2026-02-08T20:41:57,258 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json' 2026-02-08T20:41:57,260 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml' 2026-02-08T20:41:57,261 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml' 2026-02-08T20:41:57,263 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml' 2026-02-08T20:41:57,264 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml' 2026-02-08T20:41:57,265 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml' 2026-02-08T20:41:57,267 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml' 2026-02-08T20:41:57,268 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml' 2026-02-08T20:41:57,270 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml' 2026-02-08T20:41:57,271 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml' 2026-02-08T20:41:57,272 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml' 2026-02-08T20:41:57,274 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml' 2026-02-08T20:41:57,275 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml' 2026-02-08T20:41:57,276 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml' 2026-02-08T20:41:57,278 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml' 2026-02-08T20:41:57,279 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml' 2026-02-08T20:41:57,280 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml' 2026-02-08T20:41:57,282 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml' 2026-02-08T20:41:57,283 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml' 2026-02-08T20:41:57,284 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml' 2026-02-08T20:41:57,286 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py' 2026-02-08T20:41:57,288 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py' 2026-02-08T20:41:57,290 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py' 2026-02-08T20:41:57,292 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py' 2026-02-08T20:41:57,298 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py' 2026-02-08T20:41:57,301 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py' 2026-02-08T20:41:57,306 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py' 2026-02-08T20:41:57,308 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py' 2026-02-08T20:41:57,309 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py' 2026-02-08T20:41:57,311 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py' 2026-02-08T20:41:57,313 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py' 2026-02-08T20:41:57,315 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py' 2026-02-08T20:41:57,318 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py' 2026-02-08T20:41:57,320 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py' 2026-02-08T20:41:57,325 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py' 2026-02-08T20:41:57,333 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py' 2026-02-08T20:41:57,336 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py' 2026-02-08T20:41:57,337 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py' 2026-02-08T20:41:57,339 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py' 2026-02-08T20:41:57,341 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py' 2026-02-08T20:41:57,343 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py' 2026-02-08T20:41:57,344 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py' 2026-02-08T20:41:57,346 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py' 2026-02-08T20:41:57,347 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py' 2026-02-08T20:41:57,349 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py' 2026-02-08T20:41:57,352 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py' 2026-02-08T20:41:57,356 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py' 2026-02-08T20:41:57,358 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py' 2026-02-08T20:41:57,359 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py' 2026-02-08T20:41:57,361 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py' 2026-02-08T20:41:57,363 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py' 2026-02-08T20:41:57,364 adding 'evalscope/metrics/text_normalizer/__init__.py' 2026-02-08T20:41:57,366 adding 'evalscope/metrics/text_normalizer/basic.py' 2026-02-08T20:41:57,372 adding 'evalscope/metrics/text_normalizer/chinese.py' 2026-02-08T20:41:57,381 adding 'evalscope/metrics/text_normalizer/english.json' 2026-02-08T20:41:57,383 adding 'evalscope/metrics/text_normalizer/english.py' 2026-02-08T20:41:57,385 adding 'evalscope/metrics/text_normalizer/wer.py' 2026-02-08T20:41:57,387 adding 'evalscope/models/__init__.py' 2026-02-08T20:41:57,388 adding 'evalscope/models/image_edit_model.py' 2026-02-08T20:41:57,390 adding 'evalscope/models/mockllm.py' 2026-02-08T20:41:57,391 adding 'evalscope/models/model_apis.py' 2026-02-08T20:41:57,393 adding 'evalscope/models/modelscope.py' 2026-02-08T20:41:57,395 adding 'evalscope/models/openai_compatible.py' 2026-02-08T20:41:57,397 adding 'evalscope/models/text2image_model.py' 2026-02-08T20:41:57,401 adding 'evalscope/models/utils/openai.py' 2026-02-08T20:41:57,402 adding 'evalscope/perf/__init__.py' 2026-02-08T20:41:57,405 adding 'evalscope/perf/arguments.py' 2026-02-08T20:41:57,406 adding 'evalscope/perf/benchmark.py' 2026-02-08T20:41:57,408 adding 'evalscope/perf/http_client.py' 2026-02-08T20:41:57,409 adding 'evalscope/perf/main.py' 2026-02-08T20:41:57,411 adding 'evalscope/perf/plugin/__init__.py' 2026-02-08T20:41:57,412 adding 'evalscope/perf/plugin/registry.py' 2026-02-08T20:41:57,414 adding 'evalscope/perf/plugin/api/__init__.py' 2026-02-08T20:41:57,416 adding 'evalscope/perf/plugin/api/base.py' 2026-02-08T20:41:57,417 adding 'evalscope/perf/plugin/api/custom_api.py' 2026-02-08T20:41:57,419 adding 'evalscope/perf/plugin/api/dashscope_api.py' 2026-02-08T20:41:57,421 adding 'evalscope/perf/plugin/api/default_api.py' 2026-02-08T20:41:57,424 adding 'evalscope/perf/plugin/api/openai_api.py' 2026-02-08T20:41:57,426 adding 'evalscope/perf/plugin/api/openai_embedding_api.py' 2026-02-08T20:41:57,428 adding 'evalscope/perf/plugin/api/openai_rerank_api.py' 2026-02-08T20:41:57,430 adding 'evalscope/perf/plugin/datasets/__init__.py' 2026-02-08T20:41:57,431 adding 'evalscope/perf/plugin/datasets/base.py' 2026-02-08T20:41:57,433 adding 'evalscope/perf/plugin/datasets/custom.py' 2026-02-08T20:41:57,434 adding 'evalscope/perf/plugin/datasets/embedding_dataset.py' 2026-02-08T20:41:57,436 adding 'evalscope/perf/plugin/datasets/flickr8k.py' 2026-02-08T20:41:57,437 adding 'evalscope/perf/plugin/datasets/kontext_bench.py' 2026-02-08T20:41:57,438 adding 'evalscope/perf/plugin/datasets/line_by_line.py' 2026-02-08T20:41:57,439 adding 'evalscope/perf/plugin/datasets/longalpaca.py' 2026-02-08T20:41:57,441 adding 'evalscope/perf/plugin/datasets/openqa.py' 2026-02-08T20:41:57,442 adding 'evalscope/perf/plugin/datasets/random_dataset.py' 2026-02-08T20:41:57,444 adding 'evalscope/perf/plugin/datasets/random_vl_dataset.py' 2026-02-08T20:41:57,445 adding 'evalscope/perf/plugin/datasets/rerank_dataset.py' 2026-02-08T20:41:57,447 adding 'evalscope/perf/plugin/datasets/speed_benchmark.py' 2026-02-08T20:41:57,448 adding 'evalscope/perf/plugin/datasets/utils.py' 2026-02-08T20:41:57,450 adding 'evalscope/perf/sla/__init__.py' 2026-02-08T20:41:57,451 adding 'evalscope/perf/sla/sla_criterion.py' 2026-02-08T20:41:57,453 adding 'evalscope/perf/sla/sla_run.py' 2026-02-08T20:41:57,455 adding 'evalscope/perf/utils/__init__.py' 2026-02-08T20:41:57,456 adding 'evalscope/perf/utils/analysis_result.py' 2026-02-08T20:41:57,458 adding 'evalscope/perf/utils/benchmark_util.py' 2026-02-08T20:41:57,460 adding 'evalscope/perf/utils/db_util.py' 2026-02-08T20:41:57,461 adding 'evalscope/perf/utils/handler.py' 2026-02-08T20:41:57,463 adding 'evalscope/perf/utils/local_server.py' 2026-02-08T20:41:57,464 adding 'evalscope/perf/utils/log_utils.py' 2026-02-08T20:41:57,466 adding 'evalscope/perf/utils/rich_display.py' 2026-02-08T20:41:57,468 adding 'evalscope/report/__init__.py' 2026-02-08T20:41:57,470 adding 'evalscope/report/combinator.py' 2026-02-08T20:41:57,472 adding 'evalscope/report/generator.py' 2026-02-08T20:41:57,473 adding 'evalscope/report/report.py' 2026-02-08T20:41:57,475 adding 'evalscope/service/__init__.py' 2026-02-08T20:41:57,477 adding 'evalscope/service/app.py' 2026-02-08T20:41:57,478 adding 'evalscope/service/utils.py' 2026-02-08T20:41:57,480 adding 'evalscope/service/frontend/__init__.py' 2026-02-08T20:41:57,481 adding 'evalscope/service/frontend/async_client.py' 2026-02-08T20:41:57,483 adding 'evalscope/service/frontend/main.py' 2026-02-08T20:41:57,485 adding 'evalscope/service/frontend/utils.py' 2026-02-08T20:41:57,487 adding 'evalscope/summarizer/__init__.py' 2026-02-08T20:41:57,488 adding 'evalscope/summarizer/summarizer.py' 2026-02-08T20:41:57,490 adding 'evalscope/third_party/__init__.py' 2026-02-08T20:41:57,492 adding 'evalscope/third_party/longbench_write/README.md' 2026-02-08T20:41:57,493 adding 'evalscope/third_party/longbench_write/__init__.py' 2026-02-08T20:41:57,494 adding 'evalscope/third_party/longbench_write/default_task.json' 2026-02-08T20:41:57,496 adding 'evalscope/third_party/longbench_write/default_task.yaml' 2026-02-08T20:41:57,498 adding 'evalscope/third_party/longbench_write/eval.py' 2026-02-08T20:41:57,499 adding 'evalscope/third_party/longbench_write/infer.py' 2026-02-08T20:41:57,500 adding 'evalscope/third_party/longbench_write/longbench_write.py' 2026-02-08T20:41:57,502 adding 'evalscope/third_party/longbench_write/utils.py' 2026-02-08T20:41:57,503 adding 'evalscope/third_party/longbench_write/resources/__init__.py' 2026-02-08T20:41:57,505 adding 'evalscope/third_party/longbench_write/resources/judge.txt' 2026-02-08T20:41:57,512 adding 'evalscope/third_party/longbench_write/resources/longbench_write.jsonl' 2026-02-08T20:41:57,516 adding 'evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl' 2026-02-08T20:41:57,517 adding 'evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl' 2026-02-08T20:41:57,519 adding 'evalscope/third_party/longbench_write/tools/__init__.py' 2026-02-08T20:41:57,521 adding 'evalscope/third_party/longbench_write/tools/data_etl.py' 2026-02-08T20:41:57,522 adding 'evalscope/third_party/longbench_write/tools/openai_api.py' 2026-02-08T20:41:57,524 adding 'evalscope/third_party/thinkbench/__init__.py' 2026-02-08T20:41:57,526 adding 'evalscope/third_party/thinkbench/eval.py' 2026-02-08T20:41:57,528 adding 'evalscope/third_party/thinkbench/infer.py' 2026-02-08T20:41:57,530 adding 'evalscope/third_party/thinkbench/resources/critique_template.txt' 2026-02-08T20:41:57,531 adding 'evalscope/third_party/thinkbench/resources/reformat_template.txt' 2026-02-08T20:41:57,533 adding 'evalscope/third_party/thinkbench/tools/__init__.py' 2026-02-08T20:41:57,534 adding 'evalscope/third_party/thinkbench/tools/llm.py' 2026-02-08T20:41:57,536 adding 'evalscope/third_party/thinkbench/tools/utils.py' 2026-02-08T20:41:57,538 adding 'evalscope/third_party/toolbench_static/README.md' 2026-02-08T20:41:57,539 adding 'evalscope/third_party/toolbench_static/__init__.py' 2026-02-08T20:41:57,540 adding 'evalscope/third_party/toolbench_static/config_default.json' 2026-02-08T20:41:57,541 adding 'evalscope/third_party/toolbench_static/config_default.yaml' 2026-02-08T20:41:57,542 adding 'evalscope/third_party/toolbench_static/eval.py' 2026-02-08T20:41:57,544 adding 'evalscope/third_party/toolbench_static/infer.py' 2026-02-08T20:41:57,545 adding 'evalscope/third_party/toolbench_static/requirements.txt' 2026-02-08T20:41:57,546 adding 'evalscope/third_party/toolbench_static/toolbench_static.py' 2026-02-08T20:41:57,548 adding 'evalscope/third_party/toolbench_static/llm/__init__.py' 2026-02-08T20:41:57,549 adding 'evalscope/third_party/toolbench_static/llm/swift_infer.py' 2026-02-08T20:41:57,551 adding 'evalscope/utils/__init__.py' 2026-02-08T20:41:57,553 adding 'evalscope/utils/argument_utils.py' 2026-02-08T20:41:57,554 adding 'evalscope/utils/chat_service.py' 2026-02-08T20:41:57,557 adding 'evalscope/utils/code_utils.py' 2026-02-08T20:41:57,559 adding 'evalscope/utils/deprecation_utils.py' 2026-02-08T20:41:57,561 adding 'evalscope/utils/function_utils.py' 2026-02-08T20:41:57,562 adding 'evalscope/utils/import_utils.py' 2026-02-08T20:41:57,565 adding 'evalscope/utils/io_utils.py' 2026-02-08T20:41:57,567 adding 'evalscope/utils/json_schema.py' 2026-02-08T20:41:57,568 adding 'evalscope/utils/logger.py' 2026-02-08T20:41:57,570 adding 'evalscope/utils/model_utils.py' 2026-02-08T20:41:57,571 adding 'evalscope/utils/multi_choices.py' 2026-02-08T20:41:57,573 adding 'evalscope/utils/ner.py' 2026-02-08T20:41:57,575 adding 'evalscope/utils/resource_utils.py' 2026-02-08T20:41:57,576 adding 'evalscope/utils/tqdm_utils.py' 2026-02-08T20:41:57,578 adding 'evalscope/utils/url_utils.py' 2026-02-08T20:41:57,581 adding 'evalscope-1.4.2.dist-info/licenses/LICENSE' 2026-02-08T20:41:57,586 adding 'evalscope-1.4.2.dist-info/METADATA' 2026-02-08T20:41:57,587 adding 'evalscope-1.4.2.dist-info/WHEEL' 2026-02-08T20:41:57,588 adding 'evalscope-1.4.2.dist-info/entry_points.txt' 2026-02-08T20:41:57,589 adding 'evalscope-1.4.2.dist-info/top_level.txt' 2026-02-08T20:41:57,601 adding 'evalscope-1.4.2.dist-info/RECORD' 2026-02-08T20:41:57,623 removing build/bdist.linux-armv7l/wheel 2026-02-08T20:41:57,944 Building wheel for evalscope (pyproject.toml): finished with status 'done' 2026-02-08T20:41:57,976 Created wheel for evalscope: filename=evalscope-1.4.2-py3-none-any.whl size=1247892 sha256=222e938fe502394b9935f3c00677cf1372892caa530b6fb48476ae909a91399a 2026-02-08T20:41:57,977 Stored in directory: /tmp/pip-ephem-wheel-cache-2yi8tsav/wheels/b3/f9/6c/b12c844778989d0e6a6fdf60b421bdde73c4e981133c8f0c84 2026-02-08T20:41:58,018 Successfully built evalscope 2026-02-08T20:41:58,057 Removed build tracker: '/tmp/pip-build-tracker-g2uz737x'