2026-01-05T08:14:57,510 Created temporary directory: /tmp/pip-ephem-wheel-cache-yilmo1ng 2026-01-05T08:14:57,512 Created temporary directory: /tmp/pip-build-tracker-59oc0_e5 2026-01-05T08:14:57,512 Initialized build tracking at /tmp/pip-build-tracker-59oc0_e5 2026-01-05T08:14:57,513 Created build tracker: /tmp/pip-build-tracker-59oc0_e5 2026-01-05T08:14:57,513 Entered build tracker: /tmp/pip-build-tracker-59oc0_e5 2026-01-05T08:14:57,514 Created temporary directory: /tmp/pip-wheel-4ehhxqua 2026-01-05T08:14:57,517 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-01-05T08:14:57,520 Created temporary directory: /tmp/pip-ephem-wheel-cache-nyri9b3e 2026-01-05T08:14:57,542 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-01-05T08:14:57,546 2 location(s) to search for versions of evalscope: 2026-01-05T08:14:57,546 * https://pypi.org/simple/evalscope/ 2026-01-05T08:14:57,546 * https://www.piwheels.org/simple/evalscope/ 2026-01-05T08:14:57,547 Fetching project page and analyzing links: https://pypi.org/simple/evalscope/ 2026-01-05T08:14:57,547 Getting page https://pypi.org/simple/evalscope/ 2026-01-05T08:14:57,549 Found index url https://pypi.org/simple 2026-01-05T08:14:57,772 Fetched page https://pypi.org/simple/evalscope/ as application/vnd.pypi.simple.v1+json 2026-01-05T08:14:57,788 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/94/d2/dc5e929802776bf4e662a46d794b765876bb93e2300189cafd113cac74d6/evalscope-0.5.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,789 Found link https://files.pythonhosted.org/packages/e5/70/45a5dad24b1fa535bff194b99a4668e7f5f328be972b51b3b91eafb4cdbb/evalscope-0.5.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0rc0 2026-01-05T08:14:57,790 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ac/eb/341fe367df2bc9a0ae7ef5eb2037a5d549d9bb8c0d7ad84844c9926e0947/evalscope-0.5.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,791 Found link https://files.pythonhosted.org/packages/3b/3f/585f7f1cf2ce90b234c1cfd654bb26977be4d889c0e5eed0122cb3024c45/evalscope-0.5.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.0 2026-01-05T08:14:57,792 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ad/58/c0ce004159cfac6df9b5736c011576d52bdceb778943de8a022a419d86eb/evalscope-0.5.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,793 Found link https://files.pythonhosted.org/packages/94/33/7ad2f2285f5b68953ad4466d23dc5de1a2e57e7cc63d5924ab0e84d156ba/evalscope-0.5.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.2 2026-01-05T08:14:57,794 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/59/a9e1c4cf88018ece1fdd8d8b7fa976e28f9b4b181ef7ceb74a5e2db533ab/evalscope-0.5.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,795 Found link https://files.pythonhosted.org/packages/04/57/9ca7b1fd68f2acc32802b22236c83b597a5690a483d5938d38183b549d22/evalscope-0.5.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.3 2026-01-05T08:14:57,796 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/fa/c8/fcbaf01b7486c3b29b7790167c2cda560f00a04d100cec808ee9a3349ca0/evalscope-0.5.4-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,797 Found link https://files.pythonhosted.org/packages/8c/cc/abd412bad714c0266be1f0159b49a817d45db099c3bd031134d223589e93/evalscope-0.5.4.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.4 2026-01-05T08:14:57,798 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4d/da/d705d457683223f289e8c5d6cadbaad15d15e098692c53ea7e6196a94373/evalscope-0.5.5rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,799 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7e/4c/414dd545a1833245a53797d70a35a78ae1b9cfbcc81b35a4e1763e678437/evalscope-0.5.5rc1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,799 Found link https://files.pythonhosted.org/packages/56/5f/aa7fcf62102694dd66b69e88cb2523094bf04b53f785854e17ee6b7234a7/evalscope-0.5.5rc1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5rc1 2026-01-05T08:14:57,800 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/3e/d0/91a7a1f95f3fa19dd8d4e434dc711768abe5c006f32a514a8602b429e049/evalscope-0.5.5-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,801 Found link https://files.pythonhosted.org/packages/04/7e/a7f065d6ebac15fe172d3b0906ff5b26a71df5a9975c0f14978044211cf1/evalscope-0.5.5.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.5.5 2026-01-05T08:14:57,802 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/89/b812b01a5ed91fc079dea052e1341860cd65d25d463c75c90e5a30ab6ae8/evalscope-0.6.0rc0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,803 Found link https://files.pythonhosted.org/packages/d5/ad/57a2d5f33c5b7d5066f8a5dcb1d34f14bf246112f7900228b9f2fb41b21b/evalscope-0.6.0rc0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0rc0 2026-01-05T08:14:57,804 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/25/3f03d9d924f1b65610724c9f10727b48ef952afdfee8687a461949c88c78/evalscope-0.6.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,806 Found link https://files.pythonhosted.org/packages/9d/ac/1f432bcc46ccb8348b869b80d2aaabde5e583b370418ba48714083e31068/evalscope-0.6.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.0 2026-01-05T08:14:57,807 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7b/e1/42c9e58b4690f23ef48bce841fc95cbf0744c1579cdc80fa6f33b0453344/evalscope-0.6.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,808 Found link https://files.pythonhosted.org/packages/cd/d6/1d9d2db9acda6e61d4210074f51e7e3dee4d0212fabdd94999105db23eed/evalscope-0.6.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.6.1 2026-01-05T08:14:57,809 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/20/92/6fff1dfd12a4f73489c451dab56351b6a3c1095a92bb55025c2934fc625d/evalscope-0.7.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,810 Found link https://files.pythonhosted.org/packages/fe/3c/500d655a27ca80e1aba3fb2b1e8886951942732b869ad1516422d9e6ac97/evalscope-0.7.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.0 2026-01-05T08:14:57,811 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a5/3a/ae22d4d9a44ad37ac887da21b948bf6e784307001a09802d36c5bf04018d/evalscope-0.7.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,813 Found link https://files.pythonhosted.org/packages/88/47/69d067f0d3d784a7975cc4ea067fdc55c8f785ece6fbe86e5e21edc8b36f/evalscope-0.7.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.1 2026-01-05T08:14:57,814 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c2/23/acbcbc2ed6f00d3bb81220054651e6a2b1b02714d42b1aeb018a2f5574c4/evalscope-0.7.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,816 Found link https://files.pythonhosted.org/packages/a4/38/9126b9329cd2ad6ccfd4a73f04402bf71b65921564c3c12cb0d62b3b421d/evalscope-0.7.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.7.2 2026-01-05T08:14:57,817 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/11/1f/9598183da3026696adc19a6727a19d57f9abbf4fd7aeb20cdf12faee7693/evalscope-0.8.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,818 Found link https://files.pythonhosted.org/packages/63/7e/fc44d30e3a83dbc3070396d78279c9e3e8716cbc4cc05811a70f1b463bfb/evalscope-0.8.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.0 2026-01-05T08:14:57,819 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/37/a9/a9c6cda95a6b837c9303e9a1598999f2c4e605abb507365c6ff70b372a5a/evalscope-0.8.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,820 Found link https://files.pythonhosted.org/packages/63/5a/bbc230bb06a7bc40dd3985dde4615a8a71f111bf95761e40f6f0f8a7e1a6/evalscope-0.8.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.1 2026-01-05T08:14:57,821 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/07/39/a1eb2efed77d21e8253daa37f082669a004c4d288813a2ee9e15398f2e80/evalscope-0.8.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,822 Found link https://files.pythonhosted.org/packages/57/bc/5ff6d538e459d8b3c567577c991eec72ab6adbc19497dc164d96cd634d2f/evalscope-0.8.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.8.2 2026-01-05T08:14:57,823 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f6/29/81a188c03b272bf7def0c9bb556b9e9465adbc68ecb18907b636f1e8cbd7/evalscope-0.9.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,824 Found link https://files.pythonhosted.org/packages/c0/c2/19da3be1fbd6b548ecdc877d47269e92503518de53acbbfe96120c5c9753/evalscope-0.9.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.9.0 2026-01-05T08:14:57,825 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/6d/45d7407f31d6878494c3b493d7e49a8318b1839161839293c1a2e66aadcf/evalscope-0.10.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,827 Found link https://files.pythonhosted.org/packages/16/b1/b6cef37a0dd0acfa5873ca4763ac6b4ac4b19a0b15ca6bdc8f30d4443682/evalscope-0.10.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.0 2026-01-05T08:14:57,828 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a7/8f/907045290d359b4e07e7acc96ec60173380748b49e9f3c91b7ddd8e8342d/evalscope-0.10.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,829 Found link https://files.pythonhosted.org/packages/e3/d3/dda2ac0513904bff8aa0c2efe77bc851d3acc6d514707db36648e4a903d2/evalscope-0.10.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.10.1 2026-01-05T08:14:57,830 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d6/e5/852326943d86c85b5ca6b548a5f3753c217b771d93968c61ec2ca46ee0b1/evalscope-0.11.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,831 Found link https://files.pythonhosted.org/packages/5f/3f/e2816b99487b4ead257453a242b5282a85881f9d26fbb5efb21cc5cf88fa/evalscope-0.11.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.11.0 2026-01-05T08:14:57,832 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/7d/8e/f9eceecf8bc7d740603f915eef7fab3e9d657a01f5de2c523e531445299c/evalscope-0.12.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,833 Found link https://files.pythonhosted.org/packages/33/82/7765517ff80a73eac7465369767aa45a5be3d5e0fb7c4f4a3ff743811f8c/evalscope-0.12.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.0 2026-01-05T08:14:57,834 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ee/4c/84a5e18985e149eb4283fef2b58a81ff2cec2d099c017684938ec3a3935f/evalscope-0.12.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,835 Found link https://files.pythonhosted.org/packages/b2/24/83b5530319bdb02142289e04640c6008dda1b988043c42feb7f0a5eab3b7/evalscope-0.12.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.12.1 2026-01-05T08:14:57,836 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/4e/a5/65faf0660cd8ae2660354b002b2b3a586b9419bc894120fea97efd506cb6/evalscope-0.13.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,837 Found link https://files.pythonhosted.org/packages/d0/39/d5eb469a94191760c61d1bfcb235e28be1d2a080d88b44792f53d76c45d1/evalscope-0.13.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.0 2026-01-05T08:14:57,838 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f7/fc/91b7b4379131d2e15ca1f575f533daea589357293e492ef2c93e0aac6b55/evalscope-0.13.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,839 Found link https://files.pythonhosted.org/packages/9e/a3/33b4ce270d5500fe7c8f32fa2160749b607f248141328d4785b6032c8f2a/evalscope-0.13.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.1 2026-01-05T08:14:57,840 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e6/66/83752c305879cf3dea54170398839ab08a046485bc18c41a34f41aca11ab/evalscope-0.13.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,842 Found link https://files.pythonhosted.org/packages/b7/8d/711ae30b80329e2dd7da760c001d9a5b45e4d8e5292f317f1ea10c744c29/evalscope-0.13.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.13.2 2026-01-05T08:14:57,843 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a8/b4/b22c7e52e6a7381333bdfa0bf92fae0258e7812064b1e208cbab56a62d08/evalscope-0.14.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,844 Found link https://files.pythonhosted.org/packages/40/44/7db2cb90e6ca0c9db92124f10c0273d7c6ef4b81523e1c98c34a88e67faa/evalscope-0.14.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.14.0 2026-01-05T08:14:57,845 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b2/47/976633e0f29b58b8c9f3faacf7373a2da734771a7915ac45d721d96e0ad7/evalscope-0.15.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,846 Found link https://files.pythonhosted.org/packages/b8/e3/bd534d69328afa98bdd497b5eaaf4b7416da9e8f56109d045c332b17d016/evalscope-0.15.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.0 2026-01-05T08:14:57,847 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5a/54/237aee5f0317fa04450c9ad67c3bf28b730460b2e6dc1e65b74b4bf2cd67/evalscope-0.15.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,849 Found link https://files.pythonhosted.org/packages/7a/d4/8b87e83a3a08f87ce5b4325f0cd5ab9bc54d296dc3f3492a1d3216a97a6d/evalscope-0.15.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.15.1 2026-01-05T08:14:57,850 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/1e/b5/6fec1cbb02a41ab79430eb3fb51eea7709525df0f9753ae2c54fbc4633f1/evalscope-0.16.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,851 Found link https://files.pythonhosted.org/packages/99/69/63997bedfa6fd33af671b539f77c375b111017eff23313f76693e24b8872/evalscope-0.16.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.0 2026-01-05T08:14:57,851 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/ea/76/639641578bdc92d25211eb8dae24d1fae19e40cd1649e17946f6ad8a5dc3/evalscope-0.16.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,853 Found link https://files.pythonhosted.org/packages/94/13/616bf9c33b0769db44a2bae32b54d33cf7129874392682aa76326a51e085/evalscope-0.16.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.1 2026-01-05T08:14:57,853 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f1/d3/9e83cc1b5a132342a05ef6ee79018bd7561f90a6406dc9db5c85fb0a281f/evalscope-0.16.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,854 Found link https://files.pythonhosted.org/packages/94/47/edd3faaddd321e464ae72db6e7bf82246dd4ca2f0f67127ca8c427cac664/evalscope-0.16.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.2 2026-01-05T08:14:57,855 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/5d/4e/34c56086cdfe7d1ad912241af819b3e67f20373f382016e33ac89dd43dde/evalscope-0.16.3-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,857 Found link https://files.pythonhosted.org/packages/22/07/14603c038a8019472881f574f1c47bd4193481f256b6dc702c65d8b8f984/evalscope-0.16.3.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.16.3 2026-01-05T08:14:57,858 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/f3/e2/104156f74779cf2849f53566bba585015492d7320c36a7cb76c7196b0ef5/evalscope-0.17.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:57,859 Found link https://files.pythonhosted.org/packages/c5/d0/b66d1b97ec67d65b6df54f18682e30cfeb6401604b93c9b1bdd1e97b8d79/evalscope-0.17.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.8), version: 0.17.0 2026-01-05T08:14:57,860 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/a9/b5/630d2c5dc5c32e9fbad5034e04d8aba6f6461dc08f255df77dd8d463857f/evalscope-0.17.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:57,861 Found link https://files.pythonhosted.org/packages/38/88/326e48929bb9577a6a36e07afa65bbf6bd870c1b644f82e5713874ae3238/evalscope-0.17.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 0.17.1 2026-01-05T08:14:57,862 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/21/18/9a6c208d2bc119ac67b5537b60851c6bccc99f25229eaa96cbe6e38721bd/evalscope-1.0.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:57,863 Found link https://files.pythonhosted.org/packages/d8/44/3d727dd28fcc50317c95d12e5bb850ed2863105f812373ac120877875434/evalscope-1.0.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.0 2026-01-05T08:14:57,864 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/e1/8a/50456fa7dd77be4c3a0ea0b3d96cb7ae5b2557454fdd35cbf0009a9d792f/evalscope-1.0.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:57,865 Found link https://files.pythonhosted.org/packages/8d/52/93569134b3d8dea2a0d0bc2134c03056f0ffee1840f7299eb83d475457df/evalscope-1.0.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.1 2026-01-05T08:14:57,866 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/22/f6/32fb0fef08a6c881ac840117455a5697a0d63226db8a24cce5208b720829/evalscope-1.0.2-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:57,867 Found link https://files.pythonhosted.org/packages/3d/f5/025baefe432d9af1ed845ab5738b638b7b97f2dd3767e9478b8eee10966f/evalscope-1.0.2.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.0.2 2026-01-05T08:14:57,868 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/6e/12/f31dbb18daa7e3c6cfecf856ddc323a303e115271166c100e06af58ea6b6/evalscope-1.1.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:57,869 Found link https://files.pythonhosted.org/packages/b4/98/7449040e89beaa4556bf35ba1e171b2d4955ff15b2b4c43f2ff55b048aeb/evalscope-1.1.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.9), version: 1.1.0 2026-01-05T08:14:57,870 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/79/d1/afc8b23345ad8f11a5e1f8c6c3112a8679604d833bdbc02aa06787952fd1/evalscope-1.1.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-01-05T08:14:57,871 Found link https://files.pythonhosted.org/packages/e8/3f/d67b73ce19789e914d6a78740fa7bfd0c07f161bc239b92cd3c26541f2fb/evalscope-1.1.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.1.1 2026-01-05T08:14:57,872 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/82/0a/d984751e4b5751f064209da3745b6aff6cc0f1d9d93f13cab0a1017a8639/evalscope-1.2.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-01-05T08:14:57,873 Found link https://files.pythonhosted.org/packages/7f/f9/0a2a069ee4500666ec5c3d10b302fc71d176c17bbe70447336e610953e1a/evalscope-1.2.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.2.0 2026-01-05T08:14:57,874 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/d9/58/823d009dfa49cfdc750ac257e744eb97456d69528a09ac108ee8cab15316/evalscope-1.3.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-01-05T08:14:57,875 Found link https://files.pythonhosted.org/packages/42/5a/a309f7ce1fbe2b39e4b0a1f26cfcd7864eaa90e4792a5290e8cdd2ce3b4f/evalscope-1.3.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.3.0 2026-01-05T08:14:57,876 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/c6/57/005d3ef07ecdd5163e1bbed3413537b653b637d9c8b62a2bcdd97546607b/evalscope-1.4.0-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-01-05T08:14:57,877 Found link https://files.pythonhosted.org/packages/11/d5/268f610ac7db9c5c2109936f65ac4df8b4bc52106ed4369509d3b3c4f127/evalscope-1.4.0.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.0 2026-01-05T08:14:57,878 Skipping link: No binaries permitted for evalscope: https://files.pythonhosted.org/packages/b8/96/a2b4fabf6fa6cf09ac6669aa01dc483ac53576a8fd7c2c4be21ea281840f/evalscope-1.4.1-py3-none-any.whl (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10) 2026-01-05T08:14:57,879 Found link https://files.pythonhosted.org/packages/1c/8d/4fa3d9a2a5443ba2df06147ebc057961b0550bef806c716b2552fe7c7f66/evalscope-1.4.1.tar.gz (from https://pypi.org/simple/evalscope/) (requires-python:>=3.10), version: 1.4.1 2026-01-05T08:14:57,880 Fetching project page and analyzing links: https://www.piwheels.org/simple/evalscope/ 2026-01-05T08:14:57,881 Getting page https://www.piwheels.org/simple/evalscope/ 2026-01-05T08:14:57,883 Found index url https://www.piwheels.org/simple 2026-01-05T08:14:58,049 Fetched page https://www.piwheels.org/simple/evalscope/ as text/html 2026-01-05T08:14:58,059 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.4.0-py3-none-any.whl#sha256=307c9ed70f562ba776fdf9b0136b35fe4e361b1b974bb0f8ca39e425f4738e6e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-01-05T08:14:58,059 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.3.0-py3-none-any.whl#sha256=d9949bacf6c08b5ab341f872c6ee4b31995d724fe963f64ddf3c129e0e39145e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-01-05T08:14:58,060 Skipping link: No binaries permitted for evalscope: https://www.piwheels.org/simple/evalscope/evalscope-1.2.0-py3-none-any.whl#sha256=5969cf4a3132f6a29f9ef39aa35a8a3be24f114bbcd7a77a19c145bbec432be9 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-01-05T08:14:58,060 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.1-py3-none-any.whl#sha256=5bfb8c55f45e1bcd5df5cb0cd4ecceae2ccb93cd09c9477b0b0e6c097ecb1d1f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.10) 2026-01-05T08:14:58,061 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.1.0-py3-none-any.whl#sha256=ca7c951fa316bb7ec6fb0e38ad6503632067f07f469e37828fe9e39e51591994 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:58,062 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.2-py3-none-any.whl#sha256=e14016040022bcd666c05ffa806f3713775e9de19a98290bd2a0a36e5c435409 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:58,062 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.1-py3-none-any.whl#sha256=61b9c14f3409804d84ddf31b749b587c7c7441f9d1aa0d453991b1bd0bbda74c (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:58,063 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-1.0.0-py3-none-any.whl#sha256=ea1044755177db8d9e94cc4ccafbd56e35b173a8425db7d1653dd9f66e1463ad (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:58,064 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.1-py3-none-any.whl#sha256=aa0054b8aed77684e56d0836d4568080fa4827799a16d62fef6ec13802cd4050 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.9) 2026-01-05T08:14:58,064 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.17.0-py3-none-any.whl#sha256=e7a2549f9f5ac5b0061d01f8ca99900e06f6340b91d3e546163423b896287862 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,065 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.3-py3-none-any.whl#sha256=75dcbd7cc0a0336f68d407a3925fe065ecaaea0fa9030ceebfaa3f22f0f3b417 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,065 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.2-py3-none-any.whl#sha256=14e00f16b506b723a359799e3cb271370a3da768da3667563612e458982d6847 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,066 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.1-py3-none-any.whl#sha256=296b90c06a33f69c9e7049a768f16929acd1279af0830f47f18fc598560d0e13 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,067 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.16.0-py3-none-any.whl#sha256=0a6f11a7a4d564d4a1dca5fe424cda19608cf947b9ab739079f5b60842651c7e (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,067 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.1-py3-none-any.whl#sha256=504717cfc96a8fdbb0d4bc080d0292e80423891b658d61283385737794c4e95f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,068 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.15.0-py3-none-any.whl#sha256=263020a7b7f7788e17515c738604ea64904ca0da34d09664d2b7ee16d3522a00 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,068 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.14.0-py3-none-any.whl#sha256=cdf26beb4b188e1dd5e09feeb1832ccf909c9d2a535eb76e702fb3c66fc65688 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,069 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.2-py3-none-any.whl#sha256=7d360083cf9dd960996847cde2140085d0830ddc8b12aa8007b4f72d395c5211 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,070 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.1-py3-none-any.whl#sha256=6b26f11daca05d6b56da3cb1b78ea0a8f28de94901b7520171c66e2a16b1c638 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,070 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.13.0-py3-none-any.whl#sha256=8ca93c4011f04e35df239b5486248e03100a9c9e265664e01330ef4b1cc691f7 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,071 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.1-py3-none-any.whl#sha256=72e5815923789c6cab6c32425477393bf645f670edcc493f8e4dec6e93f3da23 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,072 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.12.0-py3-none-any.whl#sha256=f26c7317a5bacac8527806723d45bac74563a2608ed761d822ace03be3a5e45f (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,072 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.11.0-py3-none-any.whl#sha256=6e7d3242c5bf97b54d24644a1300575fdf41c1d90eef7c1344c20bf0d1518671 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,073 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.1-py3-none-any.whl#sha256=a7e645410333c1aaec5a024be4c02166dfb0d6b4635b020c181ce672d31102d2 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,074 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.10.0-py3-none-any.whl#sha256=494f1742178def5f86e552c004562c877cd8b8b6f5d4267d29f961f20e8bf569 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,074 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.9.0-py3-none-any.whl#sha256=26fd59fd387850f3e84f995fd38357994460e71d0124bc09954ca8837027ec52 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,075 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.2-py3-none-any.whl#sha256=3c4ed25b5c39d7a706927607e552e6ced6d50532bc442bba74979f70720f4894 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,076 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.1-py3-none-any.whl#sha256=64e9306453082a95b0c0507d6fd1dbf50ed0ab210ebcfebdcb52c534769d1856 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,076 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.8.0-py3-none-any.whl#sha256=7758550ef3406d9c6096de05909cc1c97ed0c2f7be6edaf3fe5344244e34f233 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,076 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.2-py3-none-any.whl#sha256=21a1f4448ffca926853b7516b6375f801ef6a8067501dde546e7d794fb759f20 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,077 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.1-py3-none-any.whl#sha256=b959f2d9850544f2d96ac765b06ed112cf579a11899618fcc5245407f4e33843 (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,077 Skipping link: No binaries permitted for evalscope: https://archive1.piwheels.org/simple/evalscope/evalscope-0.7.0-py3-none-any.whl#sha256=2deddeb89bfb6fa02844b72a2be24627526136cbd45315e8e4dadb672d53c9fc (from https://www.piwheels.org/simple/evalscope/) (requires-python:>=3.8) 2026-01-05T08:14:58,078 Skipping link: not a file: https://www.piwheels.org/simple/evalscope/ 2026-01-05T08:14:58,079 Skipping link: not a file: https://pypi.org/simple/evalscope/ 2026-01-05T08:14:58,102 Given no hashes to check 1 links for project 'evalscope': discarding no candidates 2026-01-05T08:14:58,121 Collecting evalscope==1.4.1 2026-01-05T08:14:58,123 Created temporary directory: /tmp/pip-unpack-vmwk9uqg 2026-01-05T08:14:58,340 Downloading evalscope-1.4.1.tar.gz (937 kB) 2026-01-05T08:15:00,070 Added evalscope==1.4.1 from https://files.pythonhosted.org/packages/1c/8d/4fa3d9a2a5443ba2df06147ebc057961b0550bef806c716b2552fe7c7f66/evalscope-1.4.1.tar.gz to build tracker '/tmp/pip-build-tracker-59oc0_e5' 2026-01-05T08:15:00,076 Created temporary directory: /tmp/pip-build-env-k_pprhfw 2026-01-05T08:15:00,082 Installing build dependencies: started 2026-01-05T08:15:00,083 Running command pip subprocess to install build dependencies 2026-01-05T08:15:01,249 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-01-05T08:15:01,893 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-01-05T08:15:01,916 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-01-05T08:15:03,621 Collecting setuptools>=69 2026-01-05T08:15:03,715 Using cached https://www.piwheels.org/simple/setuptools/setuptools-80.9.0-py3-none-any.whl (1.2 MB) 2026-01-05T08:15:03,986 Collecting wheel 2026-01-05T08:15:04,003 Using cached https://www.piwheels.org/simple/wheel/wheel-0.45.1-py3-none-any.whl (72 kB) 2026-01-05T08:15:06,993 Installing collected packages: wheel, setuptools 2026-01-05T08:15:07,227 Creating /tmp/pip-build-env-k_pprhfw/overlay/local/bin 2026-01-05T08:15:07,229 changing mode of /tmp/pip-build-env-k_pprhfw/overlay/local/bin/wheel to 755 2026-01-05T08:15:10,876 Successfully installed setuptools-80.9.0 wheel-0.45.1 2026-01-05T08:15:11,153 Installing build dependencies: finished with status 'done' 2026-01-05T08:15:11,159 Getting requirements to build wheel: started 2026-01-05T08:15:11,160 Running command Getting requirements to build wheel 2026-01-05T08:15:11,878 /tmp/pip-build-env-k_pprhfw/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-01-05T08:15:11,879 !! 2026-01-05T08:15:11,880 ******************************************************************************** 2026-01-05T08:15:11,880 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-01-05T08:15:11,881 By 2026-Feb-18, you need to update your project and remove deprecated calls 2026-01-05T08:15:11,882 or your builds will no longer be supported. 2026-01-05T08:15:11,883 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-01-05T08:15:11,883 ******************************************************************************** 2026-01-05T08:15:11,884 !! 2026-01-05T08:15:11,885 corresp(dist, value, root_dir) 2026-01-05T08:15:11,980 /tmp/pip-build-env-k_pprhfw/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-01-05T08:15:11,981 !! 2026-01-05T08:15:11,982 ******************************************************************************** 2026-01-05T08:15:11,982 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-01-05T08:15:11,984 License :: OSI Approved :: Apache Software License 2026-01-05T08:15:11,985 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-01-05T08:15:11,985 ******************************************************************************** 2026-01-05T08:15:11,987 !! 2026-01-05T08:15:11,987 dist._finalize_license_expression() 2026-01-05T08:15:11,992 /tmp/pip-build-env-k_pprhfw/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:759: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-01-05T08:15:11,993 !! 2026-01-05T08:15:11,994 ******************************************************************************** 2026-01-05T08:15:11,995 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-01-05T08:15:11,996 License :: OSI Approved :: Apache Software License 2026-01-05T08:15:11,997 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-01-05T08:15:11,998 ******************************************************************************** 2026-01-05T08:15:11,999 !! 2026-01-05T08:15:11,999 self._finalize_license_expression() 2026-01-05T08:15:12,004 running egg_info 2026-01-05T08:15:12,011 writing evalscope.egg-info/PKG-INFO 2026-01-05T08:15:12,042 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-01-05T08:15:12,044 writing entry points to evalscope.egg-info/entry_points.txt 2026-01-05T08:15:12,061 writing requirements to evalscope.egg-info/requires.txt 2026-01-05T08:15:12,063 writing top-level names to evalscope.egg-info/top_level.txt 2026-01-05T08:15:12,306 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-01-05T08:15:12,345 reading manifest template 'MANIFEST.in' 2026-01-05T08:15:12,667 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-01-05T08:15:12,671 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-01-05T08:15:12,676 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-01-05T08:15:12,681 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-01-05T08:15:12,682 adding license file 'LICENSE' 2026-01-05T08:15:12,726 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-01-05T08:15:12,827 Getting requirements to build wheel: finished with status 'done' 2026-01-05T08:15:12,830 Created temporary directory: /tmp/pip-modern-metadata-5tb4lth8 2026-01-05T08:15:12,832 Preparing metadata (pyproject.toml): started 2026-01-05T08:15:12,833 Running command Preparing metadata (pyproject.toml) 2026-01-05T08:15:13,450 /tmp/pip-build-env-k_pprhfw/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-01-05T08:15:13,450 !! 2026-01-05T08:15:13,451 ******************************************************************************** 2026-01-05T08:15:13,451 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-01-05T08:15:13,452 By 2026-Feb-18, you need to update your project and remove deprecated calls 2026-01-05T08:15:13,453 or your builds will no longer be supported. 2026-01-05T08:15:13,454 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-01-05T08:15:13,454 ******************************************************************************** 2026-01-05T08:15:13,455 !! 2026-01-05T08:15:13,456 corresp(dist, value, root_dir) 2026-01-05T08:15:13,544 /tmp/pip-build-env-k_pprhfw/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-01-05T08:15:13,544 !! 2026-01-05T08:15:13,546 ******************************************************************************** 2026-01-05T08:15:13,546 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-01-05T08:15:13,547 License :: OSI Approved :: Apache Software License 2026-01-05T08:15:13,549 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-01-05T08:15:13,549 ******************************************************************************** 2026-01-05T08:15:13,551 !! 2026-01-05T08:15:13,551 dist._finalize_license_expression() 2026-01-05T08:15:13,553 /tmp/pip-build-env-k_pprhfw/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:759: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-01-05T08:15:13,554 !! 2026-01-05T08:15:13,555 ******************************************************************************** 2026-01-05T08:15:13,556 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-01-05T08:15:13,557 License :: OSI Approved :: Apache Software License 2026-01-05T08:15:13,558 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-01-05T08:15:13,559 ******************************************************************************** 2026-01-05T08:15:13,560 !! 2026-01-05T08:15:13,561 self._finalize_license_expression() 2026-01-05T08:15:13,561 running dist_info 2026-01-05T08:15:13,573 creating /tmp/pip-modern-metadata-5tb4lth8/evalscope.egg-info 2026-01-05T08:15:13,574 writing /tmp/pip-modern-metadata-5tb4lth8/evalscope.egg-info/PKG-INFO 2026-01-05T08:15:13,605 writing dependency_links to /tmp/pip-modern-metadata-5tb4lth8/evalscope.egg-info/dependency_links.txt 2026-01-05T08:15:13,607 writing entry points to /tmp/pip-modern-metadata-5tb4lth8/evalscope.egg-info/entry_points.txt 2026-01-05T08:15:13,624 writing requirements to /tmp/pip-modern-metadata-5tb4lth8/evalscope.egg-info/requires.txt 2026-01-05T08:15:13,625 writing top-level names to /tmp/pip-modern-metadata-5tb4lth8/evalscope.egg-info/top_level.txt 2026-01-05T08:15:13,627 writing manifest file '/tmp/pip-modern-metadata-5tb4lth8/evalscope.egg-info/SOURCES.txt' 2026-01-05T08:15:13,820 reading manifest file '/tmp/pip-modern-metadata-5tb4lth8/evalscope.egg-info/SOURCES.txt' 2026-01-05T08:15:13,822 reading manifest template 'MANIFEST.in' 2026-01-05T08:15:14,136 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-01-05T08:15:14,139 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-01-05T08:15:14,143 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-01-05T08:15:14,146 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-01-05T08:15:14,147 adding license file 'LICENSE' 2026-01-05T08:15:14,182 writing manifest file '/tmp/pip-modern-metadata-5tb4lth8/evalscope.egg-info/SOURCES.txt' 2026-01-05T08:15:14,184 creating '/tmp/pip-modern-metadata-5tb4lth8/evalscope-1.4.1.dist-info' 2026-01-05T08:15:14,316 Preparing metadata (pyproject.toml): finished with status 'done' 2026-01-05T08:15:14,324 Source in /tmp/pip-wheel-4ehhxqua/evalscope_06f19ba56e88499fa5316444615b23b0 has version 1.4.1, which satisfies requirement evalscope==1.4.1 from https://files.pythonhosted.org/packages/1c/8d/4fa3d9a2a5443ba2df06147ebc057961b0550bef806c716b2552fe7c7f66/evalscope-1.4.1.tar.gz 2026-01-05T08:15:14,325 Removed evalscope==1.4.1 from https://files.pythonhosted.org/packages/1c/8d/4fa3d9a2a5443ba2df06147ebc057961b0550bef806c716b2552fe7c7f66/evalscope-1.4.1.tar.gz from build tracker '/tmp/pip-build-tracker-59oc0_e5' 2026-01-05T08:15:14,336 Created temporary directory: /tmp/pip-unpack-uyu7nklf 2026-01-05T08:15:14,337 Building wheels for collected packages: evalscope 2026-01-05T08:15:14,341 Created temporary directory: /tmp/pip-wheel-aqtrynwz 2026-01-05T08:15:14,342 Destination directory: /tmp/pip-wheel-aqtrynwz 2026-01-05T08:15:14,344 Building wheel for evalscope (pyproject.toml): started 2026-01-05T08:15:14,346 Running command Building wheel for evalscope (pyproject.toml) 2026-01-05T08:15:14,966 /tmp/pip-build-env-k_pprhfw/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-01-05T08:15:14,966 !! 2026-01-05T08:15:14,967 ******************************************************************************** 2026-01-05T08:15:14,968 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-01-05T08:15:14,969 By 2026-Feb-18, you need to update your project and remove deprecated calls 2026-01-05T08:15:14,970 or your builds will no longer be supported. 2026-01-05T08:15:14,971 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-01-05T08:15:14,971 ******************************************************************************** 2026-01-05T08:15:14,973 !! 2026-01-05T08:15:14,973 corresp(dist, value, root_dir) 2026-01-05T08:15:15,060 /tmp/pip-build-env-k_pprhfw/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-01-05T08:15:15,061 !! 2026-01-05T08:15:15,062 ******************************************************************************** 2026-01-05T08:15:15,062 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-01-05T08:15:15,064 License :: OSI Approved :: Apache Software License 2026-01-05T08:15:15,065 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-01-05T08:15:15,065 ******************************************************************************** 2026-01-05T08:15:15,066 !! 2026-01-05T08:15:15,066 dist._finalize_license_expression() 2026-01-05T08:15:15,070 /tmp/pip-build-env-k_pprhfw/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:759: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2026-01-05T08:15:15,070 !! 2026-01-05T08:15:15,071 ******************************************************************************** 2026-01-05T08:15:15,072 Please consider removing the following classifiers in favor of a SPDX license expression: 2026-01-05T08:15:15,072 License :: OSI Approved :: Apache Software License 2026-01-05T08:15:15,073 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-01-05T08:15:15,074 ******************************************************************************** 2026-01-05T08:15:15,075 !! 2026-01-05T08:15:15,075 self._finalize_license_expression() 2026-01-05T08:15:15,076 running bdist_wheel 2026-01-05T08:15:15,093 running build 2026-01-05T08:15:15,094 running build_py 2026-01-05T08:15:15,100 creating build/lib/evalscope 2026-01-05T08:15:15,102 copying evalscope/constants.py -> build/lib/evalscope 2026-01-05T08:15:15,105 copying evalscope/__init__.py -> build/lib/evalscope 2026-01-05T08:15:15,107 copying evalscope/version.py -> build/lib/evalscope 2026-01-05T08:15:15,109 copying evalscope/run.py -> build/lib/evalscope 2026-01-05T08:15:15,111 copying evalscope/arguments.py -> build/lib/evalscope 2026-01-05T08:15:15,113 copying evalscope/config.py -> build/lib/evalscope 2026-01-05T08:15:15,116 creating build/lib/evalscope/third_party 2026-01-05T08:15:15,117 copying evalscope/third_party/__init__.py -> build/lib/evalscope/third_party 2026-01-05T08:15:15,120 creating build/lib/evalscope/evaluator 2026-01-05T08:15:15,121 copying evalscope/evaluator/__init__.py -> build/lib/evalscope/evaluator 2026-01-05T08:15:15,123 copying evalscope/evaluator/evaluator.py -> build/lib/evalscope/evaluator 2026-01-05T08:15:15,126 creating build/lib/evalscope/summarizer 2026-01-05T08:15:15,127 copying evalscope/summarizer/__init__.py -> build/lib/evalscope/summarizer 2026-01-05T08:15:15,129 copying evalscope/summarizer/summarizer.py -> build/lib/evalscope/summarizer 2026-01-05T08:15:15,131 creating build/lib/evalscope/collections 2026-01-05T08:15:15,132 copying evalscope/collections/__init__.py -> build/lib/evalscope/collections 2026-01-05T08:15:15,134 copying evalscope/collections/schema.py -> build/lib/evalscope/collections 2026-01-05T08:15:15,137 copying evalscope/collections/sampler.py -> build/lib/evalscope/collections 2026-01-05T08:15:15,139 creating build/lib/evalscope/models 2026-01-05T08:15:15,140 copying evalscope/models/openai_compatible.py -> build/lib/evalscope/models 2026-01-05T08:15:15,143 copying evalscope/models/__init__.py -> build/lib/evalscope/models 2026-01-05T08:15:15,145 copying evalscope/models/model_apis.py -> build/lib/evalscope/models 2026-01-05T08:15:15,147 copying evalscope/models/modelscope.py -> build/lib/evalscope/models 2026-01-05T08:15:15,149 copying evalscope/models/text2image_model.py -> build/lib/evalscope/models 2026-01-05T08:15:15,151 copying evalscope/models/mockllm.py -> build/lib/evalscope/models 2026-01-05T08:15:15,153 copying evalscope/models/image_edit_model.py -> build/lib/evalscope/models 2026-01-05T08:15:15,156 creating build/lib/evalscope/service 2026-01-05T08:15:15,157 copying evalscope/service/__init__.py -> build/lib/evalscope/service 2026-01-05T08:15:15,159 copying evalscope/service/app.py -> build/lib/evalscope/service 2026-01-05T08:15:15,162 copying evalscope/service/utils.py -> build/lib/evalscope/service 2026-01-05T08:15:15,164 creating build/lib/evalscope/filters 2026-01-05T08:15:15,165 copying evalscope/filters/selection.py -> build/lib/evalscope/filters 2026-01-05T08:15:15,167 copying evalscope/filters/__init__.py -> build/lib/evalscope/filters 2026-01-05T08:15:15,169 copying evalscope/filters/extraction.py -> build/lib/evalscope/filters 2026-01-05T08:15:15,172 creating build/lib/evalscope/cli 2026-01-05T08:15:15,172 copying evalscope/cli/__init__.py -> build/lib/evalscope/cli 2026-01-05T08:15:15,175 copying evalscope/cli/start_service.py -> build/lib/evalscope/cli 2026-01-05T08:15:15,176 copying evalscope/cli/cli.py -> build/lib/evalscope/cli 2026-01-05T08:15:15,178 copying evalscope/cli/start_perf.py -> build/lib/evalscope/cli 2026-01-05T08:15:15,180 copying evalscope/cli/base.py -> build/lib/evalscope/cli 2026-01-05T08:15:15,182 copying evalscope/cli/start_app.py -> build/lib/evalscope/cli 2026-01-05T08:15:15,184 copying evalscope/cli/start_eval.py -> build/lib/evalscope/cli 2026-01-05T08:15:15,186 creating build/lib/evalscope/report 2026-01-05T08:15:15,187 copying evalscope/report/__init__.py -> build/lib/evalscope/report 2026-01-05T08:15:15,189 copying evalscope/report/combinator.py -> build/lib/evalscope/report 2026-01-05T08:15:15,191 copying evalscope/report/report.py -> build/lib/evalscope/report 2026-01-05T08:15:15,194 copying evalscope/report/generator.py -> build/lib/evalscope/report 2026-01-05T08:15:15,196 creating build/lib/evalscope/backend 2026-01-05T08:15:15,197 copying evalscope/backend/__init__.py -> build/lib/evalscope/backend 2026-01-05T08:15:15,199 copying evalscope/backend/base.py -> build/lib/evalscope/backend 2026-01-05T08:15:15,202 creating build/lib/evalscope/metrics 2026-01-05T08:15:15,203 copying evalscope/metrics/__init__.py -> build/lib/evalscope/metrics 2026-01-05T08:15:15,205 copying evalscope/metrics/metric.py -> build/lib/evalscope/metrics 2026-01-05T08:15:15,208 copying evalscope/metrics/rouge_metric.py -> build/lib/evalscope/metrics 2026-01-05T08:15:15,210 copying evalscope/metrics/llm_judge.py -> build/lib/evalscope/metrics 2026-01-05T08:15:15,212 copying evalscope/metrics/metrics.py -> build/lib/evalscope/metrics 2026-01-05T08:15:15,215 copying evalscope/metrics/math_parser.py -> build/lib/evalscope/metrics 2026-01-05T08:15:15,218 creating build/lib/evalscope/app 2026-01-05T08:15:15,219 copying evalscope/app/constants.py -> build/lib/evalscope/app 2026-01-05T08:15:15,221 copying evalscope/app/__init__.py -> build/lib/evalscope/app 2026-01-05T08:15:15,223 copying evalscope/app/arguments.py -> build/lib/evalscope/app 2026-01-05T08:15:15,225 copying evalscope/app/app.py -> build/lib/evalscope/app 2026-01-05T08:15:15,227 creating build/lib/evalscope/benchmarks 2026-01-05T08:15:15,228 copying evalscope/benchmarks/__init__.py -> build/lib/evalscope/benchmarks 2026-01-05T08:15:15,231 creating build/lib/evalscope/utils 2026-01-05T08:15:15,232 copying evalscope/utils/ner.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,235 copying evalscope/utils/deprecation_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,237 copying evalscope/utils/import_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,240 copying evalscope/utils/__init__.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,242 copying evalscope/utils/code_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,244 copying evalscope/utils/model_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,246 copying evalscope/utils/json_schema.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,249 copying evalscope/utils/multi_choices.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,252 copying evalscope/utils/resource_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,254 copying evalscope/utils/logger.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,256 copying evalscope/utils/chat_service.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,259 copying evalscope/utils/function_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,261 copying evalscope/utils/tqdm_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,263 copying evalscope/utils/argument_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,265 copying evalscope/utils/url_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,267 copying evalscope/utils/io_utils.py -> build/lib/evalscope/utils 2026-01-05T08:15:15,270 creating build/lib/evalscope/perf 2026-01-05T08:15:15,271 copying evalscope/perf/__init__.py -> build/lib/evalscope/perf 2026-01-05T08:15:15,273 copying evalscope/perf/benchmark.py -> build/lib/evalscope/perf 2026-01-05T08:15:15,275 copying evalscope/perf/main.py -> build/lib/evalscope/perf 2026-01-05T08:15:15,277 copying evalscope/perf/http_client.py -> build/lib/evalscope/perf 2026-01-05T08:15:15,280 copying evalscope/perf/arguments.py -> build/lib/evalscope/perf 2026-01-05T08:15:15,283 creating build/lib/evalscope/api 2026-01-05T08:15:15,284 copying evalscope/api/__init__.py -> build/lib/evalscope/api 2026-01-05T08:15:15,286 copying evalscope/api/registry.py -> build/lib/evalscope/api 2026-01-05T08:15:15,288 creating build/lib/evalscope/third_party/longbench_write 2026-01-05T08:15:15,290 copying evalscope/third_party/longbench_write/__init__.py -> build/lib/evalscope/third_party/longbench_write 2026-01-05T08:15:15,292 copying evalscope/third_party/longbench_write/longbench_write.py -> build/lib/evalscope/third_party/longbench_write 2026-01-05T08:15:15,294 copying evalscope/third_party/longbench_write/utils.py -> build/lib/evalscope/third_party/longbench_write 2026-01-05T08:15:15,296 copying evalscope/third_party/longbench_write/eval.py -> build/lib/evalscope/third_party/longbench_write 2026-01-05T08:15:15,298 copying evalscope/third_party/longbench_write/infer.py -> build/lib/evalscope/third_party/longbench_write 2026-01-05T08:15:15,302 creating build/lib/evalscope/third_party/toolbench_static 2026-01-05T08:15:15,303 copying evalscope/third_party/toolbench_static/toolbench_static.py -> build/lib/evalscope/third_party/toolbench_static 2026-01-05T08:15:15,305 copying evalscope/third_party/toolbench_static/__init__.py -> build/lib/evalscope/third_party/toolbench_static 2026-01-05T08:15:15,307 copying evalscope/third_party/toolbench_static/eval.py -> build/lib/evalscope/third_party/toolbench_static 2026-01-05T08:15:15,309 copying evalscope/third_party/toolbench_static/infer.py -> build/lib/evalscope/third_party/toolbench_static 2026-01-05T08:15:15,312 creating build/lib/evalscope/third_party/thinkbench 2026-01-05T08:15:15,314 copying evalscope/third_party/thinkbench/__init__.py -> build/lib/evalscope/third_party/thinkbench 2026-01-05T08:15:15,316 copying evalscope/third_party/thinkbench/eval.py -> build/lib/evalscope/third_party/thinkbench 2026-01-05T08:15:15,319 copying evalscope/third_party/thinkbench/infer.py -> build/lib/evalscope/third_party/thinkbench 2026-01-05T08:15:15,321 creating build/lib/evalscope/third_party/longbench_write/resources 2026-01-05T08:15:15,322 copying evalscope/third_party/longbench_write/resources/__init__.py -> build/lib/evalscope/third_party/longbench_write/resources 2026-01-05T08:15:15,325 creating build/lib/evalscope/third_party/longbench_write/tools 2026-01-05T08:15:15,326 copying evalscope/third_party/longbench_write/tools/__init__.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-01-05T08:15:15,328 copying evalscope/third_party/longbench_write/tools/data_etl.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-01-05T08:15:15,330 copying evalscope/third_party/longbench_write/tools/openai_api.py -> build/lib/evalscope/third_party/longbench_write/tools 2026-01-05T08:15:15,333 creating build/lib/evalscope/third_party/toolbench_static/llm 2026-01-05T08:15:15,334 copying evalscope/third_party/toolbench_static/llm/__init__.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-01-05T08:15:15,337 copying evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/lib/evalscope/third_party/toolbench_static/llm 2026-01-05T08:15:15,339 creating build/lib/evalscope/third_party/thinkbench/tools 2026-01-05T08:15:15,340 copying evalscope/third_party/thinkbench/tools/__init__.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-01-05T08:15:15,342 copying evalscope/third_party/thinkbench/tools/llm.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-01-05T08:15:15,344 copying evalscope/third_party/thinkbench/tools/utils.py -> build/lib/evalscope/third_party/thinkbench/tools 2026-01-05T08:15:15,347 creating build/lib/evalscope/models/utils 2026-01-05T08:15:15,348 copying evalscope/models/utils/openai.py -> build/lib/evalscope/models/utils 2026-01-05T08:15:15,351 creating build/lib/evalscope/service/frontend 2026-01-05T08:15:15,352 copying evalscope/service/frontend/async_client.py -> build/lib/evalscope/service/frontend 2026-01-05T08:15:15,354 copying evalscope/service/frontend/__init__.py -> build/lib/evalscope/service/frontend 2026-01-05T08:15:15,356 copying evalscope/service/frontend/main.py -> build/lib/evalscope/service/frontend 2026-01-05T08:15:15,358 copying evalscope/service/frontend/utils.py -> build/lib/evalscope/service/frontend 2026-01-05T08:15:15,361 creating build/lib/evalscope/backend/rag_eval 2026-01-05T08:15:15,362 copying evalscope/backend/rag_eval/__init__.py -> build/lib/evalscope/backend/rag_eval 2026-01-05T08:15:15,364 copying evalscope/backend/rag_eval/backend_manager.py -> build/lib/evalscope/backend/rag_eval 2026-01-05T08:15:15,366 creating build/lib/evalscope/backend/vlm_eval_kit 2026-01-05T08:15:15,367 copying evalscope/backend/vlm_eval_kit/__init__.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-01-05T08:15:15,369 copying evalscope/backend/vlm_eval_kit/backend_manager.py -> build/lib/evalscope/backend/vlm_eval_kit 2026-01-05T08:15:15,372 creating build/lib/evalscope/backend/opencompass 2026-01-05T08:15:15,373 copying evalscope/backend/opencompass/__init__.py -> build/lib/evalscope/backend/opencompass 2026-01-05T08:15:15,375 copying evalscope/backend/opencompass/api_meta_template.py -> build/lib/evalscope/backend/opencompass 2026-01-05T08:15:15,377 copying evalscope/backend/opencompass/backend_manager.py -> build/lib/evalscope/backend/opencompass 2026-01-05T08:15:15,380 creating build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:15,381 copying evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:15,383 copying evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:15,384 copying evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:15,387 copying evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:15,389 creating build/lib/evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:15,390 copying evalscope/backend/rag_eval/cmteb/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:15,392 copying evalscope/backend/rag_eval/cmteb/arguments.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:15,394 copying evalscope/backend/rag_eval/cmteb/base.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:15,396 copying evalscope/backend/rag_eval/cmteb/task_template.py -> build/lib/evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:15,398 creating build/lib/evalscope/backend/rag_eval/ragas 2026-01-05T08:15:15,399 copying evalscope/backend/rag_eval/ragas/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-01-05T08:15:15,401 copying evalscope/backend/rag_eval/ragas/arguments.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-01-05T08:15:15,403 copying evalscope/backend/rag_eval/ragas/task_template.py -> build/lib/evalscope/backend/rag_eval/ragas 2026-01-05T08:15:15,406 creating build/lib/evalscope/backend/rag_eval/utils 2026-01-05T08:15:15,407 copying evalscope/backend/rag_eval/utils/__init__.py -> build/lib/evalscope/backend/rag_eval/utils 2026-01-05T08:15:15,408 copying evalscope/backend/rag_eval/utils/embedding.py -> build/lib/evalscope/backend/rag_eval/utils 2026-01-05T08:15:15,411 copying evalscope/backend/rag_eval/utils/llm.py -> build/lib/evalscope/backend/rag_eval/utils 2026-01-05T08:15:15,413 copying evalscope/backend/rag_eval/utils/tools.py -> build/lib/evalscope/backend/rag_eval/utils 2026-01-05T08:15:15,415 copying evalscope/backend/rag_eval/utils/clip.py -> build/lib/evalscope/backend/rag_eval/utils 2026-01-05T08:15:15,417 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:15,418 copying evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:15,420 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:15,422 copying evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:15,425 copying evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:15,427 creating build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-01-05T08:15:15,428 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-01-05T08:15:15,431 creating build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:15,432 copying evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:15,434 copying evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:15,436 copying evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:15,439 copying evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:15,441 copying evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:15,443 copying evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:15,446 copying evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:15,447 copying evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/lib/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:15,450 creating build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-01-05T08:15:15,451 copying evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/prompts 2026-01-05T08:15:15,454 creating build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:15,455 copying evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:15,457 copying evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:15,459 copying evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:15,461 copying evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:15,464 copying evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/lib/evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:15,466 creating build/lib/evalscope/backend/opencompass/tasks 2026-01-05T08:15:15,467 copying evalscope/backend/opencompass/tasks/__init__.py -> build/lib/evalscope/backend/opencompass/tasks 2026-01-05T08:15:15,469 copying evalscope/backend/opencompass/tasks/eval_api.py -> build/lib/evalscope/backend/opencompass/tasks 2026-01-05T08:15:15,471 copying evalscope/backend/opencompass/tasks/eval_datasets.py -> build/lib/evalscope/backend/opencompass/tasks 2026-01-05T08:15:15,474 creating build/lib/evalscope/metrics/t2v_metrics 2026-01-05T08:15:15,475 copying evalscope/metrics/t2v_metrics/constants.py -> build/lib/evalscope/metrics/t2v_metrics 2026-01-05T08:15:15,477 copying evalscope/metrics/t2v_metrics/__init__.py -> build/lib/evalscope/metrics/t2v_metrics 2026-01-05T08:15:15,478 copying evalscope/metrics/t2v_metrics/score.py -> build/lib/evalscope/metrics/t2v_metrics 2026-01-05T08:15:15,480 copying evalscope/metrics/t2v_metrics/itmscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-01-05T08:15:15,482 copying evalscope/metrics/t2v_metrics/clipscore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-01-05T08:15:15,484 copying evalscope/metrics/t2v_metrics/vqascore.py -> build/lib/evalscope/metrics/t2v_metrics 2026-01-05T08:15:15,486 creating build/lib/evalscope/metrics/bert_score 2026-01-05T08:15:15,487 copying evalscope/metrics/bert_score/scorer.py -> build/lib/evalscope/metrics/bert_score 2026-01-05T08:15:15,490 copying evalscope/metrics/bert_score/__init__.py -> build/lib/evalscope/metrics/bert_score 2026-01-05T08:15:15,491 copying evalscope/metrics/bert_score/utils.py -> build/lib/evalscope/metrics/bert_score 2026-01-05T08:15:15,495 creating build/lib/evalscope/metrics/text_normalizer 2026-01-05T08:15:15,496 copying evalscope/metrics/text_normalizer/basic.py -> build/lib/evalscope/metrics/text_normalizer 2026-01-05T08:15:15,498 copying evalscope/metrics/text_normalizer/__init__.py -> build/lib/evalscope/metrics/text_normalizer 2026-01-05T08:15:15,499 copying evalscope/metrics/text_normalizer/wer.py -> build/lib/evalscope/metrics/text_normalizer 2026-01-05T08:15:15,502 copying evalscope/metrics/text_normalizer/chinese.py -> build/lib/evalscope/metrics/text_normalizer 2026-01-05T08:15:15,505 copying evalscope/metrics/text_normalizer/english.py -> build/lib/evalscope/metrics/text_normalizer 2026-01-05T08:15:15,508 creating build/lib/evalscope/metrics/bundled_rouge_score 2026-01-05T08:15:15,509 copying evalscope/metrics/bundled_rouge_score/__init__.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-01-05T08:15:15,511 copying evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/lib/evalscope/metrics/bundled_rouge_score 2026-01-05T08:15:15,514 creating build/lib/evalscope/metrics/sem_score 2026-01-05T08:15:15,515 copying evalscope/metrics/sem_score/scorer.py -> build/lib/evalscope/metrics/sem_score 2026-01-05T08:15:15,517 copying evalscope/metrics/sem_score/__init__.py -> build/lib/evalscope/metrics/sem_score 2026-01-05T08:15:15,520 creating build/lib/evalscope/metrics/t2v_metrics/models 2026-01-05T08:15:15,521 copying evalscope/metrics/t2v_metrics/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-01-05T08:15:15,522 copying evalscope/metrics/t2v_metrics/models/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-01-05T08:15:15,524 copying evalscope/metrics/t2v_metrics/models/model.py -> build/lib/evalscope/metrics/t2v_metrics/models 2026-01-05T08:15:15,526 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:15,527 copying evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:15,530 copying evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:15,532 copying evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:15,534 copying evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:15,536 copying evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:15,539 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:15,540 copying evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:15,542 copying evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:15,544 copying evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:15,546 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:15,548 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:15,549 copying evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:15,551 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:15,553 copying evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:15,555 copying evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:15,557 copying evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:15,560 creating build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:15,562 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:15,563 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:15,566 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:15,568 copying evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:15,571 creating build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-01-05T08:15:15,572 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-01-05T08:15:15,574 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-01-05T08:15:15,576 copying evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-01-05T08:15:15,579 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-01-05T08:15:15,580 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-01-05T08:15:15,582 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-01-05T08:15:15,583 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-01-05T08:15:15,586 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-01-05T08:15:15,587 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-01-05T08:15:15,590 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-01-05T08:15:15,591 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-01-05T08:15:15,593 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-01-05T08:15:15,594 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-01-05T08:15:15,598 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-01-05T08:15:15,599 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-01-05T08:15:15,601 copying evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-01-05T08:15:15,604 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:15,605 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:15,608 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:15,611 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:15,614 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:15,616 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:15,617 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:15,620 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:15,622 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:15,625 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:15,628 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:15,631 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:15,635 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:15,636 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:15,638 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:15,641 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:15,643 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:15,645 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:15,647 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:15,649 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:15,652 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,653 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,657 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,659 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,662 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,665 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,668 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,671 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,673 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,676 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,679 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:15,682 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,683 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,686 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,688 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,691 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,693 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,696 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,698 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,700 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,703 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,705 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,708 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:15,712 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-01-05T08:15:15,713 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-01-05T08:15:15,715 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-01-05T08:15:15,717 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-01-05T08:15:15,721 creating build/lib/evalscope/app/ui 2026-01-05T08:15:15,722 copying evalscope/app/ui/__init__.py -> build/lib/evalscope/app/ui 2026-01-05T08:15:15,724 copying evalscope/app/ui/sidebar.py -> build/lib/evalscope/app/ui 2026-01-05T08:15:15,726 copying evalscope/app/ui/single_model.py -> build/lib/evalscope/app/ui 2026-01-05T08:15:15,729 copying evalscope/app/ui/app_ui.py -> build/lib/evalscope/app/ui 2026-01-05T08:15:15,731 copying evalscope/app/ui/visualization.py -> build/lib/evalscope/app/ui 2026-01-05T08:15:15,733 copying evalscope/app/ui/multi_model.py -> build/lib/evalscope/app/ui 2026-01-05T08:15:15,736 creating build/lib/evalscope/app/utils 2026-01-05T08:15:15,737 copying evalscope/app/utils/localization.py -> build/lib/evalscope/app/utils 2026-01-05T08:15:15,740 copying evalscope/app/utils/env_utils.py -> build/lib/evalscope/app/utils 2026-01-05T08:15:15,742 copying evalscope/app/utils/text_utils.py -> build/lib/evalscope/app/utils 2026-01-05T08:15:15,744 copying evalscope/app/utils/data_utils.py -> build/lib/evalscope/app/utils 2026-01-05T08:15:15,747 copying evalscope/app/utils/visualization.py -> build/lib/evalscope/app/utils 2026-01-05T08:15:15,749 creating build/lib/evalscope/benchmarks/math_verse 2026-01-05T08:15:15,750 copying evalscope/benchmarks/math_verse/__init__.py -> build/lib/evalscope/benchmarks/math_verse 2026-01-05T08:15:15,752 copying evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/lib/evalscope/benchmarks/math_verse 2026-01-05T08:15:15,755 creating build/lib/evalscope/benchmarks/process_bench 2026-01-05T08:15:15,756 copying evalscope/benchmarks/process_bench/__init__.py -> build/lib/evalscope/benchmarks/process_bench 2026-01-05T08:15:15,757 copying evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/lib/evalscope/benchmarks/process_bench 2026-01-05T08:15:15,760 creating build/lib/evalscope/benchmarks/docvqa 2026-01-05T08:15:15,761 copying evalscope/benchmarks/docvqa/__init__.py -> build/lib/evalscope/benchmarks/docvqa 2026-01-05T08:15:15,763 copying evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/lib/evalscope/benchmarks/docvqa 2026-01-05T08:15:15,765 creating build/lib/evalscope/benchmarks/arena_hard 2026-01-05T08:15:15,766 copying evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/lib/evalscope/benchmarks/arena_hard 2026-01-05T08:15:15,769 copying evalscope/benchmarks/arena_hard/__init__.py -> build/lib/evalscope/benchmarks/arena_hard 2026-01-05T08:15:15,771 copying evalscope/benchmarks/arena_hard/utils.py -> build/lib/evalscope/benchmarks/arena_hard 2026-01-05T08:15:15,773 creating build/lib/evalscope/benchmarks/mri_mcqa 2026-01-05T08:15:15,774 copying evalscope/benchmarks/mri_mcqa/__init__.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-01-05T08:15:15,776 copying evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/lib/evalscope/benchmarks/mri_mcqa 2026-01-05T08:15:15,778 creating build/lib/evalscope/benchmarks/pope 2026-01-05T08:15:15,779 copying evalscope/benchmarks/pope/__init__.py -> build/lib/evalscope/benchmarks/pope 2026-01-05T08:15:15,781 copying evalscope/benchmarks/pope/pope_adapter.py -> build/lib/evalscope/benchmarks/pope 2026-01-05T08:15:15,783 creating build/lib/evalscope/benchmarks/zerobench 2026-01-05T08:15:15,784 copying evalscope/benchmarks/zerobench/__init__.py -> build/lib/evalscope/benchmarks/zerobench 2026-01-05T08:15:15,786 copying evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/lib/evalscope/benchmarks/zerobench 2026-01-05T08:15:15,788 creating build/lib/evalscope/benchmarks/general_mcq 2026-01-05T08:15:15,789 copying evalscope/benchmarks/general_mcq/__init__.py -> build/lib/evalscope/benchmarks/general_mcq 2026-01-05T08:15:15,791 copying evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/lib/evalscope/benchmarks/general_mcq 2026-01-05T08:15:15,793 creating build/lib/evalscope/benchmarks/mmlu_redux 2026-01-05T08:15:15,794 copying evalscope/benchmarks/mmlu_redux/__init__.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-01-05T08:15:15,796 copying evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/lib/evalscope/benchmarks/mmlu_redux 2026-01-05T08:15:15,799 creating build/lib/evalscope/benchmarks/vstar_bench 2026-01-05T08:15:15,800 copying evalscope/benchmarks/vstar_bench/__init__.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-01-05T08:15:15,802 copying evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/lib/evalscope/benchmarks/vstar_bench 2026-01-05T08:15:15,804 creating build/lib/evalscope/benchmarks/image_edit 2026-01-05T08:15:15,805 copying evalscope/benchmarks/image_edit/__init__.py -> build/lib/evalscope/benchmarks/image_edit 2026-01-05T08:15:15,808 creating build/lib/evalscope/benchmarks/humaneval 2026-01-05T08:15:15,809 copying evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/lib/evalscope/benchmarks/humaneval 2026-01-05T08:15:15,812 copying evalscope/benchmarks/humaneval/__init__.py -> build/lib/evalscope/benchmarks/humaneval 2026-01-05T08:15:15,814 copying evalscope/benchmarks/humaneval/utils.py -> build/lib/evalscope/benchmarks/humaneval 2026-01-05T08:15:15,817 creating build/lib/evalscope/benchmarks/arc 2026-01-05T08:15:15,818 copying evalscope/benchmarks/arc/__init__.py -> build/lib/evalscope/benchmarks/arc 2026-01-05T08:15:15,820 copying evalscope/benchmarks/arc/arc_adapter.py -> build/lib/evalscope/benchmarks/arc 2026-01-05T08:15:15,822 creating build/lib/evalscope/benchmarks/olympiad_bench 2026-01-05T08:15:15,824 copying evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-01-05T08:15:15,826 copying evalscope/benchmarks/olympiad_bench/__init__.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-01-05T08:15:15,828 copying evalscope/benchmarks/olympiad_bench/utils.py -> build/lib/evalscope/benchmarks/olympiad_bench 2026-01-05T08:15:15,831 creating build/lib/evalscope/benchmarks/siqa 2026-01-05T08:15:15,832 copying evalscope/benchmarks/siqa/__init__.py -> build/lib/evalscope/benchmarks/siqa 2026-01-05T08:15:15,834 copying evalscope/benchmarks/siqa/siqa_adapter.py -> build/lib/evalscope/benchmarks/siqa 2026-01-05T08:15:15,836 creating build/lib/evalscope/benchmarks/healthbench 2026-01-05T08:15:15,837 copying evalscope/benchmarks/healthbench/__init__.py -> build/lib/evalscope/benchmarks/healthbench 2026-01-05T08:15:15,839 copying evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/lib/evalscope/benchmarks/healthbench 2026-01-05T08:15:15,841 copying evalscope/benchmarks/healthbench/utils.py -> build/lib/evalscope/benchmarks/healthbench 2026-01-05T08:15:15,844 creating build/lib/evalscope/benchmarks/ifbench 2026-01-05T08:15:15,845 copying evalscope/benchmarks/ifbench/evaluation_lib.py -> build/lib/evalscope/benchmarks/ifbench 2026-01-05T08:15:15,847 copying evalscope/benchmarks/ifbench/__init__.py -> build/lib/evalscope/benchmarks/ifbench 2026-01-05T08:15:15,849 copying evalscope/benchmarks/ifbench/instructions_util.py -> build/lib/evalscope/benchmarks/ifbench 2026-01-05T08:15:15,852 copying evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/lib/evalscope/benchmarks/ifbench 2026-01-05T08:15:15,854 copying evalscope/benchmarks/ifbench/instructions_registry.py -> build/lib/evalscope/benchmarks/ifbench 2026-01-05T08:15:15,856 copying evalscope/benchmarks/ifbench/instructions.py -> build/lib/evalscope/benchmarks/ifbench 2026-01-05T08:15:15,860 creating build/lib/evalscope/benchmarks/aa_lcr 2026-01-05T08:15:15,861 copying evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-01-05T08:15:15,863 copying evalscope/benchmarks/aa_lcr/__init__.py -> build/lib/evalscope/benchmarks/aa_lcr 2026-01-05T08:15:15,865 creating build/lib/evalscope/benchmarks/wmt 2026-01-05T08:15:15,866 copying evalscope/benchmarks/wmt/__init__.py -> build/lib/evalscope/benchmarks/wmt 2026-01-05T08:15:15,868 copying evalscope/benchmarks/wmt/wmt24_adapter.py -> build/lib/evalscope/benchmarks/wmt 2026-01-05T08:15:15,871 creating build/lib/evalscope/benchmarks/eq_bench 2026-01-05T08:15:15,872 copying evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/lib/evalscope/benchmarks/eq_bench 2026-01-05T08:15:15,875 copying evalscope/benchmarks/eq_bench/__init__.py -> build/lib/evalscope/benchmarks/eq_bench 2026-01-05T08:15:15,877 copying evalscope/benchmarks/eq_bench/answer_validation.py -> build/lib/evalscope/benchmarks/eq_bench 2026-01-05T08:15:15,879 creating build/lib/evalscope/benchmarks/frames 2026-01-05T08:15:15,881 copying evalscope/benchmarks/frames/__init__.py -> build/lib/evalscope/benchmarks/frames 2026-01-05T08:15:15,882 copying evalscope/benchmarks/frames/utils.py -> build/lib/evalscope/benchmarks/frames 2026-01-05T08:15:15,884 copying evalscope/benchmarks/frames/frames_adapter.py -> build/lib/evalscope/benchmarks/frames 2026-01-05T08:15:15,887 creating build/lib/evalscope/benchmarks/tau_bench 2026-01-05T08:15:15,889 copying evalscope/benchmarks/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench 2026-01-05T08:15:15,891 creating build/lib/evalscope/benchmarks/general_vmcq 2026-01-05T08:15:15,892 copying evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-01-05T08:15:15,895 copying evalscope/benchmarks/general_vmcq/__init__.py -> build/lib/evalscope/benchmarks/general_vmcq 2026-01-05T08:15:15,897 creating build/lib/evalscope/benchmarks/cmmlu 2026-01-05T08:15:15,898 copying evalscope/benchmarks/cmmlu/__init__.py -> build/lib/evalscope/benchmarks/cmmlu 2026-01-05T08:15:15,900 copying evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/lib/evalscope/benchmarks/cmmlu 2026-01-05T08:15:15,903 creating build/lib/evalscope/benchmarks/mmlu 2026-01-05T08:15:15,904 copying evalscope/benchmarks/mmlu/__init__.py -> build/lib/evalscope/benchmarks/mmlu 2026-01-05T08:15:15,906 copying evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/lib/evalscope/benchmarks/mmlu 2026-01-05T08:15:15,908 creating build/lib/evalscope/benchmarks/math_vision 2026-01-05T08:15:15,910 copying evalscope/benchmarks/math_vision/__init__.py -> build/lib/evalscope/benchmarks/math_vision 2026-01-05T08:15:15,911 copying evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/lib/evalscope/benchmarks/math_vision 2026-01-05T08:15:15,915 creating build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,916 copying evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,918 copying evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,920 copying evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,922 copying evalscope/benchmarks/ner/copious_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,924 copying evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,926 copying evalscope/benchmarks/ner/bc2gm_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,927 copying evalscope/benchmarks/ner/__init__.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,929 copying evalscope/benchmarks/ner/wnut2017_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,931 copying evalscope/benchmarks/ner/anat_em_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,933 copying evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,935 copying evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,936 copying evalscope/benchmarks/ner/fin_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,938 copying evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,940 copying evalscope/benchmarks/ner/ncbi_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,942 copying evalscope/benchmarks/ner/jnlpba_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,944 copying evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,946 copying evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,948 copying evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,950 copying evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,952 copying evalscope/benchmarks/ner/genia_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,954 copying evalscope/benchmarks/ner/conll2003_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,956 copying evalscope/benchmarks/ner/conllpp_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,958 copying evalscope/benchmarks/ner/cross_ner_adapter.py -> build/lib/evalscope/benchmarks/ner 2026-01-05T08:15:15,961 creating build/lib/evalscope/benchmarks/minerva_math 2026-01-05T08:15:15,962 copying evalscope/benchmarks/minerva_math/__init__.py -> build/lib/evalscope/benchmarks/minerva_math 2026-01-05T08:15:15,964 copying evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/lib/evalscope/benchmarks/minerva_math 2026-01-05T08:15:15,967 creating build/lib/evalscope/benchmarks/biomix_qa 2026-01-05T08:15:15,968 copying evalscope/benchmarks/biomix_qa/__init__.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-01-05T08:15:15,970 copying evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/lib/evalscope/benchmarks/biomix_qa 2026-01-05T08:15:15,972 creating build/lib/evalscope/benchmarks/science_qa 2026-01-05T08:15:15,973 copying evalscope/benchmarks/science_qa/__init__.py -> build/lib/evalscope/benchmarks/science_qa 2026-01-05T08:15:15,974 copying evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/lib/evalscope/benchmarks/science_qa 2026-01-05T08:15:15,977 creating build/lib/evalscope/benchmarks/a_okvqa 2026-01-05T08:15:15,978 copying evalscope/benchmarks/a_okvqa/__init__.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-01-05T08:15:15,979 copying evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/lib/evalscope/benchmarks/a_okvqa 2026-01-05T08:15:15,982 creating build/lib/evalscope/benchmarks/hellaswag 2026-01-05T08:15:15,983 copying evalscope/benchmarks/hellaswag/__init__.py -> build/lib/evalscope/benchmarks/hellaswag 2026-01-05T08:15:15,984 copying evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/lib/evalscope/benchmarks/hellaswag 2026-01-05T08:15:15,987 creating build/lib/evalscope/benchmarks/poly_math 2026-01-05T08:15:15,988 copying evalscope/benchmarks/poly_math/__init__.py -> build/lib/evalscope/benchmarks/poly_math 2026-01-05T08:15:15,989 copying evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/lib/evalscope/benchmarks/poly_math 2026-01-05T08:15:15,992 creating build/lib/evalscope/benchmarks/simple_vqa 2026-01-05T08:15:15,993 copying evalscope/benchmarks/simple_vqa/__init__.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-01-05T08:15:15,994 copying evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/lib/evalscope/benchmarks/simple_vqa 2026-01-05T08:15:15,997 creating build/lib/evalscope/benchmarks/hallusion_bench 2026-01-05T08:15:15,998 copying evalscope/benchmarks/hallusion_bench/__init__.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-01-05T08:15:16,000 copying evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/lib/evalscope/benchmarks/hallusion_bench 2026-01-05T08:15:16,003 creating build/lib/evalscope/benchmarks/needle_haystack 2026-01-05T08:15:16,004 copying evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-01-05T08:15:16,007 copying evalscope/benchmarks/needle_haystack/__init__.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-01-05T08:15:16,008 copying evalscope/benchmarks/needle_haystack/utils.py -> build/lib/evalscope/benchmarks/needle_haystack 2026-01-05T08:15:16,011 creating build/lib/evalscope/benchmarks/terminal_bench 2026-01-05T08:15:16,012 copying evalscope/benchmarks/terminal_bench/__init__.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-01-05T08:15:16,014 copying evalscope/benchmarks/terminal_bench/utils.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-01-05T08:15:16,015 copying evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/lib/evalscope/benchmarks/terminal_bench 2026-01-05T08:15:16,018 creating build/lib/evalscope/benchmarks/refcoco 2026-01-05T08:15:16,019 copying evalscope/benchmarks/refcoco/evaluation_lib.py -> build/lib/evalscope/benchmarks/refcoco 2026-01-05T08:15:16,021 copying evalscope/benchmarks/refcoco/__init__.py -> build/lib/evalscope/benchmarks/refcoco 2026-01-05T08:15:16,023 copying evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/lib/evalscope/benchmarks/refcoco 2026-01-05T08:15:16,025 copying evalscope/benchmarks/refcoco/utils.py -> build/lib/evalscope/benchmarks/refcoco 2026-01-05T08:15:16,027 creating build/lib/evalscope/benchmarks/piqa 2026-01-05T08:15:16,028 copying evalscope/benchmarks/piqa/__init__.py -> build/lib/evalscope/benchmarks/piqa 2026-01-05T08:15:16,030 copying evalscope/benchmarks/piqa/piqa_adapter.py -> build/lib/evalscope/benchmarks/piqa 2026-01-05T08:15:16,032 creating build/lib/evalscope/benchmarks/truthful_qa 2026-01-05T08:15:16,034 copying evalscope/benchmarks/truthful_qa/__init__.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-01-05T08:15:16,036 copying evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/lib/evalscope/benchmarks/truthful_qa 2026-01-05T08:15:16,038 creating build/lib/evalscope/benchmarks/ceval 2026-01-05T08:15:16,039 copying evalscope/benchmarks/ceval/__init__.py -> build/lib/evalscope/benchmarks/ceval 2026-01-05T08:15:16,040 copying evalscope/benchmarks/ceval/ceval_adapter.py -> build/lib/evalscope/benchmarks/ceval 2026-01-05T08:15:16,043 creating build/lib/evalscope/benchmarks/chartqa 2026-01-05T08:15:16,044 copying evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/lib/evalscope/benchmarks/chartqa 2026-01-05T08:15:16,046 copying evalscope/benchmarks/chartqa/__init__.py -> build/lib/evalscope/benchmarks/chartqa 2026-01-05T08:15:16,047 copying evalscope/benchmarks/chartqa/utils.py -> build/lib/evalscope/benchmarks/chartqa 2026-01-05T08:15:16,049 creating build/lib/evalscope/benchmarks/librispeech 2026-01-05T08:15:16,050 copying evalscope/benchmarks/librispeech/__init__.py -> build/lib/evalscope/benchmarks/librispeech 2026-01-05T08:15:16,052 copying evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/lib/evalscope/benchmarks/librispeech 2026-01-05T08:15:16,054 creating build/lib/evalscope/benchmarks/med_mcqa 2026-01-05T08:15:16,055 copying evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-01-05T08:15:16,057 copying evalscope/benchmarks/med_mcqa/__init__.py -> build/lib/evalscope/benchmarks/med_mcqa 2026-01-05T08:15:16,059 creating build/lib/evalscope/benchmarks/general_arena 2026-01-05T08:15:16,060 copying evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/lib/evalscope/benchmarks/general_arena 2026-01-05T08:15:16,062 copying evalscope/benchmarks/general_arena/__init__.py -> build/lib/evalscope/benchmarks/general_arena 2026-01-05T08:15:16,064 copying evalscope/benchmarks/general_arena/utils.py -> build/lib/evalscope/benchmarks/general_arena 2026-01-05T08:15:16,067 creating build/lib/evalscope/benchmarks/mmmu 2026-01-05T08:15:16,068 copying evalscope/benchmarks/mmmu/__init__.py -> build/lib/evalscope/benchmarks/mmmu 2026-01-05T08:15:16,070 copying evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/lib/evalscope/benchmarks/mmmu 2026-01-05T08:15:16,072 creating build/lib/evalscope/benchmarks/ifeval 2026-01-05T08:15:16,073 copying evalscope/benchmarks/ifeval/__init__.py -> build/lib/evalscope/benchmarks/ifeval 2026-01-05T08:15:16,075 copying evalscope/benchmarks/ifeval/instructions_util.py -> build/lib/evalscope/benchmarks/ifeval 2026-01-05T08:15:16,078 copying evalscope/benchmarks/ifeval/utils.py -> build/lib/evalscope/benchmarks/ifeval 2026-01-05T08:15:16,080 copying evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/lib/evalscope/benchmarks/ifeval 2026-01-05T08:15:16,082 copying evalscope/benchmarks/ifeval/instructions_registry.py -> build/lib/evalscope/benchmarks/ifeval 2026-01-05T08:15:16,084 copying evalscope/benchmarks/ifeval/instructions.py -> build/lib/evalscope/benchmarks/ifeval 2026-01-05T08:15:16,088 creating build/lib/evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:16,089 copying evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:16,092 copying evalscope/benchmarks/omnidoc_bench/__init__.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:16,094 copying evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:16,096 copying evalscope/benchmarks/omnidoc_bench/metrics.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:16,099 copying evalscope/benchmarks/omnidoc_bench/utils.py -> build/lib/evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:16,103 creating build/lib/evalscope/benchmarks/blink 2026-01-05T08:15:16,104 copying evalscope/benchmarks/blink/__init__.py -> build/lib/evalscope/benchmarks/blink 2026-01-05T08:15:16,106 copying evalscope/benchmarks/blink/blink_adapter.py -> build/lib/evalscope/benchmarks/blink 2026-01-05T08:15:16,108 creating build/lib/evalscope/benchmarks/infovqa 2026-01-05T08:15:16,109 copying evalscope/benchmarks/infovqa/__init__.py -> build/lib/evalscope/benchmarks/infovqa 2026-01-05T08:15:16,111 copying evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/lib/evalscope/benchmarks/infovqa 2026-01-05T08:15:16,113 creating build/lib/evalscope/benchmarks/real_world_qa 2026-01-05T08:15:16,114 copying evalscope/benchmarks/real_world_qa/__init__.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-01-05T08:15:16,116 copying evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/lib/evalscope/benchmarks/real_world_qa 2026-01-05T08:15:16,118 creating build/lib/evalscope/benchmarks/gpqa 2026-01-05T08:15:16,119 copying evalscope/benchmarks/gpqa/__init__.py -> build/lib/evalscope/benchmarks/gpqa 2026-01-05T08:15:16,121 copying evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/lib/evalscope/benchmarks/gpqa 2026-01-05T08:15:16,123 copying evalscope/benchmarks/gpqa/prompt.py -> build/lib/evalscope/benchmarks/gpqa 2026-01-05T08:15:16,125 creating build/lib/evalscope/benchmarks/fleurs 2026-01-05T08:15:16,126 copying evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/lib/evalscope/benchmarks/fleurs 2026-01-05T08:15:16,129 copying evalscope/benchmarks/fleurs/__init__.py -> build/lib/evalscope/benchmarks/fleurs 2026-01-05T08:15:16,131 creating build/lib/evalscope/benchmarks/zebralogicbench 2026-01-05T08:15:16,132 copying evalscope/benchmarks/zebralogicbench/__init__.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-01-05T08:15:16,134 copying evalscope/benchmarks/zebralogicbench/utils.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-01-05T08:15:16,136 copying evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/lib/evalscope/benchmarks/zebralogicbench 2026-01-05T08:15:16,139 creating build/lib/evalscope/benchmarks/cmmmu 2026-01-05T08:15:16,140 copying evalscope/benchmarks/cmmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmmu 2026-01-05T08:15:16,142 copying evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmmu 2026-01-05T08:15:16,144 copying evalscope/benchmarks/cmmmu/utils.py -> build/lib/evalscope/benchmarks/cmmmu 2026-01-05T08:15:16,147 creating build/lib/evalscope/benchmarks/ai2d 2026-01-05T08:15:16,148 copying evalscope/benchmarks/ai2d/__init__.py -> build/lib/evalscope/benchmarks/ai2d 2026-01-05T08:15:16,150 copying evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/lib/evalscope/benchmarks/ai2d 2026-01-05T08:15:16,152 creating build/lib/evalscope/benchmarks/bbh 2026-01-05T08:15:16,153 copying evalscope/benchmarks/bbh/__init__.py -> build/lib/evalscope/benchmarks/bbh 2026-01-05T08:15:16,155 copying evalscope/benchmarks/bbh/bbh_adapter.py -> build/lib/evalscope/benchmarks/bbh 2026-01-05T08:15:16,158 creating build/lib/evalscope/benchmarks/swe_bench 2026-01-05T08:15:16,159 copying evalscope/benchmarks/swe_bench/__init__.py -> build/lib/evalscope/benchmarks/swe_bench 2026-01-05T08:15:16,161 copying evalscope/benchmarks/swe_bench/build_images.py -> build/lib/evalscope/benchmarks/swe_bench 2026-01-05T08:15:16,163 copying evalscope/benchmarks/swe_bench/utils.py -> build/lib/evalscope/benchmarks/swe_bench 2026-01-05T08:15:16,165 copying evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/lib/evalscope/benchmarks/swe_bench 2026-01-05T08:15:16,168 creating build/lib/evalscope/benchmarks/drop 2026-01-05T08:15:16,170 copying evalscope/benchmarks/drop/__init__.py -> build/lib/evalscope/benchmarks/drop 2026-01-05T08:15:16,171 copying evalscope/benchmarks/drop/utils.py -> build/lib/evalscope/benchmarks/drop 2026-01-05T08:15:16,173 copying evalscope/benchmarks/drop/drop_adapter.py -> build/lib/evalscope/benchmarks/drop 2026-01-05T08:15:16,177 creating build/lib/evalscope/benchmarks/openai_mrcr 2026-01-05T08:15:16,178 copying evalscope/benchmarks/openai_mrcr/__init__.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-01-05T08:15:16,179 copying evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-01-05T08:15:16,182 copying evalscope/benchmarks/openai_mrcr/utils.py -> build/lib/evalscope/benchmarks/openai_mrcr 2026-01-05T08:15:16,184 creating build/lib/evalscope/benchmarks/math_500 2026-01-05T08:15:16,185 copying evalscope/benchmarks/math_500/__init__.py -> build/lib/evalscope/benchmarks/math_500 2026-01-05T08:15:16,187 copying evalscope/benchmarks/math_500/math_500_adapter.py -> build/lib/evalscope/benchmarks/math_500 2026-01-05T08:15:16,189 creating build/lib/evalscope/benchmarks/race 2026-01-05T08:15:16,190 copying evalscope/benchmarks/race/__init__.py -> build/lib/evalscope/benchmarks/race 2026-01-05T08:15:16,192 copying evalscope/benchmarks/race/race_adapter.py -> build/lib/evalscope/benchmarks/race 2026-01-05T08:15:16,194 creating build/lib/evalscope/benchmarks/math_qa 2026-01-05T08:15:16,195 copying evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/lib/evalscope/benchmarks/math_qa 2026-01-05T08:15:16,197 copying evalscope/benchmarks/math_qa/__init__.py -> build/lib/evalscope/benchmarks/math_qa 2026-01-05T08:15:16,199 creating build/lib/evalscope/benchmarks/chinese_simple_qa 2026-01-05T08:15:16,200 copying evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-01-05T08:15:16,203 copying evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/lib/evalscope/benchmarks/chinese_simple_qa 2026-01-05T08:15:16,205 creating build/lib/evalscope/benchmarks/visu_logic 2026-01-05T08:15:16,206 copying evalscope/benchmarks/visu_logic/__init__.py -> build/lib/evalscope/benchmarks/visu_logic 2026-01-05T08:15:16,207 copying evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/lib/evalscope/benchmarks/visu_logic 2026-01-05T08:15:16,210 creating build/lib/evalscope/benchmarks/musr 2026-01-05T08:15:16,211 copying evalscope/benchmarks/musr/__init__.py -> build/lib/evalscope/benchmarks/musr 2026-01-05T08:15:16,212 copying evalscope/benchmarks/musr/musr_adapter.py -> build/lib/evalscope/benchmarks/musr 2026-01-05T08:15:16,215 creating build/lib/evalscope/benchmarks/sciq 2026-01-05T08:15:16,216 copying evalscope/benchmarks/sciq/__init__.py -> build/lib/evalscope/benchmarks/sciq 2026-01-05T08:15:16,217 copying evalscope/benchmarks/sciq/sciq_adapter.py -> build/lib/evalscope/benchmarks/sciq 2026-01-05T08:15:16,220 creating build/lib/evalscope/benchmarks/trivia_qa 2026-01-05T08:15:16,221 copying evalscope/benchmarks/trivia_qa/__init__.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-01-05T08:15:16,223 copying evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/lib/evalscope/benchmarks/trivia_qa 2026-01-05T08:15:16,225 creating build/lib/evalscope/benchmarks/commonsense_qa 2026-01-05T08:15:16,226 copying evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-01-05T08:15:16,228 copying evalscope/benchmarks/commonsense_qa/__init__.py -> build/lib/evalscope/benchmarks/commonsense_qa 2026-01-05T08:15:16,230 creating build/lib/evalscope/benchmarks/general_fc 2026-01-05T08:15:16,231 copying evalscope/benchmarks/general_fc/__init__.py -> build/lib/evalscope/benchmarks/general_fc 2026-01-05T08:15:16,233 copying evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/lib/evalscope/benchmarks/general_fc 2026-01-05T08:15:16,235 creating build/lib/evalscope/benchmarks/logi_qa 2026-01-05T08:15:16,236 copying evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/lib/evalscope/benchmarks/logi_qa 2026-01-05T08:15:16,238 copying evalscope/benchmarks/logi_qa/__int__.py -> build/lib/evalscope/benchmarks/logi_qa 2026-01-05T08:15:16,240 creating build/lib/evalscope/benchmarks/pumed_qa 2026-01-05T08:15:16,242 copying evalscope/benchmarks/pumed_qa/__init__.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-01-05T08:15:16,243 copying evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/lib/evalscope/benchmarks/pumed_qa 2026-01-05T08:15:16,246 creating build/lib/evalscope/benchmarks/mbpp 2026-01-05T08:15:16,247 copying evalscope/benchmarks/mbpp/__init__.py -> build/lib/evalscope/benchmarks/mbpp 2026-01-05T08:15:16,249 copying evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/lib/evalscope/benchmarks/mbpp 2026-01-05T08:15:16,251 creating build/lib/evalscope/benchmarks/simple_qa 2026-01-05T08:15:16,252 copying evalscope/benchmarks/simple_qa/__init__.py -> build/lib/evalscope/benchmarks/simple_qa 2026-01-05T08:15:16,254 copying evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/lib/evalscope/benchmarks/simple_qa 2026-01-05T08:15:16,257 creating build/lib/evalscope/benchmarks/micro_vqa 2026-01-05T08:15:16,258 copying evalscope/benchmarks/micro_vqa/__init__.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-01-05T08:15:16,259 copying evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/lib/evalscope/benchmarks/micro_vqa 2026-01-05T08:15:16,262 creating build/lib/evalscope/benchmarks/aime 2026-01-05T08:15:16,263 copying evalscope/benchmarks/aime/__init__.py -> build/lib/evalscope/benchmarks/aime 2026-01-05T08:15:16,264 copying evalscope/benchmarks/aime/grader.py -> build/lib/evalscope/benchmarks/aime 2026-01-05T08:15:16,267 copying evalscope/benchmarks/aime/aime24_adapter.py -> build/lib/evalscope/benchmarks/aime 2026-01-05T08:15:16,269 copying evalscope/benchmarks/aime/aime25_adapter.py -> build/lib/evalscope/benchmarks/aime 2026-01-05T08:15:16,271 copying evalscope/benchmarks/aime/math_normalize.py -> build/lib/evalscope/benchmarks/aime 2026-01-05T08:15:16,274 creating build/lib/evalscope/benchmarks/mm_bench 2026-01-05T08:15:16,275 copying evalscope/benchmarks/mm_bench/__init__.py -> build/lib/evalscope/benchmarks/mm_bench 2026-01-05T08:15:16,276 copying evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/lib/evalscope/benchmarks/mm_bench 2026-01-05T08:15:16,279 creating build/lib/evalscope/benchmarks/winogrande 2026-01-05T08:15:16,280 copying evalscope/benchmarks/winogrande/__init__.py -> build/lib/evalscope/benchmarks/winogrande 2026-01-05T08:15:16,282 copying evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/lib/evalscope/benchmarks/winogrande 2026-01-05T08:15:16,284 creating build/lib/evalscope/benchmarks/docmath 2026-01-05T08:15:16,285 copying evalscope/benchmarks/docmath/__init__.py -> build/lib/evalscope/benchmarks/docmath 2026-01-05T08:15:16,287 copying evalscope/benchmarks/docmath/utils.py -> build/lib/evalscope/benchmarks/docmath 2026-01-05T08:15:16,289 copying evalscope/benchmarks/docmath/docmath_adapter.py -> build/lib/evalscope/benchmarks/docmath 2026-01-05T08:15:16,292 creating build/lib/evalscope/benchmarks/general_qa 2026-01-05T08:15:16,293 copying evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/lib/evalscope/benchmarks/general_qa 2026-01-05T08:15:16,295 copying evalscope/benchmarks/general_qa/__init__.py -> build/lib/evalscope/benchmarks/general_qa 2026-01-05T08:15:16,297 creating build/lib/evalscope/benchmarks/text2image 2026-01-05T08:15:16,298 copying evalscope/benchmarks/text2image/__init__.py -> build/lib/evalscope/benchmarks/text2image 2026-01-05T08:15:16,300 copying evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-01-05T08:15:16,302 copying evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-01-05T08:15:16,304 copying evalscope/benchmarks/text2image/tifa_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-01-05T08:15:16,306 copying evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-01-05T08:15:16,308 copying evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/lib/evalscope/benchmarks/text2image 2026-01-05T08:15:16,310 creating build/lib/evalscope/benchmarks/halu_eval 2026-01-05T08:15:16,312 copying evalscope/benchmarks/halu_eval/__init__.py -> build/lib/evalscope/benchmarks/halu_eval 2026-01-05T08:15:16,313 copying evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/lib/evalscope/benchmarks/halu_eval 2026-01-05T08:15:16,316 copying evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/lib/evalscope/benchmarks/halu_eval 2026-01-05T08:15:16,318 creating build/lib/evalscope/benchmarks/qasc 2026-01-05T08:15:16,319 copying evalscope/benchmarks/qasc/__init__.py -> build/lib/evalscope/benchmarks/qasc 2026-01-05T08:15:16,321 copying evalscope/benchmarks/qasc/qasc_adapter.py -> build/lib/evalscope/benchmarks/qasc 2026-01-05T08:15:16,323 creating build/lib/evalscope/benchmarks/multipl_e 2026-01-05T08:15:16,324 copying evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-01-05T08:15:16,327 copying evalscope/benchmarks/multipl_e/__init__.py -> build/lib/evalscope/benchmarks/multipl_e 2026-01-05T08:15:16,328 copying evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/lib/evalscope/benchmarks/multipl_e 2026-01-05T08:15:16,331 copying evalscope/benchmarks/multipl_e/utils.py -> build/lib/evalscope/benchmarks/multipl_e 2026-01-05T08:15:16,333 creating build/lib/evalscope/benchmarks/data_collection 2026-01-05T08:15:16,334 copying evalscope/benchmarks/data_collection/__init__.py -> build/lib/evalscope/benchmarks/data_collection 2026-01-05T08:15:16,336 copying evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/lib/evalscope/benchmarks/data_collection 2026-01-05T08:15:16,339 creating build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-01-05T08:15:16,340 copying evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-01-05T08:15:16,341 copying evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/lib/evalscope/benchmarks/seed_bench_2_plus 2026-01-05T08:15:16,344 creating build/lib/evalscope/benchmarks/iquiz 2026-01-05T08:15:16,345 copying evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/lib/evalscope/benchmarks/iquiz 2026-01-05T08:15:16,347 copying evalscope/benchmarks/iquiz/__init__.py -> build/lib/evalscope/benchmarks/iquiz 2026-01-05T08:15:16,349 creating build/lib/evalscope/benchmarks/omni_bench 2026-01-05T08:15:16,350 copying evalscope/benchmarks/omni_bench/__init__.py -> build/lib/evalscope/benchmarks/omni_bench 2026-01-05T08:15:16,352 copying evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/lib/evalscope/benchmarks/omni_bench 2026-01-05T08:15:16,354 creating build/lib/evalscope/benchmarks/alpaca_eval 2026-01-05T08:15:16,355 copying evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-01-05T08:15:16,357 copying evalscope/benchmarks/alpaca_eval/__init__.py -> build/lib/evalscope/benchmarks/alpaca_eval 2026-01-05T08:15:16,359 creating build/lib/evalscope/benchmarks/coin_flip 2026-01-05T08:15:16,360 copying evalscope/benchmarks/coin_flip/__init__.py -> build/lib/evalscope/benchmarks/coin_flip 2026-01-05T08:15:16,362 copying evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/lib/evalscope/benchmarks/coin_flip 2026-01-05T08:15:16,365 creating build/lib/evalscope/benchmarks/scicode 2026-01-05T08:15:16,366 copying evalscope/benchmarks/scicode/__init__.py -> build/lib/evalscope/benchmarks/scicode 2026-01-05T08:15:16,368 copying evalscope/benchmarks/scicode/prompt_templates.py -> build/lib/evalscope/benchmarks/scicode 2026-01-05T08:15:16,369 copying evalscope/benchmarks/scicode/util.py -> build/lib/evalscope/benchmarks/scicode 2026-01-05T08:15:16,371 copying evalscope/benchmarks/scicode/scicode_adapter.py -> build/lib/evalscope/benchmarks/scicode 2026-01-05T08:15:16,374 creating build/lib/evalscope/benchmarks/tool_bench 2026-01-05T08:15:16,375 copying evalscope/benchmarks/tool_bench/__init__.py -> build/lib/evalscope/benchmarks/tool_bench 2026-01-05T08:15:16,377 copying evalscope/benchmarks/tool_bench/utils.py -> build/lib/evalscope/benchmarks/tool_bench 2026-01-05T08:15:16,379 copying evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/lib/evalscope/benchmarks/tool_bench 2026-01-05T08:15:16,382 creating build/lib/evalscope/benchmarks/mm_star 2026-01-05T08:15:16,383 copying evalscope/benchmarks/mm_star/__init__.py -> build/lib/evalscope/benchmarks/mm_star 2026-01-05T08:15:16,384 copying evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/lib/evalscope/benchmarks/mm_star 2026-01-05T08:15:16,386 creating build/lib/evalscope/benchmarks/competition_math 2026-01-05T08:15:16,387 copying evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/lib/evalscope/benchmarks/competition_math 2026-01-05T08:15:16,389 copying evalscope/benchmarks/competition_math/__init__.py -> build/lib/evalscope/benchmarks/competition_math 2026-01-05T08:15:16,392 creating build/lib/evalscope/benchmarks/multi_if 2026-01-05T08:15:16,393 copying evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/lib/evalscope/benchmarks/multi_if 2026-01-05T08:15:16,395 copying evalscope/benchmarks/multi_if/__init__.py -> build/lib/evalscope/benchmarks/multi_if 2026-01-05T08:15:16,396 copying evalscope/benchmarks/multi_if/ifeval.py -> build/lib/evalscope/benchmarks/multi_if 2026-01-05T08:15:16,401 copying evalscope/benchmarks/multi_if/metrics.py -> build/lib/evalscope/benchmarks/multi_if 2026-01-05T08:15:16,404 creating build/lib/evalscope/benchmarks/amc 2026-01-05T08:15:16,405 copying evalscope/benchmarks/amc/__init__.py -> build/lib/evalscope/benchmarks/amc 2026-01-05T08:15:16,406 copying evalscope/benchmarks/amc/amc_adapter.py -> build/lib/evalscope/benchmarks/amc 2026-01-05T08:15:16,409 creating build/lib/evalscope/benchmarks/general_vqa 2026-01-05T08:15:16,410 copying evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/lib/evalscope/benchmarks/general_vqa 2026-01-05T08:15:16,412 copying evalscope/benchmarks/general_vqa/__init__.py -> build/lib/evalscope/benchmarks/general_vqa 2026-01-05T08:15:16,414 creating build/lib/evalscope/benchmarks/drivelology 2026-01-05T08:15:16,415 copying evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-01-05T08:15:16,418 copying evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-01-05T08:15:16,420 copying evalscope/benchmarks/drivelology/__init__.py -> build/lib/evalscope/benchmarks/drivelology 2026-01-05T08:15:16,422 copying evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-01-05T08:15:16,424 copying evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/lib/evalscope/benchmarks/drivelology 2026-01-05T08:15:16,426 creating build/lib/evalscope/benchmarks/math_vista 2026-01-05T08:15:16,427 copying evalscope/benchmarks/math_vista/__init__.py -> build/lib/evalscope/benchmarks/math_vista 2026-01-05T08:15:16,429 copying evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/lib/evalscope/benchmarks/math_vista 2026-01-05T08:15:16,432 creating build/lib/evalscope/benchmarks/ocr_bench 2026-01-05T08:15:16,432 copying evalscope/benchmarks/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench 2026-01-05T08:15:16,435 creating build/lib/evalscope/benchmarks/torgo 2026-01-05T08:15:16,436 copying evalscope/benchmarks/torgo/__init__.py -> build/lib/evalscope/benchmarks/torgo 2026-01-05T08:15:16,437 copying evalscope/benchmarks/torgo/torgo_adapter.py -> build/lib/evalscope/benchmarks/torgo 2026-01-05T08:15:16,440 creating build/lib/evalscope/benchmarks/music_trivia 2026-01-05T08:15:16,441 copying evalscope/benchmarks/music_trivia/__init__.py -> build/lib/evalscope/benchmarks/music_trivia 2026-01-05T08:15:16,443 copying evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/lib/evalscope/benchmarks/music_trivia 2026-01-05T08:15:16,445 creating build/lib/evalscope/benchmarks/mmlu_pro 2026-01-05T08:15:16,446 copying evalscope/benchmarks/mmlu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-01-05T08:15:16,448 copying evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmlu_pro 2026-01-05T08:15:16,450 creating build/lib/evalscope/benchmarks/mgsm 2026-01-05T08:15:16,451 copying evalscope/benchmarks/mgsm/__init__.py -> build/lib/evalscope/benchmarks/mgsm 2026-01-05T08:15:16,452 copying evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/lib/evalscope/benchmarks/mgsm 2026-01-05T08:15:16,454 creating build/lib/evalscope/benchmarks/gsm8k 2026-01-05T08:15:16,455 copying evalscope/benchmarks/gsm8k/__init__.py -> build/lib/evalscope/benchmarks/gsm8k 2026-01-05T08:15:16,457 copying evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/lib/evalscope/benchmarks/gsm8k 2026-01-05T08:15:16,459 creating build/lib/evalscope/benchmarks/mmmu_pro 2026-01-05T08:15:16,460 copying evalscope/benchmarks/mmmu_pro/__init__.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-01-05T08:15:16,462 copying evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/lib/evalscope/benchmarks/mmmu_pro 2026-01-05T08:15:16,464 creating build/lib/evalscope/benchmarks/gsm8k_v 2026-01-05T08:15:16,465 copying evalscope/benchmarks/gsm8k_v/__init__.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-01-05T08:15:16,467 copying evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/lib/evalscope/benchmarks/gsm8k_v 2026-01-05T08:15:16,469 creating build/lib/evalscope/benchmarks/hle 2026-01-05T08:15:16,470 copying evalscope/benchmarks/hle/hle_adapter.py -> build/lib/evalscope/benchmarks/hle 2026-01-05T08:15:16,472 copying evalscope/benchmarks/hle/__init__.py -> build/lib/evalscope/benchmarks/hle 2026-01-05T08:15:16,474 creating build/lib/evalscope/benchmarks/super_gpqa 2026-01-05T08:15:16,475 copying evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-01-05T08:15:16,477 copying evalscope/benchmarks/super_gpqa/__init__.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-01-05T08:15:16,479 copying evalscope/benchmarks/super_gpqa/utils.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-01-05T08:15:16,481 copying evalscope/benchmarks/super_gpqa/prompt.py -> build/lib/evalscope/benchmarks/super_gpqa 2026-01-05T08:15:16,483 creating build/lib/evalscope/benchmarks/cmmu 2026-01-05T08:15:16,484 copying evalscope/benchmarks/cmmu/__init__.py -> build/lib/evalscope/benchmarks/cmmu 2026-01-05T08:15:16,486 copying evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/lib/evalscope/benchmarks/cmmu 2026-01-05T08:15:16,488 copying evalscope/benchmarks/cmmu/prompt.py -> build/lib/evalscope/benchmarks/cmmu 2026-01-05T08:15:16,490 creating build/lib/evalscope/benchmarks/maritime_bench 2026-01-05T08:15:16,491 copying evalscope/benchmarks/maritime_bench/__init__.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-01-05T08:15:16,493 copying evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/lib/evalscope/benchmarks/maritime_bench 2026-01-05T08:15:16,495 creating build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,496 copying evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,499 copying evalscope/benchmarks/live_code_bench/__init__.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,500 copying evalscope/benchmarks/live_code_bench/prompts.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,503 copying evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,505 copying evalscope/benchmarks/live_code_bench/testing_util.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,508 copying evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,510 copying evalscope/benchmarks/live_code_bench/extract_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,512 copying evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,513 copying evalscope/benchmarks/live_code_bench/load_utils.py -> build/lib/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:16,515 creating build/lib/evalscope/benchmarks/bfcl 2026-01-05T08:15:16,516 copying evalscope/benchmarks/bfcl/__init__.py -> build/lib/evalscope/benchmarks/bfcl 2026-01-05T08:15:16,518 creating build/lib/evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:16,519 copying evalscope/benchmarks/image_edit/gedit/__init__.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:16,521 copying evalscope/benchmarks/image_edit/gedit/utils.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:16,523 copying evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:16,525 copying evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/lib/evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:16,528 creating build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-01-05T08:15:16,529 copying evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-01-05T08:15:16,531 copying evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-01-05T08:15:16,533 copying evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau_bench 2026-01-05T08:15:16,536 creating build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-01-05T08:15:16,537 copying evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-01-05T08:15:16,538 copying evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-01-05T08:15:16,540 copying evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/lib/evalscope/benchmarks/tau_bench/tau2_bench 2026-01-05T08:15:16,543 creating build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:16,544 copying evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:16,546 copying evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:16,548 copying evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:16,550 copying evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:16,552 copying evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:16,553 copying evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/lib/evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:16,556 creating build/lib/evalscope/benchmarks/poly_math/utils 2026-01-05T08:15:16,557 copying evalscope/benchmarks/poly_math/utils/instruction.py -> build/lib/evalscope/benchmarks/poly_math/utils 2026-01-05T08:15:16,560 creating build/lib/evalscope/benchmarks/scicode/docker 2026-01-05T08:15:16,561 copying evalscope/benchmarks/scicode/docker/test_util.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-01-05T08:15:16,563 copying evalscope/benchmarks/scicode/docker/process_data.py -> build/lib/evalscope/benchmarks/scicode/docker 2026-01-05T08:15:16,566 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,567 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,569 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,571 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,574 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,576 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,578 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,580 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,582 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,584 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:16,586 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-01-05T08:15:16,587 copying evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-01-05T08:15:16,589 copying evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench 2026-01-05T08:15:16,592 creating build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:16,593 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:16,594 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:16,597 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:16,600 creating build/lib/evalscope/benchmarks/bfcl/v4 2026-01-05T08:15:16,601 copying evalscope/benchmarks/bfcl/v4/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-01-05T08:15:16,603 copying evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-01-05T08:15:16,605 copying evalscope/benchmarks/bfcl/v4/utils.py -> build/lib/evalscope/benchmarks/bfcl/v4 2026-01-05T08:15:16,608 creating build/lib/evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:16,609 copying evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:16,612 copying evalscope/benchmarks/bfcl/v3/__init__.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:16,613 copying evalscope/benchmarks/bfcl/v3/generation.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:16,616 copying evalscope/benchmarks/bfcl/v3/utils.py -> build/lib/evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:16,618 creating build/lib/evalscope/perf/sla 2026-01-05T08:15:16,619 copying evalscope/perf/sla/__init__.py -> build/lib/evalscope/perf/sla 2026-01-05T08:15:16,621 copying evalscope/perf/sla/sla_criterion.py -> build/lib/evalscope/perf/sla 2026-01-05T08:15:16,622 copying evalscope/perf/sla/sla_run.py -> build/lib/evalscope/perf/sla 2026-01-05T08:15:16,626 creating build/lib/evalscope/perf/plugin 2026-01-05T08:15:16,627 copying evalscope/perf/plugin/__init__.py -> build/lib/evalscope/perf/plugin 2026-01-05T08:15:16,629 copying evalscope/perf/plugin/registry.py -> build/lib/evalscope/perf/plugin 2026-01-05T08:15:16,631 creating build/lib/evalscope/perf/utils 2026-01-05T08:15:16,632 copying evalscope/perf/utils/__init__.py -> build/lib/evalscope/perf/utils 2026-01-05T08:15:16,634 copying evalscope/perf/utils/log_utils.py -> build/lib/evalscope/perf/utils 2026-01-05T08:15:16,636 copying evalscope/perf/utils/db_util.py -> build/lib/evalscope/perf/utils 2026-01-05T08:15:16,639 copying evalscope/perf/utils/benchmark_util.py -> build/lib/evalscope/perf/utils 2026-01-05T08:15:16,641 copying evalscope/perf/utils/handler.py -> build/lib/evalscope/perf/utils 2026-01-05T08:15:16,643 copying evalscope/perf/utils/rich_display.py -> build/lib/evalscope/perf/utils 2026-01-05T08:15:16,645 copying evalscope/perf/utils/analysis_result.py -> build/lib/evalscope/perf/utils 2026-01-05T08:15:16,647 copying evalscope/perf/utils/local_server.py -> build/lib/evalscope/perf/utils 2026-01-05T08:15:16,649 creating build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,650 copying evalscope/perf/plugin/datasets/longalpaca.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,652 copying evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,654 copying evalscope/perf/plugin/datasets/__init__.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,656 copying evalscope/perf/plugin/datasets/line_by_line.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,658 copying evalscope/perf/plugin/datasets/custom.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,659 copying evalscope/perf/plugin/datasets/kontext_bench.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,661 copying evalscope/perf/plugin/datasets/base.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,663 copying evalscope/perf/plugin/datasets/speed_benchmark.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,665 copying evalscope/perf/plugin/datasets/flickr8k.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,667 copying evalscope/perf/plugin/datasets/openqa.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,669 copying evalscope/perf/plugin/datasets/random_dataset.py -> build/lib/evalscope/perf/plugin/datasets 2026-01-05T08:15:16,671 creating build/lib/evalscope/perf/plugin/api 2026-01-05T08:15:16,672 copying evalscope/perf/plugin/api/default_api.py -> build/lib/evalscope/perf/plugin/api 2026-01-05T08:15:16,675 copying evalscope/perf/plugin/api/dashscope_api.py -> build/lib/evalscope/perf/plugin/api 2026-01-05T08:15:16,677 copying evalscope/perf/plugin/api/__init__.py -> build/lib/evalscope/perf/plugin/api 2026-01-05T08:15:16,679 copying evalscope/perf/plugin/api/custom_api.py -> build/lib/evalscope/perf/plugin/api 2026-01-05T08:15:16,681 copying evalscope/perf/plugin/api/base.py -> build/lib/evalscope/perf/plugin/api 2026-01-05T08:15:16,683 copying evalscope/perf/plugin/api/openai_api.py -> build/lib/evalscope/perf/plugin/api 2026-01-05T08:15:16,686 creating build/lib/evalscope/api/evaluator 2026-01-05T08:15:16,686 copying evalscope/api/evaluator/cache.py -> build/lib/evalscope/api/evaluator 2026-01-05T08:15:16,689 copying evalscope/api/evaluator/__init__.py -> build/lib/evalscope/api/evaluator 2026-01-05T08:15:16,691 copying evalscope/api/evaluator/state.py -> build/lib/evalscope/api/evaluator 2026-01-05T08:15:16,693 copying evalscope/api/evaluator/evaluator.py -> build/lib/evalscope/api/evaluator 2026-01-05T08:15:16,696 creating build/lib/evalscope/api/messages 2026-01-05T08:15:16,697 copying evalscope/api/messages/__init__.py -> build/lib/evalscope/api/messages 2026-01-05T08:15:16,699 copying evalscope/api/messages/content.py -> build/lib/evalscope/api/messages 2026-01-05T08:15:16,700 copying evalscope/api/messages/utils.py -> build/lib/evalscope/api/messages 2026-01-05T08:15:16,702 copying evalscope/api/messages/chat_message.py -> build/lib/evalscope/api/messages 2026-01-05T08:15:16,705 creating build/lib/evalscope/api/model 2026-01-05T08:15:16,706 copying evalscope/api/model/generate_config.py -> build/lib/evalscope/api/model 2026-01-05T08:15:16,709 copying evalscope/api/model/__init__.py -> build/lib/evalscope/api/model 2026-01-05T08:15:16,711 copying evalscope/api/model/lazy_model.py -> build/lib/evalscope/api/model 2026-01-05T08:15:16,713 copying evalscope/api/model/model_output.py -> build/lib/evalscope/api/model 2026-01-05T08:15:16,715 copying evalscope/api/model/model.py -> build/lib/evalscope/api/model 2026-01-05T08:15:16,718 creating build/lib/evalscope/api/dataset 2026-01-05T08:15:16,719 copying evalscope/api/dataset/__init__.py -> build/lib/evalscope/api/dataset 2026-01-05T08:15:16,721 copying evalscope/api/dataset/loader.py -> build/lib/evalscope/api/dataset 2026-01-05T08:15:16,723 copying evalscope/api/dataset/dataset.py -> build/lib/evalscope/api/dataset 2026-01-05T08:15:16,725 copying evalscope/api/dataset/utils.py -> build/lib/evalscope/api/dataset 2026-01-05T08:15:16,728 creating build/lib/evalscope/api/metric 2026-01-05T08:15:16,729 copying evalscope/api/metric/scorer.py -> build/lib/evalscope/api/metric 2026-01-05T08:15:16,731 copying evalscope/api/metric/__init__.py -> build/lib/evalscope/api/metric 2026-01-05T08:15:16,732 copying evalscope/api/metric/metric.py -> build/lib/evalscope/api/metric 2026-01-05T08:15:16,735 creating build/lib/evalscope/api/benchmark 2026-01-05T08:15:16,736 copying evalscope/api/benchmark/meta.py -> build/lib/evalscope/api/benchmark 2026-01-05T08:15:16,738 copying evalscope/api/benchmark/__init__.py -> build/lib/evalscope/api/benchmark 2026-01-05T08:15:16,740 copying evalscope/api/benchmark/benchmark.py -> build/lib/evalscope/api/benchmark 2026-01-05T08:15:16,743 creating build/lib/evalscope/api/filter 2026-01-05T08:15:16,744 copying evalscope/api/filter/__init__.py -> build/lib/evalscope/api/filter 2026-01-05T08:15:16,745 copying evalscope/api/filter/filter.py -> build/lib/evalscope/api/filter 2026-01-05T08:15:16,748 creating build/lib/evalscope/api/tool 2026-01-05T08:15:16,749 copying evalscope/api/tool/__init__.py -> build/lib/evalscope/api/tool 2026-01-05T08:15:16,751 copying evalscope/api/tool/tool_call.py -> build/lib/evalscope/api/tool 2026-01-05T08:15:16,752 copying evalscope/api/tool/tool_info.py -> build/lib/evalscope/api/tool 2026-01-05T08:15:16,755 copying evalscope/api/tool/utils.py -> build/lib/evalscope/api/tool 2026-01-05T08:15:16,757 creating build/lib/evalscope/api/mixin 2026-01-05T08:15:16,758 copying evalscope/api/mixin/__init__.py -> build/lib/evalscope/api/mixin 2026-01-05T08:15:16,760 copying evalscope/api/mixin/sandbox_mixin.py -> build/lib/evalscope/api/mixin 2026-01-05T08:15:16,762 copying evalscope/api/mixin/llm_judge_mixin.py -> build/lib/evalscope/api/mixin 2026-01-05T08:15:16,765 creating build/lib/evalscope/api/benchmark/adapters 2026-01-05T08:15:16,766 copying evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-01-05T08:15:16,768 copying evalscope/api/benchmark/adapters/__init__.py -> build/lib/evalscope/api/benchmark/adapters 2026-01-05T08:15:16,770 copying evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-01-05T08:15:16,772 copying evalscope/api/benchmark/adapters/agent_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-01-05T08:15:16,774 copying evalscope/api/benchmark/adapters/text2image_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-01-05T08:15:16,776 copying evalscope/api/benchmark/adapters/ner_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-01-05T08:15:16,779 copying evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-01-05T08:15:16,781 copying evalscope/api/benchmark/adapters/default_data_adapter.py -> build/lib/evalscope/api/benchmark/adapters 2026-01-05T08:15:16,785 running egg_info 2026-01-05T08:15:16,797 writing evalscope.egg-info/PKG-INFO 2026-01-05T08:15:16,826 writing dependency_links to evalscope.egg-info/dependency_links.txt 2026-01-05T08:15:16,827 writing entry points to evalscope.egg-info/entry_points.txt 2026-01-05T08:15:16,844 writing requirements to evalscope.egg-info/requires.txt 2026-01-05T08:15:16,845 writing top-level names to evalscope.egg-info/top_level.txt 2026-01-05T08:15:17,024 reading manifest file 'evalscope.egg-info/SOURCES.txt' 2026-01-05T08:15:17,063 reading manifest template 'MANIFEST.in' 2026-01-05T08:15:17,389 warning: no previously-included files matching '*.py[cod]' found anywhere in distribution 2026-01-05T08:15:17,393 warning: no previously-included files matching '__pycache__' found anywhere in distribution 2026-01-05T08:15:17,397 warning: no previously-included files matching '*.so' found anywhere in distribution 2026-01-05T08:15:17,402 warning: no previously-included files matching '*.dylib' found anywhere in distribution 2026-01-05T08:15:17,403 adding license file 'LICENSE' 2026-01-05T08:15:17,446 writing manifest file 'evalscope.egg-info/SOURCES.txt' 2026-01-05T08:15:17,540 copying evalscope/third_party/longbench_write/README.md -> build/lib/evalscope/third_party/longbench_write 2026-01-05T08:15:17,543 copying evalscope/third_party/longbench_write/default_task.json -> build/lib/evalscope/third_party/longbench_write 2026-01-05T08:15:17,545 copying evalscope/third_party/longbench_write/default_task.yaml -> build/lib/evalscope/third_party/longbench_write 2026-01-05T08:15:17,547 copying evalscope/third_party/toolbench_static/README.md -> build/lib/evalscope/third_party/toolbench_static 2026-01-05T08:15:17,549 copying evalscope/third_party/toolbench_static/config_default.json -> build/lib/evalscope/third_party/toolbench_static 2026-01-05T08:15:17,551 copying evalscope/third_party/toolbench_static/config_default.yaml -> build/lib/evalscope/third_party/toolbench_static 2026-01-05T08:15:17,553 copying evalscope/third_party/toolbench_static/requirements.txt -> build/lib/evalscope/third_party/toolbench_static 2026-01-05T08:15:17,555 copying evalscope/third_party/longbench_write/resources/judge.txt -> build/lib/evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,557 copying evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,560 copying evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,563 copying evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/lib/evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,566 creating build/lib/evalscope/third_party/thinkbench/resources 2026-01-05T08:15:17,567 copying evalscope/third_party/thinkbench/resources/critique_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-01-05T08:15:17,569 copying evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/lib/evalscope/third_party/thinkbench/resources 2026-01-05T08:15:17,571 copying evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/lib/evalscope/backend/rag_eval/clip_benchmark/utils 2026-01-05T08:15:17,573 copying evalscope/metrics/text_normalizer/english.json -> build/lib/evalscope/metrics/text_normalizer 2026-01-05T08:15:17,576 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-01-05T08:15:17,577 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-01-05T08:15:17,579 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-01-05T08:15:17,580 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-01-05T08:15:17,583 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-01-05T08:15:17,585 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-01-05T08:15:17,587 creating build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,588 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,590 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,592 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,594 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,596 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,598 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,600 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,603 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,605 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,607 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,609 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,611 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,613 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,615 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,617 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,620 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,622 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,624 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,626 copying evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:17,628 copying evalscope/benchmarks/trivia_qa/samples.jsonl -> build/lib/evalscope/benchmarks/trivia_qa 2026-01-05T08:15:17,630 creating build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,631 copying evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,633 copying evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,635 copying evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,637 copying evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,639 copying evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,642 copying evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,644 copying evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,646 copying evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,649 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,651 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,653 copying evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,655 copying evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,658 copying evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,660 copying evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,662 copying evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,664 copying evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,666 copying evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,668 copying evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,671 copying evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,673 copying evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,675 copying evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,677 copying evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,680 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,682 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,684 copying evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,686 copying evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,688 copying evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/lib/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:17,690 copying evalscope/benchmarks/scicode/docker/Dockerfile -> build/lib/evalscope/benchmarks/scicode/docker 2026-01-05T08:15:17,693 copying evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/lib/evalscope/benchmarks/scicode/docker 2026-01-05T08:15:17,695 copying evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:17,786 installing to build/bdist.linux-armv7l/wheel 2026-01-05T08:15:17,787 running install 2026-01-05T08:15:17,809 running install_lib 2026-01-05T08:15:17,816 creating build/bdist.linux-armv7l/wheel 2026-01-05T08:15:17,818 creating build/bdist.linux-armv7l/wheel/evalscope 2026-01-05T08:15:17,819 copying build/lib/evalscope/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-01-05T08:15:17,822 creating build/bdist.linux-armv7l/wheel/evalscope/third_party 2026-01-05T08:15:17,823 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write 2026-01-05T08:15:17,824 copying build/lib/evalscope/third_party/longbench_write/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-01-05T08:15:17,826 copying build/lib/evalscope/third_party/longbench_write/longbench_write.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-01-05T08:15:17,828 copying build/lib/evalscope/third_party/longbench_write/default_task.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-01-05T08:15:17,830 copying build/lib/evalscope/third_party/longbench_write/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-01-05T08:15:17,832 copying build/lib/evalscope/third_party/longbench_write/default_task.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-01-05T08:15:17,834 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,835 copying build/lib/evalscope/third_party/longbench_write/resources/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,837 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,840 copying build/lib/evalscope/third_party/longbench_write/resources/longbench_write.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,843 copying build/lib/evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,845 copying build/lib/evalscope/third_party/longbench_write/resources/judge.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/resources 2026-01-05T08:15:17,847 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/longbench_write/tools 2026-01-05T08:15:17,848 copying build/lib/evalscope/third_party/longbench_write/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-01-05T08:15:17,850 copying build/lib/evalscope/third_party/longbench_write/tools/data_etl.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-01-05T08:15:17,852 copying build/lib/evalscope/third_party/longbench_write/tools/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write/tools 2026-01-05T08:15:17,855 copying build/lib/evalscope/third_party/longbench_write/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-01-05T08:15:17,857 copying build/lib/evalscope/third_party/longbench_write/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-01-05T08:15:17,859 copying build/lib/evalscope/third_party/longbench_write/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/longbench_write 2026-01-05T08:15:17,861 copying build/lib/evalscope/third_party/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party 2026-01-05T08:15:17,863 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static 2026-01-05T08:15:17,864 copying build/lib/evalscope/third_party/toolbench_static/config_default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-01-05T08:15:17,866 copying build/lib/evalscope/third_party/toolbench_static/toolbench_static.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-01-05T08:15:17,868 copying build/lib/evalscope/third_party/toolbench_static/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-01-05T08:15:17,870 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/toolbench_static/llm 2026-01-05T08:15:17,871 copying build/lib/evalscope/third_party/toolbench_static/llm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-01-05T08:15:17,873 copying build/lib/evalscope/third_party/toolbench_static/llm/swift_infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static/llm 2026-01-05T08:15:17,875 copying build/lib/evalscope/third_party/toolbench_static/config_default.json -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-01-05T08:15:17,877 copying build/lib/evalscope/third_party/toolbench_static/requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-01-05T08:15:17,879 copying build/lib/evalscope/third_party/toolbench_static/README.md -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-01-05T08:15:17,881 copying build/lib/evalscope/third_party/toolbench_static/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-01-05T08:15:17,883 copying build/lib/evalscope/third_party/toolbench_static/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/toolbench_static 2026-01-05T08:15:17,886 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench 2026-01-05T08:15:17,887 copying build/lib/evalscope/third_party/thinkbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-01-05T08:15:17,889 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/resources 2026-01-05T08:15:17,890 copying build/lib/evalscope/third_party/thinkbench/resources/reformat_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-01-05T08:15:17,893 copying build/lib/evalscope/third_party/thinkbench/resources/critique_template.txt -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/resources 2026-01-05T08:15:17,895 creating build/bdist.linux-armv7l/wheel/evalscope/third_party/thinkbench/tools 2026-01-05T08:15:17,896 copying build/lib/evalscope/third_party/thinkbench/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-01-05T08:15:17,898 copying build/lib/evalscope/third_party/thinkbench/tools/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-01-05T08:15:17,900 copying build/lib/evalscope/third_party/thinkbench/tools/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench/tools 2026-01-05T08:15:17,901 copying build/lib/evalscope/third_party/thinkbench/eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-01-05T08:15:17,904 copying build/lib/evalscope/third_party/thinkbench/infer.py -> build/bdist.linux-armv7l/wheel/./evalscope/third_party/thinkbench 2026-01-05T08:15:17,907 creating build/bdist.linux-armv7l/wheel/evalscope/evaluator 2026-01-05T08:15:17,907 copying build/lib/evalscope/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-01-05T08:15:17,909 copying build/lib/evalscope/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/evaluator 2026-01-05T08:15:17,912 creating build/bdist.linux-armv7l/wheel/evalscope/summarizer 2026-01-05T08:15:17,913 copying build/lib/evalscope/summarizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-01-05T08:15:17,915 copying build/lib/evalscope/summarizer/summarizer.py -> build/bdist.linux-armv7l/wheel/./evalscope/summarizer 2026-01-05T08:15:17,917 copying build/lib/evalscope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-01-05T08:15:17,920 creating build/bdist.linux-armv7l/wheel/evalscope/collections 2026-01-05T08:15:17,921 copying build/lib/evalscope/collections/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-01-05T08:15:17,922 copying build/lib/evalscope/collections/schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-01-05T08:15:17,925 copying build/lib/evalscope/collections/sampler.py -> build/bdist.linux-armv7l/wheel/./evalscope/collections 2026-01-05T08:15:17,927 creating build/bdist.linux-armv7l/wheel/evalscope/models 2026-01-05T08:15:17,929 copying build/lib/evalscope/models/openai_compatible.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-01-05T08:15:17,931 copying build/lib/evalscope/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-01-05T08:15:17,933 copying build/lib/evalscope/models/model_apis.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-01-05T08:15:17,935 copying build/lib/evalscope/models/modelscope.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-01-05T08:15:17,937 copying build/lib/evalscope/models/text2image_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-01-05T08:15:17,939 copying build/lib/evalscope/models/mockllm.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-01-05T08:15:17,942 creating build/bdist.linux-armv7l/wheel/evalscope/models/utils 2026-01-05T08:15:17,943 copying build/lib/evalscope/models/utils/openai.py -> build/bdist.linux-armv7l/wheel/./evalscope/models/utils 2026-01-05T08:15:17,945 copying build/lib/evalscope/models/image_edit_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/models 2026-01-05T08:15:17,948 creating build/bdist.linux-armv7l/wheel/evalscope/service 2026-01-05T08:15:17,950 creating build/bdist.linux-armv7l/wheel/evalscope/service/frontend 2026-01-05T08:15:17,951 copying build/lib/evalscope/service/frontend/async_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-01-05T08:15:17,953 copying build/lib/evalscope/service/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-01-05T08:15:17,955 copying build/lib/evalscope/service/frontend/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-01-05T08:15:17,957 copying build/lib/evalscope/service/frontend/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/service/frontend 2026-01-05T08:15:17,959 copying build/lib/evalscope/service/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-01-05T08:15:17,961 copying build/lib/evalscope/service/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-01-05T08:15:17,963 copying build/lib/evalscope/service/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/service 2026-01-05T08:15:17,966 copying build/lib/evalscope/version.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-01-05T08:15:17,967 copying build/lib/evalscope/run.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-01-05T08:15:17,970 creating build/bdist.linux-armv7l/wheel/evalscope/filters 2026-01-05T08:15:17,971 copying build/lib/evalscope/filters/selection.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-01-05T08:15:17,973 copying build/lib/evalscope/filters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-01-05T08:15:17,975 copying build/lib/evalscope/filters/extraction.py -> build/bdist.linux-armv7l/wheel/./evalscope/filters 2026-01-05T08:15:17,977 copying build/lib/evalscope/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-01-05T08:15:17,979 creating build/bdist.linux-armv7l/wheel/evalscope/cli 2026-01-05T08:15:17,980 copying build/lib/evalscope/cli/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-01-05T08:15:17,982 copying build/lib/evalscope/cli/start_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-01-05T08:15:17,984 copying build/lib/evalscope/cli/cli.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-01-05T08:15:17,986 copying build/lib/evalscope/cli/start_perf.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-01-05T08:15:17,988 copying build/lib/evalscope/cli/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-01-05T08:15:17,990 copying build/lib/evalscope/cli/start_app.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-01-05T08:15:17,992 copying build/lib/evalscope/cli/start_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/cli 2026-01-05T08:15:17,994 copying build/lib/evalscope/config.py -> build/bdist.linux-armv7l/wheel/./evalscope 2026-01-05T08:15:17,997 creating build/bdist.linux-armv7l/wheel/evalscope/report 2026-01-05T08:15:17,998 copying build/lib/evalscope/report/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-01-05T08:15:18,000 copying build/lib/evalscope/report/combinator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-01-05T08:15:18,002 copying build/lib/evalscope/report/report.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-01-05T08:15:18,004 copying build/lib/evalscope/report/generator.py -> build/bdist.linux-armv7l/wheel/./evalscope/report 2026-01-05T08:15:18,007 creating build/bdist.linux-armv7l/wheel/evalscope/backend 2026-01-05T08:15:18,008 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval 2026-01-05T08:15:18,010 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:18,011 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:18,013 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:18,015 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:18,017 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark 2026-01-05T08:15:18,020 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:18,021 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:18,022 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:18,025 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:18,027 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/tasks 2026-01-05T08:15:18,029 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/clip_benchmark/utils 2026-01-05T08:15:18,030 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-01-05T08:15:18,032 copying build/lib/evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/clip_benchmark/utils 2026-01-05T08:15:18,034 copying build/lib/evalscope/backend/rag_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-01-05T08:15:18,037 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:18,038 copying build/lib/evalscope/backend/rag_eval/cmteb/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:18,040 copying build/lib/evalscope/backend/rag_eval/cmteb/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:18,042 copying build/lib/evalscope/backend/rag_eval/cmteb/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:18,043 copying build/lib/evalscope/backend/rag_eval/cmteb/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb 2026-01-05T08:15:18,046 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:18,047 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:18,050 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:18,052 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/STS.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:18,054 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:18,056 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:18,058 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Clustering.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:18,060 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:18,062 copying build/lib/evalscope/backend/rag_eval/cmteb/tasks/Reranking.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/cmteb/tasks 2026-01-05T08:15:18,065 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas 2026-01-05T08:15:18,066 copying build/lib/evalscope/backend/rag_eval/ragas/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-01-05T08:15:18,068 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/prompts 2026-01-05T08:15:18,069 copying build/lib/evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/prompts 2026-01-05T08:15:18,071 copying build/lib/evalscope/backend/rag_eval/ragas/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-01-05T08:15:18,073 copying build/lib/evalscope/backend/rag_eval/ragas/task_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas 2026-01-05T08:15:18,076 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:18,077 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:18,079 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_distribution.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:18,080 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/testset_generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:18,083 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/build_transform.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:18,085 copying build/lib/evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/ragas/tasks 2026-01-05T08:15:18,087 copying build/lib/evalscope/backend/rag_eval/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval 2026-01-05T08:15:18,089 creating build/bdist.linux-armv7l/wheel/evalscope/backend/rag_eval/utils 2026-01-05T08:15:18,090 copying build/lib/evalscope/backend/rag_eval/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-01-05T08:15:18,092 copying build/lib/evalscope/backend/rag_eval/utils/embedding.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-01-05T08:15:18,094 copying build/lib/evalscope/backend/rag_eval/utils/llm.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-01-05T08:15:18,096 copying build/lib/evalscope/backend/rag_eval/utils/tools.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-01-05T08:15:18,098 copying build/lib/evalscope/backend/rag_eval/utils/clip.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/rag_eval/utils 2026-01-05T08:15:18,100 copying build/lib/evalscope/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-01-05T08:15:18,102 creating build/bdist.linux-armv7l/wheel/evalscope/backend/vlm_eval_kit 2026-01-05T08:15:18,103 copying build/lib/evalscope/backend/vlm_eval_kit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-01-05T08:15:18,105 copying build/lib/evalscope/backend/vlm_eval_kit/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/vlm_eval_kit 2026-01-05T08:15:18,107 copying build/lib/evalscope/backend/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend 2026-01-05T08:15:18,109 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass 2026-01-05T08:15:18,111 copying build/lib/evalscope/backend/opencompass/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-01-05T08:15:18,112 copying build/lib/evalscope/backend/opencompass/api_meta_template.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-01-05T08:15:18,114 copying build/lib/evalscope/backend/opencompass/backend_manager.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass 2026-01-05T08:15:18,117 creating build/bdist.linux-armv7l/wheel/evalscope/backend/opencompass/tasks 2026-01-05T08:15:18,118 copying build/lib/evalscope/backend/opencompass/tasks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-01-05T08:15:18,121 copying build/lib/evalscope/backend/opencompass/tasks/eval_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-01-05T08:15:18,123 copying build/lib/evalscope/backend/opencompass/tasks/eval_datasets.py -> build/bdist.linux-armv7l/wheel/./evalscope/backend/opencompass/tasks 2026-01-05T08:15:18,126 creating build/bdist.linux-armv7l/wheel/evalscope/metrics 2026-01-05T08:15:18,127 copying build/lib/evalscope/metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-01-05T08:15:18,130 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics 2026-01-05T08:15:18,131 copying build/lib/evalscope/metrics/t2v_metrics/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-01-05T08:15:18,133 copying build/lib/evalscope/metrics/t2v_metrics/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-01-05T08:15:18,135 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models 2026-01-05T08:15:18,136 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:18,138 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:18,140 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:18,142 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:18,144 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:18,146 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:18,147 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:18,149 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:18,151 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:18,153 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model 2026-01-05T08:15:18,155 copying build/lib/evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/clipscore_models 2026-01-05T08:15:18,157 copying build/lib/evalscope/metrics/t2v_metrics/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-01-05T08:15:18,160 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:18,161 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:18,163 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:18,166 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-01-05T08:15:18,167 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-01-05T08:15:18,169 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-01-05T08:15:18,172 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward 2026-01-05T08:15:18,174 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:18,176 copying build/lib/evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/itmscore_models 2026-01-05T08:15:18,178 copying build/lib/evalscope/metrics/t2v_metrics/models/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-01-05T08:15:18,180 copying build/lib/evalscope/metrics/t2v_metrics/models/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models 2026-01-05T08:15:18,182 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:18,184 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-01-05T08:15:18,185 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5 2026-01-05T08:15:18,187 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-01-05T08:15:18,189 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-01-05T08:15:18,190 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector 2026-01-05T08:15:18,192 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model 2026-01-05T08:15:18,194 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-01-05T08:15:18,195 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model 2026-01-05T08:15:18,199 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-01-05T08:15:18,200 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-01-05T08:15:18,202 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder 2026-01-05T08:15:18,204 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:18,206 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:18,209 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:18,211 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:18,213 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models 2026-01-05T08:15:18,217 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-01-05T08:15:18,218 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:18,219 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:18,222 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:18,224 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:18,226 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors 2026-01-05T08:15:18,228 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis 2026-01-05T08:15:18,230 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:18,231 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:18,234 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:18,237 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,238 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,241 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,243 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,246 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,249 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,251 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,253 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,255 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,258 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,261 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models 2026-01-05T08:15:18,263 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:18,266 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:18,269 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:18,271 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models 2026-01-05T08:15:18,274 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,275 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,277 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,279 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,282 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,284 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,286 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,289 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,290 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,293 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,295 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,297 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models 2026-01-05T08:15:18,300 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-01-05T08:15:18,301 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs 2026-01-05T08:15:18,304 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-01-05T08:15:18,305 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-01-05T08:15:18,307 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,308 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,310 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,312 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,313 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,315 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,317 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,319 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,320 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,322 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,324 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,326 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,328 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,329 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,331 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,333 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,334 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,336 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,338 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,340 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2 2026-01-05T08:15:18,341 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-01-05T08:15:18,343 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models 2026-01-05T08:15:18,346 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:18,347 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-01-05T08:15:18,348 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-01-05T08:15:18,350 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-01-05T08:15:18,352 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools 2026-01-05T08:15:18,355 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:18,356 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:18,359 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:18,361 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:18,363 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:18,365 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:18,367 copying build/lib/evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common 2026-01-05T08:15:18,369 copying build/lib/evalscope/metrics/t2v_metrics/score.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-01-05T08:15:18,371 copying build/lib/evalscope/metrics/t2v_metrics/itmscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-01-05T08:15:18,372 copying build/lib/evalscope/metrics/t2v_metrics/clipscore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-01-05T08:15:18,374 copying build/lib/evalscope/metrics/t2v_metrics/vqascore.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/t2v_metrics 2026-01-05T08:15:18,376 copying build/lib/evalscope/metrics/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-01-05T08:15:18,378 copying build/lib/evalscope/metrics/rouge_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-01-05T08:15:18,381 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bert_score 2026-01-05T08:15:18,382 copying build/lib/evalscope/metrics/bert_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-01-05T08:15:18,384 copying build/lib/evalscope/metrics/bert_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-01-05T08:15:18,386 copying build/lib/evalscope/metrics/bert_score/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bert_score 2026-01-05T08:15:18,389 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/text_normalizer 2026-01-05T08:15:18,390 copying build/lib/evalscope/metrics/text_normalizer/basic.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-01-05T08:15:18,392 copying build/lib/evalscope/metrics/text_normalizer/english.json -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-01-05T08:15:18,395 copying build/lib/evalscope/metrics/text_normalizer/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-01-05T08:15:18,396 copying build/lib/evalscope/metrics/text_normalizer/wer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-01-05T08:15:18,399 copying build/lib/evalscope/metrics/text_normalizer/chinese.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-01-05T08:15:18,402 copying build/lib/evalscope/metrics/text_normalizer/english.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/text_normalizer 2026-01-05T08:15:18,404 copying build/lib/evalscope/metrics/llm_judge.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-01-05T08:15:18,407 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/bundled_rouge_score 2026-01-05T08:15:18,408 copying build/lib/evalscope/metrics/bundled_rouge_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-01-05T08:15:18,410 copying build/lib/evalscope/metrics/bundled_rouge_score/rouge_scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/bundled_rouge_score 2026-01-05T08:15:18,412 copying build/lib/evalscope/metrics/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-01-05T08:15:18,415 copying build/lib/evalscope/metrics/math_parser.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics 2026-01-05T08:15:18,418 creating build/bdist.linux-armv7l/wheel/evalscope/metrics/sem_score 2026-01-05T08:15:18,419 copying build/lib/evalscope/metrics/sem_score/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-01-05T08:15:18,421 copying build/lib/evalscope/metrics/sem_score/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/metrics/sem_score 2026-01-05T08:15:18,423 creating build/bdist.linux-armv7l/wheel/evalscope/app 2026-01-05T08:15:18,424 copying build/lib/evalscope/app/constants.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-01-05T08:15:18,426 copying build/lib/evalscope/app/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-01-05T08:15:18,428 creating build/bdist.linux-armv7l/wheel/evalscope/app/ui 2026-01-05T08:15:18,429 copying build/lib/evalscope/app/ui/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-01-05T08:15:18,431 copying build/lib/evalscope/app/ui/sidebar.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-01-05T08:15:18,433 copying build/lib/evalscope/app/ui/single_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-01-05T08:15:18,435 copying build/lib/evalscope/app/ui/app_ui.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-01-05T08:15:18,437 copying build/lib/evalscope/app/ui/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-01-05T08:15:18,439 copying build/lib/evalscope/app/ui/multi_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/ui 2026-01-05T08:15:18,441 copying build/lib/evalscope/app/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-01-05T08:15:18,443 copying build/lib/evalscope/app/app.py -> build/bdist.linux-armv7l/wheel/./evalscope/app 2026-01-05T08:15:18,445 creating build/bdist.linux-armv7l/wheel/evalscope/app/utils 2026-01-05T08:15:18,446 copying build/lib/evalscope/app/utils/localization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-01-05T08:15:18,449 copying build/lib/evalscope/app/utils/env_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-01-05T08:15:18,450 copying build/lib/evalscope/app/utils/text_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-01-05T08:15:18,452 copying build/lib/evalscope/app/utils/data_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-01-05T08:15:18,454 copying build/lib/evalscope/app/utils/visualization.py -> build/bdist.linux-armv7l/wheel/./evalscope/app/utils 2026-01-05T08:15:18,458 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks 2026-01-05T08:15:18,460 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_verse 2026-01-05T08:15:18,461 copying build/lib/evalscope/benchmarks/math_verse/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-01-05T08:15:18,462 copying build/lib/evalscope/benchmarks/math_verse/math_verse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_verse 2026-01-05T08:15:18,465 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/process_bench 2026-01-05T08:15:18,466 copying build/lib/evalscope/benchmarks/process_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-01-05T08:15:18,468 copying build/lib/evalscope/benchmarks/process_bench/process_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/process_bench 2026-01-05T08:15:18,470 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docvqa 2026-01-05T08:15:18,471 copying build/lib/evalscope/benchmarks/docvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-01-05T08:15:18,473 copying build/lib/evalscope/benchmarks/docvqa/docvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docvqa 2026-01-05T08:15:18,475 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arena_hard 2026-01-05T08:15:18,476 copying build/lib/evalscope/benchmarks/arena_hard/arena_hard_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-01-05T08:15:18,478 copying build/lib/evalscope/benchmarks/arena_hard/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-01-05T08:15:18,480 copying build/lib/evalscope/benchmarks/arena_hard/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arena_hard 2026-01-05T08:15:18,482 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mri_mcqa 2026-01-05T08:15:18,483 copying build/lib/evalscope/benchmarks/mri_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-01-05T08:15:18,485 copying build/lib/evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mri_mcqa 2026-01-05T08:15:18,487 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pope 2026-01-05T08:15:18,488 copying build/lib/evalscope/benchmarks/pope/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-01-05T08:15:18,490 copying build/lib/evalscope/benchmarks/pope/pope_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pope 2026-01-05T08:15:18,492 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zerobench 2026-01-05T08:15:18,493 copying build/lib/evalscope/benchmarks/zerobench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-01-05T08:15:18,495 copying build/lib/evalscope/benchmarks/zerobench/zerobench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zerobench 2026-01-05T08:15:18,497 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_mcq 2026-01-05T08:15:18,498 copying build/lib/evalscope/benchmarks/general_mcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-01-05T08:15:18,499 copying build/lib/evalscope/benchmarks/general_mcq/general_mcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_mcq 2026-01-05T08:15:18,502 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_redux 2026-01-05T08:15:18,503 copying build/lib/evalscope/benchmarks/mmlu_redux/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-01-05T08:15:18,504 copying build/lib/evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_redux 2026-01-05T08:15:18,507 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/vstar_bench 2026-01-05T08:15:18,508 copying build/lib/evalscope/benchmarks/vstar_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-01-05T08:15:18,510 copying build/lib/evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/vstar_bench 2026-01-05T08:15:18,512 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit 2026-01-05T08:15:18,513 copying build/lib/evalscope/benchmarks/image_edit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit 2026-01-05T08:15:18,515 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:18,516 copying build/lib/evalscope/benchmarks/image_edit/gedit/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:18,518 copying build/lib/evalscope/benchmarks/image_edit/gedit/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:18,520 copying build/lib/evalscope/benchmarks/image_edit/gedit/gedit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:18,522 copying build/lib/evalscope/benchmarks/image_edit/gedit/vie_prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/image_edit/gedit 2026-01-05T08:15:18,526 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/humaneval 2026-01-05T08:15:18,527 copying build/lib/evalscope/benchmarks/humaneval/humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-01-05T08:15:18,529 copying build/lib/evalscope/benchmarks/humaneval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-01-05T08:15:18,530 copying build/lib/evalscope/benchmarks/humaneval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/humaneval 2026-01-05T08:15:18,533 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/arc 2026-01-05T08:15:18,534 copying build/lib/evalscope/benchmarks/arc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-01-05T08:15:18,536 copying build/lib/evalscope/benchmarks/arc/arc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/arc 2026-01-05T08:15:18,538 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/olympiad_bench 2026-01-05T08:15:18,539 copying build/lib/evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-01-05T08:15:18,541 copying build/lib/evalscope/benchmarks/olympiad_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-01-05T08:15:18,542 copying build/lib/evalscope/benchmarks/olympiad_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/olympiad_bench 2026-01-05T08:15:18,545 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/siqa 2026-01-05T08:15:18,546 copying build/lib/evalscope/benchmarks/siqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-01-05T08:15:18,548 copying build/lib/evalscope/benchmarks/siqa/siqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/siqa 2026-01-05T08:15:18,550 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/healthbench 2026-01-05T08:15:18,551 copying build/lib/evalscope/benchmarks/healthbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-01-05T08:15:18,553 copying build/lib/evalscope/benchmarks/healthbench/healthbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-01-05T08:15:18,555 copying build/lib/evalscope/benchmarks/healthbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/healthbench 2026-01-05T08:15:18,557 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifbench 2026-01-05T08:15:18,559 copying build/lib/evalscope/benchmarks/ifbench/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-01-05T08:15:18,561 copying build/lib/evalscope/benchmarks/ifbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-01-05T08:15:18,562 copying build/lib/evalscope/benchmarks/ifbench/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-01-05T08:15:18,565 copying build/lib/evalscope/benchmarks/ifbench/ifbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-01-05T08:15:18,567 copying build/lib/evalscope/benchmarks/ifbench/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-01-05T08:15:18,569 copying build/lib/evalscope/benchmarks/ifbench/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifbench 2026-01-05T08:15:18,573 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aa_lcr 2026-01-05T08:15:18,574 copying build/lib/evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-01-05T08:15:18,576 copying build/lib/evalscope/benchmarks/aa_lcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aa_lcr 2026-01-05T08:15:18,578 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/wmt 2026-01-05T08:15:18,579 copying build/lib/evalscope/benchmarks/wmt/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-01-05T08:15:18,581 copying build/lib/evalscope/benchmarks/wmt/wmt24_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/wmt 2026-01-05T08:15:18,583 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/eq_bench 2026-01-05T08:15:18,584 copying build/lib/evalscope/benchmarks/eq_bench/eq_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-01-05T08:15:18,587 copying build/lib/evalscope/benchmarks/eq_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-01-05T08:15:18,588 copying build/lib/evalscope/benchmarks/eq_bench/answer_validation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/eq_bench 2026-01-05T08:15:18,591 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/frames 2026-01-05T08:15:18,592 copying build/lib/evalscope/benchmarks/frames/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-01-05T08:15:18,594 copying build/lib/evalscope/benchmarks/frames/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-01-05T08:15:18,595 copying build/lib/evalscope/benchmarks/frames/frames_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/frames 2026-01-05T08:15:18,598 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench 2026-01-05T08:15:18,600 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau_bench 2026-01-05T08:15:18,601 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-01-05T08:15:18,602 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-01-05T08:15:18,605 copying build/lib/evalscope/benchmarks/tau_bench/tau_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau_bench 2026-01-05T08:15:18,607 copying build/lib/evalscope/benchmarks/tau_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench 2026-01-05T08:15:18,609 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tau_bench/tau2_bench 2026-01-05T08:15:18,610 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-01-05T08:15:18,612 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-01-05T08:15:18,614 copying build/lib/evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tau_bench/tau2_bench 2026-01-05T08:15:18,616 copying build/lib/evalscope/benchmarks/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks 2026-01-05T08:15:18,619 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vmcq 2026-01-05T08:15:18,620 copying build/lib/evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-01-05T08:15:18,622 copying build/lib/evalscope/benchmarks/general_vmcq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vmcq 2026-01-05T08:15:18,624 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmlu 2026-01-05T08:15:18,625 copying build/lib/evalscope/benchmarks/cmmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-01-05T08:15:18,627 copying build/lib/evalscope/benchmarks/cmmlu/cmmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmlu 2026-01-05T08:15:18,629 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu 2026-01-05T08:15:18,630 copying build/lib/evalscope/benchmarks/mmlu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-01-05T08:15:18,632 copying build/lib/evalscope/benchmarks/mmlu/mmlu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu 2026-01-05T08:15:18,635 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vision 2026-01-05T08:15:18,636 copying build/lib/evalscope/benchmarks/math_vision/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-01-05T08:15:18,637 copying build/lib/evalscope/benchmarks/math_vision/math_vision_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vision 2026-01-05T08:15:18,640 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner 2026-01-05T08:15:18,641 copying build/lib/evalscope/benchmarks/ner/jnlpba_rare_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,643 copying build/lib/evalscope/benchmarks/ner/tweebank_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,645 copying build/lib/evalscope/benchmarks/ner/tweet_ner_7_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,646 copying build/lib/evalscope/benchmarks/ner/copious_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,648 copying build/lib/evalscope/benchmarks/ner/mit_restaurant_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,650 copying build/lib/evalscope/benchmarks/ner/bc2gm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,652 copying build/lib/evalscope/benchmarks/ner/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,653 copying build/lib/evalscope/benchmarks/ner/wnut2017_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,656 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:18,657 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/science.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:18,658 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:18,660 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/politics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:18,662 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/ai.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:18,664 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/music.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:18,666 copying build/lib/evalscope/benchmarks/ner/cross_ner_entities/literature.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner/cross_ner_entities 2026-01-05T08:15:18,667 copying build/lib/evalscope/benchmarks/ner/anat_em_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,669 copying build/lib/evalscope/benchmarks/ner/multi_nerd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,671 copying build/lib/evalscope/benchmarks/ner/bc5cdr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,673 copying build/lib/evalscope/benchmarks/ner/fin_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,675 copying build/lib/evalscope/benchmarks/ner/ontonotes5_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,676 copying build/lib/evalscope/benchmarks/ner/ncbi_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,678 copying build/lib/evalscope/benchmarks/ner/jnlpba_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,680 copying build/lib/evalscope/benchmarks/ner/bc4chemd_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,682 copying build/lib/evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,684 copying build/lib/evalscope/benchmarks/ner/harvey_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,685 copying build/lib/evalscope/benchmarks/ner/mit_movie_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,687 copying build/lib/evalscope/benchmarks/ner/genia_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,689 copying build/lib/evalscope/benchmarks/ner/conll2003_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,691 copying build/lib/evalscope/benchmarks/ner/conllpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,692 copying build/lib/evalscope/benchmarks/ner/cross_ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ner 2026-01-05T08:15:18,695 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/minerva_math 2026-01-05T08:15:18,696 copying build/lib/evalscope/benchmarks/minerva_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-01-05T08:15:18,697 copying build/lib/evalscope/benchmarks/minerva_math/minerva_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/minerva_math 2026-01-05T08:15:18,700 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/biomix_qa 2026-01-05T08:15:18,701 copying build/lib/evalscope/benchmarks/biomix_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-01-05T08:15:18,703 copying build/lib/evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/biomix_qa 2026-01-05T08:15:18,705 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/science_qa 2026-01-05T08:15:18,706 copying build/lib/evalscope/benchmarks/science_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-01-05T08:15:18,707 copying build/lib/evalscope/benchmarks/science_qa/science_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/science_qa 2026-01-05T08:15:18,710 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/a_okvqa 2026-01-05T08:15:18,711 copying build/lib/evalscope/benchmarks/a_okvqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-01-05T08:15:18,712 copying build/lib/evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/a_okvqa 2026-01-05T08:15:18,715 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hellaswag 2026-01-05T08:15:18,716 copying build/lib/evalscope/benchmarks/hellaswag/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-01-05T08:15:18,718 copying build/lib/evalscope/benchmarks/hellaswag/hellaswag_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hellaswag 2026-01-05T08:15:18,720 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math 2026-01-05T08:15:18,721 copying build/lib/evalscope/benchmarks/poly_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-01-05T08:15:18,723 copying build/lib/evalscope/benchmarks/poly_math/poly_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math 2026-01-05T08:15:18,726 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/poly_math/utils 2026-01-05T08:15:18,728 copying build/lib/evalscope/benchmarks/poly_math/utils/instruction.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/poly_math/utils 2026-01-05T08:15:18,732 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_vqa 2026-01-05T08:15:18,733 copying build/lib/evalscope/benchmarks/simple_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-01-05T08:15:18,735 copying build/lib/evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_vqa 2026-01-05T08:15:18,737 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hallusion_bench 2026-01-05T08:15:18,738 copying build/lib/evalscope/benchmarks/hallusion_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-01-05T08:15:18,740 copying build/lib/evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hallusion_bench 2026-01-05T08:15:18,742 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/needle_haystack 2026-01-05T08:15:18,743 copying build/lib/evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-01-05T08:15:18,746 copying build/lib/evalscope/benchmarks/needle_haystack/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-01-05T08:15:18,748 copying build/lib/evalscope/benchmarks/needle_haystack/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/needle_haystack 2026-01-05T08:15:18,750 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/terminal_bench 2026-01-05T08:15:18,751 copying build/lib/evalscope/benchmarks/terminal_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-01-05T08:15:18,753 copying build/lib/evalscope/benchmarks/terminal_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-01-05T08:15:18,755 copying build/lib/evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/terminal_bench 2026-01-05T08:15:18,758 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/refcoco 2026-01-05T08:15:18,759 copying build/lib/evalscope/benchmarks/refcoco/evaluation_lib.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-01-05T08:15:18,761 copying build/lib/evalscope/benchmarks/refcoco/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-01-05T08:15:18,763 copying build/lib/evalscope/benchmarks/refcoco/refcoco_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-01-05T08:15:18,766 copying build/lib/evalscope/benchmarks/refcoco/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/refcoco 2026-01-05T08:15:18,768 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/piqa 2026-01-05T08:15:18,770 copying build/lib/evalscope/benchmarks/piqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-01-05T08:15:18,772 copying build/lib/evalscope/benchmarks/piqa/piqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/piqa 2026-01-05T08:15:18,774 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/truthful_qa 2026-01-05T08:15:18,776 copying build/lib/evalscope/benchmarks/truthful_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-01-05T08:15:18,778 copying build/lib/evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/truthful_qa 2026-01-05T08:15:18,784 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ceval 2026-01-05T08:15:18,785 copying build/lib/evalscope/benchmarks/ceval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-01-05T08:15:18,787 copying build/lib/evalscope/benchmarks/ceval/ceval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ceval 2026-01-05T08:15:18,791 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chartqa 2026-01-05T08:15:18,792 copying build/lib/evalscope/benchmarks/chartqa/chartqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-01-05T08:15:18,795 copying build/lib/evalscope/benchmarks/chartqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-01-05T08:15:18,798 copying build/lib/evalscope/benchmarks/chartqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chartqa 2026-01-05T08:15:18,801 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/librispeech 2026-01-05T08:15:18,803 copying build/lib/evalscope/benchmarks/librispeech/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-01-05T08:15:18,805 copying build/lib/evalscope/benchmarks/librispeech/librispeech_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/librispeech 2026-01-05T08:15:18,808 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/med_mcqa 2026-01-05T08:15:18,811 copying build/lib/evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-01-05T08:15:18,813 copying build/lib/evalscope/benchmarks/med_mcqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/med_mcqa 2026-01-05T08:15:18,815 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_arena 2026-01-05T08:15:18,816 copying build/lib/evalscope/benchmarks/general_arena/general_arena_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-01-05T08:15:18,819 copying build/lib/evalscope/benchmarks/general_arena/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-01-05T08:15:18,821 copying build/lib/evalscope/benchmarks/general_arena/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_arena 2026-01-05T08:15:18,823 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu 2026-01-05T08:15:18,824 copying build/lib/evalscope/benchmarks/mmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-01-05T08:15:18,826 copying build/lib/evalscope/benchmarks/mmmu/mmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu 2026-01-05T08:15:18,829 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ifeval 2026-01-05T08:15:18,830 copying build/lib/evalscope/benchmarks/ifeval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-01-05T08:15:18,832 copying build/lib/evalscope/benchmarks/ifeval/instructions_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-01-05T08:15:18,835 copying build/lib/evalscope/benchmarks/ifeval/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-01-05T08:15:18,837 copying build/lib/evalscope/benchmarks/ifeval/ifeval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-01-05T08:15:18,839 copying build/lib/evalscope/benchmarks/ifeval/instructions_registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-01-05T08:15:18,842 copying build/lib/evalscope/benchmarks/ifeval/instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ifeval 2026-01-05T08:15:18,846 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:18,848 copying build/lib/evalscope/benchmarks/omnidoc_bench/end2end_eval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:18,851 copying build/lib/evalscope/benchmarks/omnidoc_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:18,853 copying build/lib/evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:18,855 copying build/lib/evalscope/benchmarks/omnidoc_bench/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:18,858 copying build/lib/evalscope/benchmarks/omnidoc_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omnidoc_bench 2026-01-05T08:15:18,864 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/blink 2026-01-05T08:15:18,866 copying build/lib/evalscope/benchmarks/blink/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-01-05T08:15:18,868 copying build/lib/evalscope/benchmarks/blink/blink_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/blink 2026-01-05T08:15:18,870 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/infovqa 2026-01-05T08:15:18,872 copying build/lib/evalscope/benchmarks/infovqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-01-05T08:15:18,874 copying build/lib/evalscope/benchmarks/infovqa/infovqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/infovqa 2026-01-05T08:15:18,876 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/real_world_qa 2026-01-05T08:15:18,878 copying build/lib/evalscope/benchmarks/real_world_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-01-05T08:15:18,880 copying build/lib/evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/real_world_qa 2026-01-05T08:15:18,883 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gpqa 2026-01-05T08:15:18,884 copying build/lib/evalscope/benchmarks/gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-01-05T08:15:18,886 copying build/lib/evalscope/benchmarks/gpqa/gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-01-05T08:15:18,888 copying build/lib/evalscope/benchmarks/gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gpqa 2026-01-05T08:15:18,892 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/fleurs 2026-01-05T08:15:18,893 copying build/lib/evalscope/benchmarks/fleurs/fleurs_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-01-05T08:15:18,896 copying build/lib/evalscope/benchmarks/fleurs/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/fleurs 2026-01-05T08:15:18,898 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/zebralogicbench 2026-01-05T08:15:18,899 copying build/lib/evalscope/benchmarks/zebralogicbench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-01-05T08:15:18,901 copying build/lib/evalscope/benchmarks/zebralogicbench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-01-05T08:15:18,903 copying build/lib/evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/zebralogicbench 2026-01-05T08:15:18,906 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmmu 2026-01-05T08:15:18,907 copying build/lib/evalscope/benchmarks/cmmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-01-05T08:15:18,909 copying build/lib/evalscope/benchmarks/cmmmu/cmmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-01-05T08:15:18,912 copying build/lib/evalscope/benchmarks/cmmmu/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmmu 2026-01-05T08:15:18,915 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ai2d 2026-01-05T08:15:18,916 copying build/lib/evalscope/benchmarks/ai2d/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-01-05T08:15:18,917 copying build/lib/evalscope/benchmarks/ai2d/ai2d_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ai2d 2026-01-05T08:15:18,920 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh 2026-01-05T08:15:18,921 copying build/lib/evalscope/benchmarks/bbh/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-01-05T08:15:18,923 copying build/lib/evalscope/benchmarks/bbh/bbh_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh 2026-01-05T08:15:18,926 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,927 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,929 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,931 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,932 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,934 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,936 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,938 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,940 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,942 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,944 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/object_counting.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,946 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,948 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,950 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,951 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,953 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,955 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,957 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/snarks.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,959 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,961 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,963 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,965 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,967 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,970 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,972 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,974 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,976 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,978 copying build/lib/evalscope/benchmarks/bbh/cot_prompts/navigate.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bbh/cot_prompts 2026-01-05T08:15:18,981 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/swe_bench 2026-01-05T08:15:18,982 copying build/lib/evalscope/benchmarks/swe_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-01-05T08:15:18,984 copying build/lib/evalscope/benchmarks/swe_bench/build_images.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-01-05T08:15:18,987 copying build/lib/evalscope/benchmarks/swe_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-01-05T08:15:18,989 copying build/lib/evalscope/benchmarks/swe_bench/swe_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/swe_bench 2026-01-05T08:15:18,992 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drop 2026-01-05T08:15:18,994 copying build/lib/evalscope/benchmarks/drop/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-01-05T08:15:18,996 copying build/lib/evalscope/benchmarks/drop/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-01-05T08:15:18,998 copying build/lib/evalscope/benchmarks/drop/drop_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drop 2026-01-05T08:15:19,001 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/openai_mrcr 2026-01-05T08:15:19,003 copying build/lib/evalscope/benchmarks/openai_mrcr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-01-05T08:15:19,005 copying build/lib/evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-01-05T08:15:19,008 copying build/lib/evalscope/benchmarks/openai_mrcr/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/openai_mrcr 2026-01-05T08:15:19,010 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_500 2026-01-05T08:15:19,012 copying build/lib/evalscope/benchmarks/math_500/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-01-05T08:15:19,014 copying build/lib/evalscope/benchmarks/math_500/math_500_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_500 2026-01-05T08:15:19,016 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/race 2026-01-05T08:15:19,018 copying build/lib/evalscope/benchmarks/race/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-01-05T08:15:19,020 copying build/lib/evalscope/benchmarks/race/race_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/race 2026-01-05T08:15:19,023 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_qa 2026-01-05T08:15:19,024 copying build/lib/evalscope/benchmarks/math_qa/math_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-01-05T08:15:19,026 copying build/lib/evalscope/benchmarks/math_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_qa 2026-01-05T08:15:19,029 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/chinese_simple_qa 2026-01-05T08:15:19,030 copying build/lib/evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-01-05T08:15:19,033 copying build/lib/evalscope/benchmarks/chinese_simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/chinese_simple_qa 2026-01-05T08:15:19,035 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/visu_logic 2026-01-05T08:15:19,037 copying build/lib/evalscope/benchmarks/visu_logic/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-01-05T08:15:19,039 copying build/lib/evalscope/benchmarks/visu_logic/visu_logic_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/visu_logic 2026-01-05T08:15:19,041 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/musr 2026-01-05T08:15:19,043 copying build/lib/evalscope/benchmarks/musr/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-01-05T08:15:19,045 copying build/lib/evalscope/benchmarks/musr/musr_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/musr 2026-01-05T08:15:19,047 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/sciq 2026-01-05T08:15:19,049 copying build/lib/evalscope/benchmarks/sciq/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-01-05T08:15:19,051 copying build/lib/evalscope/benchmarks/sciq/sciq_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/sciq 2026-01-05T08:15:19,054 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/trivia_qa 2026-01-05T08:15:19,055 copying build/lib/evalscope/benchmarks/trivia_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-01-05T08:15:19,057 copying build/lib/evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-01-05T08:15:19,060 copying build/lib/evalscope/benchmarks/trivia_qa/samples.jsonl -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/trivia_qa 2026-01-05T08:15:19,062 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/commonsense_qa 2026-01-05T08:15:19,064 copying build/lib/evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-01-05T08:15:19,066 copying build/lib/evalscope/benchmarks/commonsense_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/commonsense_qa 2026-01-05T08:15:19,069 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_fc 2026-01-05T08:15:19,070 copying build/lib/evalscope/benchmarks/general_fc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-01-05T08:15:19,072 copying build/lib/evalscope/benchmarks/general_fc/general_fc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_fc 2026-01-05T08:15:19,075 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/logi_qa 2026-01-05T08:15:19,076 copying build/lib/evalscope/benchmarks/logi_qa/logi_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-01-05T08:15:19,079 copying build/lib/evalscope/benchmarks/logi_qa/__int__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/logi_qa 2026-01-05T08:15:19,081 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/pumed_qa 2026-01-05T08:15:19,082 copying build/lib/evalscope/benchmarks/pumed_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-01-05T08:15:19,084 copying build/lib/evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/pumed_qa 2026-01-05T08:15:19,087 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mbpp 2026-01-05T08:15:19,088 copying build/lib/evalscope/benchmarks/mbpp/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-01-05T08:15:19,090 copying build/lib/evalscope/benchmarks/mbpp/mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mbpp 2026-01-05T08:15:19,093 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/simple_qa 2026-01-05T08:15:19,095 copying build/lib/evalscope/benchmarks/simple_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-01-05T08:15:19,097 copying build/lib/evalscope/benchmarks/simple_qa/simple_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/simple_qa 2026-01-05T08:15:19,100 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/micro_vqa 2026-01-05T08:15:19,102 copying build/lib/evalscope/benchmarks/micro_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-01-05T08:15:19,105 copying build/lib/evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/micro_vqa 2026-01-05T08:15:19,108 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/aime 2026-01-05T08:15:19,109 copying build/lib/evalscope/benchmarks/aime/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-01-05T08:15:19,111 copying build/lib/evalscope/benchmarks/aime/grader.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-01-05T08:15:19,113 copying build/lib/evalscope/benchmarks/aime/aime24_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-01-05T08:15:19,116 copying build/lib/evalscope/benchmarks/aime/aime25_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-01-05T08:15:19,118 copying build/lib/evalscope/benchmarks/aime/math_normalize.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/aime 2026-01-05T08:15:19,121 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_bench 2026-01-05T08:15:19,122 copying build/lib/evalscope/benchmarks/mm_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-01-05T08:15:19,124 copying build/lib/evalscope/benchmarks/mm_bench/mm_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_bench 2026-01-05T08:15:19,127 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/winogrande 2026-01-05T08:15:19,128 copying build/lib/evalscope/benchmarks/winogrande/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-01-05T08:15:19,130 copying build/lib/evalscope/benchmarks/winogrande/winogrande_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/winogrande 2026-01-05T08:15:19,133 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/docmath 2026-01-05T08:15:19,135 copying build/lib/evalscope/benchmarks/docmath/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-01-05T08:15:19,137 copying build/lib/evalscope/benchmarks/docmath/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-01-05T08:15:19,139 copying build/lib/evalscope/benchmarks/docmath/docmath_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/docmath 2026-01-05T08:15:19,142 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_qa 2026-01-05T08:15:19,144 copying build/lib/evalscope/benchmarks/general_qa/general_qa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-01-05T08:15:19,146 copying build/lib/evalscope/benchmarks/general_qa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_qa 2026-01-05T08:15:19,148 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/text2image 2026-01-05T08:15:19,150 copying build/lib/evalscope/benchmarks/text2image/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-01-05T08:15:19,152 copying build/lib/evalscope/benchmarks/text2image/general_t2i_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-01-05T08:15:19,154 copying build/lib/evalscope/benchmarks/text2image/evalmuse_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-01-05T08:15:19,156 copying build/lib/evalscope/benchmarks/text2image/tifa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-01-05T08:15:19,158 copying build/lib/evalscope/benchmarks/text2image/genai_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-01-05T08:15:19,160 copying build/lib/evalscope/benchmarks/text2image/hpdv2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/text2image 2026-01-05T08:15:19,163 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/halu_eval 2026-01-05T08:15:19,164 copying build/lib/evalscope/benchmarks/halu_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-01-05T08:15:19,166 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_instructions.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-01-05T08:15:19,168 copying build/lib/evalscope/benchmarks/halu_eval/halu_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/halu_eval 2026-01-05T08:15:19,171 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/qasc 2026-01-05T08:15:19,173 copying build/lib/evalscope/benchmarks/qasc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-01-05T08:15:19,174 copying build/lib/evalscope/benchmarks/qasc/qasc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/qasc 2026-01-05T08:15:19,177 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multipl_e 2026-01-05T08:15:19,178 copying build/lib/evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-01-05T08:15:19,181 copying build/lib/evalscope/benchmarks/multipl_e/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-01-05T08:15:19,183 copying build/lib/evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-01-05T08:15:19,185 copying build/lib/evalscope/benchmarks/multipl_e/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multipl_e 2026-01-05T08:15:19,188 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/data_collection 2026-01-05T08:15:19,189 copying build/lib/evalscope/benchmarks/data_collection/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-01-05T08:15:19,191 copying build/lib/evalscope/benchmarks/data_collection/data_collection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/data_collection 2026-01-05T08:15:19,194 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/seed_bench_2_plus 2026-01-05T08:15:19,195 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-01-05T08:15:19,197 copying build/lib/evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/seed_bench_2_plus 2026-01-05T08:15:19,199 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/iquiz 2026-01-05T08:15:19,200 copying build/lib/evalscope/benchmarks/iquiz/iquiz_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-01-05T08:15:19,202 copying build/lib/evalscope/benchmarks/iquiz/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/iquiz 2026-01-05T08:15:19,205 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/omni_bench 2026-01-05T08:15:19,206 copying build/lib/evalscope/benchmarks/omni_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-01-05T08:15:19,207 copying build/lib/evalscope/benchmarks/omni_bench/omni_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/omni_bench 2026-01-05T08:15:19,210 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/alpaca_eval 2026-01-05T08:15:19,211 copying build/lib/evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-01-05T08:15:19,213 copying build/lib/evalscope/benchmarks/alpaca_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/alpaca_eval 2026-01-05T08:15:19,215 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/coin_flip 2026-01-05T08:15:19,216 copying build/lib/evalscope/benchmarks/coin_flip/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-01-05T08:15:19,218 copying build/lib/evalscope/benchmarks/coin_flip/coin_flip_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/coin_flip 2026-01-05T08:15:19,221 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode 2026-01-05T08:15:19,222 copying build/lib/evalscope/benchmarks/scicode/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-01-05T08:15:19,223 copying build/lib/evalscope/benchmarks/scicode/prompt_templates.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-01-05T08:15:19,226 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/scicode/docker 2026-01-05T08:15:19,227 copying build/lib/evalscope/benchmarks/scicode/docker/docker_requirements.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-01-05T08:15:19,229 copying build/lib/evalscope/benchmarks/scicode/docker/test_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-01-05T08:15:19,231 copying build/lib/evalscope/benchmarks/scicode/docker/process_data.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-01-05T08:15:19,233 copying build/lib/evalscope/benchmarks/scicode/docker/Dockerfile -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode/docker 2026-01-05T08:15:19,235 copying build/lib/evalscope/benchmarks/scicode/util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-01-05T08:15:19,236 copying build/lib/evalscope/benchmarks/scicode/scicode_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/scicode 2026-01-05T08:15:19,239 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/tool_bench 2026-01-05T08:15:19,241 copying build/lib/evalscope/benchmarks/tool_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-01-05T08:15:19,242 copying build/lib/evalscope/benchmarks/tool_bench/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-01-05T08:15:19,244 copying build/lib/evalscope/benchmarks/tool_bench/tool_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/tool_bench 2026-01-05T08:15:19,247 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mm_star 2026-01-05T08:15:19,248 copying build/lib/evalscope/benchmarks/mm_star/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-01-05T08:15:19,250 copying build/lib/evalscope/benchmarks/mm_star/mm_star_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mm_star 2026-01-05T08:15:19,252 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/competition_math 2026-01-05T08:15:19,253 copying build/lib/evalscope/benchmarks/competition_math/competition_math_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-01-05T08:15:19,255 copying build/lib/evalscope/benchmarks/competition_math/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/competition_math 2026-01-05T08:15:19,258 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/multi_if 2026-01-05T08:15:19,259 copying build/lib/evalscope/benchmarks/multi_if/multi_if_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-01-05T08:15:19,261 copying build/lib/evalscope/benchmarks/multi_if/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-01-05T08:15:19,263 copying build/lib/evalscope/benchmarks/multi_if/ifeval.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-01-05T08:15:19,267 copying build/lib/evalscope/benchmarks/multi_if/metrics.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/multi_if 2026-01-05T08:15:19,270 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/amc 2026-01-05T08:15:19,271 copying build/lib/evalscope/benchmarks/amc/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-01-05T08:15:19,273 copying build/lib/evalscope/benchmarks/amc/amc_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/amc 2026-01-05T08:15:19,275 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/general_vqa 2026-01-05T08:15:19,277 copying build/lib/evalscope/benchmarks/general_vqa/general_vqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-01-05T08:15:19,279 copying build/lib/evalscope/benchmarks/general_vqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/general_vqa 2026-01-05T08:15:19,281 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/drivelology 2026-01-05T08:15:19,283 copying build/lib/evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-01-05T08:15:19,285 copying build/lib/evalscope/benchmarks/drivelology/drivelology_writing_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-01-05T08:15:19,288 copying build/lib/evalscope/benchmarks/drivelology/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-01-05T08:15:19,289 copying build/lib/evalscope/benchmarks/drivelology/drivelology_binary_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-01-05T08:15:19,291 copying build/lib/evalscope/benchmarks/drivelology/drivelology_selection_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/drivelology 2026-01-05T08:15:19,294 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/math_vista 2026-01-05T08:15:19,295 copying build/lib/evalscope/benchmarks/math_vista/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-01-05T08:15:19,296 copying build/lib/evalscope/benchmarks/math_vista/math_vista_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/math_vista 2026-01-05T08:15:19,299 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench 2026-01-05T08:15:19,301 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,302 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:19,304 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:19,306 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:19,308 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:19,311 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval 2026-01-05T08:15:19,314 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,316 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,319 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,322 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,325 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,327 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,330 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,332 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,335 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench_v2 2026-01-05T08:15:19,337 copying build/lib/evalscope/benchmarks/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench 2026-01-05T08:15:19,340 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/ocr_bench/ocr_bench 2026-01-05T08:15:19,341 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-01-05T08:15:19,343 copying build/lib/evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/ocr_bench/ocr_bench 2026-01-05T08:15:19,346 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/torgo 2026-01-05T08:15:19,347 copying build/lib/evalscope/benchmarks/torgo/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-01-05T08:15:19,349 copying build/lib/evalscope/benchmarks/torgo/torgo_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/torgo 2026-01-05T08:15:19,352 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/music_trivia 2026-01-05T08:15:19,354 copying build/lib/evalscope/benchmarks/music_trivia/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-01-05T08:15:19,355 copying build/lib/evalscope/benchmarks/music_trivia/music_trivia_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/music_trivia 2026-01-05T08:15:19,358 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmlu_pro 2026-01-05T08:15:19,359 copying build/lib/evalscope/benchmarks/mmlu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-01-05T08:15:19,361 copying build/lib/evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmlu_pro 2026-01-05T08:15:19,364 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mgsm 2026-01-05T08:15:19,365 copying build/lib/evalscope/benchmarks/mgsm/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-01-05T08:15:19,367 copying build/lib/evalscope/benchmarks/mgsm/mgsm_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mgsm 2026-01-05T08:15:19,370 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k 2026-01-05T08:15:19,371 copying build/lib/evalscope/benchmarks/gsm8k/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-01-05T08:15:19,374 copying build/lib/evalscope/benchmarks/gsm8k/gsm8k_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k 2026-01-05T08:15:19,376 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/mmmu_pro 2026-01-05T08:15:19,378 copying build/lib/evalscope/benchmarks/mmmu_pro/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-01-05T08:15:19,380 copying build/lib/evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/mmmu_pro 2026-01-05T08:15:19,383 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/gsm8k_v 2026-01-05T08:15:19,385 copying build/lib/evalscope/benchmarks/gsm8k_v/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-01-05T08:15:19,387 copying build/lib/evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/gsm8k_v 2026-01-05T08:15:19,390 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/hle 2026-01-05T08:15:19,391 copying build/lib/evalscope/benchmarks/hle/hle_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-01-05T08:15:19,394 copying build/lib/evalscope/benchmarks/hle/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/hle 2026-01-05T08:15:19,397 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/super_gpqa 2026-01-05T08:15:19,398 copying build/lib/evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-01-05T08:15:19,400 copying build/lib/evalscope/benchmarks/super_gpqa/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-01-05T08:15:19,402 copying build/lib/evalscope/benchmarks/super_gpqa/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-01-05T08:15:19,404 copying build/lib/evalscope/benchmarks/super_gpqa/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/super_gpqa 2026-01-05T08:15:19,407 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/cmmu 2026-01-05T08:15:19,409 copying build/lib/evalscope/benchmarks/cmmu/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-01-05T08:15:19,411 copying build/lib/evalscope/benchmarks/cmmu/cmmu_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-01-05T08:15:19,413 copying build/lib/evalscope/benchmarks/cmmu/prompt.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/cmmu 2026-01-05T08:15:19,416 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/maritime_bench 2026-01-05T08:15:19,418 copying build/lib/evalscope/benchmarks/maritime_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-01-05T08:15:19,421 copying build/lib/evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/maritime_bench 2026-01-05T08:15:19,424 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,426 copying build/lib/evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,430 copying build/lib/evalscope/benchmarks/live_code_bench/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,432 copying build/lib/evalscope/benchmarks/live_code_bench/prompts.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,436 copying build/lib/evalscope/benchmarks/live_code_bench/evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,439 copying build/lib/evalscope/benchmarks/live_code_bench/testing_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,443 copying build/lib/evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,447 copying build/lib/evalscope/benchmarks/live_code_bench/extract_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,450 copying build/lib/evalscope/benchmarks/live_code_bench/pass_k_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,453 copying build/lib/evalscope/benchmarks/live_code_bench/load_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/live_code_bench 2026-01-05T08:15:19,456 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl 2026-01-05T08:15:19,459 copying build/lib/evalscope/benchmarks/bfcl/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl 2026-01-05T08:15:19,461 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v4 2026-01-05T08:15:19,463 copying build/lib/evalscope/benchmarks/bfcl/v4/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-01-05T08:15:19,465 copying build/lib/evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-01-05T08:15:19,468 copying build/lib/evalscope/benchmarks/bfcl/v4/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v4 2026-01-05T08:15:19,472 creating build/bdist.linux-armv7l/wheel/evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:19,473 copying build/lib/evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:19,477 copying build/lib/evalscope/benchmarks/bfcl/v3/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:19,479 copying build/lib/evalscope/benchmarks/bfcl/v3/generation.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:19,482 copying build/lib/evalscope/benchmarks/bfcl/v3/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/benchmarks/bfcl/v3 2026-01-05T08:15:19,485 creating build/bdist.linux-armv7l/wheel/evalscope/utils 2026-01-05T08:15:19,487 copying build/lib/evalscope/utils/ner.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,490 copying build/lib/evalscope/utils/deprecation_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,492 copying build/lib/evalscope/utils/import_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,495 copying build/lib/evalscope/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,497 copying build/lib/evalscope/utils/code_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,501 copying build/lib/evalscope/utils/model_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,504 copying build/lib/evalscope/utils/json_schema.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,507 copying build/lib/evalscope/utils/multi_choices.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,510 copying build/lib/evalscope/utils/resource_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,513 copying build/lib/evalscope/utils/logger.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,516 copying build/lib/evalscope/utils/chat_service.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,519 copying build/lib/evalscope/utils/function_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,522 copying build/lib/evalscope/utils/tqdm_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,524 copying build/lib/evalscope/utils/argument_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,526 copying build/lib/evalscope/utils/url_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,529 copying build/lib/evalscope/utils/io_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/utils 2026-01-05T08:15:19,532 creating build/bdist.linux-armv7l/wheel/evalscope/perf 2026-01-05T08:15:19,534 copying build/lib/evalscope/perf/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-01-05T08:15:19,536 creating build/bdist.linux-armv7l/wheel/evalscope/perf/sla 2026-01-05T08:15:19,538 copying build/lib/evalscope/perf/sla/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-01-05T08:15:19,540 copying build/lib/evalscope/perf/sla/sla_criterion.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-01-05T08:15:19,542 copying build/lib/evalscope/perf/sla/sla_run.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/sla 2026-01-05T08:15:19,545 copying build/lib/evalscope/perf/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-01-05T08:15:19,548 copying build/lib/evalscope/perf/main.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-01-05T08:15:19,550 copying build/lib/evalscope/perf/http_client.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-01-05T08:15:19,553 copying build/lib/evalscope/perf/arguments.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf 2026-01-05T08:15:19,556 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin 2026-01-05T08:15:19,558 copying build/lib/evalscope/perf/plugin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-01-05T08:15:19,561 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/datasets 2026-01-05T08:15:19,562 copying build/lib/evalscope/perf/plugin/datasets/longalpaca.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,564 copying build/lib/evalscope/perf/plugin/datasets/random_vl_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,566 copying build/lib/evalscope/perf/plugin/datasets/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,569 copying build/lib/evalscope/perf/plugin/datasets/line_by_line.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,571 copying build/lib/evalscope/perf/plugin/datasets/custom.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,573 copying build/lib/evalscope/perf/plugin/datasets/kontext_bench.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,575 copying build/lib/evalscope/perf/plugin/datasets/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,577 copying build/lib/evalscope/perf/plugin/datasets/speed_benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,579 copying build/lib/evalscope/perf/plugin/datasets/flickr8k.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,582 copying build/lib/evalscope/perf/plugin/datasets/openqa.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,584 copying build/lib/evalscope/perf/plugin/datasets/random_dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/datasets 2026-01-05T08:15:19,587 copying build/lib/evalscope/perf/plugin/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin 2026-01-05T08:15:19,590 creating build/bdist.linux-armv7l/wheel/evalscope/perf/plugin/api 2026-01-05T08:15:19,591 copying build/lib/evalscope/perf/plugin/api/default_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-01-05T08:15:19,594 copying build/lib/evalscope/perf/plugin/api/dashscope_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-01-05T08:15:19,597 copying build/lib/evalscope/perf/plugin/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-01-05T08:15:19,599 copying build/lib/evalscope/perf/plugin/api/custom_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-01-05T08:15:19,602 copying build/lib/evalscope/perf/plugin/api/base.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-01-05T08:15:19,604 copying build/lib/evalscope/perf/plugin/api/openai_api.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/plugin/api 2026-01-05T08:15:19,608 creating build/bdist.linux-armv7l/wheel/evalscope/perf/utils 2026-01-05T08:15:19,609 copying build/lib/evalscope/perf/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-01-05T08:15:19,611 copying build/lib/evalscope/perf/utils/log_utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-01-05T08:15:19,614 copying build/lib/evalscope/perf/utils/db_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-01-05T08:15:19,617 copying build/lib/evalscope/perf/utils/benchmark_util.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-01-05T08:15:19,619 copying build/lib/evalscope/perf/utils/handler.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-01-05T08:15:19,621 copying build/lib/evalscope/perf/utils/rich_display.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-01-05T08:15:19,623 copying build/lib/evalscope/perf/utils/analysis_result.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-01-05T08:15:19,625 copying build/lib/evalscope/perf/utils/local_server.py -> build/bdist.linux-armv7l/wheel/./evalscope/perf/utils 2026-01-05T08:15:19,627 creating build/bdist.linux-armv7l/wheel/evalscope/api 2026-01-05T08:15:19,629 creating build/bdist.linux-armv7l/wheel/evalscope/api/evaluator 2026-01-05T08:15:19,630 copying build/lib/evalscope/api/evaluator/cache.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-01-05T08:15:19,632 copying build/lib/evalscope/api/evaluator/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-01-05T08:15:19,634 copying build/lib/evalscope/api/evaluator/state.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-01-05T08:15:19,636 copying build/lib/evalscope/api/evaluator/evaluator.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/evaluator 2026-01-05T08:15:19,639 copying build/lib/evalscope/api/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-01-05T08:15:19,641 creating build/bdist.linux-armv7l/wheel/evalscope/api/messages 2026-01-05T08:15:19,642 copying build/lib/evalscope/api/messages/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-01-05T08:15:19,644 copying build/lib/evalscope/api/messages/content.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-01-05T08:15:19,646 copying build/lib/evalscope/api/messages/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-01-05T08:15:19,648 copying build/lib/evalscope/api/messages/chat_message.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/messages 2026-01-05T08:15:19,651 creating build/bdist.linux-armv7l/wheel/evalscope/api/model 2026-01-05T08:15:19,652 copying build/lib/evalscope/api/model/generate_config.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-01-05T08:15:19,654 copying build/lib/evalscope/api/model/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-01-05T08:15:19,656 copying build/lib/evalscope/api/model/lazy_model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-01-05T08:15:19,658 copying build/lib/evalscope/api/model/model_output.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-01-05T08:15:19,661 copying build/lib/evalscope/api/model/model.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/model 2026-01-05T08:15:19,664 creating build/bdist.linux-armv7l/wheel/evalscope/api/dataset 2026-01-05T08:15:19,665 copying build/lib/evalscope/api/dataset/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-01-05T08:15:19,667 copying build/lib/evalscope/api/dataset/loader.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-01-05T08:15:19,669 copying build/lib/evalscope/api/dataset/dataset.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-01-05T08:15:19,672 copying build/lib/evalscope/api/dataset/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/dataset 2026-01-05T08:15:19,674 copying build/lib/evalscope/api/registry.py -> build/bdist.linux-armv7l/wheel/./evalscope/api 2026-01-05T08:15:19,677 creating build/bdist.linux-armv7l/wheel/evalscope/api/metric 2026-01-05T08:15:19,678 copying build/lib/evalscope/api/metric/scorer.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-01-05T08:15:19,680 copying build/lib/evalscope/api/metric/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-01-05T08:15:19,682 copying build/lib/evalscope/api/metric/metric.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/metric 2026-01-05T08:15:19,685 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark 2026-01-05T08:15:19,686 copying build/lib/evalscope/api/benchmark/meta.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-01-05T08:15:19,688 copying build/lib/evalscope/api/benchmark/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-01-05T08:15:19,690 copying build/lib/evalscope/api/benchmark/benchmark.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark 2026-01-05T08:15:19,693 creating build/bdist.linux-armv7l/wheel/evalscope/api/benchmark/adapters 2026-01-05T08:15:19,694 copying build/lib/evalscope/api/benchmark/adapters/multi_choice_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-01-05T08:15:19,696 copying build/lib/evalscope/api/benchmark/adapters/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-01-05T08:15:19,698 copying build/lib/evalscope/api/benchmark/adapters/vision_language_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-01-05T08:15:19,700 copying build/lib/evalscope/api/benchmark/adapters/agent_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-01-05T08:15:19,702 copying build/lib/evalscope/api/benchmark/adapters/text2image_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-01-05T08:15:19,704 copying build/lib/evalscope/api/benchmark/adapters/ner_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-01-05T08:15:19,706 copying build/lib/evalscope/api/benchmark/adapters/image_edit_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-01-05T08:15:19,708 copying build/lib/evalscope/api/benchmark/adapters/default_data_adapter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/benchmark/adapters 2026-01-05T08:15:19,712 creating build/bdist.linux-armv7l/wheel/evalscope/api/filter 2026-01-05T08:15:19,713 copying build/lib/evalscope/api/filter/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-01-05T08:15:19,715 copying build/lib/evalscope/api/filter/filter.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/filter 2026-01-05T08:15:19,717 creating build/bdist.linux-armv7l/wheel/evalscope/api/tool 2026-01-05T08:15:19,718 copying build/lib/evalscope/api/tool/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-01-05T08:15:19,720 copying build/lib/evalscope/api/tool/tool_call.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-01-05T08:15:19,722 copying build/lib/evalscope/api/tool/tool_info.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-01-05T08:15:19,725 copying build/lib/evalscope/api/tool/utils.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/tool 2026-01-05T08:15:19,727 creating build/bdist.linux-armv7l/wheel/evalscope/api/mixin 2026-01-05T08:15:19,728 copying build/lib/evalscope/api/mixin/__init__.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-01-05T08:15:19,731 copying build/lib/evalscope/api/mixin/sandbox_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-01-05T08:15:19,734 copying build/lib/evalscope/api/mixin/llm_judge_mixin.py -> build/bdist.linux-armv7l/wheel/./evalscope/api/mixin 2026-01-05T08:15:19,736 running install_egg_info 2026-01-05T08:15:19,742 Copying evalscope.egg-info to build/bdist.linux-armv7l/wheel/./evalscope-1.4.1-py3.11.egg-info 2026-01-05T08:15:19,755 running install_scripts 2026-01-05T08:15:19,769 creating build/bdist.linux-armv7l/wheel/evalscope-1.4.1.dist-info/WHEEL 2026-01-05T08:15:19,772 creating '/tmp/pip-wheel-aqtrynwz/.tmp-s7154hu1/evalscope-1.4.1-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-01-05T08:15:19,774 adding 'evalscope/__init__.py' 2026-01-05T08:15:19,776 adding 'evalscope/arguments.py' 2026-01-05T08:15:19,778 adding 'evalscope/config.py' 2026-01-05T08:15:19,780 adding 'evalscope/constants.py' 2026-01-05T08:15:19,782 adding 'evalscope/run.py' 2026-01-05T08:15:19,783 adding 'evalscope/version.py' 2026-01-05T08:15:19,785 adding 'evalscope/api/__init__.py' 2026-01-05T08:15:19,786 adding 'evalscope/api/registry.py' 2026-01-05T08:15:19,788 adding 'evalscope/api/benchmark/__init__.py' 2026-01-05T08:15:19,790 adding 'evalscope/api/benchmark/benchmark.py' 2026-01-05T08:15:19,791 adding 'evalscope/api/benchmark/meta.py' 2026-01-05T08:15:19,793 adding 'evalscope/api/benchmark/adapters/__init__.py' 2026-01-05T08:15:19,794 adding 'evalscope/api/benchmark/adapters/agent_adapter.py' 2026-01-05T08:15:19,798 adding 'evalscope/api/benchmark/adapters/default_data_adapter.py' 2026-01-05T08:15:19,800 adding 'evalscope/api/benchmark/adapters/image_edit_adapter.py' 2026-01-05T08:15:19,801 adding 'evalscope/api/benchmark/adapters/multi_choice_adapter.py' 2026-01-05T08:15:19,802 adding 'evalscope/api/benchmark/adapters/ner_adapter.py' 2026-01-05T08:15:19,804 adding 'evalscope/api/benchmark/adapters/text2image_adapter.py' 2026-01-05T08:15:19,805 adding 'evalscope/api/benchmark/adapters/vision_language_adapter.py' 2026-01-05T08:15:19,807 adding 'evalscope/api/dataset/__init__.py' 2026-01-05T08:15:19,809 adding 'evalscope/api/dataset/dataset.py' 2026-01-05T08:15:19,811 adding 'evalscope/api/dataset/loader.py' 2026-01-05T08:15:19,812 adding 'evalscope/api/dataset/utils.py' 2026-01-05T08:15:19,814 adding 'evalscope/api/evaluator/__init__.py' 2026-01-05T08:15:19,816 adding 'evalscope/api/evaluator/cache.py' 2026-01-05T08:15:19,817 adding 'evalscope/api/evaluator/evaluator.py' 2026-01-05T08:15:19,819 adding 'evalscope/api/evaluator/state.py' 2026-01-05T08:15:19,820 adding 'evalscope/api/filter/__init__.py' 2026-01-05T08:15:19,822 adding 'evalscope/api/filter/filter.py' 2026-01-05T08:15:19,823 adding 'evalscope/api/messages/__init__.py' 2026-01-05T08:15:19,825 adding 'evalscope/api/messages/chat_message.py' 2026-01-05T08:15:19,827 adding 'evalscope/api/messages/content.py' 2026-01-05T08:15:19,828 adding 'evalscope/api/messages/utils.py' 2026-01-05T08:15:19,830 adding 'evalscope/api/metric/__init__.py' 2026-01-05T08:15:19,831 adding 'evalscope/api/metric/metric.py' 2026-01-05T08:15:19,832 adding 'evalscope/api/metric/scorer.py' 2026-01-05T08:15:19,834 adding 'evalscope/api/mixin/__init__.py' 2026-01-05T08:15:19,836 adding 'evalscope/api/mixin/llm_judge_mixin.py' 2026-01-05T08:15:19,837 adding 'evalscope/api/mixin/sandbox_mixin.py' 2026-01-05T08:15:19,839 adding 'evalscope/api/model/__init__.py' 2026-01-05T08:15:19,841 adding 'evalscope/api/model/generate_config.py' 2026-01-05T08:15:19,842 adding 'evalscope/api/model/lazy_model.py' 2026-01-05T08:15:19,844 adding 'evalscope/api/model/model.py' 2026-01-05T08:15:19,846 adding 'evalscope/api/model/model_output.py' 2026-01-05T08:15:19,848 adding 'evalscope/api/tool/__init__.py' 2026-01-05T08:15:19,849 adding 'evalscope/api/tool/tool_call.py' 2026-01-05T08:15:19,851 adding 'evalscope/api/tool/tool_info.py' 2026-01-05T08:15:19,852 adding 'evalscope/api/tool/utils.py' 2026-01-05T08:15:19,854 adding 'evalscope/app/__init__.py' 2026-01-05T08:15:19,855 adding 'evalscope/app/app.py' 2026-01-05T08:15:19,856 adding 'evalscope/app/arguments.py' 2026-01-05T08:15:19,857 adding 'evalscope/app/constants.py' 2026-01-05T08:15:19,859 adding 'evalscope/app/ui/__init__.py' 2026-01-05T08:15:19,861 adding 'evalscope/app/ui/app_ui.py' 2026-01-05T08:15:19,863 adding 'evalscope/app/ui/multi_model.py' 2026-01-05T08:15:19,864 adding 'evalscope/app/ui/sidebar.py' 2026-01-05T08:15:19,866 adding 'evalscope/app/ui/single_model.py' 2026-01-05T08:15:19,867 adding 'evalscope/app/ui/visualization.py' 2026-01-05T08:15:19,869 adding 'evalscope/app/utils/data_utils.py' 2026-01-05T08:15:19,871 adding 'evalscope/app/utils/env_utils.py' 2026-01-05T08:15:19,872 adding 'evalscope/app/utils/localization.py' 2026-01-05T08:15:19,874 adding 'evalscope/app/utils/text_utils.py' 2026-01-05T08:15:19,875 adding 'evalscope/app/utils/visualization.py' 2026-01-05T08:15:19,876 adding 'evalscope/backend/__init__.py' 2026-01-05T08:15:19,877 adding 'evalscope/backend/base.py' 2026-01-05T08:15:19,879 adding 'evalscope/backend/opencompass/__init__.py' 2026-01-05T08:15:19,880 adding 'evalscope/backend/opencompass/api_meta_template.py' 2026-01-05T08:15:19,882 adding 'evalscope/backend/opencompass/backend_manager.py' 2026-01-05T08:15:19,883 adding 'evalscope/backend/opencompass/tasks/__init__.py' 2026-01-05T08:15:19,885 adding 'evalscope/backend/opencompass/tasks/eval_api.py' 2026-01-05T08:15:19,886 adding 'evalscope/backend/opencompass/tasks/eval_datasets.py' 2026-01-05T08:15:19,888 adding 'evalscope/backend/rag_eval/__init__.py' 2026-01-05T08:15:19,889 adding 'evalscope/backend/rag_eval/backend_manager.py' 2026-01-05T08:15:19,891 adding 'evalscope/backend/rag_eval/clip_benchmark/__init__.py' 2026-01-05T08:15:19,892 adding 'evalscope/backend/rag_eval/clip_benchmark/arguments.py' 2026-01-05T08:15:19,894 adding 'evalscope/backend/rag_eval/clip_benchmark/dataset_builder.py' 2026-01-05T08:15:19,895 adding 'evalscope/backend/rag_eval/clip_benchmark/task_template.py' 2026-01-05T08:15:19,897 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/__init__.py' 2026-01-05T08:15:19,898 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/image_caption.py' 2026-01-05T08:15:19,899 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_classification.py' 2026-01-05T08:15:19,901 adding 'evalscope/backend/rag_eval/clip_benchmark/tasks/zeroshot_retrieval.py' 2026-01-05T08:15:19,903 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdataset_convert.py' 2026-01-05T08:15:19,904 adding 'evalscope/backend/rag_eval/clip_benchmark/utils/webdatasets.txt' 2026-01-05T08:15:19,906 adding 'evalscope/backend/rag_eval/cmteb/__init__.py' 2026-01-05T08:15:19,907 adding 'evalscope/backend/rag_eval/cmteb/arguments.py' 2026-01-05T08:15:19,909 adding 'evalscope/backend/rag_eval/cmteb/base.py' 2026-01-05T08:15:19,910 adding 'evalscope/backend/rag_eval/cmteb/task_template.py' 2026-01-05T08:15:19,913 adding 'evalscope/backend/rag_eval/cmteb/tasks/Classification.py' 2026-01-05T08:15:19,914 adding 'evalscope/backend/rag_eval/cmteb/tasks/Clustering.py' 2026-01-05T08:15:19,915 adding 'evalscope/backend/rag_eval/cmteb/tasks/CustomTask.py' 2026-01-05T08:15:19,917 adding 'evalscope/backend/rag_eval/cmteb/tasks/PairClassification.py' 2026-01-05T08:15:19,918 adding 'evalscope/backend/rag_eval/cmteb/tasks/Reranking.py' 2026-01-05T08:15:19,920 adding 'evalscope/backend/rag_eval/cmteb/tasks/Retrieval.py' 2026-01-05T08:15:19,922 adding 'evalscope/backend/rag_eval/cmteb/tasks/STS.py' 2026-01-05T08:15:19,923 adding 'evalscope/backend/rag_eval/cmteb/tasks/__init__.py' 2026-01-05T08:15:19,925 adding 'evalscope/backend/rag_eval/ragas/__init__.py' 2026-01-05T08:15:19,926 adding 'evalscope/backend/rag_eval/ragas/arguments.py' 2026-01-05T08:15:19,927 adding 'evalscope/backend/rag_eval/ragas/task_template.py' 2026-01-05T08:15:19,929 adding 'evalscope/backend/rag_eval/ragas/prompts/persona_prompt.py' 2026-01-05T08:15:19,931 adding 'evalscope/backend/rag_eval/ragas/tasks/__init__.py' 2026-01-05T08:15:19,932 adding 'evalscope/backend/rag_eval/ragas/tasks/build_distribution.py' 2026-01-05T08:15:19,933 adding 'evalscope/backend/rag_eval/ragas/tasks/build_transform.py' 2026-01-05T08:15:19,935 adding 'evalscope/backend/rag_eval/ragas/tasks/testset_generation.py' 2026-01-05T08:15:19,936 adding 'evalscope/backend/rag_eval/ragas/tasks/translate_prompt.py' 2026-01-05T08:15:19,938 adding 'evalscope/backend/rag_eval/utils/__init__.py' 2026-01-05T08:15:19,939 adding 'evalscope/backend/rag_eval/utils/clip.py' 2026-01-05T08:15:19,941 adding 'evalscope/backend/rag_eval/utils/embedding.py' 2026-01-05T08:15:19,943 adding 'evalscope/backend/rag_eval/utils/llm.py' 2026-01-05T08:15:19,944 adding 'evalscope/backend/rag_eval/utils/tools.py' 2026-01-05T08:15:19,946 adding 'evalscope/backend/vlm_eval_kit/__init__.py' 2026-01-05T08:15:19,947 adding 'evalscope/backend/vlm_eval_kit/backend_manager.py' 2026-01-05T08:15:19,951 adding 'evalscope/benchmarks/__init__.py' 2026-01-05T08:15:19,952 adding 'evalscope/benchmarks/a_okvqa/__init__.py' 2026-01-05T08:15:19,953 adding 'evalscope/benchmarks/a_okvqa/a_okvqa_adapter.py' 2026-01-05T08:15:19,955 adding 'evalscope/benchmarks/aa_lcr/__init__.py' 2026-01-05T08:15:19,957 adding 'evalscope/benchmarks/aa_lcr/aa_lcr_adapter.py' 2026-01-05T08:15:19,958 adding 'evalscope/benchmarks/ai2d/__init__.py' 2026-01-05T08:15:19,960 adding 'evalscope/benchmarks/ai2d/ai2d_adapter.py' 2026-01-05T08:15:19,961 adding 'evalscope/benchmarks/aime/__init__.py' 2026-01-05T08:15:19,962 adding 'evalscope/benchmarks/aime/aime24_adapter.py' 2026-01-05T08:15:19,964 adding 'evalscope/benchmarks/aime/aime25_adapter.py' 2026-01-05T08:15:19,965 adding 'evalscope/benchmarks/aime/grader.py' 2026-01-05T08:15:19,967 adding 'evalscope/benchmarks/aime/math_normalize.py' 2026-01-05T08:15:19,969 adding 'evalscope/benchmarks/alpaca_eval/__init__.py' 2026-01-05T08:15:19,970 adding 'evalscope/benchmarks/alpaca_eval/alpaca_eval_adapter.py' 2026-01-05T08:15:19,972 adding 'evalscope/benchmarks/amc/__init__.py' 2026-01-05T08:15:19,973 adding 'evalscope/benchmarks/amc/amc_adapter.py' 2026-01-05T08:15:19,974 adding 'evalscope/benchmarks/arc/__init__.py' 2026-01-05T08:15:19,975 adding 'evalscope/benchmarks/arc/arc_adapter.py' 2026-01-05T08:15:19,977 adding 'evalscope/benchmarks/arena_hard/__init__.py' 2026-01-05T08:15:19,979 adding 'evalscope/benchmarks/arena_hard/arena_hard_adapter.py' 2026-01-05T08:15:19,980 adding 'evalscope/benchmarks/arena_hard/utils.py' 2026-01-05T08:15:19,982 adding 'evalscope/benchmarks/bbh/__init__.py' 2026-01-05T08:15:19,984 adding 'evalscope/benchmarks/bbh/bbh_adapter.py' 2026-01-05T08:15:19,986 adding 'evalscope/benchmarks/bbh/cot_prompts/boolean_expressions.txt' 2026-01-05T08:15:19,987 adding 'evalscope/benchmarks/bbh/cot_prompts/causal_judgement.txt' 2026-01-05T08:15:19,989 adding 'evalscope/benchmarks/bbh/cot_prompts/date_understanding.txt' 2026-01-05T08:15:19,990 adding 'evalscope/benchmarks/bbh/cot_prompts/disambiguation_qa.txt' 2026-01-05T08:15:19,991 adding 'evalscope/benchmarks/bbh/cot_prompts/dyck_languages.txt' 2026-01-05T08:15:19,993 adding 'evalscope/benchmarks/bbh/cot_prompts/formal_fallacies.txt' 2026-01-05T08:15:19,994 adding 'evalscope/benchmarks/bbh/cot_prompts/geometric_shapes.txt' 2026-01-05T08:15:19,995 adding 'evalscope/benchmarks/bbh/cot_prompts/hyperbaton.txt' 2026-01-05T08:15:19,997 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_five_objects.txt' 2026-01-05T08:15:19,998 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_seven_objects.txt' 2026-01-05T08:15:20,000 adding 'evalscope/benchmarks/bbh/cot_prompts/logical_deduction_three_objects.txt' 2026-01-05T08:15:20,001 adding 'evalscope/benchmarks/bbh/cot_prompts/movie_recommendation.txt' 2026-01-05T08:15:20,002 adding 'evalscope/benchmarks/bbh/cot_prompts/multistep_arithmetic_two.txt' 2026-01-05T08:15:20,004 adding 'evalscope/benchmarks/bbh/cot_prompts/navigate.txt' 2026-01-05T08:15:20,005 adding 'evalscope/benchmarks/bbh/cot_prompts/object_counting.txt' 2026-01-05T08:15:20,007 adding 'evalscope/benchmarks/bbh/cot_prompts/penguins_in_a_table.txt' 2026-01-05T08:15:20,008 adding 'evalscope/benchmarks/bbh/cot_prompts/reasoning_about_colored_objects.txt' 2026-01-05T08:15:20,010 adding 'evalscope/benchmarks/bbh/cot_prompts/ruin_names.txt' 2026-01-05T08:15:20,011 adding 'evalscope/benchmarks/bbh/cot_prompts/salient_translation_error_detection.txt' 2026-01-05T08:15:20,013 adding 'evalscope/benchmarks/bbh/cot_prompts/snarks.txt' 2026-01-05T08:15:20,014 adding 'evalscope/benchmarks/bbh/cot_prompts/sports_understanding.txt' 2026-01-05T08:15:20,016 adding 'evalscope/benchmarks/bbh/cot_prompts/temporal_sequences.txt' 2026-01-05T08:15:20,017 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_five_objects.txt' 2026-01-05T08:15:20,018 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_seven_objects.txt' 2026-01-05T08:15:20,020 adding 'evalscope/benchmarks/bbh/cot_prompts/tracking_shuffled_objects_three_objects.txt' 2026-01-05T08:15:20,021 adding 'evalscope/benchmarks/bbh/cot_prompts/web_of_lies.txt' 2026-01-05T08:15:20,022 adding 'evalscope/benchmarks/bbh/cot_prompts/word_sorting.txt' 2026-01-05T08:15:20,024 adding 'evalscope/benchmarks/bfcl/__init__.py' 2026-01-05T08:15:20,025 adding 'evalscope/benchmarks/bfcl/v3/__init__.py' 2026-01-05T08:15:20,027 adding 'evalscope/benchmarks/bfcl/v3/bfcl_v3_adapter.py' 2026-01-05T08:15:20,029 adding 'evalscope/benchmarks/bfcl/v3/generation.py' 2026-01-05T08:15:20,030 adding 'evalscope/benchmarks/bfcl/v3/utils.py' 2026-01-05T08:15:20,032 adding 'evalscope/benchmarks/bfcl/v4/__init__.py' 2026-01-05T08:15:20,034 adding 'evalscope/benchmarks/bfcl/v4/bfcl_v4_adapter.py' 2026-01-05T08:15:20,036 adding 'evalscope/benchmarks/bfcl/v4/utils.py' 2026-01-05T08:15:20,037 adding 'evalscope/benchmarks/biomix_qa/__init__.py' 2026-01-05T08:15:20,039 adding 'evalscope/benchmarks/biomix_qa/biomix_qa_adapter.py' 2026-01-05T08:15:20,040 adding 'evalscope/benchmarks/blink/__init__.py' 2026-01-05T08:15:20,042 adding 'evalscope/benchmarks/blink/blink_adapter.py' 2026-01-05T08:15:20,043 adding 'evalscope/benchmarks/ceval/__init__.py' 2026-01-05T08:15:20,045 adding 'evalscope/benchmarks/ceval/ceval_adapter.py' 2026-01-05T08:15:20,047 adding 'evalscope/benchmarks/chartqa/__init__.py' 2026-01-05T08:15:20,048 adding 'evalscope/benchmarks/chartqa/chartqa_adapter.py' 2026-01-05T08:15:20,049 adding 'evalscope/benchmarks/chartqa/utils.py' 2026-01-05T08:15:20,051 adding 'evalscope/benchmarks/chinese_simple_qa/__init__.py' 2026-01-05T08:15:20,053 adding 'evalscope/benchmarks/chinese_simple_qa/csimple_qa_adapter.py' 2026-01-05T08:15:20,054 adding 'evalscope/benchmarks/cmmlu/__init__.py' 2026-01-05T08:15:20,056 adding 'evalscope/benchmarks/cmmlu/cmmlu_adapter.py' 2026-01-05T08:15:20,057 adding 'evalscope/benchmarks/cmmmu/__init__.py' 2026-01-05T08:15:20,059 adding 'evalscope/benchmarks/cmmmu/cmmmu_adapter.py' 2026-01-05T08:15:20,061 adding 'evalscope/benchmarks/cmmmu/utils.py' 2026-01-05T08:15:20,063 adding 'evalscope/benchmarks/cmmu/__init__.py' 2026-01-05T08:15:20,065 adding 'evalscope/benchmarks/cmmu/cmmu_adapter.py' 2026-01-05T08:15:20,066 adding 'evalscope/benchmarks/cmmu/prompt.py' 2026-01-05T08:15:20,067 adding 'evalscope/benchmarks/coin_flip/__init__.py' 2026-01-05T08:15:20,069 adding 'evalscope/benchmarks/coin_flip/coin_flip_adapter.py' 2026-01-05T08:15:20,071 adding 'evalscope/benchmarks/commonsense_qa/__init__.py' 2026-01-05T08:15:20,072 adding 'evalscope/benchmarks/commonsense_qa/commonsense_qa_adapter.py' 2026-01-05T08:15:20,074 adding 'evalscope/benchmarks/competition_math/__init__.py' 2026-01-05T08:15:20,075 adding 'evalscope/benchmarks/competition_math/competition_math_adapter.py' 2026-01-05T08:15:20,077 adding 'evalscope/benchmarks/data_collection/__init__.py' 2026-01-05T08:15:20,079 adding 'evalscope/benchmarks/data_collection/data_collection_adapter.py' 2026-01-05T08:15:20,080 adding 'evalscope/benchmarks/docmath/__init__.py' 2026-01-05T08:15:20,082 adding 'evalscope/benchmarks/docmath/docmath_adapter.py' 2026-01-05T08:15:20,083 adding 'evalscope/benchmarks/docmath/utils.py' 2026-01-05T08:15:20,085 adding 'evalscope/benchmarks/docvqa/__init__.py' 2026-01-05T08:15:20,087 adding 'evalscope/benchmarks/docvqa/docvqa_adapter.py' 2026-01-05T08:15:20,088 adding 'evalscope/benchmarks/drivelology/__init__.py' 2026-01-05T08:15:20,090 adding 'evalscope/benchmarks/drivelology/drivelology_binary_adapter.py' 2026-01-05T08:15:20,092 adding 'evalscope/benchmarks/drivelology/drivelology_multilabel_adapter.py' 2026-01-05T08:15:20,093 adding 'evalscope/benchmarks/drivelology/drivelology_selection_adapter.py' 2026-01-05T08:15:20,095 adding 'evalscope/benchmarks/drivelology/drivelology_writing_adapter.py' 2026-01-05T08:15:20,097 adding 'evalscope/benchmarks/drop/__init__.py' 2026-01-05T08:15:20,099 adding 'evalscope/benchmarks/drop/drop_adapter.py' 2026-01-05T08:15:20,100 adding 'evalscope/benchmarks/drop/utils.py' 2026-01-05T08:15:20,102 adding 'evalscope/benchmarks/eq_bench/__init__.py' 2026-01-05T08:15:20,104 adding 'evalscope/benchmarks/eq_bench/answer_validation.py' 2026-01-05T08:15:20,106 adding 'evalscope/benchmarks/eq_bench/eq_bench_adapter.py' 2026-01-05T08:15:20,107 adding 'evalscope/benchmarks/fleurs/__init__.py' 2026-01-05T08:15:20,109 adding 'evalscope/benchmarks/fleurs/fleurs_adapter.py' 2026-01-05T08:15:20,110 adding 'evalscope/benchmarks/frames/__init__.py' 2026-01-05T08:15:20,112 adding 'evalscope/benchmarks/frames/frames_adapter.py' 2026-01-05T08:15:20,113 adding 'evalscope/benchmarks/frames/utils.py' 2026-01-05T08:15:20,114 adding 'evalscope/benchmarks/general_arena/__init__.py' 2026-01-05T08:15:20,117 adding 'evalscope/benchmarks/general_arena/general_arena_adapter.py' 2026-01-05T08:15:20,119 adding 'evalscope/benchmarks/general_arena/utils.py' 2026-01-05T08:15:20,120 adding 'evalscope/benchmarks/general_fc/__init__.py' 2026-01-05T08:15:20,122 adding 'evalscope/benchmarks/general_fc/general_fc_adapter.py' 2026-01-05T08:15:20,123 adding 'evalscope/benchmarks/general_mcq/__init__.py' 2026-01-05T08:15:20,125 adding 'evalscope/benchmarks/general_mcq/general_mcq_adapter.py' 2026-01-05T08:15:20,126 adding 'evalscope/benchmarks/general_qa/__init__.py' 2026-01-05T08:15:20,127 adding 'evalscope/benchmarks/general_qa/general_qa_adapter.py' 2026-01-05T08:15:20,129 adding 'evalscope/benchmarks/general_vmcq/__init__.py' 2026-01-05T08:15:20,131 adding 'evalscope/benchmarks/general_vmcq/general_vmcq_adapter.py' 2026-01-05T08:15:20,132 adding 'evalscope/benchmarks/general_vqa/__init__.py' 2026-01-05T08:15:20,134 adding 'evalscope/benchmarks/general_vqa/general_vqa_adapter.py' 2026-01-05T08:15:20,135 adding 'evalscope/benchmarks/gpqa/__init__.py' 2026-01-05T08:15:20,137 adding 'evalscope/benchmarks/gpqa/gpqa_adapter.py' 2026-01-05T08:15:20,138 adding 'evalscope/benchmarks/gpqa/prompt.py' 2026-01-05T08:15:20,140 adding 'evalscope/benchmarks/gsm8k/__init__.py' 2026-01-05T08:15:20,141 adding 'evalscope/benchmarks/gsm8k/gsm8k_adapter.py' 2026-01-05T08:15:20,143 adding 'evalscope/benchmarks/gsm8k_v/__init__.py' 2026-01-05T08:15:20,144 adding 'evalscope/benchmarks/gsm8k_v/gsm8k_v_adapter.py' 2026-01-05T08:15:20,146 adding 'evalscope/benchmarks/hallusion_bench/__init__.py' 2026-01-05T08:15:20,147 adding 'evalscope/benchmarks/hallusion_bench/hallusion_bench_adapter.py' 2026-01-05T08:15:20,149 adding 'evalscope/benchmarks/halu_eval/__init__.py' 2026-01-05T08:15:20,151 adding 'evalscope/benchmarks/halu_eval/halu_eval_adapter.py' 2026-01-05T08:15:20,153 adding 'evalscope/benchmarks/halu_eval/halu_eval_instructions.py' 2026-01-05T08:15:20,154 adding 'evalscope/benchmarks/healthbench/__init__.py' 2026-01-05T08:15:20,156 adding 'evalscope/benchmarks/healthbench/healthbench_adapter.py' 2026-01-05T08:15:20,158 adding 'evalscope/benchmarks/healthbench/utils.py' 2026-01-05T08:15:20,160 adding 'evalscope/benchmarks/hellaswag/__init__.py' 2026-01-05T08:15:20,161 adding 'evalscope/benchmarks/hellaswag/hellaswag_adapter.py' 2026-01-05T08:15:20,162 adding 'evalscope/benchmarks/hle/__init__.py' 2026-01-05T08:15:20,164 adding 'evalscope/benchmarks/hle/hle_adapter.py' 2026-01-05T08:15:20,166 adding 'evalscope/benchmarks/humaneval/__init__.py' 2026-01-05T08:15:20,167 adding 'evalscope/benchmarks/humaneval/humaneval_adapter.py' 2026-01-05T08:15:20,169 adding 'evalscope/benchmarks/humaneval/utils.py' 2026-01-05T08:15:20,171 adding 'evalscope/benchmarks/ifbench/__init__.py' 2026-01-05T08:15:20,172 adding 'evalscope/benchmarks/ifbench/evaluation_lib.py' 2026-01-05T08:15:20,174 adding 'evalscope/benchmarks/ifbench/ifbench_adapter.py' 2026-01-05T08:15:20,181 adding 'evalscope/benchmarks/ifbench/instructions.py' 2026-01-05T08:15:20,183 adding 'evalscope/benchmarks/ifbench/instructions_registry.py' 2026-01-05T08:15:20,186 adding 'evalscope/benchmarks/ifbench/instructions_util.py' 2026-01-05T08:15:20,188 adding 'evalscope/benchmarks/ifeval/__init__.py' 2026-01-05T08:15:20,190 adding 'evalscope/benchmarks/ifeval/ifeval_adapter.py' 2026-01-05T08:15:20,194 adding 'evalscope/benchmarks/ifeval/instructions.py' 2026-01-05T08:15:20,196 adding 'evalscope/benchmarks/ifeval/instructions_registry.py' 2026-01-05T08:15:20,199 adding 'evalscope/benchmarks/ifeval/instructions_util.py' 2026-01-05T08:15:20,201 adding 'evalscope/benchmarks/ifeval/utils.py' 2026-01-05T08:15:20,202 adding 'evalscope/benchmarks/image_edit/__init__.py' 2026-01-05T08:15:20,204 adding 'evalscope/benchmarks/image_edit/gedit/__init__.py' 2026-01-05T08:15:20,205 adding 'evalscope/benchmarks/image_edit/gedit/gedit_adapter.py' 2026-01-05T08:15:20,207 adding 'evalscope/benchmarks/image_edit/gedit/utils.py' 2026-01-05T08:15:20,209 adding 'evalscope/benchmarks/image_edit/gedit/vie_prompts.py' 2026-01-05T08:15:20,211 adding 'evalscope/benchmarks/infovqa/__init__.py' 2026-01-05T08:15:20,212 adding 'evalscope/benchmarks/infovqa/infovqa_adapter.py' 2026-01-05T08:15:20,214 adding 'evalscope/benchmarks/iquiz/__init__.py' 2026-01-05T08:15:20,215 adding 'evalscope/benchmarks/iquiz/iquiz_adapter.py' 2026-01-05T08:15:20,216 adding 'evalscope/benchmarks/librispeech/__init__.py' 2026-01-05T08:15:20,218 adding 'evalscope/benchmarks/librispeech/librispeech_adapter.py' 2026-01-05T08:15:20,219 adding 'evalscope/benchmarks/live_code_bench/__init__.py' 2026-01-05T08:15:20,221 adding 'evalscope/benchmarks/live_code_bench/evaluate_utils.py' 2026-01-05T08:15:20,222 adding 'evalscope/benchmarks/live_code_bench/extract_utils.py' 2026-01-05T08:15:20,224 adding 'evalscope/benchmarks/live_code_bench/live_code_bench_adapter.py' 2026-01-05T08:15:20,225 adding 'evalscope/benchmarks/live_code_bench/load_utils.py' 2026-01-05T08:15:20,226 adding 'evalscope/benchmarks/live_code_bench/pass_k_utils.py' 2026-01-05T08:15:20,228 adding 'evalscope/benchmarks/live_code_bench/prompts.py' 2026-01-05T08:15:20,230 adding 'evalscope/benchmarks/live_code_bench/sandbox_evaluate_utils.py' 2026-01-05T08:15:20,232 adding 'evalscope/benchmarks/live_code_bench/testing_util.py' 2026-01-05T08:15:20,234 adding 'evalscope/benchmarks/logi_qa/__int__.py' 2026-01-05T08:15:20,235 adding 'evalscope/benchmarks/logi_qa/logi_qa_adapter.py' 2026-01-05T08:15:20,237 adding 'evalscope/benchmarks/maritime_bench/__init__.py' 2026-01-05T08:15:20,238 adding 'evalscope/benchmarks/maritime_bench/maritime_bench_adapter.py' 2026-01-05T08:15:20,240 adding 'evalscope/benchmarks/math_500/__init__.py' 2026-01-05T08:15:20,241 adding 'evalscope/benchmarks/math_500/math_500_adapter.py' 2026-01-05T08:15:20,243 adding 'evalscope/benchmarks/math_qa/__init__.py' 2026-01-05T08:15:20,244 adding 'evalscope/benchmarks/math_qa/math_qa_adapter.py' 2026-01-05T08:15:20,246 adding 'evalscope/benchmarks/math_verse/__init__.py' 2026-01-05T08:15:20,247 adding 'evalscope/benchmarks/math_verse/math_verse_adapter.py' 2026-01-05T08:15:20,249 adding 'evalscope/benchmarks/math_vision/__init__.py' 2026-01-05T08:15:20,251 adding 'evalscope/benchmarks/math_vision/math_vision_adapter.py' 2026-01-05T08:15:20,252 adding 'evalscope/benchmarks/math_vista/__init__.py' 2026-01-05T08:15:20,254 adding 'evalscope/benchmarks/math_vista/math_vista_adapter.py' 2026-01-05T08:15:20,256 adding 'evalscope/benchmarks/mbpp/__init__.py' 2026-01-05T08:15:20,257 adding 'evalscope/benchmarks/mbpp/mbpp_adapter.py' 2026-01-05T08:15:20,259 adding 'evalscope/benchmarks/med_mcqa/__init__.py' 2026-01-05T08:15:20,260 adding 'evalscope/benchmarks/med_mcqa/med_mcqa_adapter.py' 2026-01-05T08:15:20,262 adding 'evalscope/benchmarks/mgsm/__init__.py' 2026-01-05T08:15:20,263 adding 'evalscope/benchmarks/mgsm/mgsm_adapter.py' 2026-01-05T08:15:20,265 adding 'evalscope/benchmarks/micro_vqa/__init__.py' 2026-01-05T08:15:20,266 adding 'evalscope/benchmarks/micro_vqa/micro_vqa_adapter.py' 2026-01-05T08:15:20,267 adding 'evalscope/benchmarks/minerva_math/__init__.py' 2026-01-05T08:15:20,269 adding 'evalscope/benchmarks/minerva_math/minerva_math_adapter.py' 2026-01-05T08:15:20,270 adding 'evalscope/benchmarks/mm_bench/__init__.py' 2026-01-05T08:15:20,271 adding 'evalscope/benchmarks/mm_bench/mm_bench_adapter.py' 2026-01-05T08:15:20,273 adding 'evalscope/benchmarks/mm_star/__init__.py' 2026-01-05T08:15:20,274 adding 'evalscope/benchmarks/mm_star/mm_star_adapter.py' 2026-01-05T08:15:20,276 adding 'evalscope/benchmarks/mmlu/__init__.py' 2026-01-05T08:15:20,277 adding 'evalscope/benchmarks/mmlu/mmlu_adapter.py' 2026-01-05T08:15:20,279 adding 'evalscope/benchmarks/mmlu_pro/__init__.py' 2026-01-05T08:15:20,280 adding 'evalscope/benchmarks/mmlu_pro/mmlu_pro_adapter.py' 2026-01-05T08:15:20,282 adding 'evalscope/benchmarks/mmlu_redux/__init__.py' 2026-01-05T08:15:20,283 adding 'evalscope/benchmarks/mmlu_redux/mmlu_redux_adapter.py' 2026-01-05T08:15:20,285 adding 'evalscope/benchmarks/mmmu/__init__.py' 2026-01-05T08:15:20,286 adding 'evalscope/benchmarks/mmmu/mmmu_adapter.py' 2026-01-05T08:15:20,288 adding 'evalscope/benchmarks/mmmu_pro/__init__.py' 2026-01-05T08:15:20,289 adding 'evalscope/benchmarks/mmmu_pro/mmmu_pro_adapter.py' 2026-01-05T08:15:20,291 adding 'evalscope/benchmarks/mri_mcqa/__init__.py' 2026-01-05T08:15:20,292 adding 'evalscope/benchmarks/mri_mcqa/mri_mcqa_adapter.py' 2026-01-05T08:15:20,294 adding 'evalscope/benchmarks/multi_if/__init__.py' 2026-01-05T08:15:20,302 adding 'evalscope/benchmarks/multi_if/ifeval.py' 2026-01-05T08:15:20,304 adding 'evalscope/benchmarks/multi_if/metrics.py' 2026-01-05T08:15:20,306 adding 'evalscope/benchmarks/multi_if/multi_if_adapter.py' 2026-01-05T08:15:20,307 adding 'evalscope/benchmarks/multipl_e/__init__.py' 2026-01-05T08:15:20,309 adding 'evalscope/benchmarks/multipl_e/multiple_humaneval_adapter.py' 2026-01-05T08:15:20,311 adding 'evalscope/benchmarks/multipl_e/multiple_mbpp_adapter.py' 2026-01-05T08:15:20,312 adding 'evalscope/benchmarks/multipl_e/utils.py' 2026-01-05T08:15:20,314 adding 'evalscope/benchmarks/music_trivia/__init__.py' 2026-01-05T08:15:20,315 adding 'evalscope/benchmarks/music_trivia/music_trivia_adapter.py' 2026-01-05T08:15:20,317 adding 'evalscope/benchmarks/musr/__init__.py' 2026-01-05T08:15:20,318 adding 'evalscope/benchmarks/musr/musr_adapter.py' 2026-01-05T08:15:20,320 adding 'evalscope/benchmarks/needle_haystack/__init__.py' 2026-01-05T08:15:20,322 adding 'evalscope/benchmarks/needle_haystack/needle_haystack_adapter.py' 2026-01-05T08:15:20,324 adding 'evalscope/benchmarks/needle_haystack/utils.py' 2026-01-05T08:15:20,327 adding 'evalscope/benchmarks/ner/__init__.py' 2026-01-05T08:15:20,328 adding 'evalscope/benchmarks/ner/anat_em_adapter.py' 2026-01-05T08:15:20,330 adding 'evalscope/benchmarks/ner/bc2gm_adapter.py' 2026-01-05T08:15:20,332 adding 'evalscope/benchmarks/ner/bc4chemd_adapter.py' 2026-01-05T08:15:20,333 adding 'evalscope/benchmarks/ner/bc5cdr_adapter.py' 2026-01-05T08:15:20,335 adding 'evalscope/benchmarks/ner/broad_twitter_corpus_adapter.py' 2026-01-05T08:15:20,337 adding 'evalscope/benchmarks/ner/conll2003_adapter.py' 2026-01-05T08:15:20,339 adding 'evalscope/benchmarks/ner/conllpp_adapter.py' 2026-01-05T08:15:20,341 adding 'evalscope/benchmarks/ner/copious_adapter.py' 2026-01-05T08:15:20,343 adding 'evalscope/benchmarks/ner/cross_ner_adapter.py' 2026-01-05T08:15:20,345 adding 'evalscope/benchmarks/ner/fin_ner_adapter.py' 2026-01-05T08:15:20,347 adding 'evalscope/benchmarks/ner/genia_ner_adapter.py' 2026-01-05T08:15:20,349 adding 'evalscope/benchmarks/ner/harvey_ner_adapter.py' 2026-01-05T08:15:20,351 adding 'evalscope/benchmarks/ner/jnlpba_adapter.py' 2026-01-05T08:15:20,352 adding 'evalscope/benchmarks/ner/jnlpba_rare_adapter.py' 2026-01-05T08:15:20,354 adding 'evalscope/benchmarks/ner/mit_movie_trivia_adapter.py' 2026-01-05T08:15:20,356 adding 'evalscope/benchmarks/ner/mit_restaurant_adapter.py' 2026-01-05T08:15:20,358 adding 'evalscope/benchmarks/ner/multi_nerd_adapter.py' 2026-01-05T08:15:20,360 adding 'evalscope/benchmarks/ner/ncbi_adapter.py' 2026-01-05T08:15:20,362 adding 'evalscope/benchmarks/ner/ontonotes5_adapter.py' 2026-01-05T08:15:20,363 adding 'evalscope/benchmarks/ner/tweebank_ner_adapter.py' 2026-01-05T08:15:20,365 adding 'evalscope/benchmarks/ner/tweet_ner_7_adapter.py' 2026-01-05T08:15:20,367 adding 'evalscope/benchmarks/ner/wnut2017_adapter.py' 2026-01-05T08:15:20,369 adding 'evalscope/benchmarks/ner/cross_ner_entities/__init__.py' 2026-01-05T08:15:20,371 adding 'evalscope/benchmarks/ner/cross_ner_entities/ai.py' 2026-01-05T08:15:20,372 adding 'evalscope/benchmarks/ner/cross_ner_entities/literature.py' 2026-01-05T08:15:20,374 adding 'evalscope/benchmarks/ner/cross_ner_entities/music.py' 2026-01-05T08:15:20,375 adding 'evalscope/benchmarks/ner/cross_ner_entities/politics.py' 2026-01-05T08:15:20,377 adding 'evalscope/benchmarks/ner/cross_ner_entities/science.py' 2026-01-05T08:15:20,379 adding 'evalscope/benchmarks/ocr_bench/__init__.py' 2026-01-05T08:15:20,381 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/__init__.py' 2026-01-05T08:15:20,383 adding 'evalscope/benchmarks/ocr_bench/ocr_bench/ocr_bench_adapter.py' 2026-01-05T08:15:20,385 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/IoUscore_metric.py' 2026-01-05T08:15:20,391 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/TEDS_metric.py' 2026-01-05T08:15:20,393 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/__init__.py' 2026-01-05T08:15:20,395 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/ocr_bench_v2_adapter.py' 2026-01-05T08:15:20,396 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/page_ocr_metric.py' 2026-01-05T08:15:20,398 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/parallel.py' 2026-01-05T08:15:20,400 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_metric.py' 2026-01-05T08:15:20,402 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/utils.py' 2026-01-05T08:15:20,404 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/vqa_metric.py' 2026-01-05T08:15:20,407 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/__init__.py' 2026-01-05T08:15:20,408 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/readme.txt' 2026-01-05T08:15:20,411 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/rrc_evaluation_funcs_1_1.py' 2026-01-05T08:15:20,415 adding 'evalscope/benchmarks/ocr_bench/ocr_bench_v2/spotting_eval/script.py' 2026-01-05T08:15:20,417 adding 'evalscope/benchmarks/olympiad_bench/__init__.py' 2026-01-05T08:15:20,419 adding 'evalscope/benchmarks/olympiad_bench/olympiad_bench_adapter.py' 2026-01-05T08:15:20,423 adding 'evalscope/benchmarks/olympiad_bench/utils.py' 2026-01-05T08:15:20,425 adding 'evalscope/benchmarks/omni_bench/__init__.py' 2026-01-05T08:15:20,426 adding 'evalscope/benchmarks/omni_bench/omni_bench_adapter.py' 2026-01-05T08:15:20,428 adding 'evalscope/benchmarks/omnidoc_bench/__init__.py' 2026-01-05T08:15:20,430 adding 'evalscope/benchmarks/omnidoc_bench/end2end_eval.py' 2026-01-05T08:15:20,433 adding 'evalscope/benchmarks/omnidoc_bench/metrics.py' 2026-01-05T08:15:20,434 adding 'evalscope/benchmarks/omnidoc_bench/omnidoc_bench_adapter.py' 2026-01-05T08:15:20,443 adding 'evalscope/benchmarks/omnidoc_bench/utils.py' 2026-01-05T08:15:20,445 adding 'evalscope/benchmarks/openai_mrcr/__init__.py' 2026-01-05T08:15:20,447 adding 'evalscope/benchmarks/openai_mrcr/openai_mrcr_adapter.py' 2026-01-05T08:15:20,448 adding 'evalscope/benchmarks/openai_mrcr/utils.py' 2026-01-05T08:15:20,450 adding 'evalscope/benchmarks/piqa/__init__.py' 2026-01-05T08:15:20,451 adding 'evalscope/benchmarks/piqa/piqa_adapter.py' 2026-01-05T08:15:20,453 adding 'evalscope/benchmarks/poly_math/__init__.py' 2026-01-05T08:15:20,454 adding 'evalscope/benchmarks/poly_math/poly_math_adapter.py' 2026-01-05T08:15:20,457 adding 'evalscope/benchmarks/poly_math/utils/instruction.py' 2026-01-05T08:15:20,458 adding 'evalscope/benchmarks/pope/__init__.py' 2026-01-05T08:15:20,459 adding 'evalscope/benchmarks/pope/pope_adapter.py' 2026-01-05T08:15:20,461 adding 'evalscope/benchmarks/process_bench/__init__.py' 2026-01-05T08:15:20,462 adding 'evalscope/benchmarks/process_bench/process_bench_adapter.py' 2026-01-05T08:15:20,464 adding 'evalscope/benchmarks/pumed_qa/__init__.py' 2026-01-05T08:15:20,465 adding 'evalscope/benchmarks/pumed_qa/pubmed_qa_adapter.py' 2026-01-05T08:15:20,467 adding 'evalscope/benchmarks/qasc/__init__.py' 2026-01-05T08:15:20,468 adding 'evalscope/benchmarks/qasc/qasc_adapter.py' 2026-01-05T08:15:20,470 adding 'evalscope/benchmarks/race/__init__.py' 2026-01-05T08:15:20,471 adding 'evalscope/benchmarks/race/race_adapter.py' 2026-01-05T08:15:20,472 adding 'evalscope/benchmarks/real_world_qa/__init__.py' 2026-01-05T08:15:20,474 adding 'evalscope/benchmarks/real_world_qa/real_world_qa_adapter.py' 2026-01-05T08:15:20,475 adding 'evalscope/benchmarks/refcoco/__init__.py' 2026-01-05T08:15:20,477 adding 'evalscope/benchmarks/refcoco/evaluation_lib.py' 2026-01-05T08:15:20,478 adding 'evalscope/benchmarks/refcoco/refcoco_adapter.py' 2026-01-05T08:15:20,479 adding 'evalscope/benchmarks/refcoco/utils.py' 2026-01-05T08:15:20,481 adding 'evalscope/benchmarks/scicode/__init__.py' 2026-01-05T08:15:20,482 adding 'evalscope/benchmarks/scicode/prompt_templates.py' 2026-01-05T08:15:20,484 adding 'evalscope/benchmarks/scicode/scicode_adapter.py' 2026-01-05T08:15:20,485 adding 'evalscope/benchmarks/scicode/util.py' 2026-01-05T08:15:20,487 adding 'evalscope/benchmarks/scicode/docker/Dockerfile' 2026-01-05T08:15:20,488 adding 'evalscope/benchmarks/scicode/docker/docker_requirements.txt' 2026-01-05T08:15:20,490 adding 'evalscope/benchmarks/scicode/docker/process_data.py' 2026-01-05T08:15:20,491 adding 'evalscope/benchmarks/scicode/docker/test_util.py' 2026-01-05T08:15:20,493 adding 'evalscope/benchmarks/science_qa/__init__.py' 2026-01-05T08:15:20,494 adding 'evalscope/benchmarks/science_qa/science_qa_adapter.py' 2026-01-05T08:15:20,496 adding 'evalscope/benchmarks/sciq/__init__.py' 2026-01-05T08:15:20,497 adding 'evalscope/benchmarks/sciq/sciq_adapter.py' 2026-01-05T08:15:20,499 adding 'evalscope/benchmarks/seed_bench_2_plus/__init__.py' 2026-01-05T08:15:20,500 adding 'evalscope/benchmarks/seed_bench_2_plus/seed_bench_2_plus_adapter.py' 2026-01-05T08:15:20,502 adding 'evalscope/benchmarks/simple_qa/__init__.py' 2026-01-05T08:15:20,503 adding 'evalscope/benchmarks/simple_qa/simple_qa_adapter.py' 2026-01-05T08:15:20,505 adding 'evalscope/benchmarks/simple_vqa/__init__.py' 2026-01-05T08:15:20,507 adding 'evalscope/benchmarks/simple_vqa/simple_vqa_adapter.py' 2026-01-05T08:15:20,509 adding 'evalscope/benchmarks/siqa/__init__.py' 2026-01-05T08:15:20,510 adding 'evalscope/benchmarks/siqa/siqa_adapter.py' 2026-01-05T08:15:20,512 adding 'evalscope/benchmarks/super_gpqa/__init__.py' 2026-01-05T08:15:20,513 adding 'evalscope/benchmarks/super_gpqa/prompt.py' 2026-01-05T08:15:20,515 adding 'evalscope/benchmarks/super_gpqa/super_gpqa_adapter.py' 2026-01-05T08:15:20,516 adding 'evalscope/benchmarks/super_gpqa/utils.py' 2026-01-05T08:15:20,518 adding 'evalscope/benchmarks/swe_bench/__init__.py' 2026-01-05T08:15:20,520 adding 'evalscope/benchmarks/swe_bench/build_images.py' 2026-01-05T08:15:20,521 adding 'evalscope/benchmarks/swe_bench/swe_bench_adapter.py' 2026-01-05T08:15:20,523 adding 'evalscope/benchmarks/swe_bench/utils.py' 2026-01-05T08:15:20,525 adding 'evalscope/benchmarks/tau_bench/__init__.py' 2026-01-05T08:15:20,526 adding 'evalscope/benchmarks/tau_bench/tau2_bench/__init__.py' 2026-01-05T08:15:20,528 adding 'evalscope/benchmarks/tau_bench/tau2_bench/generation.py' 2026-01-05T08:15:20,529 adding 'evalscope/benchmarks/tau_bench/tau2_bench/tau2_bench_adapter.py' 2026-01-05T08:15:20,531 adding 'evalscope/benchmarks/tau_bench/tau_bench/__init__.py' 2026-01-05T08:15:20,533 adding 'evalscope/benchmarks/tau_bench/tau_bench/generation.py' 2026-01-05T08:15:20,534 adding 'evalscope/benchmarks/tau_bench/tau_bench/tau_bench_adapter.py' 2026-01-05T08:15:20,536 adding 'evalscope/benchmarks/terminal_bench/__init__.py' 2026-01-05T08:15:20,537 adding 'evalscope/benchmarks/terminal_bench/terminal_bench_adapter.py' 2026-01-05T08:15:20,539 adding 'evalscope/benchmarks/terminal_bench/utils.py' 2026-01-05T08:15:20,540 adding 'evalscope/benchmarks/text2image/__init__.py' 2026-01-05T08:15:20,541 adding 'evalscope/benchmarks/text2image/evalmuse_adapter.py' 2026-01-05T08:15:20,543 adding 'evalscope/benchmarks/text2image/genai_bench_adapter.py' 2026-01-05T08:15:20,544 adding 'evalscope/benchmarks/text2image/general_t2i_adapter.py' 2026-01-05T08:15:20,545 adding 'evalscope/benchmarks/text2image/hpdv2_adapter.py' 2026-01-05T08:15:20,546 adding 'evalscope/benchmarks/text2image/tifa_adapter.py' 2026-01-05T08:15:20,547 adding 'evalscope/benchmarks/tool_bench/__init__.py' 2026-01-05T08:15:20,549 adding 'evalscope/benchmarks/tool_bench/tool_bench_adapter.py' 2026-01-05T08:15:20,550 adding 'evalscope/benchmarks/tool_bench/utils.py' 2026-01-05T08:15:20,552 adding 'evalscope/benchmarks/torgo/__init__.py' 2026-01-05T08:15:20,553 adding 'evalscope/benchmarks/torgo/torgo_adapter.py' 2026-01-05T08:15:20,555 adding 'evalscope/benchmarks/trivia_qa/__init__.py' 2026-01-05T08:15:20,556 adding 'evalscope/benchmarks/trivia_qa/samples.jsonl' 2026-01-05T08:15:20,557 adding 'evalscope/benchmarks/trivia_qa/trivia_qa_adapter.py' 2026-01-05T08:15:20,559 adding 'evalscope/benchmarks/truthful_qa/__init__.py' 2026-01-05T08:15:20,561 adding 'evalscope/benchmarks/truthful_qa/truthful_qa_adapter.py' 2026-01-05T08:15:20,562 adding 'evalscope/benchmarks/visu_logic/__init__.py' 2026-01-05T08:15:20,564 adding 'evalscope/benchmarks/visu_logic/visu_logic_adapter.py' 2026-01-05T08:15:20,565 adding 'evalscope/benchmarks/vstar_bench/__init__.py' 2026-01-05T08:15:20,567 adding 'evalscope/benchmarks/vstar_bench/vstar_bench_adapter.py' 2026-01-05T08:15:20,569 adding 'evalscope/benchmarks/winogrande/__init__.py' 2026-01-05T08:15:20,570 adding 'evalscope/benchmarks/winogrande/winogrande_adapter.py' 2026-01-05T08:15:20,572 adding 'evalscope/benchmarks/wmt/__init__.py' 2026-01-05T08:15:20,573 adding 'evalscope/benchmarks/wmt/wmt24_adapter.py' 2026-01-05T08:15:20,575 adding 'evalscope/benchmarks/zebralogicbench/__init__.py' 2026-01-05T08:15:20,576 adding 'evalscope/benchmarks/zebralogicbench/utils.py' 2026-01-05T08:15:20,578 adding 'evalscope/benchmarks/zebralogicbench/zebralogicbench_adapter.py' 2026-01-05T08:15:20,580 adding 'evalscope/benchmarks/zerobench/__init__.py' 2026-01-05T08:15:20,581 adding 'evalscope/benchmarks/zerobench/zerobench_adapter.py' 2026-01-05T08:15:20,583 adding 'evalscope/cli/__init__.py' 2026-01-05T08:15:20,584 adding 'evalscope/cli/base.py' 2026-01-05T08:15:20,585 adding 'evalscope/cli/cli.py' 2026-01-05T08:15:20,587 adding 'evalscope/cli/start_app.py' 2026-01-05T08:15:20,588 adding 'evalscope/cli/start_eval.py' 2026-01-05T08:15:20,589 adding 'evalscope/cli/start_perf.py' 2026-01-05T08:15:20,590 adding 'evalscope/cli/start_service.py' 2026-01-05T08:15:20,592 adding 'evalscope/collections/__init__.py' 2026-01-05T08:15:20,594 adding 'evalscope/collections/sampler.py' 2026-01-05T08:15:20,595 adding 'evalscope/collections/schema.py' 2026-01-05T08:15:20,597 adding 'evalscope/evaluator/__init__.py' 2026-01-05T08:15:20,600 adding 'evalscope/evaluator/evaluator.py' 2026-01-05T08:15:20,602 adding 'evalscope/filters/__init__.py' 2026-01-05T08:15:20,603 adding 'evalscope/filters/extraction.py' 2026-01-05T08:15:20,604 adding 'evalscope/filters/selection.py' 2026-01-05T08:15:20,606 adding 'evalscope/metrics/__init__.py' 2026-01-05T08:15:20,608 adding 'evalscope/metrics/llm_judge.py' 2026-01-05T08:15:20,611 adding 'evalscope/metrics/math_parser.py' 2026-01-05T08:15:20,613 adding 'evalscope/metrics/metric.py' 2026-01-05T08:15:20,616 adding 'evalscope/metrics/metrics.py' 2026-01-05T08:15:20,617 adding 'evalscope/metrics/rouge_metric.py' 2026-01-05T08:15:20,619 adding 'evalscope/metrics/bert_score/__init__.py' 2026-01-05T08:15:20,621 adding 'evalscope/metrics/bert_score/scorer.py' 2026-01-05T08:15:20,624 adding 'evalscope/metrics/bert_score/utils.py' 2026-01-05T08:15:20,626 adding 'evalscope/metrics/bundled_rouge_score/__init__.py' 2026-01-05T08:15:20,628 adding 'evalscope/metrics/bundled_rouge_score/rouge_scorer.py' 2026-01-05T08:15:20,629 adding 'evalscope/metrics/sem_score/__init__.py' 2026-01-05T08:15:20,631 adding 'evalscope/metrics/sem_score/scorer.py' 2026-01-05T08:15:20,632 adding 'evalscope/metrics/t2v_metrics/__init__.py' 2026-01-05T08:15:20,634 adding 'evalscope/metrics/t2v_metrics/clipscore.py' 2026-01-05T08:15:20,635 adding 'evalscope/metrics/t2v_metrics/constants.py' 2026-01-05T08:15:20,636 adding 'evalscope/metrics/t2v_metrics/itmscore.py' 2026-01-05T08:15:20,637 adding 'evalscope/metrics/t2v_metrics/score.py' 2026-01-05T08:15:20,639 adding 'evalscope/metrics/t2v_metrics/vqascore.py' 2026-01-05T08:15:20,640 adding 'evalscope/metrics/t2v_metrics/models/__init__.py' 2026-01-05T08:15:20,641 adding 'evalscope/metrics/t2v_metrics/models/model.py' 2026-01-05T08:15:20,643 adding 'evalscope/metrics/t2v_metrics/models/utils.py' 2026-01-05T08:15:20,644 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/__init__.py' 2026-01-05T08:15:20,646 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/clip_model.py' 2026-01-05T08:15:20,647 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/hpsv2_model.py' 2026-01-05T08:15:20,648 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/mps_model.py' 2026-01-05T08:15:20,650 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/pickscore_model.py' 2026-01-05T08:15:20,654 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/__init__.py' 2026-01-05T08:15:20,655 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/base_model.py' 2026-01-05T08:15:20,656 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/clip_model.py' 2026-01-05T08:15:20,658 adding 'evalscope/metrics/t2v_metrics/models/clipscore_models/build_mps_model/cross_modeling.py' 2026-01-05T08:15:20,660 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/__init__.py' 2026-01-05T08:15:20,661 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/blip2_itm_model.py' 2026-01-05T08:15:20,663 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/fga_blip2_model.py' 2026-01-05T08:15:20,664 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward_model.py' 2026-01-05T08:15:20,666 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/ImageReward.py' 2026-01-05T08:15:20,667 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/__init__.py' 2026-01-05T08:15:20,669 adding 'evalscope/metrics/t2v_metrics/models/itmscore_models/image_reward/blip_pretrain.py' 2026-01-05T08:15:20,671 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/__init__.py' 2026-01-05T08:15:20,672 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5_model.py' 2026-01-05T08:15:20,674 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/gpt4v_model.py' 2026-01-05T08:15:20,675 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/mm_utils.py' 2026-01-05T08:15:20,677 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/vqa_model.py' 2026-01-05T08:15:20,678 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/__init__.py' 2026-01-05T08:15:20,680 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/__init__.py' 2026-01-05T08:15:20,683 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/language_model/clip_t5.py' 2026-01-05T08:15:20,684 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/builder.py' 2026-01-05T08:15:20,686 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_encoder/clip_encoder.py' 2026-01-05T08:15:20,688 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/clip_t5/model/multimodal_projector/builder.py' 2026-01-05T08:15:20,689 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/__init__.py' 2026-01-05T08:15:20,692 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/config.py' 2026-01-05T08:15:20,693 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/dist_utils.py' 2026-01-05T08:15:20,695 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/gradcam.py' 2026-01-05T08:15:20,696 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/logger.py' 2026-01-05T08:15:20,697 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/optims.py' 2026-01-05T08:15:20,699 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/registry.py' 2026-01-05T08:15:20,701 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/utils.py' 2026-01-05T08:15:20,703 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/__init__.py' 2026-01-05T08:15:20,705 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa.py' 2026-01-05T08:15:20,707 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/common/vqa_tools/vqa_eval.py' 2026-01-05T08:15:20,708 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/default.yaml' 2026-01-05T08:15:20,710 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config.json' 2026-01-05T08:15:20,711 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_config_albef.json' 2026-01-05T08:15:20,712 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/med_large_config.json' 2026-01-05T08:15:20,714 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_flant5xl.yaml' 2026-01-05T08:15:20,715 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt2.7b.yaml' 2026-01-05T08:15:20,717 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_caption_opt6.7b.yaml' 2026-01-05T08:15:20,718 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_coco.yaml' 2026-01-05T08:15:20,719 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xl.yaml' 2026-01-05T08:15:20,720 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_flant5xxl.yaml' 2026-01-05T08:15:20,722 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna13b.yaml' 2026-01-05T08:15:20,723 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_instruct_vicuna7b.yaml' 2026-01-05T08:15:20,724 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain.yaml' 2026-01-05T08:15:20,726 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl.yaml' 2026-01-05T08:15:20,727 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_no_prefix.yaml' 2026-01-05T08:15:20,728 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_iter_80k_total_100k_prefix.yaml' 2026-01-05T08:15:20,729 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xl_vitL.yaml' 2026-01-05T08:15:20,731 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_flant5xxl.yaml' 2026-01-05T08:15:20,732 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt2.7b.yaml' 2026-01-05T08:15:20,733 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_opt6.7b.yaml' 2026-01-05T08:15:20,734 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_pretrain_vitL.yaml' 2026-01-05T08:15:20,735 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna13b.yaml' 2026-01-05T08:15:20,737 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/configs/models/blip2/blip2_vicuna7b.yaml' 2026-01-05T08:15:20,739 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/__init__.py' 2026-01-05T08:15:20,740 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/base_model.py' 2026-01-05T08:15:20,742 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/clip_vit.py' 2026-01-05T08:15:20,745 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/eva_vit.py' 2026-01-05T08:15:20,750 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/med.py' 2026-01-05T08:15:20,753 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/vit.py' 2026-01-05T08:15:20,759 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/Qformer.py' 2026-01-05T08:15:20,760 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/__init__.py' 2026-01-05T08:15:20,762 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2.py' 2026-01-05T08:15:20,763 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_image_text_matching.py' 2026-01-05T08:15:20,765 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_qformer.py' 2026-01-05T08:15:20,767 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5.py' 2026-01-05T08:15:20,770 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/blip2_t5_instruct.py' 2026-01-05T08:15:20,772 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/fga_blip2.py' 2026-01-05T08:15:20,777 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_llama.py' 2026-01-05T08:15:20,785 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip2_models/modeling_t5.py' 2026-01-05T08:15:20,787 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/__init__.py' 2026-01-05T08:15:20,789 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip.py' 2026-01-05T08:15:20,790 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_caption.py' 2026-01-05T08:15:20,792 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_classification.py' 2026-01-05T08:15:20,794 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_feature_extractor.py' 2026-01-05T08:15:20,795 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_image_text_matching.py' 2026-01-05T08:15:20,797 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_nlvr.py' 2026-01-05T08:15:20,798 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_outputs.py' 2026-01-05T08:15:20,801 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_pretrain.py' 2026-01-05T08:15:20,803 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/blip_vqa.py' 2026-01-05T08:15:20,807 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/models/blip_models/nlvr_encoder.py' 2026-01-05T08:15:20,809 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/__init__.py' 2026-01-05T08:15:20,810 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/base_processor.py' 2026-01-05T08:15:20,811 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/blip_processors.py' 2026-01-05T08:15:20,813 adding 'evalscope/metrics/t2v_metrics/models/vqascore_models/lavis/processors/randaugment.py' 2026-01-05T08:15:20,815 adding 'evalscope/metrics/text_normalizer/__init__.py' 2026-01-05T08:15:20,816 adding 'evalscope/metrics/text_normalizer/basic.py' 2026-01-05T08:15:20,822 adding 'evalscope/metrics/text_normalizer/chinese.py' 2026-01-05T08:15:20,831 adding 'evalscope/metrics/text_normalizer/english.json' 2026-01-05T08:15:20,834 adding 'evalscope/metrics/text_normalizer/english.py' 2026-01-05T08:15:20,836 adding 'evalscope/metrics/text_normalizer/wer.py' 2026-01-05T08:15:20,838 adding 'evalscope/models/__init__.py' 2026-01-05T08:15:20,839 adding 'evalscope/models/image_edit_model.py' 2026-01-05T08:15:20,840 adding 'evalscope/models/mockllm.py' 2026-01-05T08:15:20,842 adding 'evalscope/models/model_apis.py' 2026-01-05T08:15:20,844 adding 'evalscope/models/modelscope.py' 2026-01-05T08:15:20,846 adding 'evalscope/models/openai_compatible.py' 2026-01-05T08:15:20,847 adding 'evalscope/models/text2image_model.py' 2026-01-05T08:15:20,851 adding 'evalscope/models/utils/openai.py' 2026-01-05T08:15:20,853 adding 'evalscope/perf/__init__.py' 2026-01-05T08:15:20,855 adding 'evalscope/perf/arguments.py' 2026-01-05T08:15:20,857 adding 'evalscope/perf/benchmark.py' 2026-01-05T08:15:20,858 adding 'evalscope/perf/http_client.py' 2026-01-05T08:15:20,860 adding 'evalscope/perf/main.py' 2026-01-05T08:15:20,861 adding 'evalscope/perf/plugin/__init__.py' 2026-01-05T08:15:20,863 adding 'evalscope/perf/plugin/registry.py' 2026-01-05T08:15:20,864 adding 'evalscope/perf/plugin/api/__init__.py' 2026-01-05T08:15:20,866 adding 'evalscope/perf/plugin/api/base.py' 2026-01-05T08:15:20,867 adding 'evalscope/perf/plugin/api/custom_api.py' 2026-01-05T08:15:20,869 adding 'evalscope/perf/plugin/api/dashscope_api.py' 2026-01-05T08:15:20,870 adding 'evalscope/perf/plugin/api/default_api.py' 2026-01-05T08:15:20,872 adding 'evalscope/perf/plugin/api/openai_api.py' 2026-01-05T08:15:20,874 adding 'evalscope/perf/plugin/datasets/__init__.py' 2026-01-05T08:15:20,876 adding 'evalscope/perf/plugin/datasets/base.py' 2026-01-05T08:15:20,877 adding 'evalscope/perf/plugin/datasets/custom.py' 2026-01-05T08:15:20,878 adding 'evalscope/perf/plugin/datasets/flickr8k.py' 2026-01-05T08:15:20,880 adding 'evalscope/perf/plugin/datasets/kontext_bench.py' 2026-01-05T08:15:20,881 adding 'evalscope/perf/plugin/datasets/line_by_line.py' 2026-01-05T08:15:20,883 adding 'evalscope/perf/plugin/datasets/longalpaca.py' 2026-01-05T08:15:20,884 adding 'evalscope/perf/plugin/datasets/openqa.py' 2026-01-05T08:15:20,886 adding 'evalscope/perf/plugin/datasets/random_dataset.py' 2026-01-05T08:15:20,887 adding 'evalscope/perf/plugin/datasets/random_vl_dataset.py' 2026-01-05T08:15:20,889 adding 'evalscope/perf/plugin/datasets/speed_benchmark.py' 2026-01-05T08:15:20,891 adding 'evalscope/perf/sla/__init__.py' 2026-01-05T08:15:20,892 adding 'evalscope/perf/sla/sla_criterion.py' 2026-01-05T08:15:20,894 adding 'evalscope/perf/sla/sla_run.py' 2026-01-05T08:15:20,896 adding 'evalscope/perf/utils/__init__.py' 2026-01-05T08:15:20,898 adding 'evalscope/perf/utils/analysis_result.py' 2026-01-05T08:15:20,900 adding 'evalscope/perf/utils/benchmark_util.py' 2026-01-05T08:15:20,902 adding 'evalscope/perf/utils/db_util.py' 2026-01-05T08:15:20,903 adding 'evalscope/perf/utils/handler.py' 2026-01-05T08:15:20,905 adding 'evalscope/perf/utils/local_server.py' 2026-01-05T08:15:20,906 adding 'evalscope/perf/utils/log_utils.py' 2026-01-05T08:15:20,908 adding 'evalscope/perf/utils/rich_display.py' 2026-01-05T08:15:20,910 adding 'evalscope/report/__init__.py' 2026-01-05T08:15:20,912 adding 'evalscope/report/combinator.py' 2026-01-05T08:15:20,913 adding 'evalscope/report/generator.py' 2026-01-05T08:15:20,915 adding 'evalscope/report/report.py' 2026-01-05T08:15:20,917 adding 'evalscope/service/__init__.py' 2026-01-05T08:15:20,918 adding 'evalscope/service/app.py' 2026-01-05T08:15:20,920 adding 'evalscope/service/utils.py' 2026-01-05T08:15:20,922 adding 'evalscope/service/frontend/__init__.py' 2026-01-05T08:15:20,923 adding 'evalscope/service/frontend/async_client.py' 2026-01-05T08:15:20,925 adding 'evalscope/service/frontend/main.py' 2026-01-05T08:15:20,926 adding 'evalscope/service/frontend/utils.py' 2026-01-05T08:15:20,928 adding 'evalscope/summarizer/__init__.py' 2026-01-05T08:15:20,930 adding 'evalscope/summarizer/summarizer.py' 2026-01-05T08:15:20,931 adding 'evalscope/third_party/__init__.py' 2026-01-05T08:15:20,933 adding 'evalscope/third_party/longbench_write/README.md' 2026-01-05T08:15:20,934 adding 'evalscope/third_party/longbench_write/__init__.py' 2026-01-05T08:15:20,936 adding 'evalscope/third_party/longbench_write/default_task.json' 2026-01-05T08:15:20,937 adding 'evalscope/third_party/longbench_write/default_task.yaml' 2026-01-05T08:15:20,939 adding 'evalscope/third_party/longbench_write/eval.py' 2026-01-05T08:15:20,940 adding 'evalscope/third_party/longbench_write/infer.py' 2026-01-05T08:15:20,942 adding 'evalscope/third_party/longbench_write/longbench_write.py' 2026-01-05T08:15:20,943 adding 'evalscope/third_party/longbench_write/utils.py' 2026-01-05T08:15:20,944 adding 'evalscope/third_party/longbench_write/resources/__init__.py' 2026-01-05T08:15:20,946 adding 'evalscope/third_party/longbench_write/resources/judge.txt' 2026-01-05T08:15:20,953 adding 'evalscope/third_party/longbench_write/resources/longbench_write.jsonl' 2026-01-05T08:15:20,957 adding 'evalscope/third_party/longbench_write/resources/longbench_write_en.jsonl' 2026-01-05T08:15:20,958 adding 'evalscope/third_party/longbench_write/resources/longwrite_ruler.jsonl' 2026-01-05T08:15:20,960 adding 'evalscope/third_party/longbench_write/tools/__init__.py' 2026-01-05T08:15:20,962 adding 'evalscope/third_party/longbench_write/tools/data_etl.py' 2026-01-05T08:15:20,963 adding 'evalscope/third_party/longbench_write/tools/openai_api.py' 2026-01-05T08:15:20,965 adding 'evalscope/third_party/thinkbench/__init__.py' 2026-01-05T08:15:20,968 adding 'evalscope/third_party/thinkbench/eval.py' 2026-01-05T08:15:20,969 adding 'evalscope/third_party/thinkbench/infer.py' 2026-01-05T08:15:20,971 adding 'evalscope/third_party/thinkbench/resources/critique_template.txt' 2026-01-05T08:15:20,972 adding 'evalscope/third_party/thinkbench/resources/reformat_template.txt' 2026-01-05T08:15:20,974 adding 'evalscope/third_party/thinkbench/tools/__init__.py' 2026-01-05T08:15:20,975 adding 'evalscope/third_party/thinkbench/tools/llm.py' 2026-01-05T08:15:20,976 adding 'evalscope/third_party/thinkbench/tools/utils.py' 2026-01-05T08:15:20,978 adding 'evalscope/third_party/toolbench_static/README.md' 2026-01-05T08:15:20,979 adding 'evalscope/third_party/toolbench_static/__init__.py' 2026-01-05T08:15:20,981 adding 'evalscope/third_party/toolbench_static/config_default.json' 2026-01-05T08:15:20,982 adding 'evalscope/third_party/toolbench_static/config_default.yaml' 2026-01-05T08:15:20,984 adding 'evalscope/third_party/toolbench_static/eval.py' 2026-01-05T08:15:20,986 adding 'evalscope/third_party/toolbench_static/infer.py' 2026-01-05T08:15:20,987 adding 'evalscope/third_party/toolbench_static/requirements.txt' 2026-01-05T08:15:20,988 adding 'evalscope/third_party/toolbench_static/toolbench_static.py' 2026-01-05T08:15:20,990 adding 'evalscope/third_party/toolbench_static/llm/__init__.py' 2026-01-05T08:15:20,991 adding 'evalscope/third_party/toolbench_static/llm/swift_infer.py' 2026-01-05T08:15:20,993 adding 'evalscope/utils/__init__.py' 2026-01-05T08:15:20,994 adding 'evalscope/utils/argument_utils.py' 2026-01-05T08:15:20,996 adding 'evalscope/utils/chat_service.py' 2026-01-05T08:15:20,999 adding 'evalscope/utils/code_utils.py' 2026-01-05T08:15:21,000 adding 'evalscope/utils/deprecation_utils.py' 2026-01-05T08:15:21,002 adding 'evalscope/utils/function_utils.py' 2026-01-05T08:15:21,004 adding 'evalscope/utils/import_utils.py' 2026-01-05T08:15:21,006 adding 'evalscope/utils/io_utils.py' 2026-01-05T08:15:21,008 adding 'evalscope/utils/json_schema.py' 2026-01-05T08:15:21,009 adding 'evalscope/utils/logger.py' 2026-01-05T08:15:21,011 adding 'evalscope/utils/model_utils.py' 2026-01-05T08:15:21,012 adding 'evalscope/utils/multi_choices.py' 2026-01-05T08:15:21,014 adding 'evalscope/utils/ner.py' 2026-01-05T08:15:21,016 adding 'evalscope/utils/resource_utils.py' 2026-01-05T08:15:21,017 adding 'evalscope/utils/tqdm_utils.py' 2026-01-05T08:15:21,018 adding 'evalscope/utils/url_utils.py' 2026-01-05T08:15:21,022 adding 'evalscope-1.4.1.dist-info/licenses/LICENSE' 2026-01-05T08:15:21,026 adding 'evalscope-1.4.1.dist-info/METADATA' 2026-01-05T08:15:21,027 adding 'evalscope-1.4.1.dist-info/WHEEL' 2026-01-05T08:15:21,028 adding 'evalscope-1.4.1.dist-info/entry_points.txt' 2026-01-05T08:15:21,029 adding 'evalscope-1.4.1.dist-info/top_level.txt' 2026-01-05T08:15:21,041 adding 'evalscope-1.4.1.dist-info/RECORD' 2026-01-05T08:15:21,065 removing build/bdist.linux-armv7l/wheel 2026-01-05T08:15:21,409 Building wheel for evalscope (pyproject.toml): finished with status 'done' 2026-01-05T08:15:21,436 Created wheel for evalscope: filename=evalscope-1.4.1-py3-none-any.whl size=1230811 sha256=d74ddb7150b19de1eb995026c697b598cdfc0b75fee3a2d110219256c4241688 2026-01-05T08:15:21,437 Stored in directory: /tmp/pip-ephem-wheel-cache-nyri9b3e/wheels/31/aa/fd/f368b242c27b2e0732876538d3e4c603b7733dbd1101ae23d8 2026-01-05T08:15:21,478 Successfully built evalscope 2026-01-05T08:15:21,509 Removed build tracker: '/tmp/pip-build-tracker-59oc0_e5'