2025-06-05T03:37:20,828 Created temporary directory: /tmp/pip-build-tracker-5t3c6rya 2025-06-05T03:37:20,829 Initialized build tracking at /tmp/pip-build-tracker-5t3c6rya 2025-06-05T03:37:20,830 Created build tracker: /tmp/pip-build-tracker-5t3c6rya 2025-06-05T03:37:20,830 Entered build tracker: /tmp/pip-build-tracker-5t3c6rya 2025-06-05T03:37:20,831 Created temporary directory: /tmp/pip-wheel-poc9r3bs 2025-06-05T03:37:20,835 Created temporary directory: /tmp/pip-ephem-wheel-cache-kac7qprv 2025-06-05T03:37:20,891 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-06-05T03:37:20,893 2 location(s) to search for versions of py-data-juicer: 2025-06-05T03:37:20,893 * https://pypi.org/simple/py-data-juicer/ 2025-06-05T03:37:20,893 * https://www.piwheels.org/simple/py-data-juicer/ 2025-06-05T03:37:20,894 Fetching project page and analyzing links: https://pypi.org/simple/py-data-juicer/ 2025-06-05T03:37:20,895 Getting page https://pypi.org/simple/py-data-juicer/ 2025-06-05T03:37:20,897 Found index url https://pypi.org/simple/ 2025-06-05T03:37:21,127 Fetched page https://pypi.org/simple/py-data-juicer/ as application/vnd.pypi.simple.v1+json 2025-06-05T03:37:21,138 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/e3/d3/d0724d922e7c55a0664485fdf642124bdcd801df2697e29c9463c4360958/py_data_juicer-0.1.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,139 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/fd/c6/e1428310bf534319ddc2e362c6eee21985f097e70cedc0e9cd3e914de826/py_data_juicer-0.1.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,140 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/67/70/731e349d2a92bf59a767230b956665dd27ea24a7a948d4a1710154d77a24/py_data_juicer-0.1.2-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,141 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/66/08/f584efbdf8277a061ce73c9befc33fec813cbd08020761af2380dd331692/py_data_juicer-0.1.3-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,142 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/69/20/9862dfe7a94f10caa0e6387834d1265420613345c7adadd41a3ced7230e8/py_data_juicer-0.2.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,143 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/05/34/33fb401d350f1a66cdaefc224419f9d01484722829f5b0c71ef59a8365a4/py_data_juicer-1.0.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,143 Found link https://files.pythonhosted.org/packages/49/e9/cfb994255490c36554048e0b3956858e61f5a75ffacfde20849f31f04ef8/py_data_juicer-1.0.0.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.0.0 2025-06-05T03:37:21,144 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/27/37/150e2198f14349fdd6b647a63feaede809e21dbfc4617aba487f18049828/py_data_juicer-1.0.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,145 Found link https://files.pythonhosted.org/packages/4a/60/dadcbe4337a76d8f98022b60d142ac31073d568c9e06a5461b3014d16fc6/py_data_juicer-1.0.1.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.0.1 2025-06-05T03:37:21,146 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/10/dd/f9cadd6ed2f19c4f94e61a3fd4bb5bb2029c20b2cf61ec86c3a05f89f74b/py_data_juicer-1.0.2-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,146 Found link https://files.pythonhosted.org/packages/95/8f/c0447ce091ca0b86283e8074a3cb350572a5436ca13dceb288887a0de332/py_data_juicer-1.0.2.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.0.2 2025-06-05T03:37:21,147 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/1f/dd/0f1804c5cfce50c52c8b60a1b710f30d893f681e42a6acc41bfd29a20059/py_data_juicer-1.0.3-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,148 Found link https://files.pythonhosted.org/packages/6e/d0/eefb1ca00cd4c8e62c0e8f62a1b5357b961ce50c5d2647107f51f68bf128/py_data_juicer-1.0.3.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.0.3 2025-06-05T03:37:21,149 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/5b/06/5df5581f724d49731b91db0b7c65fdde739358617af9e7ba9916c43f2a76/py_data_juicer-1.1.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,149 Found link https://files.pythonhosted.org/packages/69/43/f20a50fdbfc0ba44a4816d3c7f9bfcfaf55682ffebf78a10eeff6b40feeb/py_data_juicer-1.1.0.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.1.0 2025-06-05T03:37:21,150 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/a0/90/a95bb5b60200125b9cbb29bf1c39b706c3e5f50b247d3599386f7be973ea/py_data_juicer-1.1.0.post1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,150 Found link https://files.pythonhosted.org/packages/79/f8/3e29fb4eeeda4413bc746b0a661bc61d5786f8bf67ae9071976550fd0b41/py_data_juicer-1.1.0.post1.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.1.0.post1 2025-06-05T03:37:21,151 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/9d/ca/e32ecfafde96a367dcb63a4c4ee104bc5e952205c55169fd1120df0e86ee/py_data_juicer-1.2.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,152 Found link https://files.pythonhosted.org/packages/3d/bc/607884c148a0b6bf2198960166743e95818d230e970cbcdbd69d02758541/py_data_juicer-1.2.0.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.2.0 2025-06-05T03:37:21,152 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/80/a2/4788daf084637e94c05615f237298e1d7fe13753f7622907ce0904a4b18c/py_data_juicer-1.2.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,153 Found link https://files.pythonhosted.org/packages/df/b6/0ca40521ecc5d3a79f8e3614af2974f4a50ae0c175fee658c8f2b091eb6b/py_data_juicer-1.2.1.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.2.1 2025-06-05T03:37:21,154 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/2d/ec/75b61b05a4b19fe2e425f10b0e7f023c13d404be4ec7aa7b39ad85f18e3f/py_data_juicer-1.2.2-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,154 Found link https://files.pythonhosted.org/packages/8e/01/e4ad384e1c25acb029fd433f6bcc5d0c146cf729bd3801308cc681354ac1/py_data_juicer-1.2.2.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.2.2 2025-06-05T03:37:21,155 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/a6/3a/6b6ec164c0b6270a2dde6c03c449387f21e3f72d98ecc2e1bb98d7c6224f/py_data_juicer-1.3.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,156 Found link https://files.pythonhosted.org/packages/6c/56/b2fccdf35cfbc8bd34f90baa136bd86a81205f35e4ed1057503fb33a1ce8/py_data_juicer-1.3.0.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.3.0 2025-06-05T03:37:21,156 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/f7/b5/a43d80e76bc0c728fe1a380b24d5d9c4566bfc867c29191075be6cd03f1c/py_data_juicer-1.3.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,157 Found link https://files.pythonhosted.org/packages/78/dd/71fd69f0652b12c930f7cd5d72b99e36fb32a923fdae0ec393c5bf2033a8/py_data_juicer-1.3.1.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.3.1 2025-06-05T03:37:21,158 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/28/2b/f5005649d29bbff5235ada3c1465fc972fcc59816a874f04bcaf4963f58e/py_data_juicer-1.3.2-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,159 Found link https://files.pythonhosted.org/packages/b0/17/0a05201a5190476b0e9ea9483f3406769f2155da6c9ff1d99d78b783b621/py_data_juicer-1.3.2.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.3.2 2025-06-05T03:37:21,160 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/85/42/6ac1b8d0dd752bd5d4e156c1ccc6452e6f483481e1826aa923932fba7a6e/py_data_juicer-1.3.3-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-06-05T03:37:21,160 Found link https://files.pythonhosted.org/packages/22/f8/30a56cc7fd809a4892234b3cb39696013bd22e6a0ce35e52207a7951b87d/py_data_juicer-1.3.3.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.3.3 2025-06-05T03:37:21,161 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/44/5c/a550f1dc80743cda2513177b4d8f5a50ffdcbd320efe20078380ce2d2e0c/py_data_juicer-1.4.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) (requires-python:>=3.10) 2025-06-05T03:37:21,162 Found link https://files.pythonhosted.org/packages/cb/de/07edb7f56fea68160c4107b6df3e1affe79a19f6ac64b7545a070860c130/py_data_juicer-1.4.0.tar.gz (from https://pypi.org/simple/py-data-juicer/) (requires-python:>=3.10), version: 1.4.0 2025-06-05T03:37:21,163 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/ca/09/ea50d9a6dbc5d00b4979d52a8ac8da1b4fa7332567a043048ac58c4cfb3e/py_data_juicer-1.4.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) (requires-python:>=3.10) 2025-06-05T03:37:21,163 Found link https://files.pythonhosted.org/packages/c5/6f/df7cee4e71b590d05b2f935fa02f4a077cd109674561818ddca24a4c0779/py_data_juicer-1.4.1.tar.gz (from https://pypi.org/simple/py-data-juicer/) (requires-python:>=3.10), version: 1.4.1 2025-06-05T03:37:21,164 Fetching project page and analyzing links: https://www.piwheels.org/simple/py-data-juicer/ 2025-06-05T03:37:21,165 Getting page https://www.piwheels.org/simple/py-data-juicer/ 2025-06-05T03:37:21,166 Found index url https://www.piwheels.org/simple/ 2025-06-05T03:37:21,330 WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-06-05T03:37:21,986 WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-06-05T03:37:23,166 WARNING: Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-06-05T03:37:25,339 WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-06-05T03:37:29,541 WARNING: Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-06-05T03:37:29,719 Could not fetch URL https://www.piwheels.org/simple/py-data-juicer/: There was a problem confirming the ssl certificate: HTTPSConnectionPool(host='www.piwheels.org', port=443): Max retries exceeded with url: /simple/py-data-juicer/ (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))) - skipping 2025-06-05T03:37:29,721 Skipping link: not a file: https://www.piwheels.org/simple/py-data-juicer/ 2025-06-05T03:37:29,722 Skipping link: not a file: https://pypi.org/simple/py-data-juicer/ 2025-06-05T03:37:29,744 Given no hashes to check 1 links for project 'py-data-juicer': discarding no candidates 2025-06-05T03:37:29,746 Collecting py-data-juicer==1.3.2 2025-06-05T03:37:29,749 Created temporary directory: /tmp/pip-unpack-zubyd17r 2025-06-05T03:37:29,977 Downloading py_data_juicer-1.3.2.tar.gz (346 kB) 2025-06-05T03:37:30,612 Added py-data-juicer==1.3.2 from https://files.pythonhosted.org/packages/b0/17/0a05201a5190476b0e9ea9483f3406769f2155da6c9ff1d99d78b783b621/py_data_juicer-1.3.2.tar.gz to build tracker '/tmp/pip-build-tracker-5t3c6rya' 2025-06-05T03:37:30,615 Running setup.py (path:/tmp/pip-wheel-poc9r3bs/py-data-juicer_5b06723a8e1742faae446f305ad9212a/setup.py) egg_info for package py-data-juicer 2025-06-05T03:37:30,616 Created temporary directory: /tmp/pip-pip-egg-info-4hcch1r9 2025-06-05T03:37:30,616 Preparing metadata (setup.py): started 2025-06-05T03:37:30,618 Running command python setup.py egg_info 2025-06-05T03:37:31,126 WARNING:root:target file does not exist: environments/minimal_requires.txt 2025-06-05T03:37:31,126 WARNING:root:target file does not exist: environments/science_requires.txt 2025-06-05T03:37:31,127 WARNING:root:target file does not exist: environments/dist_requires.txt 2025-06-05T03:37:31,127 WARNING:root:target file does not exist: environments/dev_requires.txt 2025-06-05T03:37:31,128 WARNING:root:target file does not exist: environments/preprocess_requires.txt 2025-06-05T03:37:31,128 WARNING:root:target file does not exist: environments/quality_classifier_requires.txt 2025-06-05T03:37:31,129 WARNING:root:target file does not exist: environments/sandbox_requires.txt 2025-06-05T03:37:31,551 /usr/local/lib/python3.11/dist-packages/setuptools/dist.py:759: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2025-06-05T03:37:31,552 !! 2025-06-05T03:37:31,553 ******************************************************************************** 2025-06-05T03:37:31,554 Please consider removing the following classifiers in favor of a SPDX license expression: 2025-06-05T03:37:31,555 License :: OSI Approved :: Apache Software License 2025-06-05T03:37:31,556 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2025-06-05T03:37:31,557 ******************************************************************************** 2025-06-05T03:37:31,558 !! 2025-06-05T03:37:31,559 self._finalize_license_expression() 2025-06-05T03:37:31,583 INFO:root:running egg_info 2025-06-05T03:37:31,615 INFO:root:creating /tmp/pip-pip-egg-info-4hcch1r9/py_data_juicer.egg-info 2025-06-05T03:37:31,616 INFO:root:writing /tmp/pip-pip-egg-info-4hcch1r9/py_data_juicer.egg-info/PKG-INFO 2025-06-05T03:37:31,621 INFO:root:writing dependency_links to /tmp/pip-pip-egg-info-4hcch1r9/py_data_juicer.egg-info/dependency_links.txt 2025-06-05T03:37:31,623 INFO:root:writing entry points to /tmp/pip-pip-egg-info-4hcch1r9/py_data_juicer.egg-info/entry_points.txt 2025-06-05T03:37:31,625 INFO:root:writing requirements to /tmp/pip-pip-egg-info-4hcch1r9/py_data_juicer.egg-info/requires.txt 2025-06-05T03:37:31,626 INFO:root:writing top-level names to /tmp/pip-pip-egg-info-4hcch1r9/py_data_juicer.egg-info/top_level.txt 2025-06-05T03:37:31,628 INFO:root:writing manifest file '/tmp/pip-pip-egg-info-4hcch1r9/py_data_juicer.egg-info/SOURCES.txt' 2025-06-05T03:37:31,751 INFO:root:reading manifest file '/tmp/pip-pip-egg-info-4hcch1r9/py_data_juicer.egg-info/SOURCES.txt' 2025-06-05T03:37:31,753 INFO:root:adding license file 'LICENSE' 2025-06-05T03:37:31,762 INFO:root:writing manifest file '/tmp/pip-pip-egg-info-4hcch1r9/py_data_juicer.egg-info/SOURCES.txt' 2025-06-05T03:37:31,854 Preparing metadata (setup.py): finished with status 'done' 2025-06-05T03:37:31,861 Source in /tmp/pip-wheel-poc9r3bs/py-data-juicer_5b06723a8e1742faae446f305ad9212a has version 1.3.2, which satisfies requirement py-data-juicer==1.3.2 from https://files.pythonhosted.org/packages/b0/17/0a05201a5190476b0e9ea9483f3406769f2155da6c9ff1d99d78b783b621/py_data_juicer-1.3.2.tar.gz 2025-06-05T03:37:31,862 Removed py-data-juicer==1.3.2 from https://files.pythonhosted.org/packages/b0/17/0a05201a5190476b0e9ea9483f3406769f2155da6c9ff1d99d78b783b621/py_data_juicer-1.3.2.tar.gz from build tracker '/tmp/pip-build-tracker-5t3c6rya' 2025-06-05T03:37:31,870 Created temporary directory: /tmp/pip-unpack-9hx85qzr 2025-06-05T03:37:31,872 Created temporary directory: /tmp/pip-unpack-ligf77b0 2025-06-05T03:37:31,872 Building wheels for collected packages: py-data-juicer 2025-06-05T03:37:31,877 Created temporary directory: /tmp/pip-wheel-6v12qr72 2025-06-05T03:37:31,878 DEPRECATION: Building 'py-data-juicer' using the legacy setup.py bdist_wheel mechanism, which will be removed in a future version. pip 25.3 will enforce this behaviour change. A possible replacement is to use the standardized build interface by setting the `--use-pep517` option, (possibly combined with `--no-build-isolation`), or adding a `pyproject.toml` file to the source tree of 'py-data-juicer'. Discussion can be found at https://github.com/pypa/pip/issues/6334 2025-06-05T03:37:31,879 Building wheel for py-data-juicer (setup.py): started 2025-06-05T03:37:31,881 Destination directory: /tmp/pip-wheel-6v12qr72 2025-06-05T03:37:31,881 Running command python setup.py bdist_wheel 2025-06-05T03:37:32,368 WARNING:root:target file does not exist: environments/minimal_requires.txt 2025-06-05T03:37:32,369 WARNING:root:target file does not exist: environments/science_requires.txt 2025-06-05T03:37:32,369 WARNING:root:target file does not exist: environments/dist_requires.txt 2025-06-05T03:37:32,370 WARNING:root:target file does not exist: environments/dev_requires.txt 2025-06-05T03:37:32,371 WARNING:root:target file does not exist: environments/preprocess_requires.txt 2025-06-05T03:37:32,371 WARNING:root:target file does not exist: environments/quality_classifier_requires.txt 2025-06-05T03:37:32,372 WARNING:root:target file does not exist: environments/sandbox_requires.txt 2025-06-05T03:37:32,747 /usr/local/lib/python3.11/dist-packages/setuptools/dist.py:759: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2025-06-05T03:37:32,748 !! 2025-06-05T03:37:32,749 ******************************************************************************** 2025-06-05T03:37:32,749 Please consider removing the following classifiers in favor of a SPDX license expression: 2025-06-05T03:37:32,751 License :: OSI Approved :: Apache Software License 2025-06-05T03:37:32,752 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2025-06-05T03:37:32,752 ******************************************************************************** 2025-06-05T03:37:32,754 !! 2025-06-05T03:37:32,754 self._finalize_license_expression() 2025-06-05T03:37:32,755 INFO:root:running bdist_wheel 2025-06-05T03:37:32,891 INFO:root:running build 2025-06-05T03:37:32,892 INFO:root:running build_py 2025-06-05T03:37:32,923 INFO:root:creating build/lib/data_juicer 2025-06-05T03:37:32,925 INFO:root:copying data_juicer/__init__.py -> build/lib/data_juicer 2025-06-05T03:37:32,928 INFO:root:creating build/lib/data_juicer/analysis 2025-06-05T03:37:32,929 INFO:root:copying data_juicer/analysis/column_wise_analysis.py -> build/lib/data_juicer/analysis 2025-06-05T03:37:32,932 INFO:root:copying data_juicer/analysis/diversity_analysis.py -> build/lib/data_juicer/analysis 2025-06-05T03:37:32,935 INFO:root:copying data_juicer/analysis/draw.py -> build/lib/data_juicer/analysis 2025-06-05T03:37:32,937 INFO:root:copying data_juicer/analysis/__init__.py -> build/lib/data_juicer/analysis 2025-06-05T03:37:32,939 INFO:root:copying data_juicer/analysis/measure.py -> build/lib/data_juicer/analysis 2025-06-05T03:37:32,941 INFO:root:copying data_juicer/analysis/overall_analysis.py -> build/lib/data_juicer/analysis 2025-06-05T03:37:32,944 INFO:root:copying data_juicer/analysis/collector.py -> build/lib/data_juicer/analysis 2025-06-05T03:37:32,947 INFO:root:creating build/lib/data_juicer/download 2025-06-05T03:37:32,948 INFO:root:copying data_juicer/download/commoncrawl.py -> build/lib/data_juicer/download 2025-06-05T03:37:32,950 INFO:root:copying data_juicer/download/__init__.py -> build/lib/data_juicer/download 2025-06-05T03:37:32,952 INFO:root:copying data_juicer/download/wikipedia.py -> build/lib/data_juicer/download 2025-06-05T03:37:32,955 INFO:root:copying data_juicer/download/arxiv.py -> build/lib/data_juicer/download 2025-06-05T03:37:32,958 INFO:root:copying data_juicer/download/downloader.py -> build/lib/data_juicer/download 2025-06-05T03:37:32,961 INFO:root:creating build/lib/data_juicer/format 2025-06-05T03:37:32,962 INFO:root:copying data_juicer/format/load.py -> build/lib/data_juicer/format 2025-06-05T03:37:32,965 INFO:root:copying data_juicer/format/json_formatter.py -> build/lib/data_juicer/format 2025-06-05T03:37:32,967 INFO:root:copying data_juicer/format/tsv_formatter.py -> build/lib/data_juicer/format 2025-06-05T03:37:32,969 INFO:root:copying data_juicer/format/formatter.py -> build/lib/data_juicer/format 2025-06-05T03:37:32,972 INFO:root:copying data_juicer/format/__init__.py -> build/lib/data_juicer/format 2025-06-05T03:37:32,974 INFO:root:copying data_juicer/format/csv_formatter.py -> build/lib/data_juicer/format 2025-06-05T03:37:32,976 INFO:root:copying data_juicer/format/text_formatter.py -> build/lib/data_juicer/format 2025-06-05T03:37:32,978 INFO:root:copying data_juicer/format/parquet_formatter.py -> build/lib/data_juicer/format 2025-06-05T03:37:32,981 INFO:root:copying data_juicer/format/empty_formatter.py -> build/lib/data_juicer/format 2025-06-05T03:37:32,984 INFO:root:creating build/lib/data_juicer/utils 2025-06-05T03:37:32,985 INFO:root:copying data_juicer/utils/ckpt_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:32,988 INFO:root:copying data_juicer/utils/sample.py -> build/lib/data_juicer/utils 2025-06-05T03:37:32,990 INFO:root:copying data_juicer/utils/compress.py -> build/lib/data_juicer/utils 2025-06-05T03:37:32,993 INFO:root:copying data_juicer/utils/auto_install_mapping.py -> build/lib/data_juicer/utils 2025-06-05T03:37:32,996 INFO:root:copying data_juicer/utils/auto_install_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:32,998 INFO:root:copying data_juicer/utils/logger_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,001 INFO:root:copying data_juicer/utils/file_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,004 INFO:root:copying data_juicer/utils/availability_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,006 INFO:root:copying data_juicer/utils/constant.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,009 INFO:root:copying data_juicer/utils/fingerprint_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,011 INFO:root:copying data_juicer/utils/mm_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,015 INFO:root:copying data_juicer/utils/common_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,018 INFO:root:copying data_juicer/utils/__init__.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,019 INFO:root:copying data_juicer/utils/process_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,022 INFO:root:copying data_juicer/utils/nltk_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,025 INFO:root:copying data_juicer/utils/unittest_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,028 INFO:root:copying data_juicer/utils/cache_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,030 INFO:root:copying data_juicer/utils/resource_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,032 INFO:root:copying data_juicer/utils/registry.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,034 INFO:root:copying data_juicer/utils/lazy_loader.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,036 INFO:root:copying data_juicer/utils/asset_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,038 INFO:root:copying data_juicer/utils/model_utils.py -> build/lib/data_juicer/utils 2025-06-05T03:37:33,042 INFO:root:creating build/lib/data_juicer/core 2025-06-05T03:37:33,043 INFO:root:copying data_juicer/core/tracer.py -> build/lib/data_juicer/core 2025-06-05T03:37:33,046 INFO:root:copying data_juicer/core/exporter.py -> build/lib/data_juicer/core 2025-06-05T03:37:33,048 INFO:root:copying data_juicer/core/__init__.py -> build/lib/data_juicer/core 2025-06-05T03:37:33,050 INFO:root:copying data_juicer/core/monitor.py -> build/lib/data_juicer/core 2025-06-05T03:37:33,053 INFO:root:copying data_juicer/core/analyzer.py -> build/lib/data_juicer/core 2025-06-05T03:37:33,055 INFO:root:copying data_juicer/core/adapter.py -> build/lib/data_juicer/core 2025-06-05T03:37:33,058 INFO:root:creating build/lib/data_juicer/config 2025-06-05T03:37:33,060 INFO:root:copying data_juicer/config/__init__.py -> build/lib/data_juicer/config 2025-06-05T03:37:33,062 INFO:root:copying data_juicer/config/config.py -> build/lib/data_juicer/config 2025-06-05T03:37:33,066 INFO:root:creating build/lib/data_juicer/ops 2025-06-05T03:37:33,067 INFO:root:copying data_juicer/ops/load.py -> build/lib/data_juicer/ops 2025-06-05T03:37:33,070 INFO:root:copying data_juicer/ops/__init__.py -> build/lib/data_juicer/ops 2025-06-05T03:37:33,072 INFO:root:copying data_juicer/ops/mixins.py -> build/lib/data_juicer/ops 2025-06-05T03:37:33,075 INFO:root:copying data_juicer/ops/base_op.py -> build/lib/data_juicer/ops 2025-06-05T03:37:33,078 INFO:root:copying data_juicer/ops/op_fusion.py -> build/lib/data_juicer/ops 2025-06-05T03:37:33,081 INFO:root:creating build/lib/data_juicer/core/executor 2025-06-05T03:37:33,082 INFO:root:copying data_juicer/core/executor/base.py -> build/lib/data_juicer/core/executor 2025-06-05T03:37:33,084 INFO:root:copying data_juicer/core/executor/factory.py -> build/lib/data_juicer/core/executor 2025-06-05T03:37:33,086 INFO:root:copying data_juicer/core/executor/__init__.py -> build/lib/data_juicer/core/executor 2025-06-05T03:37:33,088 INFO:root:copying data_juicer/core/executor/ray_executor.py -> build/lib/data_juicer/core/executor 2025-06-05T03:37:33,091 INFO:root:copying data_juicer/core/executor/default_executor.py -> build/lib/data_juicer/core/executor 2025-06-05T03:37:33,094 INFO:root:creating build/lib/data_juicer/core/data 2025-06-05T03:37:33,095 INFO:root:copying data_juicer/core/data/dj_dataset.py -> build/lib/data_juicer/core/data 2025-06-05T03:37:33,098 INFO:root:copying data_juicer/core/data/load_strategy.py -> build/lib/data_juicer/core/data 2025-06-05T03:37:33,100 INFO:root:copying data_juicer/core/data/__init__.py -> build/lib/data_juicer/core/data 2025-06-05T03:37:33,103 INFO:root:copying data_juicer/core/data/schema.py -> build/lib/data_juicer/core/data 2025-06-05T03:37:33,105 INFO:root:copying data_juicer/core/data/data_validator.py -> build/lib/data_juicer/core/data 2025-06-05T03:37:33,107 INFO:root:copying data_juicer/core/data/config_validator.py -> build/lib/data_juicer/core/data 2025-06-05T03:37:33,110 INFO:root:copying data_juicer/core/data/dataset_builder.py -> build/lib/data_juicer/core/data 2025-06-05T03:37:33,112 INFO:root:copying data_juicer/core/data/ray_dataset.py -> build/lib/data_juicer/core/data 2025-06-05T03:37:33,116 INFO:root:creating build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,117 INFO:root:copying data_juicer/ops/deduplicator/ray_image_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,120 INFO:root:copying data_juicer/ops/deduplicator/image_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,123 INFO:root:copying data_juicer/ops/deduplicator/document_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,125 INFO:root:copying data_juicer/ops/deduplicator/document_simhash_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,128 INFO:root:copying data_juicer/ops/deduplicator/document_minhash_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,131 INFO:root:copying data_juicer/ops/deduplicator/__init__.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,133 INFO:root:copying data_juicer/ops/deduplicator/video_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,136 INFO:root:copying data_juicer/ops/deduplicator/ray_video_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,138 INFO:root:copying data_juicer/ops/deduplicator/ray_basic_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,140 INFO:root:copying data_juicer/ops/deduplicator/ray_document_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,142 INFO:root:copying data_juicer/ops/deduplicator/ray_bts_minhash_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-06-05T03:37:33,148 INFO:root:creating build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,149 INFO:root:copying data_juicer/ops/mapper/mllm_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,151 INFO:root:copying data_juicer/ops/mapper/generate_qa_from_text_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,154 INFO:root:copying data_juicer/ops/mapper/whitespace_normalization_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,156 INFO:root:copying data_juicer/ops/mapper/image_captioning_from_gpt4v_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,159 INFO:root:copying data_juicer/ops/mapper/chinese_convert_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,161 INFO:root:copying data_juicer/ops/mapper/video_captioning_from_summarizer_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,164 INFO:root:copying data_juicer/ops/mapper/replace_content_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,166 INFO:root:copying data_juicer/ops/mapper/calibrate_response_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,169 INFO:root:copying data_juicer/ops/mapper/expand_macro_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,171 INFO:root:copying data_juicer/ops/mapper/nlpcda_zh_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,173 INFO:root:copying data_juicer/ops/mapper/video_captioning_from_frames_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,176 INFO:root:copying data_juicer/ops/mapper/optimize_query_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,178 INFO:root:copying data_juicer/ops/mapper/pair_preference_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,181 INFO:root:copying data_juicer/ops/mapper/text_chunk_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,184 INFO:root:copying data_juicer/ops/mapper/clean_copyright_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,186 INFO:root:copying data_juicer/ops/mapper/extract_keyword_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,189 INFO:root:copying data_juicer/ops/mapper/video_captioning_from_audio_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,192 INFO:root:copying data_juicer/ops/mapper/video_extract_frames_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,194 INFO:root:copying data_juicer/ops/mapper/video_tagging_from_frames_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,197 INFO:root:copying data_juicer/ops/mapper/image_diffusion_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,200 INFO:root:copying data_juicer/ops/mapper/query_sentiment_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,202 INFO:root:copying data_juicer/ops/mapper/video_face_blur_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,205 INFO:root:copying data_juicer/ops/mapper/nlpaug_en_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,207 INFO:root:copying data_juicer/ops/mapper/remove_comments_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,210 INFO:root:copying data_juicer/ops/mapper/remove_header_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,212 INFO:root:copying data_juicer/ops/mapper/extract_tables_from_html_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,215 INFO:root:copying data_juicer/ops/mapper/image_tagging_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,217 INFO:root:copying data_juicer/ops/mapper/remove_repeat_sentences_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,219 INFO:root:copying data_juicer/ops/mapper/clean_html_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,221 INFO:root:copying data_juicer/ops/mapper/remove_long_words_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,223 INFO:root:copying data_juicer/ops/mapper/dialog_sentiment_intensity_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,226 INFO:root:copying data_juicer/ops/mapper/audio_add_gaussian_noise_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,228 INFO:root:copying data_juicer/ops/mapper/optimize_response_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,231 INFO:root:copying data_juicer/ops/mapper/extract_support_text_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,233 INFO:root:copying data_juicer/ops/mapper/image_remove_background_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,236 INFO:root:copying data_juicer/ops/mapper/remove_non_chinese_character_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,238 INFO:root:copying data_juicer/ops/mapper/__init__.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,241 INFO:root:copying data_juicer/ops/mapper/clean_links_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,243 INFO:root:copying data_juicer/ops/mapper/calibrate_qa_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,245 INFO:root:copying data_juicer/ops/mapper/video_split_by_key_frame_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,248 INFO:root:copying data_juicer/ops/mapper/video_split_by_scene_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,250 INFO:root:copying data_juicer/ops/mapper/punctuation_normalization_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,252 INFO:root:copying data_juicer/ops/mapper/extract_event_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,255 INFO:root:copying data_juicer/ops/mapper/audio_ffmpeg_wrapped_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,258 INFO:root:copying data_juicer/ops/mapper/fix_unicode_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,260 INFO:root:copying data_juicer/ops/mapper/remove_bibliography_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,262 INFO:root:copying data_juicer/ops/mapper/extract_entity_relation_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,265 INFO:root:copying data_juicer/ops/mapper/relation_identity_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,268 INFO:root:copying data_juicer/ops/mapper/calibrate_query_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,270 INFO:root:copying data_juicer/ops/mapper/python_lambda_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,272 INFO:root:copying data_juicer/ops/mapper/python_file_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,274 INFO:root:copying data_juicer/ops/mapper/sdxl_prompt2prompt_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,277 INFO:root:copying data_juicer/ops/mapper/image_captioning_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,280 INFO:root:copying data_juicer/ops/mapper/remove_table_text_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,282 INFO:root:copying data_juicer/ops/mapper/query_intent_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,284 INFO:root:copying data_juicer/ops/mapper/optimize_qa_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,286 INFO:root:copying data_juicer/ops/mapper/video_captioning_from_video_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,289 INFO:root:copying data_juicer/ops/mapper/extract_entity_attribute_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,292 INFO:root:copying data_juicer/ops/mapper/video_ffmpeg_wrapped_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,294 INFO:root:copying data_juicer/ops/mapper/video_split_by_duration_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,297 INFO:root:copying data_juicer/ops/mapper/video_resize_resolution_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,300 INFO:root:copying data_juicer/ops/mapper/image_segment_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,302 INFO:root:copying data_juicer/ops/mapper/generate_qa_from_examples_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,305 INFO:root:copying data_juicer/ops/mapper/image_face_blur_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,308 INFO:root:copying data_juicer/ops/mapper/sentence_split_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,310 INFO:root:copying data_juicer/ops/mapper/extract_nickname_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,313 INFO:root:copying data_juicer/ops/mapper/video_remove_watermark_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,315 INFO:root:copying data_juicer/ops/mapper/query_topic_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,318 INFO:root:copying data_juicer/ops/mapper/sentence_augmentation_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,320 INFO:root:copying data_juicer/ops/mapper/clean_email_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,323 INFO:root:copying data_juicer/ops/mapper/dialog_intent_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,325 INFO:root:copying data_juicer/ops/mapper/remove_words_with_incorrect_substrings_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,327 INFO:root:copying data_juicer/ops/mapper/dialog_topic_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,330 INFO:root:copying data_juicer/ops/mapper/image_blur_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,332 INFO:root:copying data_juicer/ops/mapper/clean_ip_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,334 INFO:root:copying data_juicer/ops/mapper/remove_specific_chars_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,336 INFO:root:copying data_juicer/ops/mapper/dialog_sentiment_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,339 INFO:root:copying data_juicer/ops/mapper/video_tagging_from_audio_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,341 INFO:root:copying data_juicer/ops/mapper/video_resize_aspect_ratio_mapper.py -> build/lib/data_juicer/ops/mapper 2025-06-05T03:37:33,344 INFO:root:creating build/lib/data_juicer/ops/grouper 2025-06-05T03:37:33,346 INFO:root:copying data_juicer/ops/grouper/key_value_grouper.py -> build/lib/data_juicer/ops/grouper 2025-06-05T03:37:33,348 INFO:root:copying data_juicer/ops/grouper/naive_reverse_grouper.py -> build/lib/data_juicer/ops/grouper 2025-06-05T03:37:33,350 INFO:root:copying data_juicer/ops/grouper/__init__.py -> build/lib/data_juicer/ops/grouper 2025-06-05T03:37:33,352 INFO:root:copying data_juicer/ops/grouper/naive_grouper.py -> build/lib/data_juicer/ops/grouper 2025-06-05T03:37:33,355 INFO:root:creating build/lib/data_juicer/ops/aggregator 2025-06-05T03:37:33,356 INFO:root:copying data_juicer/ops/aggregator/meta_tags_aggregator.py -> build/lib/data_juicer/ops/aggregator 2025-06-05T03:37:33,359 INFO:root:copying data_juicer/ops/aggregator/entity_attribute_aggregator.py -> build/lib/data_juicer/ops/aggregator 2025-06-05T03:37:33,361 INFO:root:copying data_juicer/ops/aggregator/__init__.py -> build/lib/data_juicer/ops/aggregator 2025-06-05T03:37:33,363 INFO:root:copying data_juicer/ops/aggregator/nested_aggregator.py -> build/lib/data_juicer/ops/aggregator 2025-06-05T03:37:33,366 INFO:root:copying data_juicer/ops/aggregator/most_relevant_entities_aggregator.py -> build/lib/data_juicer/ops/aggregator 2025-06-05T03:37:33,369 INFO:root:creating build/lib/data_juicer/ops/selector 2025-06-05T03:37:33,370 INFO:root:copying data_juicer/ops/selector/random_selector.py -> build/lib/data_juicer/ops/selector 2025-06-05T03:37:33,372 INFO:root:copying data_juicer/ops/selector/tags_specified_field_selector.py -> build/lib/data_juicer/ops/selector 2025-06-05T03:37:33,374 INFO:root:copying data_juicer/ops/selector/topk_specified_field_selector.py -> build/lib/data_juicer/ops/selector 2025-06-05T03:37:33,376 INFO:root:copying data_juicer/ops/selector/__init__.py -> build/lib/data_juicer/ops/selector 2025-06-05T03:37:33,379 INFO:root:copying data_juicer/ops/selector/frequency_specified_field_selector.py -> build/lib/data_juicer/ops/selector 2025-06-05T03:37:33,381 INFO:root:copying data_juicer/ops/selector/range_specified_field_selector.py -> build/lib/data_juicer/ops/selector 2025-06-05T03:37:33,384 INFO:root:creating build/lib/data_juicer/ops/common 2025-06-05T03:37:33,385 INFO:root:copying data_juicer/ops/common/helper_func.py -> build/lib/data_juicer/ops/common 2025-06-05T03:37:33,387 INFO:root:copying data_juicer/ops/common/__init__.py -> build/lib/data_juicer/ops/common 2025-06-05T03:37:33,389 INFO:root:copying data_juicer/ops/common/special_characters.py -> build/lib/data_juicer/ops/common 2025-06-05T03:37:33,392 INFO:root:copying data_juicer/ops/common/prompt2prompt_pipeline.py -> build/lib/data_juicer/ops/common 2025-06-05T03:37:33,397 INFO:root:creating build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,398 INFO:root:copying data_juicer/ops/filter/perplexity_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,401 INFO:root:copying data_juicer/ops/filter/image_pair_similarity_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,403 INFO:root:copying data_juicer/ops/filter/image_aesthetics_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,406 INFO:root:copying data_juicer/ops/filter/maximum_line_length_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,408 INFO:root:copying data_juicer/ops/filter/video_nsfw_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,410 INFO:root:copying data_juicer/ops/filter/text_entity_dependency_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,412 INFO:root:copying data_juicer/ops/filter/video_aspect_ratio_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,415 INFO:root:copying data_juicer/ops/filter/video_duration_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,417 INFO:root:copying data_juicer/ops/filter/video_motion_score_raft_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,419 INFO:root:copying data_juicer/ops/filter/image_aspect_ratio_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,421 INFO:root:copying data_juicer/ops/filter/word_repetition_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,424 INFO:root:copying data_juicer/ops/filter/text_length_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,426 INFO:root:copying data_juicer/ops/filter/audio_size_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,429 INFO:root:copying data_juicer/ops/filter/image_face_ratio_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,431 INFO:root:copying data_juicer/ops/filter/phrase_grounding_recall_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,434 INFO:root:copying data_juicer/ops/filter/specified_numeric_field_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,436 INFO:root:copying data_juicer/ops/filter/image_face_count_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,439 INFO:root:copying data_juicer/ops/filter/language_id_score_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,441 INFO:root:copying data_juicer/ops/filter/flagged_words_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,444 INFO:root:copying data_juicer/ops/filter/__init__.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,446 INFO:root:copying data_juicer/ops/filter/llm_difficulty_score_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,449 INFO:root:copying data_juicer/ops/filter/image_size_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,451 INFO:root:copying data_juicer/ops/filter/image_shape_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,453 INFO:root:copying data_juicer/ops/filter/video_tagging_from_frames_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,455 INFO:root:copying data_juicer/ops/filter/text_action_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,458 INFO:root:copying data_juicer/ops/filter/video_aesthetics_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,460 INFO:root:copying data_juicer/ops/filter/llm_quality_score_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,463 INFO:root:copying data_juicer/ops/filter/words_num_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,465 INFO:root:copying data_juicer/ops/filter/image_watermark_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,467 INFO:root:copying data_juicer/ops/filter/video_ocr_area_ratio_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,470 INFO:root:copying data_juicer/ops/filter/average_line_length_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,472 INFO:root:copying data_juicer/ops/filter/special_characters_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,474 INFO:root:copying data_juicer/ops/filter/audio_nmf_snr_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,477 INFO:root:copying data_juicer/ops/filter/stopwords_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,480 INFO:root:copying data_juicer/ops/filter/audio_duration_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,482 INFO:root:copying data_juicer/ops/filter/image_text_similarity_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,485 INFO:root:copying data_juicer/ops/filter/video_frames_text_similarity_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,488 INFO:root:copying data_juicer/ops/filter/video_resolution_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,490 INFO:root:copying data_juicer/ops/filter/specified_field_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,492 INFO:root:copying data_juicer/ops/filter/suffix_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,495 INFO:root:copying data_juicer/ops/filter/text_pair_similarity_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,497 INFO:root:copying data_juicer/ops/filter/video_motion_score_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,500 INFO:root:copying data_juicer/ops/filter/token_num_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,502 INFO:root:copying data_juicer/ops/filter/character_repetition_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,505 INFO:root:copying data_juicer/ops/filter/image_text_matching_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,507 INFO:root:copying data_juicer/ops/filter/image_nsfw_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,510 INFO:root:copying data_juicer/ops/filter/alphanumeric_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,512 INFO:root:copying data_juicer/ops/filter/video_watermark_filter.py -> build/lib/data_juicer/ops/filter 2025-06-05T03:37:33,515 INFO:root:creating build/lib/data_juicer/ops/mapper/annotation 2025-06-05T03:37:33,516 INFO:root:copying data_juicer/ops/mapper/annotation/human_preference_annotation_mapper.py -> build/lib/data_juicer/ops/mapper/annotation 2025-06-05T03:37:33,519 INFO:root:copying data_juicer/ops/mapper/annotation/__init__.py -> build/lib/data_juicer/ops/mapper/annotation 2025-06-05T03:37:33,521 INFO:root:copying data_juicer/ops/mapper/annotation/annotation_mapper.py -> build/lib/data_juicer/ops/mapper/annotation 2025-06-05T03:37:33,524 INFO:root:creating build/lib/data_juicer/tools 2025-06-05T03:37:33,525 INFO:root:copying tools/sandbox_starter.py -> build/lib/data_juicer/tools 2025-06-05T03:37:33,528 INFO:root:copying tools/process_data.py -> build/lib/data_juicer/tools 2025-06-05T03:37:33,530 INFO:root:copying tools/analyze_data.py -> build/lib/data_juicer/tools 2025-06-05T03:37:33,532 INFO:root:copying tools/__init__.py -> build/lib/data_juicer/tools 2025-06-05T03:37:33,533 INFO:root:copying tools/data_resplit.py -> build/lib/data_juicer/tools 2025-06-05T03:37:33,535 INFO:root:copying tools/dj_install.py -> build/lib/data_juicer/tools 2025-06-05T03:37:33,538 INFO:root:copying tools/generate_smtp_cert.py -> build/lib/data_juicer/tools 2025-06-05T03:37:33,596 /usr/local/lib/python3.11/dist-packages/setuptools/_distutils/cmd.py:90: SetuptoolsDeprecationWarning: setup.py install is deprecated. 2025-06-05T03:37:33,597 !! 2025-06-05T03:37:33,598 ******************************************************************************** 2025-06-05T03:37:33,599 Please avoid running ``setup.py`` directly. 2025-06-05T03:37:33,600 Instead, use pypa/build, pypa/installer or other 2025-06-05T03:37:33,600 standards-based tools. 2025-06-05T03:37:33,601 By 2025-Oct-31, you need to update your project and remove deprecated calls 2025-06-05T03:37:33,602 or your builds will no longer be supported. 2025-06-05T03:37:33,603 See https://blog.ganssle.io/articles/2021/10/setup-py-deprecated.html for details. 2025-06-05T03:37:33,604 ******************************************************************************** 2025-06-05T03:37:33,605 !! 2025-06-05T03:37:33,606 self.initialize_options() 2025-06-05T03:37:33,625 INFO:root:installing to build/bdist.linux-armv7l/wheel 2025-06-05T03:37:33,626 INFO:root:running install 2025-06-05T03:37:33,651 INFO:root:running install_lib 2025-06-05T03:37:33,679 INFO:root:creating build/bdist.linux-armv7l/wheel 2025-06-05T03:37:33,681 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer 2025-06-05T03:37:33,683 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/analysis 2025-06-05T03:37:33,684 INFO:root:copying build/lib/data_juicer/analysis/column_wise_analysis.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-06-05T03:37:33,687 INFO:root:copying build/lib/data_juicer/analysis/diversity_analysis.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-06-05T03:37:33,690 INFO:root:copying build/lib/data_juicer/analysis/draw.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-06-05T03:37:33,692 INFO:root:copying build/lib/data_juicer/analysis/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-06-05T03:37:33,694 INFO:root:copying build/lib/data_juicer/analysis/measure.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-06-05T03:37:33,697 INFO:root:copying build/lib/data_juicer/analysis/overall_analysis.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-06-05T03:37:33,699 INFO:root:copying build/lib/data_juicer/analysis/collector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-06-05T03:37:33,702 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/download 2025-06-05T03:37:33,703 INFO:root:copying build/lib/data_juicer/download/commoncrawl.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-06-05T03:37:33,704 INFO:root:copying build/lib/data_juicer/download/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-06-05T03:37:33,706 INFO:root:copying build/lib/data_juicer/download/wikipedia.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-06-05T03:37:33,709 INFO:root:copying build/lib/data_juicer/download/arxiv.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-06-05T03:37:33,712 INFO:root:copying build/lib/data_juicer/download/downloader.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-06-05T03:37:33,715 INFO:root:copying build/lib/data_juicer/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer 2025-06-05T03:37:33,718 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/format 2025-06-05T03:37:33,719 INFO:root:copying build/lib/data_juicer/format/load.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-06-05T03:37:33,722 INFO:root:copying build/lib/data_juicer/format/json_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-06-05T03:37:33,724 INFO:root:copying build/lib/data_juicer/format/tsv_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-06-05T03:37:33,726 INFO:root:copying build/lib/data_juicer/format/formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-06-05T03:37:33,729 INFO:root:copying build/lib/data_juicer/format/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-06-05T03:37:33,731 INFO:root:copying build/lib/data_juicer/format/csv_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-06-05T03:37:33,733 INFO:root:copying build/lib/data_juicer/format/text_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-06-05T03:37:33,736 INFO:root:copying build/lib/data_juicer/format/parquet_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-06-05T03:37:33,738 INFO:root:copying build/lib/data_juicer/format/empty_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-06-05T03:37:33,741 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/utils 2025-06-05T03:37:33,742 INFO:root:copying build/lib/data_juicer/utils/ckpt_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,745 INFO:root:copying build/lib/data_juicer/utils/sample.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,747 INFO:root:copying build/lib/data_juicer/utils/compress.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,750 INFO:root:copying build/lib/data_juicer/utils/auto_install_mapping.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,753 INFO:root:copying build/lib/data_juicer/utils/auto_install_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,755 INFO:root:copying build/lib/data_juicer/utils/logger_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,758 INFO:root:copying build/lib/data_juicer/utils/file_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,761 INFO:root:copying build/lib/data_juicer/utils/availability_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,763 INFO:root:copying build/lib/data_juicer/utils/constant.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,766 INFO:root:copying build/lib/data_juicer/utils/fingerprint_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,769 INFO:root:copying build/lib/data_juicer/utils/mm_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,772 INFO:root:copying build/lib/data_juicer/utils/common_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,774 INFO:root:copying build/lib/data_juicer/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,776 INFO:root:copying build/lib/data_juicer/utils/process_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,779 INFO:root:copying build/lib/data_juicer/utils/nltk_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,782 INFO:root:copying build/lib/data_juicer/utils/unittest_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,784 INFO:root:copying build/lib/data_juicer/utils/cache_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,786 INFO:root:copying build/lib/data_juicer/utils/resource_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,788 INFO:root:copying build/lib/data_juicer/utils/registry.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,790 INFO:root:copying build/lib/data_juicer/utils/lazy_loader.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,793 INFO:root:copying build/lib/data_juicer/utils/asset_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,795 INFO:root:copying build/lib/data_juicer/utils/model_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-06-05T03:37:33,798 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/core 2025-06-05T03:37:33,800 INFO:root:copying build/lib/data_juicer/core/tracer.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-06-05T03:37:33,802 INFO:root:copying build/lib/data_juicer/core/exporter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-06-05T03:37:33,805 INFO:root:copying build/lib/data_juicer/core/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-06-05T03:37:33,808 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/core/executor 2025-06-05T03:37:33,809 INFO:root:copying build/lib/data_juicer/core/executor/base.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-06-05T03:37:33,811 INFO:root:copying build/lib/data_juicer/core/executor/factory.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-06-05T03:37:33,813 INFO:root:copying build/lib/data_juicer/core/executor/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-06-05T03:37:33,815 INFO:root:copying build/lib/data_juicer/core/executor/ray_executor.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-06-05T03:37:33,818 INFO:root:copying build/lib/data_juicer/core/executor/default_executor.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-06-05T03:37:33,820 INFO:root:copying build/lib/data_juicer/core/monitor.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-06-05T03:37:33,823 INFO:root:copying build/lib/data_juicer/core/analyzer.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-06-05T03:37:33,826 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/core/data 2025-06-05T03:37:33,828 INFO:root:copying build/lib/data_juicer/core/data/dj_dataset.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-06-05T03:37:33,831 INFO:root:copying build/lib/data_juicer/core/data/load_strategy.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-06-05T03:37:33,834 INFO:root:copying build/lib/data_juicer/core/data/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-06-05T03:37:33,836 INFO:root:copying build/lib/data_juicer/core/data/schema.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-06-05T03:37:33,838 INFO:root:copying build/lib/data_juicer/core/data/data_validator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-06-05T03:37:33,841 INFO:root:copying build/lib/data_juicer/core/data/config_validator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-06-05T03:37:33,843 INFO:root:copying build/lib/data_juicer/core/data/dataset_builder.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-06-05T03:37:33,846 INFO:root:copying build/lib/data_juicer/core/data/ray_dataset.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-06-05T03:37:33,849 INFO:root:copying build/lib/data_juicer/core/adapter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-06-05T03:37:33,852 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/config 2025-06-05T03:37:33,853 INFO:root:copying build/lib/data_juicer/config/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/config 2025-06-05T03:37:33,855 INFO:root:copying build/lib/data_juicer/config/config.py -> build/bdist.linux-armv7l/wheel/./data_juicer/config 2025-06-05T03:37:33,859 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops 2025-06-05T03:37:33,860 INFO:root:copying build/lib/data_juicer/ops/load.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-06-05T03:37:33,863 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/deduplicator 2025-06-05T03:37:33,865 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_image_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,867 INFO:root:copying build/lib/data_juicer/ops/deduplicator/image_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,869 INFO:root:copying build/lib/data_juicer/ops/deduplicator/document_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,872 INFO:root:copying build/lib/data_juicer/ops/deduplicator/document_simhash_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,874 INFO:root:copying build/lib/data_juicer/ops/deduplicator/document_minhash_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,877 INFO:root:copying build/lib/data_juicer/ops/deduplicator/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,879 INFO:root:copying build/lib/data_juicer/ops/deduplicator/video_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,882 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_video_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,884 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_basic_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,886 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_document_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,888 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_bts_minhash_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-06-05T03:37:33,894 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/mapper 2025-06-05T03:37:33,895 INFO:root:copying build/lib/data_juicer/ops/mapper/mllm_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,898 INFO:root:copying build/lib/data_juicer/ops/mapper/generate_qa_from_text_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,900 INFO:root:copying build/lib/data_juicer/ops/mapper/whitespace_normalization_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,903 INFO:root:copying build/lib/data_juicer/ops/mapper/image_captioning_from_gpt4v_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,905 INFO:root:copying build/lib/data_juicer/ops/mapper/chinese_convert_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,908 INFO:root:copying build/lib/data_juicer/ops/mapper/video_captioning_from_summarizer_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,911 INFO:root:copying build/lib/data_juicer/ops/mapper/replace_content_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,913 INFO:root:copying build/lib/data_juicer/ops/mapper/calibrate_response_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,915 INFO:root:copying build/lib/data_juicer/ops/mapper/expand_macro_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,917 INFO:root:copying build/lib/data_juicer/ops/mapper/nlpcda_zh_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,919 INFO:root:copying build/lib/data_juicer/ops/mapper/video_captioning_from_frames_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,922 INFO:root:copying build/lib/data_juicer/ops/mapper/optimize_query_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,924 INFO:root:copying build/lib/data_juicer/ops/mapper/pair_preference_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,927 INFO:root:copying build/lib/data_juicer/ops/mapper/text_chunk_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,929 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_copyright_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,931 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_keyword_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,934 INFO:root:copying build/lib/data_juicer/ops/mapper/video_captioning_from_audio_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,938 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/mapper/annotation 2025-06-05T03:37:33,939 INFO:root:copying build/lib/data_juicer/ops/mapper/annotation/human_preference_annotation_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper/annotation 2025-06-05T03:37:33,942 INFO:root:copying build/lib/data_juicer/ops/mapper/annotation/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper/annotation 2025-06-05T03:37:33,944 INFO:root:copying build/lib/data_juicer/ops/mapper/annotation/annotation_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper/annotation 2025-06-05T03:37:33,947 INFO:root:copying build/lib/data_juicer/ops/mapper/video_extract_frames_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,950 INFO:root:copying build/lib/data_juicer/ops/mapper/video_tagging_from_frames_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,952 INFO:root:copying build/lib/data_juicer/ops/mapper/image_diffusion_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,955 INFO:root:copying build/lib/data_juicer/ops/mapper/query_sentiment_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,957 INFO:root:copying build/lib/data_juicer/ops/mapper/video_face_blur_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,960 INFO:root:copying build/lib/data_juicer/ops/mapper/nlpaug_en_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,963 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_comments_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,965 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_header_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,968 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_tables_from_html_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,970 INFO:root:copying build/lib/data_juicer/ops/mapper/image_tagging_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,972 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_repeat_sentences_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,974 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_html_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,976 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_long_words_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,978 INFO:root:copying build/lib/data_juicer/ops/mapper/dialog_sentiment_intensity_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,981 INFO:root:copying build/lib/data_juicer/ops/mapper/audio_add_gaussian_noise_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,983 INFO:root:copying build/lib/data_juicer/ops/mapper/optimize_response_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,985 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_support_text_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,988 INFO:root:copying build/lib/data_juicer/ops/mapper/image_remove_background_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,990 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_non_chinese_character_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,992 INFO:root:copying build/lib/data_juicer/ops/mapper/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,995 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_links_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:33,997 INFO:root:copying build/lib/data_juicer/ops/mapper/calibrate_qa_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,000 INFO:root:copying build/lib/data_juicer/ops/mapper/video_split_by_key_frame_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,002 INFO:root:copying build/lib/data_juicer/ops/mapper/video_split_by_scene_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,005 INFO:root:copying build/lib/data_juicer/ops/mapper/punctuation_normalization_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,007 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_event_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,010 INFO:root:copying build/lib/data_juicer/ops/mapper/audio_ffmpeg_wrapped_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,012 INFO:root:copying build/lib/data_juicer/ops/mapper/fix_unicode_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,015 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_bibliography_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,017 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_entity_relation_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,020 INFO:root:copying build/lib/data_juicer/ops/mapper/relation_identity_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,023 INFO:root:copying build/lib/data_juicer/ops/mapper/calibrate_query_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,025 INFO:root:copying build/lib/data_juicer/ops/mapper/python_lambda_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,027 INFO:root:copying build/lib/data_juicer/ops/mapper/python_file_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,030 INFO:root:copying build/lib/data_juicer/ops/mapper/sdxl_prompt2prompt_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,032 INFO:root:copying build/lib/data_juicer/ops/mapper/image_captioning_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,035 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_table_text_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,037 INFO:root:copying build/lib/data_juicer/ops/mapper/query_intent_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,039 INFO:root:copying build/lib/data_juicer/ops/mapper/optimize_qa_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,041 INFO:root:copying build/lib/data_juicer/ops/mapper/video_captioning_from_video_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,044 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_entity_attribute_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,047 INFO:root:copying build/lib/data_juicer/ops/mapper/video_ffmpeg_wrapped_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,049 INFO:root:copying build/lib/data_juicer/ops/mapper/video_split_by_duration_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,051 INFO:root:copying build/lib/data_juicer/ops/mapper/video_resize_resolution_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,054 INFO:root:copying build/lib/data_juicer/ops/mapper/image_segment_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,056 INFO:root:copying build/lib/data_juicer/ops/mapper/generate_qa_from_examples_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,059 INFO:root:copying build/lib/data_juicer/ops/mapper/image_face_blur_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,062 INFO:root:copying build/lib/data_juicer/ops/mapper/sentence_split_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,064 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_nickname_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,067 INFO:root:copying build/lib/data_juicer/ops/mapper/video_remove_watermark_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,070 INFO:root:copying build/lib/data_juicer/ops/mapper/query_topic_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,072 INFO:root:copying build/lib/data_juicer/ops/mapper/sentence_augmentation_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,074 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_email_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,077 INFO:root:copying build/lib/data_juicer/ops/mapper/dialog_intent_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,080 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_words_with_incorrect_substrings_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,082 INFO:root:copying build/lib/data_juicer/ops/mapper/dialog_topic_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,085 INFO:root:copying build/lib/data_juicer/ops/mapper/image_blur_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,087 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_ip_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,090 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_specific_chars_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,092 INFO:root:copying build/lib/data_juicer/ops/mapper/dialog_sentiment_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,095 INFO:root:copying build/lib/data_juicer/ops/mapper/video_tagging_from_audio_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,098 INFO:root:copying build/lib/data_juicer/ops/mapper/video_resize_aspect_ratio_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-06-05T03:37:34,102 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/grouper 2025-06-05T03:37:34,104 INFO:root:copying build/lib/data_juicer/ops/grouper/key_value_grouper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/grouper 2025-06-05T03:37:34,107 INFO:root:copying build/lib/data_juicer/ops/grouper/naive_reverse_grouper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/grouper 2025-06-05T03:37:34,110 INFO:root:copying build/lib/data_juicer/ops/grouper/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/grouper 2025-06-05T03:37:34,112 INFO:root:copying build/lib/data_juicer/ops/grouper/naive_grouper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/grouper 2025-06-05T03:37:34,115 INFO:root:copying build/lib/data_juicer/ops/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-06-05T03:37:34,118 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/aggregator 2025-06-05T03:37:34,120 INFO:root:copying build/lib/data_juicer/ops/aggregator/meta_tags_aggregator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-06-05T03:37:34,123 INFO:root:copying build/lib/data_juicer/ops/aggregator/entity_attribute_aggregator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-06-05T03:37:34,126 INFO:root:copying build/lib/data_juicer/ops/aggregator/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-06-05T03:37:34,128 INFO:root:copying build/lib/data_juicer/ops/aggregator/nested_aggregator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-06-05T03:37:34,131 INFO:root:copying build/lib/data_juicer/ops/aggregator/most_relevant_entities_aggregator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-06-05T03:37:34,135 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/selector 2025-06-05T03:37:34,136 INFO:root:copying build/lib/data_juicer/ops/selector/random_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-06-05T03:37:34,139 INFO:root:copying build/lib/data_juicer/ops/selector/tags_specified_field_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-06-05T03:37:34,141 INFO:root:copying build/lib/data_juicer/ops/selector/topk_specified_field_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-06-05T03:37:34,144 INFO:root:copying build/lib/data_juicer/ops/selector/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-06-05T03:37:34,146 INFO:root:copying build/lib/data_juicer/ops/selector/frequency_specified_field_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-06-05T03:37:34,149 INFO:root:copying build/lib/data_juicer/ops/selector/range_specified_field_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-06-05T03:37:34,152 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/common 2025-06-05T03:37:34,154 INFO:root:copying build/lib/data_juicer/ops/common/helper_func.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/common 2025-06-05T03:37:34,157 INFO:root:copying build/lib/data_juicer/ops/common/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/common 2025-06-05T03:37:34,160 INFO:root:copying build/lib/data_juicer/ops/common/special_characters.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/common 2025-06-05T03:37:34,162 INFO:root:copying build/lib/data_juicer/ops/common/prompt2prompt_pipeline.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/common 2025-06-05T03:37:34,166 INFO:root:copying build/lib/data_juicer/ops/mixins.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-06-05T03:37:34,170 INFO:root:copying build/lib/data_juicer/ops/base_op.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-06-05T03:37:34,174 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/filter 2025-06-05T03:37:34,176 INFO:root:copying build/lib/data_juicer/ops/filter/perplexity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,179 INFO:root:copying build/lib/data_juicer/ops/filter/image_pair_similarity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,181 INFO:root:copying build/lib/data_juicer/ops/filter/image_aesthetics_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,184 INFO:root:copying build/lib/data_juicer/ops/filter/maximum_line_length_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,187 INFO:root:copying build/lib/data_juicer/ops/filter/video_nsfw_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,190 INFO:root:copying build/lib/data_juicer/ops/filter/text_entity_dependency_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,193 INFO:root:copying build/lib/data_juicer/ops/filter/video_aspect_ratio_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,195 INFO:root:copying build/lib/data_juicer/ops/filter/video_duration_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,197 INFO:root:copying build/lib/data_juicer/ops/filter/video_motion_score_raft_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,200 INFO:root:copying build/lib/data_juicer/ops/filter/image_aspect_ratio_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,202 INFO:root:copying build/lib/data_juicer/ops/filter/word_repetition_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,205 INFO:root:copying build/lib/data_juicer/ops/filter/text_length_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,207 INFO:root:copying build/lib/data_juicer/ops/filter/audio_size_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,210 INFO:root:copying build/lib/data_juicer/ops/filter/image_face_ratio_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,213 INFO:root:copying build/lib/data_juicer/ops/filter/phrase_grounding_recall_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,216 INFO:root:copying build/lib/data_juicer/ops/filter/specified_numeric_field_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,218 INFO:root:copying build/lib/data_juicer/ops/filter/image_face_count_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,221 INFO:root:copying build/lib/data_juicer/ops/filter/language_id_score_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,224 INFO:root:copying build/lib/data_juicer/ops/filter/flagged_words_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,227 INFO:root:copying build/lib/data_juicer/ops/filter/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,230 INFO:root:copying build/lib/data_juicer/ops/filter/llm_difficulty_score_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,233 INFO:root:copying build/lib/data_juicer/ops/filter/image_size_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,235 INFO:root:copying build/lib/data_juicer/ops/filter/image_shape_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,238 INFO:root:copying build/lib/data_juicer/ops/filter/video_tagging_from_frames_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,241 INFO:root:copying build/lib/data_juicer/ops/filter/text_action_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,244 INFO:root:copying build/lib/data_juicer/ops/filter/video_aesthetics_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,247 INFO:root:copying build/lib/data_juicer/ops/filter/llm_quality_score_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,250 INFO:root:copying build/lib/data_juicer/ops/filter/words_num_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,252 INFO:root:copying build/lib/data_juicer/ops/filter/image_watermark_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,254 INFO:root:copying build/lib/data_juicer/ops/filter/video_ocr_area_ratio_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,257 INFO:root:copying build/lib/data_juicer/ops/filter/average_line_length_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,259 INFO:root:copying build/lib/data_juicer/ops/filter/special_characters_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,261 INFO:root:copying build/lib/data_juicer/ops/filter/audio_nmf_snr_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,264 INFO:root:copying build/lib/data_juicer/ops/filter/stopwords_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,266 INFO:root:copying build/lib/data_juicer/ops/filter/audio_duration_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,269 INFO:root:copying build/lib/data_juicer/ops/filter/image_text_similarity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,271 INFO:root:copying build/lib/data_juicer/ops/filter/video_frames_text_similarity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,274 INFO:root:copying build/lib/data_juicer/ops/filter/video_resolution_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,277 INFO:root:copying build/lib/data_juicer/ops/filter/specified_field_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,279 INFO:root:copying build/lib/data_juicer/ops/filter/suffix_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,282 INFO:root:copying build/lib/data_juicer/ops/filter/text_pair_similarity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,284 INFO:root:copying build/lib/data_juicer/ops/filter/video_motion_score_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,287 INFO:root:copying build/lib/data_juicer/ops/filter/token_num_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,290 INFO:root:copying build/lib/data_juicer/ops/filter/character_repetition_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,292 INFO:root:copying build/lib/data_juicer/ops/filter/image_text_matching_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,295 INFO:root:copying build/lib/data_juicer/ops/filter/image_nsfw_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,297 INFO:root:copying build/lib/data_juicer/ops/filter/alphanumeric_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,300 INFO:root:copying build/lib/data_juicer/ops/filter/video_watermark_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-06-05T03:37:34,303 INFO:root:copying build/lib/data_juicer/ops/op_fusion.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-06-05T03:37:34,306 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/tools 2025-06-05T03:37:34,307 INFO:root:copying build/lib/data_juicer/tools/sandbox_starter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-06-05T03:37:34,309 INFO:root:copying build/lib/data_juicer/tools/process_data.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-06-05T03:37:34,311 INFO:root:copying build/lib/data_juicer/tools/analyze_data.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-06-05T03:37:34,313 INFO:root:copying build/lib/data_juicer/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-06-05T03:37:34,315 INFO:root:copying build/lib/data_juicer/tools/data_resplit.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-06-05T03:37:34,318 INFO:root:copying build/lib/data_juicer/tools/dj_install.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-06-05T03:37:34,320 INFO:root:copying build/lib/data_juicer/tools/generate_smtp_cert.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-06-05T03:37:34,323 INFO:root:running install_egg_info 2025-06-05T03:37:34,362 INFO:root:running egg_info 2025-06-05T03:37:34,392 INFO:root:writing py_data_juicer.egg-info/PKG-INFO 2025-06-05T03:37:34,397 INFO:root:writing dependency_links to py_data_juicer.egg-info/dependency_links.txt 2025-06-05T03:37:34,399 INFO:root:writing entry points to py_data_juicer.egg-info/entry_points.txt 2025-06-05T03:37:34,400 INFO:root:writing requirements to py_data_juicer.egg-info/requires.txt 2025-06-05T03:37:34,402 INFO:root:writing top-level names to py_data_juicer.egg-info/top_level.txt 2025-06-05T03:37:34,458 INFO:root:reading manifest file 'py_data_juicer.egg-info/SOURCES.txt' 2025-06-05T03:37:34,473 INFO:root:adding license file 'LICENSE' 2025-06-05T03:37:34,486 INFO:root:writing manifest file 'py_data_juicer.egg-info/SOURCES.txt' 2025-06-05T03:37:34,488 INFO:root:Copying py_data_juicer.egg-info to build/bdist.linux-armv7l/wheel/./py_data_juicer-1.3.2-py3.11.egg-info 2025-06-05T03:37:34,501 INFO:root:running install_scripts 2025-06-05T03:37:34,518 INFO:root:creating build/bdist.linux-armv7l/wheel/py_data_juicer-1.3.2.dist-info/WHEEL 2025-06-05T03:37:34,521 INFO:wheel:creating '/tmp/pip-wheel-6v12qr72/py_data_juicer-1.3.2-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-06-05T03:37:34,524 INFO:wheel:adding 'data_juicer/__init__.py' 2025-06-05T03:37:34,526 INFO:wheel:adding 'data_juicer/analysis/__init__.py' 2025-06-05T03:37:34,528 INFO:wheel:adding 'data_juicer/analysis/collector.py' 2025-06-05T03:37:34,531 INFO:wheel:adding 'data_juicer/analysis/column_wise_analysis.py' 2025-06-05T03:37:34,533 INFO:wheel:adding 'data_juicer/analysis/diversity_analysis.py' 2025-06-05T03:37:34,535 INFO:wheel:adding 'data_juicer/analysis/draw.py' 2025-06-05T03:37:34,537 INFO:wheel:adding 'data_juicer/analysis/measure.py' 2025-06-05T03:37:34,539 INFO:wheel:adding 'data_juicer/analysis/overall_analysis.py' 2025-06-05T03:37:34,541 INFO:wheel:adding 'data_juicer/config/__init__.py' 2025-06-05T03:37:34,547 INFO:wheel:adding 'data_juicer/config/config.py' 2025-06-05T03:37:34,549 INFO:wheel:adding 'data_juicer/core/__init__.py' 2025-06-05T03:37:34,551 INFO:wheel:adding 'data_juicer/core/adapter.py' 2025-06-05T03:37:34,553 INFO:wheel:adding 'data_juicer/core/analyzer.py' 2025-06-05T03:37:34,555 INFO:wheel:adding 'data_juicer/core/exporter.py' 2025-06-05T03:37:34,557 INFO:wheel:adding 'data_juicer/core/monitor.py' 2025-06-05T03:37:34,558 INFO:wheel:adding 'data_juicer/core/tracer.py' 2025-06-05T03:37:34,560 INFO:wheel:adding 'data_juicer/core/data/__init__.py' 2025-06-05T03:37:34,561 INFO:wheel:adding 'data_juicer/core/data/config_validator.py' 2025-06-05T03:37:34,563 INFO:wheel:adding 'data_juicer/core/data/data_validator.py' 2025-06-05T03:37:34,566 INFO:wheel:adding 'data_juicer/core/data/dataset_builder.py' 2025-06-05T03:37:34,568 INFO:wheel:adding 'data_juicer/core/data/dj_dataset.py' 2025-06-05T03:37:34,570 INFO:wheel:adding 'data_juicer/core/data/load_strategy.py' 2025-06-05T03:37:34,572 INFO:wheel:adding 'data_juicer/core/data/ray_dataset.py' 2025-06-05T03:37:34,574 INFO:wheel:adding 'data_juicer/core/data/schema.py' 2025-06-05T03:37:34,575 INFO:wheel:adding 'data_juicer/core/executor/__init__.py' 2025-06-05T03:37:34,576 INFO:wheel:adding 'data_juicer/core/executor/base.py' 2025-06-05T03:37:34,578 INFO:wheel:adding 'data_juicer/core/executor/default_executor.py' 2025-06-05T03:37:34,579 INFO:wheel:adding 'data_juicer/core/executor/factory.py' 2025-06-05T03:37:34,581 INFO:wheel:adding 'data_juicer/core/executor/ray_executor.py' 2025-06-05T03:37:34,582 INFO:wheel:adding 'data_juicer/download/__init__.py' 2025-06-05T03:37:34,585 INFO:wheel:adding 'data_juicer/download/arxiv.py' 2025-06-05T03:37:34,586 INFO:wheel:adding 'data_juicer/download/commoncrawl.py' 2025-06-05T03:37:34,588 INFO:wheel:adding 'data_juicer/download/downloader.py' 2025-06-05T03:37:34,593 INFO:wheel:adding 'data_juicer/download/wikipedia.py' 2025-06-05T03:37:34,595 INFO:wheel:adding 'data_juicer/format/__init__.py' 2025-06-05T03:37:34,596 INFO:wheel:adding 'data_juicer/format/csv_formatter.py' 2025-06-05T03:37:34,597 INFO:wheel:adding 'data_juicer/format/empty_formatter.py' 2025-06-05T03:37:34,599 INFO:wheel:adding 'data_juicer/format/formatter.py' 2025-06-05T03:37:34,601 INFO:wheel:adding 'data_juicer/format/json_formatter.py' 2025-06-05T03:37:34,602 INFO:wheel:adding 'data_juicer/format/load.py' 2025-06-05T03:37:34,603 INFO:wheel:adding 'data_juicer/format/parquet_formatter.py' 2025-06-05T03:37:34,605 INFO:wheel:adding 'data_juicer/format/text_formatter.py' 2025-06-05T03:37:34,606 INFO:wheel:adding 'data_juicer/format/tsv_formatter.py' 2025-06-05T03:37:34,608 INFO:wheel:adding 'data_juicer/ops/__init__.py' 2025-06-05T03:37:34,611 INFO:wheel:adding 'data_juicer/ops/base_op.py' 2025-06-05T03:37:34,612 INFO:wheel:adding 'data_juicer/ops/load.py' 2025-06-05T03:37:34,615 INFO:wheel:adding 'data_juicer/ops/mixins.py' 2025-06-05T03:37:34,618 INFO:wheel:adding 'data_juicer/ops/op_fusion.py' 2025-06-05T03:37:34,620 INFO:wheel:adding 'data_juicer/ops/aggregator/__init__.py' 2025-06-05T03:37:34,622 INFO:wheel:adding 'data_juicer/ops/aggregator/entity_attribute_aggregator.py' 2025-06-05T03:37:34,623 INFO:wheel:adding 'data_juicer/ops/aggregator/meta_tags_aggregator.py' 2025-06-05T03:37:34,625 INFO:wheel:adding 'data_juicer/ops/aggregator/most_relevant_entities_aggregator.py' 2025-06-05T03:37:34,627 INFO:wheel:adding 'data_juicer/ops/aggregator/nested_aggregator.py' 2025-06-05T03:37:34,629 INFO:wheel:adding 'data_juicer/ops/common/__init__.py' 2025-06-05T03:37:34,631 INFO:wheel:adding 'data_juicer/ops/common/helper_func.py' 2025-06-05T03:37:34,637 INFO:wheel:adding 'data_juicer/ops/common/prompt2prompt_pipeline.py' 2025-06-05T03:37:34,639 INFO:wheel:adding 'data_juicer/ops/common/special_characters.py' 2025-06-05T03:37:34,641 INFO:wheel:adding 'data_juicer/ops/deduplicator/__init__.py' 2025-06-05T03:37:34,643 INFO:wheel:adding 'data_juicer/ops/deduplicator/document_deduplicator.py' 2025-06-05T03:37:34,645 INFO:wheel:adding 'data_juicer/ops/deduplicator/document_minhash_deduplicator.py' 2025-06-05T03:37:34,647 INFO:wheel:adding 'data_juicer/ops/deduplicator/document_simhash_deduplicator.py' 2025-06-05T03:37:34,648 INFO:wheel:adding 'data_juicer/ops/deduplicator/image_deduplicator.py' 2025-06-05T03:37:34,650 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_basic_deduplicator.py' 2025-06-05T03:37:34,653 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_bts_minhash_deduplicator.py' 2025-06-05T03:37:34,654 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_document_deduplicator.py' 2025-06-05T03:37:34,656 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_image_deduplicator.py' 2025-06-05T03:37:34,657 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_video_deduplicator.py' 2025-06-05T03:37:34,658 INFO:wheel:adding 'data_juicer/ops/deduplicator/video_deduplicator.py' 2025-06-05T03:37:34,661 INFO:wheel:adding 'data_juicer/ops/filter/__init__.py' 2025-06-05T03:37:34,663 INFO:wheel:adding 'data_juicer/ops/filter/alphanumeric_filter.py' 2025-06-05T03:37:34,664 INFO:wheel:adding 'data_juicer/ops/filter/audio_duration_filter.py' 2025-06-05T03:37:34,666 INFO:wheel:adding 'data_juicer/ops/filter/audio_nmf_snr_filter.py' 2025-06-05T03:37:34,667 INFO:wheel:adding 'data_juicer/ops/filter/audio_size_filter.py' 2025-06-05T03:37:34,668 INFO:wheel:adding 'data_juicer/ops/filter/average_line_length_filter.py' 2025-06-05T03:37:34,670 INFO:wheel:adding 'data_juicer/ops/filter/character_repetition_filter.py' 2025-06-05T03:37:34,671 INFO:wheel:adding 'data_juicer/ops/filter/flagged_words_filter.py' 2025-06-05T03:37:34,673 INFO:wheel:adding 'data_juicer/ops/filter/image_aesthetics_filter.py' 2025-06-05T03:37:34,675 INFO:wheel:adding 'data_juicer/ops/filter/image_aspect_ratio_filter.py' 2025-06-05T03:37:34,677 INFO:wheel:adding 'data_juicer/ops/filter/image_face_count_filter.py' 2025-06-05T03:37:34,679 INFO:wheel:adding 'data_juicer/ops/filter/image_face_ratio_filter.py' 2025-06-05T03:37:34,680 INFO:wheel:adding 'data_juicer/ops/filter/image_nsfw_filter.py' 2025-06-05T03:37:34,682 INFO:wheel:adding 'data_juicer/ops/filter/image_pair_similarity_filter.py' 2025-06-05T03:37:34,683 INFO:wheel:adding 'data_juicer/ops/filter/image_shape_filter.py' 2025-06-05T03:37:34,684 INFO:wheel:adding 'data_juicer/ops/filter/image_size_filter.py' 2025-06-05T03:37:34,686 INFO:wheel:adding 'data_juicer/ops/filter/image_text_matching_filter.py' 2025-06-05T03:37:34,688 INFO:wheel:adding 'data_juicer/ops/filter/image_text_similarity_filter.py' 2025-06-05T03:37:34,689 INFO:wheel:adding 'data_juicer/ops/filter/image_watermark_filter.py' 2025-06-05T03:37:34,691 INFO:wheel:adding 'data_juicer/ops/filter/language_id_score_filter.py' 2025-06-05T03:37:34,693 INFO:wheel:adding 'data_juicer/ops/filter/llm_difficulty_score_filter.py' 2025-06-05T03:37:34,694 INFO:wheel:adding 'data_juicer/ops/filter/llm_quality_score_filter.py' 2025-06-05T03:37:34,696 INFO:wheel:adding 'data_juicer/ops/filter/maximum_line_length_filter.py' 2025-06-05T03:37:34,697 INFO:wheel:adding 'data_juicer/ops/filter/perplexity_filter.py' 2025-06-05T03:37:34,700 INFO:wheel:adding 'data_juicer/ops/filter/phrase_grounding_recall_filter.py' 2025-06-05T03:37:34,701 INFO:wheel:adding 'data_juicer/ops/filter/special_characters_filter.py' 2025-06-05T03:37:34,702 INFO:wheel:adding 'data_juicer/ops/filter/specified_field_filter.py' 2025-06-05T03:37:34,704 INFO:wheel:adding 'data_juicer/ops/filter/specified_numeric_field_filter.py' 2025-06-05T03:37:34,705 INFO:wheel:adding 'data_juicer/ops/filter/stopwords_filter.py' 2025-06-05T03:37:34,707 INFO:wheel:adding 'data_juicer/ops/filter/suffix_filter.py' 2025-06-05T03:37:34,708 INFO:wheel:adding 'data_juicer/ops/filter/text_action_filter.py' 2025-06-05T03:37:34,710 INFO:wheel:adding 'data_juicer/ops/filter/text_entity_dependency_filter.py' 2025-06-05T03:37:34,711 INFO:wheel:adding 'data_juicer/ops/filter/text_length_filter.py' 2025-06-05T03:37:34,713 INFO:wheel:adding 'data_juicer/ops/filter/text_pair_similarity_filter.py' 2025-06-05T03:37:34,714 INFO:wheel:adding 'data_juicer/ops/filter/token_num_filter.py' 2025-06-05T03:37:34,716 INFO:wheel:adding 'data_juicer/ops/filter/video_aesthetics_filter.py' 2025-06-05T03:37:34,718 INFO:wheel:adding 'data_juicer/ops/filter/video_aspect_ratio_filter.py' 2025-06-05T03:37:34,719 INFO:wheel:adding 'data_juicer/ops/filter/video_duration_filter.py' 2025-06-05T03:37:34,721 INFO:wheel:adding 'data_juicer/ops/filter/video_frames_text_similarity_filter.py' 2025-06-05T03:37:34,723 INFO:wheel:adding 'data_juicer/ops/filter/video_motion_score_filter.py' 2025-06-05T03:37:34,725 INFO:wheel:adding 'data_juicer/ops/filter/video_motion_score_raft_filter.py' 2025-06-05T03:37:34,726 INFO:wheel:adding 'data_juicer/ops/filter/video_nsfw_filter.py' 2025-06-05T03:37:34,728 INFO:wheel:adding 'data_juicer/ops/filter/video_ocr_area_ratio_filter.py' 2025-06-05T03:37:34,730 INFO:wheel:adding 'data_juicer/ops/filter/video_resolution_filter.py' 2025-06-05T03:37:34,731 INFO:wheel:adding 'data_juicer/ops/filter/video_tagging_from_frames_filter.py' 2025-06-05T03:37:34,733 INFO:wheel:adding 'data_juicer/ops/filter/video_watermark_filter.py' 2025-06-05T03:37:34,734 INFO:wheel:adding 'data_juicer/ops/filter/word_repetition_filter.py' 2025-06-05T03:37:34,736 INFO:wheel:adding 'data_juicer/ops/filter/words_num_filter.py' 2025-06-05T03:37:34,738 INFO:wheel:adding 'data_juicer/ops/grouper/__init__.py' 2025-06-05T03:37:34,739 INFO:wheel:adding 'data_juicer/ops/grouper/key_value_grouper.py' 2025-06-05T03:37:34,740 INFO:wheel:adding 'data_juicer/ops/grouper/naive_grouper.py' 2025-06-05T03:37:34,742 INFO:wheel:adding 'data_juicer/ops/grouper/naive_reverse_grouper.py' 2025-06-05T03:37:34,745 INFO:wheel:adding 'data_juicer/ops/mapper/__init__.py' 2025-06-05T03:37:34,747 INFO:wheel:adding 'data_juicer/ops/mapper/audio_add_gaussian_noise_mapper.py' 2025-06-05T03:37:34,748 INFO:wheel:adding 'data_juicer/ops/mapper/audio_ffmpeg_wrapped_mapper.py' 2025-06-05T03:37:34,750 INFO:wheel:adding 'data_juicer/ops/mapper/calibrate_qa_mapper.py' 2025-06-05T03:37:34,751 INFO:wheel:adding 'data_juicer/ops/mapper/calibrate_query_mapper.py' 2025-06-05T03:37:34,752 INFO:wheel:adding 'data_juicer/ops/mapper/calibrate_response_mapper.py' 2025-06-05T03:37:34,754 INFO:wheel:adding 'data_juicer/ops/mapper/chinese_convert_mapper.py' 2025-06-05T03:37:34,755 INFO:wheel:adding 'data_juicer/ops/mapper/clean_copyright_mapper.py' 2025-06-05T03:37:34,756 INFO:wheel:adding 'data_juicer/ops/mapper/clean_email_mapper.py' 2025-06-05T03:37:34,758 INFO:wheel:adding 'data_juicer/ops/mapper/clean_html_mapper.py' 2025-06-05T03:37:34,759 INFO:wheel:adding 'data_juicer/ops/mapper/clean_ip_mapper.py' 2025-06-05T03:37:34,760 INFO:wheel:adding 'data_juicer/ops/mapper/clean_links_mapper.py' 2025-06-05T03:37:34,762 INFO:wheel:adding 'data_juicer/ops/mapper/dialog_intent_detection_mapper.py' 2025-06-05T03:37:34,764 INFO:wheel:adding 'data_juicer/ops/mapper/dialog_sentiment_detection_mapper.py' 2025-06-05T03:37:34,766 INFO:wheel:adding 'data_juicer/ops/mapper/dialog_sentiment_intensity_mapper.py' 2025-06-05T03:37:34,768 INFO:wheel:adding 'data_juicer/ops/mapper/dialog_topic_detection_mapper.py' 2025-06-05T03:37:34,770 INFO:wheel:adding 'data_juicer/ops/mapper/expand_macro_mapper.py' 2025-06-05T03:37:34,772 INFO:wheel:adding 'data_juicer/ops/mapper/extract_entity_attribute_mapper.py' 2025-06-05T03:37:34,775 INFO:wheel:adding 'data_juicer/ops/mapper/extract_entity_relation_mapper.py' 2025-06-05T03:37:34,777 INFO:wheel:adding 'data_juicer/ops/mapper/extract_event_mapper.py' 2025-06-05T03:37:34,779 INFO:wheel:adding 'data_juicer/ops/mapper/extract_keyword_mapper.py' 2025-06-05T03:37:34,781 INFO:wheel:adding 'data_juicer/ops/mapper/extract_nickname_mapper.py' 2025-06-05T03:37:34,783 INFO:wheel:adding 'data_juicer/ops/mapper/extract_support_text_mapper.py' 2025-06-05T03:37:34,784 INFO:wheel:adding 'data_juicer/ops/mapper/extract_tables_from_html_mapper.py' 2025-06-05T03:37:34,785 INFO:wheel:adding 'data_juicer/ops/mapper/fix_unicode_mapper.py' 2025-06-05T03:37:34,788 INFO:wheel:adding 'data_juicer/ops/mapper/generate_qa_from_examples_mapper.py' 2025-06-05T03:37:34,789 INFO:wheel:adding 'data_juicer/ops/mapper/generate_qa_from_text_mapper.py' 2025-06-05T03:37:34,791 INFO:wheel:adding 'data_juicer/ops/mapper/image_blur_mapper.py' 2025-06-05T03:37:34,793 INFO:wheel:adding 'data_juicer/ops/mapper/image_captioning_from_gpt4v_mapper.py' 2025-06-05T03:37:34,795 INFO:wheel:adding 'data_juicer/ops/mapper/image_captioning_mapper.py' 2025-06-05T03:37:34,797 INFO:wheel:adding 'data_juicer/ops/mapper/image_diffusion_mapper.py' 2025-06-05T03:37:34,799 INFO:wheel:adding 'data_juicer/ops/mapper/image_face_blur_mapper.py' 2025-06-05T03:37:34,801 INFO:wheel:adding 'data_juicer/ops/mapper/image_remove_background_mapper.py' 2025-06-05T03:37:34,802 INFO:wheel:adding 'data_juicer/ops/mapper/image_segment_mapper.py' 2025-06-05T03:37:34,803 INFO:wheel:adding 'data_juicer/ops/mapper/image_tagging_mapper.py' 2025-06-05T03:37:34,805 INFO:wheel:adding 'data_juicer/ops/mapper/mllm_mapper.py' 2025-06-05T03:37:34,806 INFO:wheel:adding 'data_juicer/ops/mapper/nlpaug_en_mapper.py' 2025-06-05T03:37:34,808 INFO:wheel:adding 'data_juicer/ops/mapper/nlpcda_zh_mapper.py' 2025-06-05T03:37:34,809 INFO:wheel:adding 'data_juicer/ops/mapper/optimize_qa_mapper.py' 2025-06-05T03:37:34,811 INFO:wheel:adding 'data_juicer/ops/mapper/optimize_query_mapper.py' 2025-06-05T03:37:34,812 INFO:wheel:adding 'data_juicer/ops/mapper/optimize_response_mapper.py' 2025-06-05T03:37:34,813 INFO:wheel:adding 'data_juicer/ops/mapper/pair_preference_mapper.py' 2025-06-05T03:37:34,815 INFO:wheel:adding 'data_juicer/ops/mapper/punctuation_normalization_mapper.py' 2025-06-05T03:37:34,816 INFO:wheel:adding 'data_juicer/ops/mapper/python_file_mapper.py' 2025-06-05T03:37:34,817 INFO:wheel:adding 'data_juicer/ops/mapper/python_lambda_mapper.py' 2025-06-05T03:37:34,819 INFO:wheel:adding 'data_juicer/ops/mapper/query_intent_detection_mapper.py' 2025-06-05T03:37:34,820 INFO:wheel:adding 'data_juicer/ops/mapper/query_sentiment_detection_mapper.py' 2025-06-05T03:37:34,821 INFO:wheel:adding 'data_juicer/ops/mapper/query_topic_detection_mapper.py' 2025-06-05T03:37:34,823 INFO:wheel:adding 'data_juicer/ops/mapper/relation_identity_mapper.py' 2025-06-05T03:37:34,824 INFO:wheel:adding 'data_juicer/ops/mapper/remove_bibliography_mapper.py' 2025-06-05T03:37:34,826 INFO:wheel:adding 'data_juicer/ops/mapper/remove_comments_mapper.py' 2025-06-05T03:37:34,827 INFO:wheel:adding 'data_juicer/ops/mapper/remove_header_mapper.py' 2025-06-05T03:37:34,829 INFO:wheel:adding 'data_juicer/ops/mapper/remove_long_words_mapper.py' 2025-06-05T03:37:34,830 INFO:wheel:adding 'data_juicer/ops/mapper/remove_non_chinese_character_mapper.py' 2025-06-05T03:37:34,831 INFO:wheel:adding 'data_juicer/ops/mapper/remove_repeat_sentences_mapper.py' 2025-06-05T03:37:34,833 INFO:wheel:adding 'data_juicer/ops/mapper/remove_specific_chars_mapper.py' 2025-06-05T03:37:34,834 INFO:wheel:adding 'data_juicer/ops/mapper/remove_table_text_mapper.py' 2025-06-05T03:37:34,836 INFO:wheel:adding 'data_juicer/ops/mapper/remove_words_with_incorrect_substrings_mapper.py' 2025-06-05T03:37:34,837 INFO:wheel:adding 'data_juicer/ops/mapper/replace_content_mapper.py' 2025-06-05T03:37:34,839 INFO:wheel:adding 'data_juicer/ops/mapper/sdxl_prompt2prompt_mapper.py' 2025-06-05T03:37:34,840 INFO:wheel:adding 'data_juicer/ops/mapper/sentence_augmentation_mapper.py' 2025-06-05T03:37:34,841 INFO:wheel:adding 'data_juicer/ops/mapper/sentence_split_mapper.py' 2025-06-05T03:37:34,843 INFO:wheel:adding 'data_juicer/ops/mapper/text_chunk_mapper.py' 2025-06-05T03:37:34,845 INFO:wheel:adding 'data_juicer/ops/mapper/video_captioning_from_audio_mapper.py' 2025-06-05T03:37:34,847 INFO:wheel:adding 'data_juicer/ops/mapper/video_captioning_from_frames_mapper.py' 2025-06-05T03:37:34,850 INFO:wheel:adding 'data_juicer/ops/mapper/video_captioning_from_summarizer_mapper.py' 2025-06-05T03:37:34,852 INFO:wheel:adding 'data_juicer/ops/mapper/video_captioning_from_video_mapper.py' 2025-06-05T03:37:34,854 INFO:wheel:adding 'data_juicer/ops/mapper/video_extract_frames_mapper.py' 2025-06-05T03:37:34,856 INFO:wheel:adding 'data_juicer/ops/mapper/video_face_blur_mapper.py' 2025-06-05T03:37:34,857 INFO:wheel:adding 'data_juicer/ops/mapper/video_ffmpeg_wrapped_mapper.py' 2025-06-05T03:37:34,859 INFO:wheel:adding 'data_juicer/ops/mapper/video_remove_watermark_mapper.py' 2025-06-05T03:37:34,861 INFO:wheel:adding 'data_juicer/ops/mapper/video_resize_aspect_ratio_mapper.py' 2025-06-05T03:37:34,862 INFO:wheel:adding 'data_juicer/ops/mapper/video_resize_resolution_mapper.py' 2025-06-05T03:37:34,864 INFO:wheel:adding 'data_juicer/ops/mapper/video_split_by_duration_mapper.py' 2025-06-05T03:37:34,866 INFO:wheel:adding 'data_juicer/ops/mapper/video_split_by_key_frame_mapper.py' 2025-06-05T03:37:34,867 INFO:wheel:adding 'data_juicer/ops/mapper/video_split_by_scene_mapper.py' 2025-06-05T03:37:34,869 INFO:wheel:adding 'data_juicer/ops/mapper/video_tagging_from_audio_mapper.py' 2025-06-05T03:37:34,871 INFO:wheel:adding 'data_juicer/ops/mapper/video_tagging_from_frames_mapper.py' 2025-06-05T03:37:34,872 INFO:wheel:adding 'data_juicer/ops/mapper/whitespace_normalization_mapper.py' 2025-06-05T03:37:34,874 INFO:wheel:adding 'data_juicer/ops/mapper/annotation/__init__.py' 2025-06-05T03:37:34,878 INFO:wheel:adding 'data_juicer/ops/mapper/annotation/annotation_mapper.py' 2025-06-05T03:37:34,880 INFO:wheel:adding 'data_juicer/ops/mapper/annotation/human_preference_annotation_mapper.py' 2025-06-05T03:37:34,882 INFO:wheel:adding 'data_juicer/ops/selector/__init__.py' 2025-06-05T03:37:34,883 INFO:wheel:adding 'data_juicer/ops/selector/frequency_specified_field_selector.py' 2025-06-05T03:37:34,885 INFO:wheel:adding 'data_juicer/ops/selector/random_selector.py' 2025-06-05T03:37:34,886 INFO:wheel:adding 'data_juicer/ops/selector/range_specified_field_selector.py' 2025-06-05T03:37:34,887 INFO:wheel:adding 'data_juicer/ops/selector/tags_specified_field_selector.py' 2025-06-05T03:37:34,889 INFO:wheel:adding 'data_juicer/ops/selector/topk_specified_field_selector.py' 2025-06-05T03:37:34,890 INFO:wheel:adding 'data_juicer/tools/__init__.py' 2025-06-05T03:37:34,891 INFO:wheel:adding 'data_juicer/tools/analyze_data.py' 2025-06-05T03:37:34,893 INFO:wheel:adding 'data_juicer/tools/data_resplit.py' 2025-06-05T03:37:34,894 INFO:wheel:adding 'data_juicer/tools/dj_install.py' 2025-06-05T03:37:34,896 INFO:wheel:adding 'data_juicer/tools/generate_smtp_cert.py' 2025-06-05T03:37:34,897 INFO:wheel:adding 'data_juicer/tools/process_data.py' 2025-06-05T03:37:34,898 INFO:wheel:adding 'data_juicer/tools/sandbox_starter.py' 2025-06-05T03:37:34,900 INFO:wheel:adding 'data_juicer/utils/__init__.py' 2025-06-05T03:37:34,902 INFO:wheel:adding 'data_juicer/utils/asset_utils.py' 2025-06-05T03:37:34,903 INFO:wheel:adding 'data_juicer/utils/auto_install_mapping.py' 2025-06-05T03:37:34,905 INFO:wheel:adding 'data_juicer/utils/auto_install_utils.py' 2025-06-05T03:37:34,906 INFO:wheel:adding 'data_juicer/utils/availability_utils.py' 2025-06-05T03:37:34,908 INFO:wheel:adding 'data_juicer/utils/cache_utils.py' 2025-06-05T03:37:34,909 INFO:wheel:adding 'data_juicer/utils/ckpt_utils.py' 2025-06-05T03:37:34,911 INFO:wheel:adding 'data_juicer/utils/common_utils.py' 2025-06-05T03:37:34,913 INFO:wheel:adding 'data_juicer/utils/compress.py' 2025-06-05T03:37:34,916 INFO:wheel:adding 'data_juicer/utils/constant.py' 2025-06-05T03:37:34,918 INFO:wheel:adding 'data_juicer/utils/file_utils.py' 2025-06-05T03:37:34,920 INFO:wheel:adding 'data_juicer/utils/fingerprint_utils.py' 2025-06-05T03:37:34,921 INFO:wheel:adding 'data_juicer/utils/lazy_loader.py' 2025-06-05T03:37:34,923 INFO:wheel:adding 'data_juicer/utils/logger_utils.py' 2025-06-05T03:37:34,928 INFO:wheel:adding 'data_juicer/utils/mm_utils.py' 2025-06-05T03:37:34,933 INFO:wheel:adding 'data_juicer/utils/model_utils.py' 2025-06-05T03:37:34,935 INFO:wheel:adding 'data_juicer/utils/nltk_utils.py' 2025-06-05T03:37:34,937 INFO:wheel:adding 'data_juicer/utils/process_utils.py' 2025-06-05T03:37:34,938 INFO:wheel:adding 'data_juicer/utils/registry.py' 2025-06-05T03:37:34,939 INFO:wheel:adding 'data_juicer/utils/resource_utils.py' 2025-06-05T03:37:34,941 INFO:wheel:adding 'data_juicer/utils/sample.py' 2025-06-05T03:37:34,943 INFO:wheel:adding 'data_juicer/utils/unittest_utils.py' 2025-06-05T03:37:34,947 INFO:wheel:adding 'py_data_juicer-1.3.2.dist-info/licenses/LICENSE' 2025-06-05T03:37:34,952 INFO:wheel:adding 'py_data_juicer-1.3.2.dist-info/METADATA' 2025-06-05T03:37:34,954 INFO:wheel:adding 'py_data_juicer-1.3.2.dist-info/WHEEL' 2025-06-05T03:37:34,955 INFO:wheel:adding 'py_data_juicer-1.3.2.dist-info/entry_points.txt' 2025-06-05T03:37:34,956 INFO:wheel:adding 'py_data_juicer-1.3.2.dist-info/top_level.txt' 2025-06-05T03:37:34,960 INFO:wheel:adding 'py_data_juicer-1.3.2.dist-info/RECORD' 2025-06-05T03:37:34,969 INFO:root:removing build/bdist.linux-armv7l/wheel 2025-06-05T03:37:35,131 Building wheel for py-data-juicer (setup.py): finished with status 'done' 2025-06-05T03:37:35,138 Created wheel for py-data-juicer: filename=py_data_juicer-1.3.2-py3-none-any.whl size=483579 sha256=f45d474e08936bc888564b816c2b2b1a7b58c241fc40c381e182efeee84238a3 2025-06-05T03:37:35,139 Stored in directory: /tmp/pip-ephem-wheel-cache-kac7qprv/wheels/0e/ba/1b/514bbfc8b8b7f93ea096b5bacb0a2dfd5adae443bf19d35077 2025-06-05T03:37:35,169 Successfully built py-data-juicer 2025-06-05T03:37:35,191 Removed build tracker: '/tmp/pip-build-tracker-5t3c6rya'