2025-05-31T14:50:38,701 Created temporary directory: /tmp/pip-build-tracker-i5jhcsyd 2025-05-31T14:50:38,702 Initialized build tracking at /tmp/pip-build-tracker-i5jhcsyd 2025-05-31T14:50:38,703 Created build tracker: /tmp/pip-build-tracker-i5jhcsyd 2025-05-31T14:50:38,703 Entered build tracker: /tmp/pip-build-tracker-i5jhcsyd 2025-05-31T14:50:38,704 Created temporary directory: /tmp/pip-wheel-i5czuh96 2025-05-31T14:50:38,709 Created temporary directory: /tmp/pip-ephem-wheel-cache-k9xi46ht 2025-05-31T14:50:38,763 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-05-31T14:50:38,765 2 location(s) to search for versions of py-data-juicer: 2025-05-31T14:50:38,765 * https://pypi.org/simple/py-data-juicer/ 2025-05-31T14:50:38,765 * https://www.piwheels.org/simple/py-data-juicer/ 2025-05-31T14:50:38,766 Fetching project page and analyzing links: https://pypi.org/simple/py-data-juicer/ 2025-05-31T14:50:38,767 Getting page https://pypi.org/simple/py-data-juicer/ 2025-05-31T14:50:38,769 Found index url https://pypi.org/simple/ 2025-05-31T14:50:38,996 Fetched page https://pypi.org/simple/py-data-juicer/ as application/vnd.pypi.simple.v1+json 2025-05-31T14:50:39,007 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/e3/d3/d0724d922e7c55a0664485fdf642124bdcd801df2697e29c9463c4360958/py_data_juicer-0.1.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,007 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/fd/c6/e1428310bf534319ddc2e362c6eee21985f097e70cedc0e9cd3e914de826/py_data_juicer-0.1.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,008 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/67/70/731e349d2a92bf59a767230b956665dd27ea24a7a948d4a1710154d77a24/py_data_juicer-0.1.2-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,009 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/66/08/f584efbdf8277a061ce73c9befc33fec813cbd08020761af2380dd331692/py_data_juicer-0.1.3-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,009 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/69/20/9862dfe7a94f10caa0e6387834d1265420613345c7adadd41a3ced7230e8/py_data_juicer-0.2.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,010 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/05/34/33fb401d350f1a66cdaefc224419f9d01484722829f5b0c71ef59a8365a4/py_data_juicer-1.0.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,011 Found link https://files.pythonhosted.org/packages/49/e9/cfb994255490c36554048e0b3956858e61f5a75ffacfde20849f31f04ef8/py_data_juicer-1.0.0.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.0.0 2025-05-31T14:50:39,012 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/27/37/150e2198f14349fdd6b647a63feaede809e21dbfc4617aba487f18049828/py_data_juicer-1.0.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,013 Found link https://files.pythonhosted.org/packages/4a/60/dadcbe4337a76d8f98022b60d142ac31073d568c9e06a5461b3014d16fc6/py_data_juicer-1.0.1.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.0.1 2025-05-31T14:50:39,014 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/10/dd/f9cadd6ed2f19c4f94e61a3fd4bb5bb2029c20b2cf61ec86c3a05f89f74b/py_data_juicer-1.0.2-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,014 Found link https://files.pythonhosted.org/packages/95/8f/c0447ce091ca0b86283e8074a3cb350572a5436ca13dceb288887a0de332/py_data_juicer-1.0.2.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.0.2 2025-05-31T14:50:39,015 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/1f/dd/0f1804c5cfce50c52c8b60a1b710f30d893f681e42a6acc41bfd29a20059/py_data_juicer-1.0.3-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,016 Found link https://files.pythonhosted.org/packages/6e/d0/eefb1ca00cd4c8e62c0e8f62a1b5357b961ce50c5d2647107f51f68bf128/py_data_juicer-1.0.3.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.0.3 2025-05-31T14:50:39,017 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/5b/06/5df5581f724d49731b91db0b7c65fdde739358617af9e7ba9916c43f2a76/py_data_juicer-1.1.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,018 Found link https://files.pythonhosted.org/packages/69/43/f20a50fdbfc0ba44a4816d3c7f9bfcfaf55682ffebf78a10eeff6b40feeb/py_data_juicer-1.1.0.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.1.0 2025-05-31T14:50:39,019 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/a0/90/a95bb5b60200125b9cbb29bf1c39b706c3e5f50b247d3599386f7be973ea/py_data_juicer-1.1.0.post1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,019 Found link https://files.pythonhosted.org/packages/79/f8/3e29fb4eeeda4413bc746b0a661bc61d5786f8bf67ae9071976550fd0b41/py_data_juicer-1.1.0.post1.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.1.0.post1 2025-05-31T14:50:39,020 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/9d/ca/e32ecfafde96a367dcb63a4c4ee104bc5e952205c55169fd1120df0e86ee/py_data_juicer-1.2.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,021 Found link https://files.pythonhosted.org/packages/3d/bc/607884c148a0b6bf2198960166743e95818d230e970cbcdbd69d02758541/py_data_juicer-1.2.0.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.2.0 2025-05-31T14:50:39,022 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/80/a2/4788daf084637e94c05615f237298e1d7fe13753f7622907ce0904a4b18c/py_data_juicer-1.2.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,023 Found link https://files.pythonhosted.org/packages/df/b6/0ca40521ecc5d3a79f8e3614af2974f4a50ae0c175fee658c8f2b091eb6b/py_data_juicer-1.2.1.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.2.1 2025-05-31T14:50:39,023 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/2d/ec/75b61b05a4b19fe2e425f10b0e7f023c13d404be4ec7aa7b39ad85f18e3f/py_data_juicer-1.2.2-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,024 Found link https://files.pythonhosted.org/packages/8e/01/e4ad384e1c25acb029fd433f6bcc5d0c146cf729bd3801308cc681354ac1/py_data_juicer-1.2.2.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.2.2 2025-05-31T14:50:39,025 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/a6/3a/6b6ec164c0b6270a2dde6c03c449387f21e3f72d98ecc2e1bb98d7c6224f/py_data_juicer-1.3.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,026 Found link https://files.pythonhosted.org/packages/6c/56/b2fccdf35cfbc8bd34f90baa136bd86a81205f35e4ed1057503fb33a1ce8/py_data_juicer-1.3.0.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.3.0 2025-05-31T14:50:39,027 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/f7/b5/a43d80e76bc0c728fe1a380b24d5d9c4566bfc867c29191075be6cd03f1c/py_data_juicer-1.3.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,027 Found link https://files.pythonhosted.org/packages/78/dd/71fd69f0652b12c930f7cd5d72b99e36fb32a923fdae0ec393c5bf2033a8/py_data_juicer-1.3.1.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.3.1 2025-05-31T14:50:39,028 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/28/2b/f5005649d29bbff5235ada3c1465fc972fcc59816a874f04bcaf4963f58e/py_data_juicer-1.3.2-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,029 Found link https://files.pythonhosted.org/packages/b0/17/0a05201a5190476b0e9ea9483f3406769f2155da6c9ff1d99d78b783b621/py_data_juicer-1.3.2.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.3.2 2025-05-31T14:50:39,030 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/85/42/6ac1b8d0dd752bd5d4e156c1ccc6452e6f483481e1826aa923932fba7a6e/py_data_juicer-1.3.3-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) 2025-05-31T14:50:39,031 Found link https://files.pythonhosted.org/packages/22/f8/30a56cc7fd809a4892234b3cb39696013bd22e6a0ce35e52207a7951b87d/py_data_juicer-1.3.3.tar.gz (from https://pypi.org/simple/py-data-juicer/), version: 1.3.3 2025-05-31T14:50:39,032 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/44/5c/a550f1dc80743cda2513177b4d8f5a50ffdcbd320efe20078380ce2d2e0c/py_data_juicer-1.4.0-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) (requires-python:>=3.10) 2025-05-31T14:50:39,033 Found link https://files.pythonhosted.org/packages/cb/de/07edb7f56fea68160c4107b6df3e1affe79a19f6ac64b7545a070860c130/py_data_juicer-1.4.0.tar.gz (from https://pypi.org/simple/py-data-juicer/) (requires-python:>=3.10), version: 1.4.0 2025-05-31T14:50:39,034 Skipping link: No binaries permitted for py-data-juicer: https://files.pythonhosted.org/packages/ca/09/ea50d9a6dbc5d00b4979d52a8ac8da1b4fa7332567a043048ac58c4cfb3e/py_data_juicer-1.4.1-py3-none-any.whl (from https://pypi.org/simple/py-data-juicer/) (requires-python:>=3.10) 2025-05-31T14:50:39,035 Found link https://files.pythonhosted.org/packages/c5/6f/df7cee4e71b590d05b2f935fa02f4a077cd109674561818ddca24a4c0779/py_data_juicer-1.4.1.tar.gz (from https://pypi.org/simple/py-data-juicer/) (requires-python:>=3.10), version: 1.4.1 2025-05-31T14:50:39,036 Fetching project page and analyzing links: https://www.piwheels.org/simple/py-data-juicer/ 2025-05-31T14:50:39,037 Getting page https://www.piwheels.org/simple/py-data-juicer/ 2025-05-31T14:50:39,039 Found index url https://www.piwheels.org/simple/ 2025-05-31T14:50:39,210 WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-05-31T14:50:39,867 WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-05-31T14:50:41,041 WARNING: Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-05-31T14:50:43,210 WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-05-31T14:50:47,380 WARNING: Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))': /simple/py-data-juicer/ 2025-05-31T14:50:47,554 Could not fetch URL https://www.piwheels.org/simple/py-data-juicer/: There was a problem confirming the ssl certificate: HTTPSConnectionPool(host='www.piwheels.org', port=443): Max retries exceeded with url: /simple/py-data-juicer/ (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate is not yet valid (_ssl.c:992)'))) - skipping 2025-05-31T14:50:47,556 Skipping link: not a file: https://www.piwheels.org/simple/py-data-juicer/ 2025-05-31T14:50:47,557 Skipping link: not a file: https://pypi.org/simple/py-data-juicer/ 2025-05-31T14:50:47,580 Given no hashes to check 1 links for project 'py-data-juicer': discarding no candidates 2025-05-31T14:50:47,582 Collecting py-data-juicer==1.3.1 2025-05-31T14:50:47,585 Created temporary directory: /tmp/pip-unpack-a05djfrw 2025-05-31T14:50:47,815 Downloading py_data_juicer-1.3.1.tar.gz (337 kB) 2025-05-31T14:50:48,474 Added py-data-juicer==1.3.1 from https://files.pythonhosted.org/packages/78/dd/71fd69f0652b12c930f7cd5d72b99e36fb32a923fdae0ec393c5bf2033a8/py_data_juicer-1.3.1.tar.gz to build tracker '/tmp/pip-build-tracker-i5jhcsyd' 2025-05-31T14:50:48,477 Running setup.py (path:/tmp/pip-wheel-i5czuh96/py-data-juicer_368c91eb02534ba69a7aa1ff867ef092/setup.py) egg_info for package py-data-juicer 2025-05-31T14:50:48,478 Created temporary directory: /tmp/pip-pip-egg-info-uw7ae4hr 2025-05-31T14:50:48,479 Preparing metadata (setup.py): started 2025-05-31T14:50:48,480 Running command python setup.py egg_info 2025-05-31T14:50:48,988 WARNING:root:target file does not exist: environments/minimal_requires.txt 2025-05-31T14:50:48,988 WARNING:root:target file does not exist: environments/science_requires.txt 2025-05-31T14:50:48,988 WARNING:root:target file does not exist: environments/dist_requires.txt 2025-05-31T14:50:48,989 WARNING:root:target file does not exist: environments/dev_requires.txt 2025-05-31T14:50:48,990 WARNING:root:target file does not exist: environments/preprocess_requires.txt 2025-05-31T14:50:48,990 WARNING:root:target file does not exist: environments/quality_classifier_requires.txt 2025-05-31T14:50:48,991 WARNING:root:target file does not exist: environments/sandbox_requires.txt 2025-05-31T14:50:49,404 /usr/local/lib/python3.11/dist-packages/setuptools/dist.py:759: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2025-05-31T14:50:49,405 !! 2025-05-31T14:50:49,407 ******************************************************************************** 2025-05-31T14:50:49,408 Please consider removing the following classifiers in favor of a SPDX license expression: 2025-05-31T14:50:49,409 License :: OSI Approved :: Apache Software License 2025-05-31T14:50:49,410 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2025-05-31T14:50:49,411 ******************************************************************************** 2025-05-31T14:50:49,412 !! 2025-05-31T14:50:49,413 self._finalize_license_expression() 2025-05-31T14:50:49,439 INFO:root:running egg_info 2025-05-31T14:50:49,474 INFO:root:creating /tmp/pip-pip-egg-info-uw7ae4hr/py_data_juicer.egg-info 2025-05-31T14:50:49,475 INFO:root:writing /tmp/pip-pip-egg-info-uw7ae4hr/py_data_juicer.egg-info/PKG-INFO 2025-05-31T14:50:49,482 INFO:root:writing dependency_links to /tmp/pip-pip-egg-info-uw7ae4hr/py_data_juicer.egg-info/dependency_links.txt 2025-05-31T14:50:49,485 INFO:root:writing entry points to /tmp/pip-pip-egg-info-uw7ae4hr/py_data_juicer.egg-info/entry_points.txt 2025-05-31T14:50:49,487 INFO:root:writing requirements to /tmp/pip-pip-egg-info-uw7ae4hr/py_data_juicer.egg-info/requires.txt 2025-05-31T14:50:49,488 INFO:root:writing top-level names to /tmp/pip-pip-egg-info-uw7ae4hr/py_data_juicer.egg-info/top_level.txt 2025-05-31T14:50:49,492 INFO:root:writing manifest file '/tmp/pip-pip-egg-info-uw7ae4hr/py_data_juicer.egg-info/SOURCES.txt' 2025-05-31T14:50:49,636 INFO:root:reading manifest file '/tmp/pip-pip-egg-info-uw7ae4hr/py_data_juicer.egg-info/SOURCES.txt' 2025-05-31T14:50:49,638 INFO:root:adding license file 'LICENSE' 2025-05-31T14:50:49,648 INFO:root:writing manifest file '/tmp/pip-pip-egg-info-uw7ae4hr/py_data_juicer.egg-info/SOURCES.txt' 2025-05-31T14:50:49,738 Preparing metadata (setup.py): finished with status 'done' 2025-05-31T14:50:49,745 Source in /tmp/pip-wheel-i5czuh96/py-data-juicer_368c91eb02534ba69a7aa1ff867ef092 has version 1.3.1, which satisfies requirement py-data-juicer==1.3.1 from https://files.pythonhosted.org/packages/78/dd/71fd69f0652b12c930f7cd5d72b99e36fb32a923fdae0ec393c5bf2033a8/py_data_juicer-1.3.1.tar.gz 2025-05-31T14:50:49,746 Removed py-data-juicer==1.3.1 from https://files.pythonhosted.org/packages/78/dd/71fd69f0652b12c930f7cd5d72b99e36fb32a923fdae0ec393c5bf2033a8/py_data_juicer-1.3.1.tar.gz from build tracker '/tmp/pip-build-tracker-i5jhcsyd' 2025-05-31T14:50:49,755 Created temporary directory: /tmp/pip-unpack-m_ewxqr5 2025-05-31T14:50:49,756 Created temporary directory: /tmp/pip-unpack-wye0kpq3 2025-05-31T14:50:49,756 Building wheels for collected packages: py-data-juicer 2025-05-31T14:50:49,761 Created temporary directory: /tmp/pip-wheel-zt4omyqs 2025-05-31T14:50:49,762 DEPRECATION: Building 'py-data-juicer' using the legacy setup.py bdist_wheel mechanism, which will be removed in a future version. pip 25.3 will enforce this behaviour change. A possible replacement is to use the standardized build interface by setting the `--use-pep517` option, (possibly combined with `--no-build-isolation`), or adding a `pyproject.toml` file to the source tree of 'py-data-juicer'. Discussion can be found at https://github.com/pypa/pip/issues/6334 2025-05-31T14:50:49,763 Building wheel for py-data-juicer (setup.py): started 2025-05-31T14:50:49,765 Destination directory: /tmp/pip-wheel-zt4omyqs 2025-05-31T14:50:49,765 Running command python setup.py bdist_wheel 2025-05-31T14:50:50,274 WARNING:root:target file does not exist: environments/minimal_requires.txt 2025-05-31T14:50:50,275 WARNING:root:target file does not exist: environments/science_requires.txt 2025-05-31T14:50:50,275 WARNING:root:target file does not exist: environments/dist_requires.txt 2025-05-31T14:50:50,277 WARNING:root:target file does not exist: environments/dev_requires.txt 2025-05-31T14:50:50,277 WARNING:root:target file does not exist: environments/preprocess_requires.txt 2025-05-31T14:50:50,278 WARNING:root:target file does not exist: environments/quality_classifier_requires.txt 2025-05-31T14:50:50,279 WARNING:root:target file does not exist: environments/sandbox_requires.txt 2025-05-31T14:50:50,696 /usr/local/lib/python3.11/dist-packages/setuptools/dist.py:759: SetuptoolsDeprecationWarning: License classifiers are deprecated. 2025-05-31T14:50:50,697 !! 2025-05-31T14:50:50,698 ******************************************************************************** 2025-05-31T14:50:50,699 Please consider removing the following classifiers in favor of a SPDX license expression: 2025-05-31T14:50:50,700 License :: OSI Approved :: Apache Software License 2025-05-31T14:50:50,702 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2025-05-31T14:50:50,702 ******************************************************************************** 2025-05-31T14:50:50,704 !! 2025-05-31T14:50:50,705 self._finalize_license_expression() 2025-05-31T14:50:50,705 INFO:root:running bdist_wheel 2025-05-31T14:50:50,864 INFO:root:running build 2025-05-31T14:50:50,864 INFO:root:running build_py 2025-05-31T14:50:50,898 INFO:root:creating build/lib/data_juicer 2025-05-31T14:50:50,900 INFO:root:copying data_juicer/__init__.py -> build/lib/data_juicer 2025-05-31T14:50:50,903 INFO:root:creating build/lib/data_juicer/analysis 2025-05-31T14:50:50,904 INFO:root:copying data_juicer/analysis/diversity_analysis.py -> build/lib/data_juicer/analysis 2025-05-31T14:50:50,907 INFO:root:copying data_juicer/analysis/measure.py -> build/lib/data_juicer/analysis 2025-05-31T14:50:50,909 INFO:root:copying data_juicer/analysis/overall_analysis.py -> build/lib/data_juicer/analysis 2025-05-31T14:50:50,912 INFO:root:copying data_juicer/analysis/collector.py -> build/lib/data_juicer/analysis 2025-05-31T14:50:50,914 INFO:root:copying data_juicer/analysis/column_wise_analysis.py -> build/lib/data_juicer/analysis 2025-05-31T14:50:50,917 INFO:root:copying data_juicer/analysis/draw.py -> build/lib/data_juicer/analysis 2025-05-31T14:50:50,919 INFO:root:copying data_juicer/analysis/__init__.py -> build/lib/data_juicer/analysis 2025-05-31T14:50:50,922 INFO:root:creating build/lib/data_juicer/core 2025-05-31T14:50:50,923 INFO:root:copying data_juicer/core/tracer.py -> build/lib/data_juicer/core 2025-05-31T14:50:50,926 INFO:root:copying data_juicer/core/exporter.py -> build/lib/data_juicer/core 2025-05-31T14:50:50,929 INFO:root:copying data_juicer/core/analyzer.py -> build/lib/data_juicer/core 2025-05-31T14:50:50,931 INFO:root:copying data_juicer/core/monitor.py -> build/lib/data_juicer/core 2025-05-31T14:50:50,935 INFO:root:copying data_juicer/core/adapter.py -> build/lib/data_juicer/core 2025-05-31T14:50:50,937 INFO:root:copying data_juicer/core/__init__.py -> build/lib/data_juicer/core 2025-05-31T14:50:50,940 INFO:root:creating build/lib/data_juicer/format 2025-05-31T14:50:50,941 INFO:root:copying data_juicer/format/tsv_formatter.py -> build/lib/data_juicer/format 2025-05-31T14:50:50,945 INFO:root:copying data_juicer/format/csv_formatter.py -> build/lib/data_juicer/format 2025-05-31T14:50:50,947 INFO:root:copying data_juicer/format/parquet_formatter.py -> build/lib/data_juicer/format 2025-05-31T14:50:50,949 INFO:root:copying data_juicer/format/load.py -> build/lib/data_juicer/format 2025-05-31T14:50:50,951 INFO:root:copying data_juicer/format/json_formatter.py -> build/lib/data_juicer/format 2025-05-31T14:50:50,953 INFO:root:copying data_juicer/format/text_formatter.py -> build/lib/data_juicer/format 2025-05-31T14:50:50,956 INFO:root:copying data_juicer/format/formatter.py -> build/lib/data_juicer/format 2025-05-31T14:50:50,959 INFO:root:copying data_juicer/format/__init__.py -> build/lib/data_juicer/format 2025-05-31T14:50:50,961 INFO:root:copying data_juicer/format/empty_formatter.py -> build/lib/data_juicer/format 2025-05-31T14:50:50,965 INFO:root:creating build/lib/data_juicer/utils 2025-05-31T14:50:50,966 INFO:root:copying data_juicer/utils/lazy_loader.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,969 INFO:root:copying data_juicer/utils/compress.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,972 INFO:root:copying data_juicer/utils/auto_install_mapping.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,975 INFO:root:copying data_juicer/utils/fingerprint_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,978 INFO:root:copying data_juicer/utils/availability_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,980 INFO:root:copying data_juicer/utils/file_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,983 INFO:root:copying data_juicer/utils/resource_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,985 INFO:root:copying data_juicer/utils/process_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,988 INFO:root:copying data_juicer/utils/auto_install_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,990 INFO:root:copying data_juicer/utils/constant.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,993 INFO:root:copying data_juicer/utils/mm_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:50,997 INFO:root:copying data_juicer/utils/model_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,000 INFO:root:copying data_juicer/utils/logger_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,004 INFO:root:copying data_juicer/utils/ckpt_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,006 INFO:root:copying data_juicer/utils/registry.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,009 INFO:root:copying data_juicer/utils/sample.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,013 INFO:root:copying data_juicer/utils/asset_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,015 INFO:root:copying data_juicer/utils/common_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,017 INFO:root:copying data_juicer/utils/nltk_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,020 INFO:root:copying data_juicer/utils/unittest_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,023 INFO:root:copying data_juicer/utils/__init__.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,025 INFO:root:copying data_juicer/utils/cache_utils.py -> build/lib/data_juicer/utils 2025-05-31T14:50:51,027 INFO:root:creating build/lib/data_juicer/download 2025-05-31T14:50:51,029 INFO:root:copying data_juicer/download/commoncrawl.py -> build/lib/data_juicer/download 2025-05-31T14:50:51,031 INFO:root:copying data_juicer/download/arxiv.py -> build/lib/data_juicer/download 2025-05-31T14:50:51,034 INFO:root:copying data_juicer/download/wikipedia.py -> build/lib/data_juicer/download 2025-05-31T14:50:51,038 INFO:root:copying data_juicer/download/downloader.py -> build/lib/data_juicer/download 2025-05-31T14:50:51,041 INFO:root:copying data_juicer/download/__init__.py -> build/lib/data_juicer/download 2025-05-31T14:50:51,044 INFO:root:creating build/lib/data_juicer/ops 2025-05-31T14:50:51,045 INFO:root:copying data_juicer/ops/mixins.py -> build/lib/data_juicer/ops 2025-05-31T14:50:51,049 INFO:root:copying data_juicer/ops/base_op.py -> build/lib/data_juicer/ops 2025-05-31T14:50:51,052 INFO:root:copying data_juicer/ops/load.py -> build/lib/data_juicer/ops 2025-05-31T14:50:51,054 INFO:root:copying data_juicer/ops/op_fusion.py -> build/lib/data_juicer/ops 2025-05-31T14:50:51,057 INFO:root:copying data_juicer/ops/__init__.py -> build/lib/data_juicer/ops 2025-05-31T14:50:51,060 INFO:root:creating build/lib/data_juicer/config 2025-05-31T14:50:51,061 INFO:root:copying data_juicer/config/config.py -> build/lib/data_juicer/config 2025-05-31T14:50:51,065 INFO:root:copying data_juicer/config/__init__.py -> build/lib/data_juicer/config 2025-05-31T14:50:51,068 INFO:root:creating build/lib/data_juicer/core/data 2025-05-31T14:50:51,071 INFO:root:copying data_juicer/core/data/schema.py -> build/lib/data_juicer/core/data 2025-05-31T14:50:51,073 INFO:root:copying data_juicer/core/data/ray_dataset.py -> build/lib/data_juicer/core/data 2025-05-31T14:50:51,077 INFO:root:copying data_juicer/core/data/dataset_builder.py -> build/lib/data_juicer/core/data 2025-05-31T14:50:51,080 INFO:root:copying data_juicer/core/data/config_validator.py -> build/lib/data_juicer/core/data 2025-05-31T14:50:51,083 INFO:root:copying data_juicer/core/data/dj_dataset.py -> build/lib/data_juicer/core/data 2025-05-31T14:50:51,086 INFO:root:copying data_juicer/core/data/data_validator.py -> build/lib/data_juicer/core/data 2025-05-31T14:50:51,090 INFO:root:copying data_juicer/core/data/__init__.py -> build/lib/data_juicer/core/data 2025-05-31T14:50:51,092 INFO:root:copying data_juicer/core/data/load_strategy.py -> build/lib/data_juicer/core/data 2025-05-31T14:50:51,096 INFO:root:creating build/lib/data_juicer/core/executor 2025-05-31T14:50:51,098 INFO:root:copying data_juicer/core/executor/default_executor.py -> build/lib/data_juicer/core/executor 2025-05-31T14:50:51,101 INFO:root:copying data_juicer/core/executor/base.py -> build/lib/data_juicer/core/executor 2025-05-31T14:50:51,104 INFO:root:copying data_juicer/core/executor/factory.py -> build/lib/data_juicer/core/executor 2025-05-31T14:50:51,106 INFO:root:copying data_juicer/core/executor/__init__.py -> build/lib/data_juicer/core/executor 2025-05-31T14:50:51,109 INFO:root:copying data_juicer/core/executor/ray_executor.py -> build/lib/data_juicer/core/executor 2025-05-31T14:50:51,113 INFO:root:creating build/lib/data_juicer/ops/grouper 2025-05-31T14:50:51,114 INFO:root:copying data_juicer/ops/grouper/naive_grouper.py -> build/lib/data_juicer/ops/grouper 2025-05-31T14:50:51,118 INFO:root:copying data_juicer/ops/grouper/key_value_grouper.py -> build/lib/data_juicer/ops/grouper 2025-05-31T14:50:51,120 INFO:root:copying data_juicer/ops/grouper/naive_reverse_grouper.py -> build/lib/data_juicer/ops/grouper 2025-05-31T14:50:51,123 INFO:root:copying data_juicer/ops/grouper/__init__.py -> build/lib/data_juicer/ops/grouper 2025-05-31T14:50:51,127 INFO:root:creating build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,128 INFO:root:copying data_juicer/ops/filter/stopwords_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,130 INFO:root:copying data_juicer/ops/filter/flagged_words_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,133 INFO:root:copying data_juicer/ops/filter/image_pair_similarity_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,136 INFO:root:copying data_juicer/ops/filter/image_aesthetics_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,139 INFO:root:copying data_juicer/ops/filter/word_repetition_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,142 INFO:root:copying data_juicer/ops/filter/audio_nmf_snr_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,144 INFO:root:copying data_juicer/ops/filter/video_aesthetics_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,147 INFO:root:copying data_juicer/ops/filter/audio_duration_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,150 INFO:root:copying data_juicer/ops/filter/video_nsfw_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,152 INFO:root:copying data_juicer/ops/filter/text_entity_dependency_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,155 INFO:root:copying data_juicer/ops/filter/alphanumeric_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,157 INFO:root:copying data_juicer/ops/filter/video_ocr_area_ratio_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,160 INFO:root:copying data_juicer/ops/filter/character_repetition_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,162 INFO:root:copying data_juicer/ops/filter/image_shape_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,165 INFO:root:copying data_juicer/ops/filter/llm_quality_score_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,168 INFO:root:copying data_juicer/ops/filter/image_aspect_ratio_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,171 INFO:root:copying data_juicer/ops/filter/specified_field_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,173 INFO:root:copying data_juicer/ops/filter/phrase_grounding_recall_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,177 INFO:root:copying data_juicer/ops/filter/special_characters_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,179 INFO:root:copying data_juicer/ops/filter/video_motion_score_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,182 INFO:root:copying data_juicer/ops/filter/text_length_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,184 INFO:root:copying data_juicer/ops/filter/suffix_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,186 INFO:root:copying data_juicer/ops/filter/words_num_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,189 INFO:root:copying data_juicer/ops/filter/llm_difficulty_score_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,193 INFO:root:copying data_juicer/ops/filter/text_pair_similarity_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,196 INFO:root:copying data_juicer/ops/filter/image_text_similarity_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,199 INFO:root:copying data_juicer/ops/filter/image_text_matching_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,202 INFO:root:copying data_juicer/ops/filter/specified_numeric_field_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,205 INFO:root:copying data_juicer/ops/filter/image_watermark_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,208 INFO:root:copying data_juicer/ops/filter/video_watermark_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,211 INFO:root:copying data_juicer/ops/filter/video_resolution_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,214 INFO:root:copying data_juicer/ops/filter/token_num_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,216 INFO:root:copying data_juicer/ops/filter/image_size_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,219 INFO:root:copying data_juicer/ops/filter/average_line_length_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,222 INFO:root:copying data_juicer/ops/filter/video_duration_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,225 INFO:root:copying data_juicer/ops/filter/audio_size_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,228 INFO:root:copying data_juicer/ops/filter/maximum_line_length_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,231 INFO:root:copying data_juicer/ops/filter/video_frames_text_similarity_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,234 INFO:root:copying data_juicer/ops/filter/__init__.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,237 INFO:root:copying data_juicer/ops/filter/perplexity_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,240 INFO:root:copying data_juicer/ops/filter/image_face_ratio_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,243 INFO:root:copying data_juicer/ops/filter/image_nsfw_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,246 INFO:root:copying data_juicer/ops/filter/video_tagging_from_frames_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,249 INFO:root:copying data_juicer/ops/filter/text_action_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,251 INFO:root:copying data_juicer/ops/filter/image_face_count_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,254 INFO:root:copying data_juicer/ops/filter/language_id_score_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,257 INFO:root:copying data_juicer/ops/filter/video_motion_score_raft_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,259 INFO:root:copying data_juicer/ops/filter/video_aspect_ratio_filter.py -> build/lib/data_juicer/ops/filter 2025-05-31T14:50:51,262 INFO:root:creating build/lib/data_juicer/ops/selector 2025-05-31T14:50:51,264 INFO:root:copying data_juicer/ops/selector/frequency_specified_field_selector.py -> build/lib/data_juicer/ops/selector 2025-05-31T14:50:51,266 INFO:root:copying data_juicer/ops/selector/random_selector.py -> build/lib/data_juicer/ops/selector 2025-05-31T14:50:51,269 INFO:root:copying data_juicer/ops/selector/range_specified_field_selector.py -> build/lib/data_juicer/ops/selector 2025-05-31T14:50:51,272 INFO:root:copying data_juicer/ops/selector/topk_specified_field_selector.py -> build/lib/data_juicer/ops/selector 2025-05-31T14:50:51,275 INFO:root:copying data_juicer/ops/selector/tags_specified_field_selector.py -> build/lib/data_juicer/ops/selector 2025-05-31T14:50:51,277 INFO:root:copying data_juicer/ops/selector/__init__.py -> build/lib/data_juicer/ops/selector 2025-05-31T14:50:51,280 INFO:root:creating build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,281 INFO:root:copying data_juicer/ops/deduplicator/ray_video_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,284 INFO:root:copying data_juicer/ops/deduplicator/document_minhash_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,287 INFO:root:copying data_juicer/ops/deduplicator/video_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,289 INFO:root:copying data_juicer/ops/deduplicator/ray_document_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,292 INFO:root:copying data_juicer/ops/deduplicator/ray_bts_minhash_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,295 INFO:root:copying data_juicer/ops/deduplicator/ray_basic_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,297 INFO:root:copying data_juicer/ops/deduplicator/image_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,300 INFO:root:copying data_juicer/ops/deduplicator/ray_image_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,302 INFO:root:copying data_juicer/ops/deduplicator/document_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,304 INFO:root:copying data_juicer/ops/deduplicator/document_simhash_deduplicator.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,307 INFO:root:copying data_juicer/ops/deduplicator/__init__.py -> build/lib/data_juicer/ops/deduplicator 2025-05-31T14:50:51,310 INFO:root:creating build/lib/data_juicer/ops/aggregator 2025-05-31T14:50:51,312 INFO:root:copying data_juicer/ops/aggregator/entity_attribute_aggregator.py -> build/lib/data_juicer/ops/aggregator 2025-05-31T14:50:51,315 INFO:root:copying data_juicer/ops/aggregator/nested_aggregator.py -> build/lib/data_juicer/ops/aggregator 2025-05-31T14:50:51,318 INFO:root:copying data_juicer/ops/aggregator/meta_tags_aggregator.py -> build/lib/data_juicer/ops/aggregator 2025-05-31T14:50:51,322 INFO:root:copying data_juicer/ops/aggregator/most_relevant_entities_aggregator.py -> build/lib/data_juicer/ops/aggregator 2025-05-31T14:50:51,325 INFO:root:copying data_juicer/ops/aggregator/__init__.py -> build/lib/data_juicer/ops/aggregator 2025-05-31T14:50:51,328 INFO:root:creating build/lib/data_juicer/ops/common 2025-05-31T14:50:51,330 INFO:root:copying data_juicer/ops/common/special_characters.py -> build/lib/data_juicer/ops/common 2025-05-31T14:50:51,332 INFO:root:copying data_juicer/ops/common/prompt2prompt_pipeline.py -> build/lib/data_juicer/ops/common 2025-05-31T14:50:51,336 INFO:root:copying data_juicer/ops/common/helper_func.py -> build/lib/data_juicer/ops/common 2025-05-31T14:50:51,339 INFO:root:copying data_juicer/ops/common/__init__.py -> build/lib/data_juicer/ops/common 2025-05-31T14:50:51,345 INFO:root:creating build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,347 INFO:root:copying data_juicer/ops/mapper/video_face_blur_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,350 INFO:root:copying data_juicer/ops/mapper/sentence_augmentation_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,353 INFO:root:copying data_juicer/ops/mapper/dialog_sentiment_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,356 INFO:root:copying data_juicer/ops/mapper/sentence_split_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,359 INFO:root:copying data_juicer/ops/mapper/generate_qa_from_examples_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,362 INFO:root:copying data_juicer/ops/mapper/dialog_topic_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,364 INFO:root:copying data_juicer/ops/mapper/video_resize_resolution_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,367 INFO:root:copying data_juicer/ops/mapper/extract_tables_from_html_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,369 INFO:root:copying data_juicer/ops/mapper/extract_keyword_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,373 INFO:root:copying data_juicer/ops/mapper/video_tagging_from_frames_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,375 INFO:root:copying data_juicer/ops/mapper/sdxl_prompt2prompt_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,378 INFO:root:copying data_juicer/ops/mapper/clean_copyright_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,381 INFO:root:copying data_juicer/ops/mapper/calibrate_response_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,383 INFO:root:copying data_juicer/ops/mapper/audio_ffmpeg_wrapped_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,386 INFO:root:copying data_juicer/ops/mapper/optimize_response_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,389 INFO:root:copying data_juicer/ops/mapper/clean_html_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,391 INFO:root:copying data_juicer/ops/mapper/remove_specific_chars_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,395 INFO:root:copying data_juicer/ops/mapper/remove_words_with_incorrect_substrings_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,399 INFO:root:copying data_juicer/ops/mapper/dialog_sentiment_intensity_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,403 INFO:root:copying data_juicer/ops/mapper/remove_table_text_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,405 INFO:root:copying data_juicer/ops/mapper/query_topic_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,408 INFO:root:copying data_juicer/ops/mapper/replace_content_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,413 INFO:root:copying data_juicer/ops/mapper/image_captioning_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,417 INFO:root:copying data_juicer/ops/mapper/nlpaug_en_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,421 INFO:root:copying data_juicer/ops/mapper/fix_unicode_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,425 INFO:root:copying data_juicer/ops/mapper/remove_repeat_sentences_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,427 INFO:root:copying data_juicer/ops/mapper/chinese_convert_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,430 INFO:root:copying data_juicer/ops/mapper/video_remove_watermark_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,434 INFO:root:copying data_juicer/ops/mapper/remove_comments_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,438 INFO:root:copying data_juicer/ops/mapper/calibrate_qa_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,441 INFO:root:copying data_juicer/ops/mapper/remove_long_words_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,444 INFO:root:copying data_juicer/ops/mapper/mllm_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,447 INFO:root:copying data_juicer/ops/mapper/python_lambda_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,452 INFO:root:copying data_juicer/ops/mapper/calibrate_query_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,455 INFO:root:copying data_juicer/ops/mapper/whitespace_normalization_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,459 INFO:root:copying data_juicer/ops/mapper/clean_links_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,463 INFO:root:copying data_juicer/ops/mapper/expand_macro_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,467 INFO:root:copying data_juicer/ops/mapper/clean_email_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,469 INFO:root:copying data_juicer/ops/mapper/extract_entity_relation_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,474 INFO:root:copying data_juicer/ops/mapper/video_split_by_scene_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,480 INFO:root:copying data_juicer/ops/mapper/extract_nickname_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,484 INFO:root:copying data_juicer/ops/mapper/optimize_qa_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,487 INFO:root:copying data_juicer/ops/mapper/video_captioning_from_summarizer_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,492 INFO:root:copying data_juicer/ops/mapper/video_captioning_from_frames_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,495 INFO:root:copying data_juicer/ops/mapper/pair_preference_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,500 INFO:root:copying data_juicer/ops/mapper/extract_entity_attribute_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,506 INFO:root:copying data_juicer/ops/mapper/nlpcda_zh_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,508 INFO:root:copying data_juicer/ops/mapper/image_face_blur_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,511 INFO:root:copying data_juicer/ops/mapper/image_segment_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,514 INFO:root:copying data_juicer/ops/mapper/video_captioning_from_video_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,518 INFO:root:copying data_juicer/ops/mapper/video_ffmpeg_wrapped_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,520 INFO:root:copying data_juicer/ops/mapper/image_captioning_from_gpt4v_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,524 INFO:root:copying data_juicer/ops/mapper/image_tagging_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,527 INFO:root:copying data_juicer/ops/mapper/image_blur_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,529 INFO:root:copying data_juicer/ops/mapper/image_diffusion_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,533 INFO:root:copying data_juicer/ops/mapper/extract_event_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,536 INFO:root:copying data_juicer/ops/mapper/remove_bibliography_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,538 INFO:root:copying data_juicer/ops/mapper/dialog_intent_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,541 INFO:root:copying data_juicer/ops/mapper/text_chunk_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,544 INFO:root:copying data_juicer/ops/mapper/generate_qa_from_text_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,547 INFO:root:copying data_juicer/ops/mapper/query_intent_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,550 INFO:root:copying data_juicer/ops/mapper/video_split_by_duration_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,553 INFO:root:copying data_juicer/ops/mapper/optimize_query_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,555 INFO:root:copying data_juicer/ops/mapper/relation_identity_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,559 INFO:root:copying data_juicer/ops/mapper/extract_support_text_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,563 INFO:root:copying data_juicer/ops/mapper/video_resize_aspect_ratio_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,566 INFO:root:copying data_juicer/ops/mapper/audio_add_gaussian_noise_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,569 INFO:root:copying data_juicer/ops/mapper/image_remove_background_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,571 INFO:root:copying data_juicer/ops/mapper/video_captioning_from_audio_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,574 INFO:root:copying data_juicer/ops/mapper/video_extract_frames_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,577 INFO:root:copying data_juicer/ops/mapper/__init__.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,581 INFO:root:copying data_juicer/ops/mapper/punctuation_normalization_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,583 INFO:root:copying data_juicer/ops/mapper/video_split_by_key_frame_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,587 INFO:root:copying data_juicer/ops/mapper/remove_non_chinese_character_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,590 INFO:root:copying data_juicer/ops/mapper/python_file_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,593 INFO:root:copying data_juicer/ops/mapper/query_sentiment_detection_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,596 INFO:root:copying data_juicer/ops/mapper/remove_header_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,599 INFO:root:copying data_juicer/ops/mapper/clean_ip_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,601 INFO:root:copying data_juicer/ops/mapper/video_tagging_from_audio_mapper.py -> build/lib/data_juicer/ops/mapper 2025-05-31T14:50:51,604 INFO:root:creating build/lib/data_juicer/tools 2025-05-31T14:50:51,606 INFO:root:copying tools/process_data.py -> build/lib/data_juicer/tools 2025-05-31T14:50:51,608 INFO:root:copying tools/sandbox_starter.py -> build/lib/data_juicer/tools 2025-05-31T14:50:51,610 INFO:root:copying tools/generate_smtp_cert.py -> build/lib/data_juicer/tools 2025-05-31T14:50:51,614 INFO:root:copying tools/data_resplit.py -> build/lib/data_juicer/tools 2025-05-31T14:50:51,616 INFO:root:copying tools/analyze_data.py -> build/lib/data_juicer/tools 2025-05-31T14:50:51,618 INFO:root:copying tools/dj_install.py -> build/lib/data_juicer/tools 2025-05-31T14:50:51,621 INFO:root:copying tools/__init__.py -> build/lib/data_juicer/tools 2025-05-31T14:50:51,681 /usr/local/lib/python3.11/dist-packages/setuptools/_distutils/cmd.py:90: SetuptoolsDeprecationWarning: setup.py install is deprecated. 2025-05-31T14:50:51,682 !! 2025-05-31T14:50:51,683 ******************************************************************************** 2025-05-31T14:50:51,684 Please avoid running ``setup.py`` directly. 2025-05-31T14:50:51,684 Instead, use pypa/build, pypa/installer or other 2025-05-31T14:50:51,685 standards-based tools. 2025-05-31T14:50:51,687 By 2025-Oct-31, you need to update your project and remove deprecated calls 2025-05-31T14:50:51,687 or your builds will no longer be supported. 2025-05-31T14:50:51,689 See https://blog.ganssle.io/articles/2021/10/setup-py-deprecated.html for details. 2025-05-31T14:50:51,690 ******************************************************************************** 2025-05-31T14:50:51,692 !! 2025-05-31T14:50:51,693 self.initialize_options() 2025-05-31T14:50:51,720 INFO:root:installing to build/bdist.linux-armv7l/wheel 2025-05-31T14:50:51,721 INFO:root:running install 2025-05-31T14:50:51,748 INFO:root:running install_lib 2025-05-31T14:50:51,778 INFO:root:creating build/bdist.linux-armv7l/wheel 2025-05-31T14:50:51,781 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer 2025-05-31T14:50:51,783 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/analysis 2025-05-31T14:50:51,785 INFO:root:copying build/lib/data_juicer/analysis/diversity_analysis.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-05-31T14:50:51,787 INFO:root:copying build/lib/data_juicer/analysis/measure.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-05-31T14:50:51,790 INFO:root:copying build/lib/data_juicer/analysis/overall_analysis.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-05-31T14:50:51,792 INFO:root:copying build/lib/data_juicer/analysis/collector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-05-31T14:50:51,794 INFO:root:copying build/lib/data_juicer/analysis/column_wise_analysis.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-05-31T14:50:51,797 INFO:root:copying build/lib/data_juicer/analysis/draw.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-05-31T14:50:51,799 INFO:root:copying build/lib/data_juicer/analysis/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/analysis 2025-05-31T14:50:51,801 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/tools 2025-05-31T14:50:51,803 INFO:root:copying build/lib/data_juicer/tools/process_data.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-05-31T14:50:51,806 INFO:root:copying build/lib/data_juicer/tools/sandbox_starter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-05-31T14:50:51,808 INFO:root:copying build/lib/data_juicer/tools/generate_smtp_cert.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-05-31T14:50:51,812 INFO:root:copying build/lib/data_juicer/tools/data_resplit.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-05-31T14:50:51,814 INFO:root:copying build/lib/data_juicer/tools/analyze_data.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-05-31T14:50:51,817 INFO:root:copying build/lib/data_juicer/tools/dj_install.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-05-31T14:50:51,819 INFO:root:copying build/lib/data_juicer/tools/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/tools 2025-05-31T14:50:51,822 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/core 2025-05-31T14:50:51,824 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/core/data 2025-05-31T14:50:51,826 INFO:root:copying build/lib/data_juicer/core/data/schema.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-05-31T14:50:51,829 INFO:root:copying build/lib/data_juicer/core/data/ray_dataset.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-05-31T14:50:51,833 INFO:root:copying build/lib/data_juicer/core/data/dataset_builder.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-05-31T14:50:51,838 INFO:root:copying build/lib/data_juicer/core/data/config_validator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-05-31T14:50:51,842 INFO:root:copying build/lib/data_juicer/core/data/dj_dataset.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-05-31T14:50:51,849 INFO:root:copying build/lib/data_juicer/core/data/data_validator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-05-31T14:50:51,858 INFO:root:copying build/lib/data_juicer/core/data/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-05-31T14:50:51,865 INFO:root:copying build/lib/data_juicer/core/data/load_strategy.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/data 2025-05-31T14:50:51,876 INFO:root:copying build/lib/data_juicer/core/tracer.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-05-31T14:50:51,891 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/core/executor 2025-05-31T14:50:51,893 INFO:root:copying build/lib/data_juicer/core/executor/default_executor.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-05-31T14:50:51,899 INFO:root:copying build/lib/data_juicer/core/executor/base.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-05-31T14:50:51,902 INFO:root:copying build/lib/data_juicer/core/executor/factory.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-05-31T14:50:51,907 INFO:root:copying build/lib/data_juicer/core/executor/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-05-31T14:50:51,910 INFO:root:copying build/lib/data_juicer/core/executor/ray_executor.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core/executor 2025-05-31T14:50:51,915 INFO:root:copying build/lib/data_juicer/core/exporter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-05-31T14:50:51,921 INFO:root:copying build/lib/data_juicer/core/analyzer.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-05-31T14:50:51,928 INFO:root:copying build/lib/data_juicer/core/monitor.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-05-31T14:50:51,935 INFO:root:copying build/lib/data_juicer/core/adapter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-05-31T14:50:51,942 INFO:root:copying build/lib/data_juicer/core/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/core 2025-05-31T14:50:51,954 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/format 2025-05-31T14:50:51,957 INFO:root:copying build/lib/data_juicer/format/tsv_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-05-31T14:50:51,967 INFO:root:copying build/lib/data_juicer/format/csv_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-05-31T14:50:51,976 INFO:root:copying build/lib/data_juicer/format/parquet_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-05-31T14:50:51,985 INFO:root:copying build/lib/data_juicer/format/load.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-05-31T14:50:51,993 INFO:root:copying build/lib/data_juicer/format/json_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-05-31T14:50:51,996 INFO:root:copying build/lib/data_juicer/format/text_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-05-31T14:50:51,999 INFO:root:copying build/lib/data_juicer/format/formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-05-31T14:50:52,003 INFO:root:copying build/lib/data_juicer/format/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-05-31T14:50:52,006 INFO:root:copying build/lib/data_juicer/format/empty_formatter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/format 2025-05-31T14:50:52,010 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/utils 2025-05-31T14:50:52,012 INFO:root:copying build/lib/data_juicer/utils/lazy_loader.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,015 INFO:root:copying build/lib/data_juicer/utils/compress.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,019 INFO:root:copying build/lib/data_juicer/utils/auto_install_mapping.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,022 INFO:root:copying build/lib/data_juicer/utils/fingerprint_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,025 INFO:root:copying build/lib/data_juicer/utils/availability_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,028 INFO:root:copying build/lib/data_juicer/utils/file_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,031 INFO:root:copying build/lib/data_juicer/utils/resource_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,033 INFO:root:copying build/lib/data_juicer/utils/process_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,036 INFO:root:copying build/lib/data_juicer/utils/auto_install_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,038 INFO:root:copying build/lib/data_juicer/utils/constant.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,040 INFO:root:copying build/lib/data_juicer/utils/mm_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,044 INFO:root:copying build/lib/data_juicer/utils/model_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,048 INFO:root:copying build/lib/data_juicer/utils/logger_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,050 INFO:root:copying build/lib/data_juicer/utils/ckpt_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,053 INFO:root:copying build/lib/data_juicer/utils/registry.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,056 INFO:root:copying build/lib/data_juicer/utils/sample.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,059 INFO:root:copying build/lib/data_juicer/utils/asset_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,063 INFO:root:copying build/lib/data_juicer/utils/common_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,066 INFO:root:copying build/lib/data_juicer/utils/nltk_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,069 INFO:root:copying build/lib/data_juicer/utils/unittest_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,073 INFO:root:copying build/lib/data_juicer/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,076 INFO:root:copying build/lib/data_juicer/utils/cache_utils.py -> build/bdist.linux-armv7l/wheel/./data_juicer/utils 2025-05-31T14:50:52,082 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/download 2025-05-31T14:50:52,085 INFO:root:copying build/lib/data_juicer/download/commoncrawl.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-05-31T14:50:52,089 INFO:root:copying build/lib/data_juicer/download/arxiv.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-05-31T14:50:52,096 INFO:root:copying build/lib/data_juicer/download/wikipedia.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-05-31T14:50:52,106 INFO:root:copying build/lib/data_juicer/download/downloader.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-05-31T14:50:52,117 INFO:root:copying build/lib/data_juicer/download/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/download 2025-05-31T14:50:52,129 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops 2025-05-31T14:50:52,136 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/grouper 2025-05-31T14:50:52,139 INFO:root:copying build/lib/data_juicer/ops/grouper/naive_grouper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/grouper 2025-05-31T14:50:52,147 INFO:root:copying build/lib/data_juicer/ops/grouper/key_value_grouper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/grouper 2025-05-31T14:50:52,154 INFO:root:copying build/lib/data_juicer/ops/grouper/naive_reverse_grouper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/grouper 2025-05-31T14:50:52,156 INFO:root:copying build/lib/data_juicer/ops/grouper/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/grouper 2025-05-31T14:50:52,159 INFO:root:copying build/lib/data_juicer/ops/mixins.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-05-31T14:50:52,164 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/filter 2025-05-31T14:50:52,166 INFO:root:copying build/lib/data_juicer/ops/filter/stopwords_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,169 INFO:root:copying build/lib/data_juicer/ops/filter/flagged_words_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,172 INFO:root:copying build/lib/data_juicer/ops/filter/image_pair_similarity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,174 INFO:root:copying build/lib/data_juicer/ops/filter/image_aesthetics_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,177 INFO:root:copying build/lib/data_juicer/ops/filter/word_repetition_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,180 INFO:root:copying build/lib/data_juicer/ops/filter/audio_nmf_snr_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,182 INFO:root:copying build/lib/data_juicer/ops/filter/video_aesthetics_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,185 INFO:root:copying build/lib/data_juicer/ops/filter/audio_duration_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,187 INFO:root:copying build/lib/data_juicer/ops/filter/video_nsfw_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,189 INFO:root:copying build/lib/data_juicer/ops/filter/text_entity_dependency_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,192 INFO:root:copying build/lib/data_juicer/ops/filter/alphanumeric_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,194 INFO:root:copying build/lib/data_juicer/ops/filter/video_ocr_area_ratio_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,197 INFO:root:copying build/lib/data_juicer/ops/filter/character_repetition_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,199 INFO:root:copying build/lib/data_juicer/ops/filter/image_shape_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,201 INFO:root:copying build/lib/data_juicer/ops/filter/llm_quality_score_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,204 INFO:root:copying build/lib/data_juicer/ops/filter/image_aspect_ratio_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,206 INFO:root:copying build/lib/data_juicer/ops/filter/specified_field_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,208 INFO:root:copying build/lib/data_juicer/ops/filter/phrase_grounding_recall_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,211 INFO:root:copying build/lib/data_juicer/ops/filter/special_characters_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,213 INFO:root:copying build/lib/data_juicer/ops/filter/video_motion_score_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,216 INFO:root:copying build/lib/data_juicer/ops/filter/text_length_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,218 INFO:root:copying build/lib/data_juicer/ops/filter/suffix_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,220 INFO:root:copying build/lib/data_juicer/ops/filter/words_num_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,222 INFO:root:copying build/lib/data_juicer/ops/filter/llm_difficulty_score_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,225 INFO:root:copying build/lib/data_juicer/ops/filter/text_pair_similarity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,227 INFO:root:copying build/lib/data_juicer/ops/filter/image_text_similarity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,229 INFO:root:copying build/lib/data_juicer/ops/filter/image_text_matching_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,232 INFO:root:copying build/lib/data_juicer/ops/filter/specified_numeric_field_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,234 INFO:root:copying build/lib/data_juicer/ops/filter/image_watermark_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,237 INFO:root:copying build/lib/data_juicer/ops/filter/video_watermark_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,239 INFO:root:copying build/lib/data_juicer/ops/filter/video_resolution_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,242 INFO:root:copying build/lib/data_juicer/ops/filter/token_num_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,245 INFO:root:copying build/lib/data_juicer/ops/filter/image_size_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,247 INFO:root:copying build/lib/data_juicer/ops/filter/average_line_length_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,249 INFO:root:copying build/lib/data_juicer/ops/filter/video_duration_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,252 INFO:root:copying build/lib/data_juicer/ops/filter/audio_size_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,254 INFO:root:copying build/lib/data_juicer/ops/filter/maximum_line_length_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,256 INFO:root:copying build/lib/data_juicer/ops/filter/video_frames_text_similarity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,259 INFO:root:copying build/lib/data_juicer/ops/filter/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,262 INFO:root:copying build/lib/data_juicer/ops/filter/perplexity_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,264 INFO:root:copying build/lib/data_juicer/ops/filter/image_face_ratio_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,266 INFO:root:copying build/lib/data_juicer/ops/filter/image_nsfw_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,268 INFO:root:copying build/lib/data_juicer/ops/filter/video_tagging_from_frames_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,271 INFO:root:copying build/lib/data_juicer/ops/filter/text_action_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,273 INFO:root:copying build/lib/data_juicer/ops/filter/image_face_count_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,276 INFO:root:copying build/lib/data_juicer/ops/filter/language_id_score_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,278 INFO:root:copying build/lib/data_juicer/ops/filter/video_motion_score_raft_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,280 INFO:root:copying build/lib/data_juicer/ops/filter/video_aspect_ratio_filter.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/filter 2025-05-31T14:50:52,283 INFO:root:copying build/lib/data_juicer/ops/base_op.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-05-31T14:50:52,287 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/selector 2025-05-31T14:50:52,288 INFO:root:copying build/lib/data_juicer/ops/selector/frequency_specified_field_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-05-31T14:50:52,291 INFO:root:copying build/lib/data_juicer/ops/selector/random_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-05-31T14:50:52,293 INFO:root:copying build/lib/data_juicer/ops/selector/range_specified_field_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-05-31T14:50:52,297 INFO:root:copying build/lib/data_juicer/ops/selector/topk_specified_field_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-05-31T14:50:52,299 INFO:root:copying build/lib/data_juicer/ops/selector/tags_specified_field_selector.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-05-31T14:50:52,301 INFO:root:copying build/lib/data_juicer/ops/selector/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/selector 2025-05-31T14:50:52,305 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/deduplicator 2025-05-31T14:50:52,307 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_video_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,309 INFO:root:copying build/lib/data_juicer/ops/deduplicator/document_minhash_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,313 INFO:root:copying build/lib/data_juicer/ops/deduplicator/video_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,315 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_document_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,318 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_bts_minhash_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,322 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_basic_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,325 INFO:root:copying build/lib/data_juicer/ops/deduplicator/image_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,328 INFO:root:copying build/lib/data_juicer/ops/deduplicator/ray_image_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,330 INFO:root:copying build/lib/data_juicer/ops/deduplicator/document_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,333 INFO:root:copying build/lib/data_juicer/ops/deduplicator/document_simhash_deduplicator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,335 INFO:root:copying build/lib/data_juicer/ops/deduplicator/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/deduplicator 2025-05-31T14:50:52,338 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/aggregator 2025-05-31T14:50:52,340 INFO:root:copying build/lib/data_juicer/ops/aggregator/entity_attribute_aggregator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-05-31T14:50:52,343 INFO:root:copying build/lib/data_juicer/ops/aggregator/nested_aggregator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-05-31T14:50:52,346 INFO:root:copying build/lib/data_juicer/ops/aggregator/meta_tags_aggregator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-05-31T14:50:52,349 INFO:root:copying build/lib/data_juicer/ops/aggregator/most_relevant_entities_aggregator.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-05-31T14:50:52,351 INFO:root:copying build/lib/data_juicer/ops/aggregator/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/aggregator 2025-05-31T14:50:52,354 INFO:root:copying build/lib/data_juicer/ops/load.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-05-31T14:50:52,356 INFO:root:copying build/lib/data_juicer/ops/op_fusion.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-05-31T14:50:52,360 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/common 2025-05-31T14:50:52,362 INFO:root:copying build/lib/data_juicer/ops/common/special_characters.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/common 2025-05-31T14:50:52,364 INFO:root:copying build/lib/data_juicer/ops/common/prompt2prompt_pipeline.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/common 2025-05-31T14:50:52,368 INFO:root:copying build/lib/data_juicer/ops/common/helper_func.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/common 2025-05-31T14:50:52,371 INFO:root:copying build/lib/data_juicer/ops/common/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/common 2025-05-31T14:50:52,374 INFO:root:copying build/lib/data_juicer/ops/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops 2025-05-31T14:50:52,378 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/ops/mapper 2025-05-31T14:50:52,379 INFO:root:copying build/lib/data_juicer/ops/mapper/video_face_blur_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,382 INFO:root:copying build/lib/data_juicer/ops/mapper/sentence_augmentation_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,385 INFO:root:copying build/lib/data_juicer/ops/mapper/dialog_sentiment_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,388 INFO:root:copying build/lib/data_juicer/ops/mapper/sentence_split_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,391 INFO:root:copying build/lib/data_juicer/ops/mapper/generate_qa_from_examples_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,394 INFO:root:copying build/lib/data_juicer/ops/mapper/dialog_topic_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,396 INFO:root:copying build/lib/data_juicer/ops/mapper/video_resize_resolution_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,400 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_tables_from_html_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,403 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_keyword_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,405 INFO:root:copying build/lib/data_juicer/ops/mapper/video_tagging_from_frames_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,409 INFO:root:copying build/lib/data_juicer/ops/mapper/sdxl_prompt2prompt_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,411 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_copyright_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,415 INFO:root:copying build/lib/data_juicer/ops/mapper/calibrate_response_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,417 INFO:root:copying build/lib/data_juicer/ops/mapper/audio_ffmpeg_wrapped_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,421 INFO:root:copying build/lib/data_juicer/ops/mapper/optimize_response_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,423 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_html_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,426 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_specific_chars_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,429 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_words_with_incorrect_substrings_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,432 INFO:root:copying build/lib/data_juicer/ops/mapper/dialog_sentiment_intensity_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,435 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_table_text_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,439 INFO:root:copying build/lib/data_juicer/ops/mapper/query_topic_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,442 INFO:root:copying build/lib/data_juicer/ops/mapper/replace_content_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,445 INFO:root:copying build/lib/data_juicer/ops/mapper/image_captioning_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,448 INFO:root:copying build/lib/data_juicer/ops/mapper/nlpaug_en_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,452 INFO:root:copying build/lib/data_juicer/ops/mapper/fix_unicode_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,455 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_repeat_sentences_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,458 INFO:root:copying build/lib/data_juicer/ops/mapper/chinese_convert_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,461 INFO:root:copying build/lib/data_juicer/ops/mapper/video_remove_watermark_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,464 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_comments_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,467 INFO:root:copying build/lib/data_juicer/ops/mapper/calibrate_qa_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,471 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_long_words_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,473 INFO:root:copying build/lib/data_juicer/ops/mapper/mllm_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,477 INFO:root:copying build/lib/data_juicer/ops/mapper/python_lambda_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,479 INFO:root:copying build/lib/data_juicer/ops/mapper/calibrate_query_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,482 INFO:root:copying build/lib/data_juicer/ops/mapper/whitespace_normalization_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,485 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_links_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,487 INFO:root:copying build/lib/data_juicer/ops/mapper/expand_macro_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,490 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_email_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,493 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_entity_relation_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,496 INFO:root:copying build/lib/data_juicer/ops/mapper/video_split_by_scene_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,498 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_nickname_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,501 INFO:root:copying build/lib/data_juicer/ops/mapper/optimize_qa_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,504 INFO:root:copying build/lib/data_juicer/ops/mapper/video_captioning_from_summarizer_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,507 INFO:root:copying build/lib/data_juicer/ops/mapper/video_captioning_from_frames_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,510 INFO:root:copying build/lib/data_juicer/ops/mapper/pair_preference_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,514 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_entity_attribute_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,517 INFO:root:copying build/lib/data_juicer/ops/mapper/nlpcda_zh_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,521 INFO:root:copying build/lib/data_juicer/ops/mapper/image_face_blur_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,525 INFO:root:copying build/lib/data_juicer/ops/mapper/image_segment_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,528 INFO:root:copying build/lib/data_juicer/ops/mapper/video_captioning_from_video_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,531 INFO:root:copying build/lib/data_juicer/ops/mapper/video_ffmpeg_wrapped_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,535 INFO:root:copying build/lib/data_juicer/ops/mapper/image_captioning_from_gpt4v_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,538 INFO:root:copying build/lib/data_juicer/ops/mapper/image_tagging_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,540 INFO:root:copying build/lib/data_juicer/ops/mapper/image_blur_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,543 INFO:root:copying build/lib/data_juicer/ops/mapper/image_diffusion_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,546 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_event_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,549 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_bibliography_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,551 INFO:root:copying build/lib/data_juicer/ops/mapper/dialog_intent_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,555 INFO:root:copying build/lib/data_juicer/ops/mapper/text_chunk_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,557 INFO:root:copying build/lib/data_juicer/ops/mapper/generate_qa_from_text_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,559 INFO:root:copying build/lib/data_juicer/ops/mapper/query_intent_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,562 INFO:root:copying build/lib/data_juicer/ops/mapper/video_split_by_duration_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,564 INFO:root:copying build/lib/data_juicer/ops/mapper/optimize_query_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,566 INFO:root:copying build/lib/data_juicer/ops/mapper/relation_identity_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,569 INFO:root:copying build/lib/data_juicer/ops/mapper/extract_support_text_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,572 INFO:root:copying build/lib/data_juicer/ops/mapper/video_resize_aspect_ratio_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,575 INFO:root:copying build/lib/data_juicer/ops/mapper/audio_add_gaussian_noise_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,577 INFO:root:copying build/lib/data_juicer/ops/mapper/image_remove_background_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,579 INFO:root:copying build/lib/data_juicer/ops/mapper/video_captioning_from_audio_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,582 INFO:root:copying build/lib/data_juicer/ops/mapper/video_extract_frames_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,585 INFO:root:copying build/lib/data_juicer/ops/mapper/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,588 INFO:root:copying build/lib/data_juicer/ops/mapper/punctuation_normalization_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,590 INFO:root:copying build/lib/data_juicer/ops/mapper/video_split_by_key_frame_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,593 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_non_chinese_character_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,595 INFO:root:copying build/lib/data_juicer/ops/mapper/python_file_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,598 INFO:root:copying build/lib/data_juicer/ops/mapper/query_sentiment_detection_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,600 INFO:root:copying build/lib/data_juicer/ops/mapper/remove_header_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,602 INFO:root:copying build/lib/data_juicer/ops/mapper/clean_ip_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,605 INFO:root:copying build/lib/data_juicer/ops/mapper/video_tagging_from_audio_mapper.py -> build/bdist.linux-armv7l/wheel/./data_juicer/ops/mapper 2025-05-31T14:50:52,608 INFO:root:creating build/bdist.linux-armv7l/wheel/data_juicer/config 2025-05-31T14:50:52,610 INFO:root:copying build/lib/data_juicer/config/config.py -> build/bdist.linux-armv7l/wheel/./data_juicer/config 2025-05-31T14:50:52,614 INFO:root:copying build/lib/data_juicer/config/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer/config 2025-05-31T14:50:52,616 INFO:root:copying build/lib/data_juicer/__init__.py -> build/bdist.linux-armv7l/wheel/./data_juicer 2025-05-31T14:50:52,618 INFO:root:running install_egg_info 2025-05-31T14:50:52,660 INFO:root:running egg_info 2025-05-31T14:50:52,694 INFO:root:writing py_data_juicer.egg-info/PKG-INFO 2025-05-31T14:50:52,699 INFO:root:writing dependency_links to py_data_juicer.egg-info/dependency_links.txt 2025-05-31T14:50:52,701 INFO:root:writing entry points to py_data_juicer.egg-info/entry_points.txt 2025-05-31T14:50:52,703 INFO:root:writing requirements to py_data_juicer.egg-info/requires.txt 2025-05-31T14:50:52,704 INFO:root:writing top-level names to py_data_juicer.egg-info/top_level.txt 2025-05-31T14:50:52,763 INFO:root:reading manifest file 'py_data_juicer.egg-info/SOURCES.txt' 2025-05-31T14:50:52,777 INFO:root:adding license file 'LICENSE' 2025-05-31T14:50:52,790 INFO:root:writing manifest file 'py_data_juicer.egg-info/SOURCES.txt' 2025-05-31T14:50:52,792 INFO:root:Copying py_data_juicer.egg-info to build/bdist.linux-armv7l/wheel/./py_data_juicer-1.3.1-py3.11.egg-info 2025-05-31T14:50:52,809 INFO:root:running install_scripts 2025-05-31T14:50:52,827 INFO:root:creating build/bdist.linux-armv7l/wheel/py_data_juicer-1.3.1.dist-info/WHEEL 2025-05-31T14:50:52,832 INFO:wheel:creating '/tmp/pip-wheel-zt4omyqs/py_data_juicer-1.3.1-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-05-31T14:50:52,835 INFO:wheel:adding 'data_juicer/__init__.py' 2025-05-31T14:50:52,838 INFO:wheel:adding 'data_juicer/analysis/__init__.py' 2025-05-31T14:50:52,840 INFO:wheel:adding 'data_juicer/analysis/collector.py' 2025-05-31T14:50:52,844 INFO:wheel:adding 'data_juicer/analysis/column_wise_analysis.py' 2025-05-31T14:50:52,846 INFO:wheel:adding 'data_juicer/analysis/diversity_analysis.py' 2025-05-31T14:50:52,848 INFO:wheel:adding 'data_juicer/analysis/draw.py' 2025-05-31T14:50:52,850 INFO:wheel:adding 'data_juicer/analysis/measure.py' 2025-05-31T14:50:52,852 INFO:wheel:adding 'data_juicer/analysis/overall_analysis.py' 2025-05-31T14:50:52,855 INFO:wheel:adding 'data_juicer/config/__init__.py' 2025-05-31T14:50:52,861 INFO:wheel:adding 'data_juicer/config/config.py' 2025-05-31T14:50:52,864 INFO:wheel:adding 'data_juicer/core/__init__.py' 2025-05-31T14:50:52,866 INFO:wheel:adding 'data_juicer/core/adapter.py' 2025-05-31T14:50:52,868 INFO:wheel:adding 'data_juicer/core/analyzer.py' 2025-05-31T14:50:52,871 INFO:wheel:adding 'data_juicer/core/exporter.py' 2025-05-31T14:50:52,874 INFO:wheel:adding 'data_juicer/core/monitor.py' 2025-05-31T14:50:52,875 INFO:wheel:adding 'data_juicer/core/tracer.py' 2025-05-31T14:50:52,878 INFO:wheel:adding 'data_juicer/core/data/__init__.py' 2025-05-31T14:50:52,880 INFO:wheel:adding 'data_juicer/core/data/config_validator.py' 2025-05-31T14:50:52,882 INFO:wheel:adding 'data_juicer/core/data/data_validator.py' 2025-05-31T14:50:52,885 INFO:wheel:adding 'data_juicer/core/data/dataset_builder.py' 2025-05-31T14:50:52,888 INFO:wheel:adding 'data_juicer/core/data/dj_dataset.py' 2025-05-31T14:50:52,892 INFO:wheel:adding 'data_juicer/core/data/load_strategy.py' 2025-05-31T14:50:52,894 INFO:wheel:adding 'data_juicer/core/data/ray_dataset.py' 2025-05-31T14:50:52,897 INFO:wheel:adding 'data_juicer/core/data/schema.py' 2025-05-31T14:50:52,899 INFO:wheel:adding 'data_juicer/core/executor/__init__.py' 2025-05-31T14:50:52,900 INFO:wheel:adding 'data_juicer/core/executor/base.py' 2025-05-31T14:50:52,902 INFO:wheel:adding 'data_juicer/core/executor/default_executor.py' 2025-05-31T14:50:52,904 INFO:wheel:adding 'data_juicer/core/executor/factory.py' 2025-05-31T14:50:52,906 INFO:wheel:adding 'data_juicer/core/executor/ray_executor.py' 2025-05-31T14:50:52,908 INFO:wheel:adding 'data_juicer/download/__init__.py' 2025-05-31T14:50:52,910 INFO:wheel:adding 'data_juicer/download/arxiv.py' 2025-05-31T14:50:52,912 INFO:wheel:adding 'data_juicer/download/commoncrawl.py' 2025-05-31T14:50:52,914 INFO:wheel:adding 'data_juicer/download/downloader.py' 2025-05-31T14:50:52,919 INFO:wheel:adding 'data_juicer/download/wikipedia.py' 2025-05-31T14:50:52,922 INFO:wheel:adding 'data_juicer/format/__init__.py' 2025-05-31T14:50:52,923 INFO:wheel:adding 'data_juicer/format/csv_formatter.py' 2025-05-31T14:50:52,925 INFO:wheel:adding 'data_juicer/format/empty_formatter.py' 2025-05-31T14:50:52,927 INFO:wheel:adding 'data_juicer/format/formatter.py' 2025-05-31T14:50:52,928 INFO:wheel:adding 'data_juicer/format/json_formatter.py' 2025-05-31T14:50:52,930 INFO:wheel:adding 'data_juicer/format/load.py' 2025-05-31T14:50:52,931 INFO:wheel:adding 'data_juicer/format/parquet_formatter.py' 2025-05-31T14:50:52,933 INFO:wheel:adding 'data_juicer/format/text_formatter.py' 2025-05-31T14:50:52,934 INFO:wheel:adding 'data_juicer/format/tsv_formatter.py' 2025-05-31T14:50:52,936 INFO:wheel:adding 'data_juicer/ops/__init__.py' 2025-05-31T14:50:52,939 INFO:wheel:adding 'data_juicer/ops/base_op.py' 2025-05-31T14:50:52,940 INFO:wheel:adding 'data_juicer/ops/load.py' 2025-05-31T14:50:52,943 INFO:wheel:adding 'data_juicer/ops/mixins.py' 2025-05-31T14:50:52,945 INFO:wheel:adding 'data_juicer/ops/op_fusion.py' 2025-05-31T14:50:52,947 INFO:wheel:adding 'data_juicer/ops/aggregator/__init__.py' 2025-05-31T14:50:52,949 INFO:wheel:adding 'data_juicer/ops/aggregator/entity_attribute_aggregator.py' 2025-05-31T14:50:52,951 INFO:wheel:adding 'data_juicer/ops/aggregator/meta_tags_aggregator.py' 2025-05-31T14:50:52,953 INFO:wheel:adding 'data_juicer/ops/aggregator/most_relevant_entities_aggregator.py' 2025-05-31T14:50:52,955 INFO:wheel:adding 'data_juicer/ops/aggregator/nested_aggregator.py' 2025-05-31T14:50:52,958 INFO:wheel:adding 'data_juicer/ops/common/__init__.py' 2025-05-31T14:50:52,961 INFO:wheel:adding 'data_juicer/ops/common/helper_func.py' 2025-05-31T14:50:52,967 INFO:wheel:adding 'data_juicer/ops/common/prompt2prompt_pipeline.py' 2025-05-31T14:50:52,969 INFO:wheel:adding 'data_juicer/ops/common/special_characters.py' 2025-05-31T14:50:52,971 INFO:wheel:adding 'data_juicer/ops/deduplicator/__init__.py' 2025-05-31T14:50:52,972 INFO:wheel:adding 'data_juicer/ops/deduplicator/document_deduplicator.py' 2025-05-31T14:50:52,974 INFO:wheel:adding 'data_juicer/ops/deduplicator/document_minhash_deduplicator.py' 2025-05-31T14:50:52,976 INFO:wheel:adding 'data_juicer/ops/deduplicator/document_simhash_deduplicator.py' 2025-05-31T14:50:52,978 INFO:wheel:adding 'data_juicer/ops/deduplicator/image_deduplicator.py' 2025-05-31T14:50:52,979 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_basic_deduplicator.py' 2025-05-31T14:50:52,982 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_bts_minhash_deduplicator.py' 2025-05-31T14:50:52,984 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_document_deduplicator.py' 2025-05-31T14:50:52,985 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_image_deduplicator.py' 2025-05-31T14:50:52,987 INFO:wheel:adding 'data_juicer/ops/deduplicator/ray_video_deduplicator.py' 2025-05-31T14:50:52,988 INFO:wheel:adding 'data_juicer/ops/deduplicator/video_deduplicator.py' 2025-05-31T14:50:52,991 INFO:wheel:adding 'data_juicer/ops/filter/__init__.py' 2025-05-31T14:50:52,992 INFO:wheel:adding 'data_juicer/ops/filter/alphanumeric_filter.py' 2025-05-31T14:50:52,994 INFO:wheel:adding 'data_juicer/ops/filter/audio_duration_filter.py' 2025-05-31T14:50:52,995 INFO:wheel:adding 'data_juicer/ops/filter/audio_nmf_snr_filter.py' 2025-05-31T14:50:52,997 INFO:wheel:adding 'data_juicer/ops/filter/audio_size_filter.py' 2025-05-31T14:50:52,998 INFO:wheel:adding 'data_juicer/ops/filter/average_line_length_filter.py' 2025-05-31T14:50:53,000 INFO:wheel:adding 'data_juicer/ops/filter/character_repetition_filter.py' 2025-05-31T14:50:53,001 INFO:wheel:adding 'data_juicer/ops/filter/flagged_words_filter.py' 2025-05-31T14:50:53,003 INFO:wheel:adding 'data_juicer/ops/filter/image_aesthetics_filter.py' 2025-05-31T14:50:53,004 INFO:wheel:adding 'data_juicer/ops/filter/image_aspect_ratio_filter.py' 2025-05-31T14:50:53,006 INFO:wheel:adding 'data_juicer/ops/filter/image_face_count_filter.py' 2025-05-31T14:50:53,008 INFO:wheel:adding 'data_juicer/ops/filter/image_face_ratio_filter.py' 2025-05-31T14:50:53,009 INFO:wheel:adding 'data_juicer/ops/filter/image_nsfw_filter.py' 2025-05-31T14:50:53,011 INFO:wheel:adding 'data_juicer/ops/filter/image_pair_similarity_filter.py' 2025-05-31T14:50:53,012 INFO:wheel:adding 'data_juicer/ops/filter/image_shape_filter.py' 2025-05-31T14:50:53,014 INFO:wheel:adding 'data_juicer/ops/filter/image_size_filter.py' 2025-05-31T14:50:53,016 INFO:wheel:adding 'data_juicer/ops/filter/image_text_matching_filter.py' 2025-05-31T14:50:53,017 INFO:wheel:adding 'data_juicer/ops/filter/image_text_similarity_filter.py' 2025-05-31T14:50:53,019 INFO:wheel:adding 'data_juicer/ops/filter/image_watermark_filter.py' 2025-05-31T14:50:53,020 INFO:wheel:adding 'data_juicer/ops/filter/language_id_score_filter.py' 2025-05-31T14:50:53,022 INFO:wheel:adding 'data_juicer/ops/filter/llm_difficulty_score_filter.py' 2025-05-31T14:50:53,025 INFO:wheel:adding 'data_juicer/ops/filter/llm_quality_score_filter.py' 2025-05-31T14:50:53,026 INFO:wheel:adding 'data_juicer/ops/filter/maximum_line_length_filter.py' 2025-05-31T14:50:53,027 INFO:wheel:adding 'data_juicer/ops/filter/perplexity_filter.py' 2025-05-31T14:50:53,029 INFO:wheel:adding 'data_juicer/ops/filter/phrase_grounding_recall_filter.py' 2025-05-31T14:50:53,031 INFO:wheel:adding 'data_juicer/ops/filter/special_characters_filter.py' 2025-05-31T14:50:53,032 INFO:wheel:adding 'data_juicer/ops/filter/specified_field_filter.py' 2025-05-31T14:50:53,033 INFO:wheel:adding 'data_juicer/ops/filter/specified_numeric_field_filter.py' 2025-05-31T14:50:53,035 INFO:wheel:adding 'data_juicer/ops/filter/stopwords_filter.py' 2025-05-31T14:50:53,036 INFO:wheel:adding 'data_juicer/ops/filter/suffix_filter.py' 2025-05-31T14:50:53,038 INFO:wheel:adding 'data_juicer/ops/filter/text_action_filter.py' 2025-05-31T14:50:53,039 INFO:wheel:adding 'data_juicer/ops/filter/text_entity_dependency_filter.py' 2025-05-31T14:50:53,040 INFO:wheel:adding 'data_juicer/ops/filter/text_length_filter.py' 2025-05-31T14:50:53,042 INFO:wheel:adding 'data_juicer/ops/filter/text_pair_similarity_filter.py' 2025-05-31T14:50:53,043 INFO:wheel:adding 'data_juicer/ops/filter/token_num_filter.py' 2025-05-31T14:50:53,045 INFO:wheel:adding 'data_juicer/ops/filter/video_aesthetics_filter.py' 2025-05-31T14:50:53,047 INFO:wheel:adding 'data_juicer/ops/filter/video_aspect_ratio_filter.py' 2025-05-31T14:50:53,048 INFO:wheel:adding 'data_juicer/ops/filter/video_duration_filter.py' 2025-05-31T14:50:53,050 INFO:wheel:adding 'data_juicer/ops/filter/video_frames_text_similarity_filter.py' 2025-05-31T14:50:53,052 INFO:wheel:adding 'data_juicer/ops/filter/video_motion_score_filter.py' 2025-05-31T14:50:53,053 INFO:wheel:adding 'data_juicer/ops/filter/video_motion_score_raft_filter.py' 2025-05-31T14:50:53,055 INFO:wheel:adding 'data_juicer/ops/filter/video_nsfw_filter.py' 2025-05-31T14:50:53,057 INFO:wheel:adding 'data_juicer/ops/filter/video_ocr_area_ratio_filter.py' 2025-05-31T14:50:53,059 INFO:wheel:adding 'data_juicer/ops/filter/video_resolution_filter.py' 2025-05-31T14:50:53,061 INFO:wheel:adding 'data_juicer/ops/filter/video_tagging_from_frames_filter.py' 2025-05-31T14:50:53,062 INFO:wheel:adding 'data_juicer/ops/filter/video_watermark_filter.py' 2025-05-31T14:50:53,064 INFO:wheel:adding 'data_juicer/ops/filter/word_repetition_filter.py' 2025-05-31T14:50:53,065 INFO:wheel:adding 'data_juicer/ops/filter/words_num_filter.py' 2025-05-31T14:50:53,067 INFO:wheel:adding 'data_juicer/ops/grouper/__init__.py' 2025-05-31T14:50:53,069 INFO:wheel:adding 'data_juicer/ops/grouper/key_value_grouper.py' 2025-05-31T14:50:53,070 INFO:wheel:adding 'data_juicer/ops/grouper/naive_grouper.py' 2025-05-31T14:50:53,071 INFO:wheel:adding 'data_juicer/ops/grouper/naive_reverse_grouper.py' 2025-05-31T14:50:53,075 INFO:wheel:adding 'data_juicer/ops/mapper/__init__.py' 2025-05-31T14:50:53,077 INFO:wheel:adding 'data_juicer/ops/mapper/audio_add_gaussian_noise_mapper.py' 2025-05-31T14:50:53,078 INFO:wheel:adding 'data_juicer/ops/mapper/audio_ffmpeg_wrapped_mapper.py' 2025-05-31T14:50:53,079 INFO:wheel:adding 'data_juicer/ops/mapper/calibrate_qa_mapper.py' 2025-05-31T14:50:53,081 INFO:wheel:adding 'data_juicer/ops/mapper/calibrate_query_mapper.py' 2025-05-31T14:50:53,082 INFO:wheel:adding 'data_juicer/ops/mapper/calibrate_response_mapper.py' 2025-05-31T14:50:53,083 INFO:wheel:adding 'data_juicer/ops/mapper/chinese_convert_mapper.py' 2025-05-31T14:50:53,085 INFO:wheel:adding 'data_juicer/ops/mapper/clean_copyright_mapper.py' 2025-05-31T14:50:53,086 INFO:wheel:adding 'data_juicer/ops/mapper/clean_email_mapper.py' 2025-05-31T14:50:53,088 INFO:wheel:adding 'data_juicer/ops/mapper/clean_html_mapper.py' 2025-05-31T14:50:53,089 INFO:wheel:adding 'data_juicer/ops/mapper/clean_ip_mapper.py' 2025-05-31T14:50:53,090 INFO:wheel:adding 'data_juicer/ops/mapper/clean_links_mapper.py' 2025-05-31T14:50:53,093 INFO:wheel:adding 'data_juicer/ops/mapper/dialog_intent_detection_mapper.py' 2025-05-31T14:50:53,095 INFO:wheel:adding 'data_juicer/ops/mapper/dialog_sentiment_detection_mapper.py' 2025-05-31T14:50:53,097 INFO:wheel:adding 'data_juicer/ops/mapper/dialog_sentiment_intensity_mapper.py' 2025-05-31T14:50:53,099 INFO:wheel:adding 'data_juicer/ops/mapper/dialog_topic_detection_mapper.py' 2025-05-31T14:50:53,100 INFO:wheel:adding 'data_juicer/ops/mapper/expand_macro_mapper.py' 2025-05-31T14:50:53,102 INFO:wheel:adding 'data_juicer/ops/mapper/extract_entity_attribute_mapper.py' 2025-05-31T14:50:53,105 INFO:wheel:adding 'data_juicer/ops/mapper/extract_entity_relation_mapper.py' 2025-05-31T14:50:53,107 INFO:wheel:adding 'data_juicer/ops/mapper/extract_event_mapper.py' 2025-05-31T14:50:53,109 INFO:wheel:adding 'data_juicer/ops/mapper/extract_keyword_mapper.py' 2025-05-31T14:50:53,110 INFO:wheel:adding 'data_juicer/ops/mapper/extract_nickname_mapper.py' 2025-05-31T14:50:53,112 INFO:wheel:adding 'data_juicer/ops/mapper/extract_support_text_mapper.py' 2025-05-31T14:50:53,113 INFO:wheel:adding 'data_juicer/ops/mapper/extract_tables_from_html_mapper.py' 2025-05-31T14:50:53,115 INFO:wheel:adding 'data_juicer/ops/mapper/fix_unicode_mapper.py' 2025-05-31T14:50:53,117 INFO:wheel:adding 'data_juicer/ops/mapper/generate_qa_from_examples_mapper.py' 2025-05-31T14:50:53,119 INFO:wheel:adding 'data_juicer/ops/mapper/generate_qa_from_text_mapper.py' 2025-05-31T14:50:53,120 INFO:wheel:adding 'data_juicer/ops/mapper/image_blur_mapper.py' 2025-05-31T14:50:53,122 INFO:wheel:adding 'data_juicer/ops/mapper/image_captioning_from_gpt4v_mapper.py' 2025-05-31T14:50:53,124 INFO:wheel:adding 'data_juicer/ops/mapper/image_captioning_mapper.py' 2025-05-31T14:50:53,126 INFO:wheel:adding 'data_juicer/ops/mapper/image_diffusion_mapper.py' 2025-05-31T14:50:53,128 INFO:wheel:adding 'data_juicer/ops/mapper/image_face_blur_mapper.py' 2025-05-31T14:50:53,130 INFO:wheel:adding 'data_juicer/ops/mapper/image_remove_background_mapper.py' 2025-05-31T14:50:53,131 INFO:wheel:adding 'data_juicer/ops/mapper/image_segment_mapper.py' 2025-05-31T14:50:53,133 INFO:wheel:adding 'data_juicer/ops/mapper/image_tagging_mapper.py' 2025-05-31T14:50:53,134 INFO:wheel:adding 'data_juicer/ops/mapper/mllm_mapper.py' 2025-05-31T14:50:53,136 INFO:wheel:adding 'data_juicer/ops/mapper/nlpaug_en_mapper.py' 2025-05-31T14:50:53,138 INFO:wheel:adding 'data_juicer/ops/mapper/nlpcda_zh_mapper.py' 2025-05-31T14:50:53,140 INFO:wheel:adding 'data_juicer/ops/mapper/optimize_qa_mapper.py' 2025-05-31T14:50:53,141 INFO:wheel:adding 'data_juicer/ops/mapper/optimize_query_mapper.py' 2025-05-31T14:50:53,143 INFO:wheel:adding 'data_juicer/ops/mapper/optimize_response_mapper.py' 2025-05-31T14:50:53,145 INFO:wheel:adding 'data_juicer/ops/mapper/pair_preference_mapper.py' 2025-05-31T14:50:53,146 INFO:wheel:adding 'data_juicer/ops/mapper/punctuation_normalization_mapper.py' 2025-05-31T14:50:53,148 INFO:wheel:adding 'data_juicer/ops/mapper/python_file_mapper.py' 2025-05-31T14:50:53,149 INFO:wheel:adding 'data_juicer/ops/mapper/python_lambda_mapper.py' 2025-05-31T14:50:53,151 INFO:wheel:adding 'data_juicer/ops/mapper/query_intent_detection_mapper.py' 2025-05-31T14:50:53,153 INFO:wheel:adding 'data_juicer/ops/mapper/query_sentiment_detection_mapper.py' 2025-05-31T14:50:53,154 INFO:wheel:adding 'data_juicer/ops/mapper/query_topic_detection_mapper.py' 2025-05-31T14:50:53,156 INFO:wheel:adding 'data_juicer/ops/mapper/relation_identity_mapper.py' 2025-05-31T14:50:53,157 INFO:wheel:adding 'data_juicer/ops/mapper/remove_bibliography_mapper.py' 2025-05-31T14:50:53,158 INFO:wheel:adding 'data_juicer/ops/mapper/remove_comments_mapper.py' 2025-05-31T14:50:53,160 INFO:wheel:adding 'data_juicer/ops/mapper/remove_header_mapper.py' 2025-05-31T14:50:53,161 INFO:wheel:adding 'data_juicer/ops/mapper/remove_long_words_mapper.py' 2025-05-31T14:50:53,162 INFO:wheel:adding 'data_juicer/ops/mapper/remove_non_chinese_character_mapper.py' 2025-05-31T14:50:53,164 INFO:wheel:adding 'data_juicer/ops/mapper/remove_repeat_sentences_mapper.py' 2025-05-31T14:50:53,165 INFO:wheel:adding 'data_juicer/ops/mapper/remove_specific_chars_mapper.py' 2025-05-31T14:50:53,166 INFO:wheel:adding 'data_juicer/ops/mapper/remove_table_text_mapper.py' 2025-05-31T14:50:53,168 INFO:wheel:adding 'data_juicer/ops/mapper/remove_words_with_incorrect_substrings_mapper.py' 2025-05-31T14:50:53,169 INFO:wheel:adding 'data_juicer/ops/mapper/replace_content_mapper.py' 2025-05-31T14:50:53,171 INFO:wheel:adding 'data_juicer/ops/mapper/sdxl_prompt2prompt_mapper.py' 2025-05-31T14:50:53,173 INFO:wheel:adding 'data_juicer/ops/mapper/sentence_augmentation_mapper.py' 2025-05-31T14:50:53,174 INFO:wheel:adding 'data_juicer/ops/mapper/sentence_split_mapper.py' 2025-05-31T14:50:53,176 INFO:wheel:adding 'data_juicer/ops/mapper/text_chunk_mapper.py' 2025-05-31T14:50:53,177 INFO:wheel:adding 'data_juicer/ops/mapper/video_captioning_from_audio_mapper.py' 2025-05-31T14:50:53,180 INFO:wheel:adding 'data_juicer/ops/mapper/video_captioning_from_frames_mapper.py' 2025-05-31T14:50:53,182 INFO:wheel:adding 'data_juicer/ops/mapper/video_captioning_from_summarizer_mapper.py' 2025-05-31T14:50:53,185 INFO:wheel:adding 'data_juicer/ops/mapper/video_captioning_from_video_mapper.py' 2025-05-31T14:50:53,187 INFO:wheel:adding 'data_juicer/ops/mapper/video_extract_frames_mapper.py' 2025-05-31T14:50:53,188 INFO:wheel:adding 'data_juicer/ops/mapper/video_face_blur_mapper.py' 2025-05-31T14:50:53,190 INFO:wheel:adding 'data_juicer/ops/mapper/video_ffmpeg_wrapped_mapper.py' 2025-05-31T14:50:53,192 INFO:wheel:adding 'data_juicer/ops/mapper/video_remove_watermark_mapper.py' 2025-05-31T14:50:53,193 INFO:wheel:adding 'data_juicer/ops/mapper/video_resize_aspect_ratio_mapper.py' 2025-05-31T14:50:53,195 INFO:wheel:adding 'data_juicer/ops/mapper/video_resize_resolution_mapper.py' 2025-05-31T14:50:53,197 INFO:wheel:adding 'data_juicer/ops/mapper/video_split_by_duration_mapper.py' 2025-05-31T14:50:53,199 INFO:wheel:adding 'data_juicer/ops/mapper/video_split_by_key_frame_mapper.py' 2025-05-31T14:50:53,200 INFO:wheel:adding 'data_juicer/ops/mapper/video_split_by_scene_mapper.py' 2025-05-31T14:50:53,202 INFO:wheel:adding 'data_juicer/ops/mapper/video_tagging_from_audio_mapper.py' 2025-05-31T14:50:53,204 INFO:wheel:adding 'data_juicer/ops/mapper/video_tagging_from_frames_mapper.py' 2025-05-31T14:50:53,205 INFO:wheel:adding 'data_juicer/ops/mapper/whitespace_normalization_mapper.py' 2025-05-31T14:50:53,207 INFO:wheel:adding 'data_juicer/ops/selector/__init__.py' 2025-05-31T14:50:53,208 INFO:wheel:adding 'data_juicer/ops/selector/frequency_specified_field_selector.py' 2025-05-31T14:50:53,210 INFO:wheel:adding 'data_juicer/ops/selector/random_selector.py' 2025-05-31T14:50:53,211 INFO:wheel:adding 'data_juicer/ops/selector/range_specified_field_selector.py' 2025-05-31T14:50:53,213 INFO:wheel:adding 'data_juicer/ops/selector/tags_specified_field_selector.py' 2025-05-31T14:50:53,214 INFO:wheel:adding 'data_juicer/ops/selector/topk_specified_field_selector.py' 2025-05-31T14:50:53,216 INFO:wheel:adding 'data_juicer/tools/__init__.py' 2025-05-31T14:50:53,217 INFO:wheel:adding 'data_juicer/tools/analyze_data.py' 2025-05-31T14:50:53,218 INFO:wheel:adding 'data_juicer/tools/data_resplit.py' 2025-05-31T14:50:53,220 INFO:wheel:adding 'data_juicer/tools/dj_install.py' 2025-05-31T14:50:53,221 INFO:wheel:adding 'data_juicer/tools/generate_smtp_cert.py' 2025-05-31T14:50:53,223 INFO:wheel:adding 'data_juicer/tools/process_data.py' 2025-05-31T14:50:53,224 INFO:wheel:adding 'data_juicer/tools/sandbox_starter.py' 2025-05-31T14:50:53,226 INFO:wheel:adding 'data_juicer/utils/__init__.py' 2025-05-31T14:50:53,227 INFO:wheel:adding 'data_juicer/utils/asset_utils.py' 2025-05-31T14:50:53,228 INFO:wheel:adding 'data_juicer/utils/auto_install_mapping.py' 2025-05-31T14:50:53,230 INFO:wheel:adding 'data_juicer/utils/auto_install_utils.py' 2025-05-31T14:50:53,231 INFO:wheel:adding 'data_juicer/utils/availability_utils.py' 2025-05-31T14:50:53,233 INFO:wheel:adding 'data_juicer/utils/cache_utils.py' 2025-05-31T14:50:53,235 INFO:wheel:adding 'data_juicer/utils/ckpt_utils.py' 2025-05-31T14:50:53,236 INFO:wheel:adding 'data_juicer/utils/common_utils.py' 2025-05-31T14:50:53,239 INFO:wheel:adding 'data_juicer/utils/compress.py' 2025-05-31T14:50:53,241 INFO:wheel:adding 'data_juicer/utils/constant.py' 2025-05-31T14:50:53,243 INFO:wheel:adding 'data_juicer/utils/file_utils.py' 2025-05-31T14:50:53,245 INFO:wheel:adding 'data_juicer/utils/fingerprint_utils.py' 2025-05-31T14:50:53,246 INFO:wheel:adding 'data_juicer/utils/lazy_loader.py' 2025-05-31T14:50:53,248 INFO:wheel:adding 'data_juicer/utils/logger_utils.py' 2025-05-31T14:50:53,254 INFO:wheel:adding 'data_juicer/utils/mm_utils.py' 2025-05-31T14:50:53,260 INFO:wheel:adding 'data_juicer/utils/model_utils.py' 2025-05-31T14:50:53,263 INFO:wheel:adding 'data_juicer/utils/nltk_utils.py' 2025-05-31T14:50:53,265 INFO:wheel:adding 'data_juicer/utils/process_utils.py' 2025-05-31T14:50:53,267 INFO:wheel:adding 'data_juicer/utils/registry.py' 2025-05-31T14:50:53,268 INFO:wheel:adding 'data_juicer/utils/resource_utils.py' 2025-05-31T14:50:53,270 INFO:wheel:adding 'data_juicer/utils/sample.py' 2025-05-31T14:50:53,271 INFO:wheel:adding 'data_juicer/utils/unittest_utils.py' 2025-05-31T14:50:53,275 INFO:wheel:adding 'py_data_juicer-1.3.1.dist-info/licenses/LICENSE' 2025-05-31T14:50:53,281 INFO:wheel:adding 'py_data_juicer-1.3.1.dist-info/METADATA' 2025-05-31T14:50:53,282 INFO:wheel:adding 'py_data_juicer-1.3.1.dist-info/WHEEL' 2025-05-31T14:50:53,283 INFO:wheel:adding 'py_data_juicer-1.3.1.dist-info/entry_points.txt' 2025-05-31T14:50:53,284 INFO:wheel:adding 'py_data_juicer-1.3.1.dist-info/top_level.txt' 2025-05-31T14:50:53,288 INFO:wheel:adding 'py_data_juicer-1.3.1.dist-info/RECORD' 2025-05-31T14:50:53,299 INFO:root:removing build/bdist.linux-armv7l/wheel 2025-05-31T14:50:53,454 Building wheel for py-data-juicer (setup.py): finished with status 'done' 2025-05-31T14:50:53,461 Created wheel for py-data-juicer: filename=py_data_juicer-1.3.1-py3-none-any.whl size=473483 sha256=2a2fb315917638bd22836efd277c3d2813f1984f218077b85a771be2e8a05bd9 2025-05-31T14:50:53,463 Stored in directory: /tmp/pip-ephem-wheel-cache-k9xi46ht/wheels/63/11/3f/9b2e9f24411da7bab9b1412159d51507a5937452c98a5420cd 2025-05-31T14:50:53,494 Successfully built py-data-juicer 2025-05-31T14:50:53,511 Removed build tracker: '/tmp/pip-build-tracker-i5jhcsyd'