2026-02-28T02:14:58,977 Created temporary directory: /tmp/pip-ephem-wheel-cache-mdmwex6y 2026-02-28T02:14:58,979 Created temporary directory: /tmp/pip-build-tracker-ftj7x1td 2026-02-28T02:14:58,979 Initialized build tracking at /tmp/pip-build-tracker-ftj7x1td 2026-02-28T02:14:58,980 Created build tracker: /tmp/pip-build-tracker-ftj7x1td 2026-02-28T02:14:58,980 Entered build tracker: /tmp/pip-build-tracker-ftj7x1td 2026-02-28T02:14:58,981 Created temporary directory: /tmp/pip-wheel-dkf2u_iw 2026-02-28T02:14:58,984 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-02-28T02:14:58,987 Created temporary directory: /tmp/pip-ephem-wheel-cache-kxdj5waf 2026-02-28T02:14:59,008 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-02-28T02:14:59,012 2 location(s) to search for versions of torchtitan: 2026-02-28T02:14:59,012 * https://pypi.org/simple/torchtitan/ 2026-02-28T02:14:59,012 * https://www.piwheels.org/simple/torchtitan/ 2026-02-28T02:14:59,013 Fetching project page and analyzing links: https://pypi.org/simple/torchtitan/ 2026-02-28T02:14:59,013 Getting page https://pypi.org/simple/torchtitan/ 2026-02-28T02:14:59,015 Found index url https://pypi.org/simple 2026-02-28T02:14:59,243 Fetched page https://pypi.org/simple/torchtitan/ as application/vnd.pypi.simple.v1+json 2026-02-28T02:14:59,246 Skipping link: No binaries permitted for torchtitan: https://files.pythonhosted.org/packages/c6/96/007977f62a02259e3cff0400cd43a9cf6d357a4d851232cf29951048bf43/torchtitan-0.0.2-py3-none-any.whl (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.8) 2026-02-28T02:14:59,247 Found link https://files.pythonhosted.org/packages/fb/73/83d7c481a9ee1d97d44da7cc3b0dee4c09b04c9fe99cf055ee47f9e18aae/torchtitan-0.0.2.tar.gz (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.8), version: 0.0.2 2026-02-28T02:14:59,248 Skipping link: No binaries permitted for torchtitan: https://files.pythonhosted.org/packages/3e/74/a64d1b45c2e51fdc3c9cb77a3e11a224bfd60479a9055937f20e1ee0c753/torchtitan-0.1.0-py3-none-any.whl (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.10) 2026-02-28T02:14:59,249 Found link https://files.pythonhosted.org/packages/db/bd/370326cb9bb7aeaa569d3a632769edf47e97ca407112930f963d654bf999/torchtitan-0.1.0.tar.gz (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.10), version: 0.1.0 2026-02-28T02:14:59,250 Skipping link: No binaries permitted for torchtitan: https://files.pythonhosted.org/packages/eb/f2/af09570c6ce7a4d9616d14215f839e0db11ef584a9a3f47dfa63847edc21/torchtitan-0.2.0-py3-none-any.whl (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.10) 2026-02-28T02:14:59,251 Found link https://files.pythonhosted.org/packages/34/c9/70a500b5b29c8209d60673c449f5bcb119adaf1f7085dc70442524621e13/torchtitan-0.2.0.tar.gz (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.10), version: 0.2.0 2026-02-28T02:14:59,252 Skipping link: No binaries permitted for torchtitan: https://files.pythonhosted.org/packages/a4/b9/5b2783f0630ab4c5ed971291649ffc3d8299d7287ab05639445ab2ba3934/torchtitan-0.2.1-py3-none-any.whl (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.10) 2026-02-28T02:14:59,253 Found link https://files.pythonhosted.org/packages/f8/97/43465aca4e1c1a0a430d432dcefeae01f72ffb77317d63eb676da70124fb/torchtitan-0.2.1.tar.gz (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.10), version: 0.2.1 2026-02-28T02:14:59,253 Skipping link: No binaries permitted for torchtitan: https://files.pythonhosted.org/packages/c7/06/a8908cb4fd0185be48a640615ee2ec9055c35cd7aaafdb13859b7fb723a7/torchtitan-0.2.2-py3-none-any.whl (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.10) 2026-02-28T02:14:59,254 Found link https://files.pythonhosted.org/packages/89/fc/de6756235ae3b44ae2cf65ea817b94bf5bc0c104a0024e4d9a11f2d11a02/torchtitan-0.2.2.tar.gz (from https://pypi.org/simple/torchtitan/) (requires-python:>=3.10), version: 0.2.2 2026-02-28T02:14:59,256 Fetching project page and analyzing links: https://www.piwheels.org/simple/torchtitan/ 2026-02-28T02:14:59,256 Getting page https://www.piwheels.org/simple/torchtitan/ 2026-02-28T02:14:59,257 Found index url https://www.piwheels.org/simple 2026-02-28T02:14:59,436 Fetched page https://www.piwheels.org/simple/torchtitan/ as text/html 2026-02-28T02:14:59,438 Skipping link: No binaries permitted for torchtitan: https://www.piwheels.org/simple/torchtitan/torchtitan-0.2.1-py3-none-any.whl#sha256=394e766b84a2681293c92a7564016da86dbd3feecec3c4512bf50433db4d2786 (from https://www.piwheels.org/simple/torchtitan/) (requires-python:>=3.10) 2026-02-28T02:14:59,439 Skipping link: No binaries permitted for torchtitan: https://archive1.piwheels.org/simple/torchtitan/torchtitan-0.2.0-py3-none-any.whl#sha256=388dd105af83ea42213c0b7c33e3babda660502dcf3d7344b29e6dc9f6d1cfb9 (from https://www.piwheels.org/simple/torchtitan/) (requires-python:>=3.10) 2026-02-28T02:14:59,439 Skipping link: No binaries permitted for torchtitan: https://archive1.piwheels.org/simple/torchtitan/torchtitan-0.1.0-py3-none-any.whl#sha256=07d7d3feb4a6344130177fa42b0e2712fa4aaa60719300bc97f74bc7113bf6d2 (from https://www.piwheels.org/simple/torchtitan/) (requires-python:>=3.10) 2026-02-28T02:14:59,440 Skipping link: No binaries permitted for torchtitan: https://archive1.piwheels.org/simple/torchtitan/torchtitan-0.0.2-py3-none-any.whl#sha256=6afcb2648f9b6e8134671b05914281754f2735aa5208e10c8d9585b3ca2b8695 (from https://www.piwheels.org/simple/torchtitan/) (requires-python:>=3.8) 2026-02-28T02:14:59,440 Skipping link: not a file: https://www.piwheels.org/simple/torchtitan/ 2026-02-28T02:14:59,441 Skipping link: not a file: https://pypi.org/simple/torchtitan/ 2026-02-28T02:14:59,462 Given no hashes to check 1 links for project 'torchtitan': discarding no candidates 2026-02-28T02:14:59,481 Collecting torchtitan==0.2.2 2026-02-28T02:14:59,484 Created temporary directory: /tmp/pip-unpack-rm3dhfcg 2026-02-28T02:14:59,707 Downloading torchtitan-0.2.2.tar.gz (344 kB) 2026-02-28T02:15:00,271 Added torchtitan==0.2.2 from https://files.pythonhosted.org/packages/89/fc/de6756235ae3b44ae2cf65ea817b94bf5bc0c104a0024e4d9a11f2d11a02/torchtitan-0.2.2.tar.gz to build tracker '/tmp/pip-build-tracker-ftj7x1td' 2026-02-28T02:15:00,276 Created temporary directory: /tmp/pip-build-env-zxu6zbb_ 2026-02-28T02:15:00,281 Installing build dependencies: started 2026-02-28T02:15:00,282 Running command pip subprocess to install build dependencies 2026-02-28T02:15:00,314 Error processing line 1 of /home/piwheels/.local/lib/python3.11/site-packages/cntimer.pth: 2026-02-28T02:15:00,355 Traceback (most recent call last): 2026-02-28T02:15:00,356 File "", line 192, in addpackage 2026-02-28T02:15:00,356 File "", line 1, in 2026-02-28T02:15:00,357 ModuleNotFoundError: No module named 'cntimer' 2026-02-28T02:15:00,359 Remainder of file ignored 2026-02-28T02:15:01,418 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-02-28T02:15:02,043 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-02-28T02:15:02,065 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-02-28T02:15:03,798 Collecting setuptools>=61.0 2026-02-28T02:15:03,872 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.0-py3-none-any.whl (1.0 MB) 2026-02-28T02:15:06,889 Installing collected packages: setuptools 2026-02-28T02:15:09,932 Successfully installed setuptools-82.0.0 2026-02-28T02:15:10,201 Installing build dependencies: finished with status 'done' 2026-02-28T02:15:10,207 Getting requirements to build wheel: started 2026-02-28T02:15:10,209 Running command Getting requirements to build wheel 2026-02-28T02:15:10,851 /tmp/pip-build-env-zxu6zbb_/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-02-28T02:15:10,851 !! 2026-02-28T02:15:10,853 ******************************************************************************** 2026-02-28T02:15:10,853 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-02-28T02:15:10,854 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-02-28T02:15:10,855 or your builds will no longer be supported. 2026-02-28T02:15:10,856 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-28T02:15:10,856 ******************************************************************************** 2026-02-28T02:15:10,857 !! 2026-02-28T02:15:10,858 corresp(dist, value, root_dir) 2026-02-28T02:15:10,939 running egg_info 2026-02-28T02:15:10,945 writing torchtitan.egg-info/PKG-INFO 2026-02-28T02:15:10,950 writing dependency_links to torchtitan.egg-info/dependency_links.txt 2026-02-28T02:15:10,953 writing requirements to torchtitan.egg-info/requires.txt 2026-02-28T02:15:10,955 writing top-level names to torchtitan.egg-info/top_level.txt 2026-02-28T02:15:11,053 reading manifest file 'torchtitan.egg-info/SOURCES.txt' 2026-02-28T02:15:11,065 adding license file 'LICENSE' 2026-02-28T02:15:11,076 writing manifest file 'torchtitan.egg-info/SOURCES.txt' 2026-02-28T02:15:11,175 Getting requirements to build wheel: finished with status 'done' 2026-02-28T02:15:11,178 Created temporary directory: /tmp/pip-modern-metadata-vq27x2bi 2026-02-28T02:15:11,180 Preparing metadata (pyproject.toml): started 2026-02-28T02:15:11,182 Running command Preparing metadata (pyproject.toml) 2026-02-28T02:15:11,765 /tmp/pip-build-env-zxu6zbb_/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-02-28T02:15:11,765 !! 2026-02-28T02:15:11,767 ******************************************************************************** 2026-02-28T02:15:11,768 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-02-28T02:15:11,769 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-02-28T02:15:11,770 or your builds will no longer be supported. 2026-02-28T02:15:11,771 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-28T02:15:11,771 ******************************************************************************** 2026-02-28T02:15:11,772 !! 2026-02-28T02:15:11,773 corresp(dist, value, root_dir) 2026-02-28T02:15:11,851 running dist_info 2026-02-28T02:15:11,862 creating /tmp/pip-modern-metadata-vq27x2bi/torchtitan.egg-info 2026-02-28T02:15:11,863 writing /tmp/pip-modern-metadata-vq27x2bi/torchtitan.egg-info/PKG-INFO 2026-02-28T02:15:11,868 writing dependency_links to /tmp/pip-modern-metadata-vq27x2bi/torchtitan.egg-info/dependency_links.txt 2026-02-28T02:15:11,871 writing requirements to /tmp/pip-modern-metadata-vq27x2bi/torchtitan.egg-info/requires.txt 2026-02-28T02:15:11,872 writing top-level names to /tmp/pip-modern-metadata-vq27x2bi/torchtitan.egg-info/top_level.txt 2026-02-28T02:15:11,873 writing manifest file '/tmp/pip-modern-metadata-vq27x2bi/torchtitan.egg-info/SOURCES.txt' 2026-02-28T02:15:11,951 reading manifest file '/tmp/pip-modern-metadata-vq27x2bi/torchtitan.egg-info/SOURCES.txt' 2026-02-28T02:15:11,953 adding license file 'LICENSE' 2026-02-28T02:15:11,962 writing manifest file '/tmp/pip-modern-metadata-vq27x2bi/torchtitan.egg-info/SOURCES.txt' 2026-02-28T02:15:11,963 creating '/tmp/pip-modern-metadata-vq27x2bi/torchtitan-0.2.2.dist-info' 2026-02-28T02:15:12,085 Preparing metadata (pyproject.toml): finished with status 'done' 2026-02-28T02:15:12,090 Source in /tmp/pip-wheel-dkf2u_iw/torchtitan_797585ac51b845d8a0b8198c154c9f7d has version 0.2.2, which satisfies requirement torchtitan==0.2.2 from https://files.pythonhosted.org/packages/89/fc/de6756235ae3b44ae2cf65ea817b94bf5bc0c104a0024e4d9a11f2d11a02/torchtitan-0.2.2.tar.gz 2026-02-28T02:15:12,091 Removed torchtitan==0.2.2 from https://files.pythonhosted.org/packages/89/fc/de6756235ae3b44ae2cf65ea817b94bf5bc0c104a0024e4d9a11f2d11a02/torchtitan-0.2.2.tar.gz from build tracker '/tmp/pip-build-tracker-ftj7x1td' 2026-02-28T02:15:12,098 Created temporary directory: /tmp/pip-unpack-o86uy9n0 2026-02-28T02:15:12,099 Building wheels for collected packages: torchtitan 2026-02-28T02:15:12,103 Created temporary directory: /tmp/pip-wheel-j41ln14x 2026-02-28T02:15:12,103 Destination directory: /tmp/pip-wheel-j41ln14x 2026-02-28T02:15:12,105 Building wheel for torchtitan (pyproject.toml): started 2026-02-28T02:15:12,106 Running command Building wheel for torchtitan (pyproject.toml) 2026-02-28T02:15:12,681 /tmp/pip-build-env-zxu6zbb_/overlay/local/lib/python3.11/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.license` as a TOML table is deprecated 2026-02-28T02:15:12,681 !! 2026-02-28T02:15:12,682 ******************************************************************************** 2026-02-28T02:15:12,683 Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`. (Both options available on setuptools>=77.0.0). 2026-02-28T02:15:12,684 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-02-28T02:15:12,685 or your builds will no longer be supported. 2026-02-28T02:15:12,686 See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details. 2026-02-28T02:15:12,686 ******************************************************************************** 2026-02-28T02:15:12,688 !! 2026-02-28T02:15:12,688 corresp(dist, value, root_dir) 2026-02-28T02:15:12,757 running bdist_wheel 2026-02-28T02:15:12,775 running build 2026-02-28T02:15:12,776 running build_py 2026-02-28T02:15:12,782 creating build/lib/torchtitan 2026-02-28T02:15:12,784 copying torchtitan/train.py -> build/lib/torchtitan 2026-02-28T02:15:12,787 copying torchtitan/__init__.py -> build/lib/torchtitan 2026-02-28T02:15:12,789 creating build/lib/torchtitan/hf_datasets 2026-02-28T02:15:12,790 copying torchtitan/hf_datasets/text_datasets.py -> build/lib/torchtitan/hf_datasets 2026-02-28T02:15:12,793 copying torchtitan/hf_datasets/__init__.py -> build/lib/torchtitan/hf_datasets 2026-02-28T02:15:12,795 creating build/lib/torchtitan/tools 2026-02-28T02:15:12,796 copying torchtitan/tools/profiling.py -> build/lib/torchtitan/tools 2026-02-28T02:15:12,799 copying torchtitan/tools/logging.py -> build/lib/torchtitan/tools 2026-02-28T02:15:12,801 copying torchtitan/tools/utils.py -> build/lib/torchtitan/tools 2026-02-28T02:15:12,804 creating build/lib/torchtitan/protocols 2026-02-28T02:15:12,805 copying torchtitan/protocols/model_converter.py -> build/lib/torchtitan/protocols 2026-02-28T02:15:12,807 copying torchtitan/protocols/state_dict_adapter.py -> build/lib/torchtitan/protocols 2026-02-28T02:15:12,809 copying torchtitan/protocols/train_spec.py -> build/lib/torchtitan/protocols 2026-02-28T02:15:12,811 copying torchtitan/protocols/model.py -> build/lib/torchtitan/protocols 2026-02-28T02:15:12,813 copying torchtitan/protocols/__init__.py -> build/lib/torchtitan/protocols 2026-02-28T02:15:12,816 creating build/lib/torchtitan/components 2026-02-28T02:15:12,817 copying torchtitan/components/optimizer.py -> build/lib/torchtitan/components 2026-02-28T02:15:12,819 copying torchtitan/components/metrics.py -> build/lib/torchtitan/components 2026-02-28T02:15:12,822 copying torchtitan/components/lr_scheduler.py -> build/lib/torchtitan/components 2026-02-28T02:15:12,824 copying torchtitan/components/loss.py -> build/lib/torchtitan/components 2026-02-28T02:15:12,826 copying torchtitan/components/validate.py -> build/lib/torchtitan/components 2026-02-28T02:15:12,828 copying torchtitan/components/tokenizer.py -> build/lib/torchtitan/components 2026-02-28T02:15:12,831 copying torchtitan/components/checkpoint.py -> build/lib/torchtitan/components 2026-02-28T02:15:12,834 copying torchtitan/components/dataloader.py -> build/lib/torchtitan/components 2026-02-28T02:15:12,837 creating build/lib/torchtitan/distributed 2026-02-28T02:15:12,838 copying torchtitan/distributed/activation_checkpoint.py -> build/lib/torchtitan/distributed 2026-02-28T02:15:12,840 copying torchtitan/distributed/dual_pipe_v.py -> build/lib/torchtitan/distributed 2026-02-28T02:15:12,843 copying torchtitan/distributed/tensor_parallel.py -> build/lib/torchtitan/distributed 2026-02-28T02:15:12,845 copying torchtitan/distributed/parallel_dims.py -> build/lib/torchtitan/distributed 2026-02-28T02:15:12,847 copying torchtitan/distributed/pipeline_parallel.py -> build/lib/torchtitan/distributed 2026-02-28T02:15:12,850 copying torchtitan/distributed/context_parallel.py -> build/lib/torchtitan/distributed 2026-02-28T02:15:12,852 copying torchtitan/distributed/utils.py -> build/lib/torchtitan/distributed 2026-02-28T02:15:12,855 copying torchtitan/distributed/__init__.py -> build/lib/torchtitan/distributed 2026-02-28T02:15:12,857 copying torchtitan/distributed/expert_parallel.py -> build/lib/torchtitan/distributed 2026-02-28T02:15:12,860 creating build/lib/torchtitan/models 2026-02-28T02:15:12,861 copying torchtitan/models/utils.py -> build/lib/torchtitan/models 2026-02-28T02:15:12,864 copying torchtitan/models/__init__.py -> build/lib/torchtitan/models 2026-02-28T02:15:12,866 copying torchtitan/models/attention.py -> build/lib/torchtitan/models 2026-02-28T02:15:12,869 creating build/lib/torchtitan/experiments 2026-02-28T02:15:12,870 copying torchtitan/experiments/__init__.py -> build/lib/torchtitan/experiments 2026-02-28T02:15:12,873 creating build/lib/torchtitan/config 2026-02-28T02:15:12,874 copying torchtitan/config/job_config.py -> build/lib/torchtitan/config 2026-02-28T02:15:12,878 copying torchtitan/config/__init__.py -> build/lib/torchtitan/config 2026-02-28T02:15:12,880 copying torchtitan/config/manager.py -> build/lib/torchtitan/config 2026-02-28T02:15:12,883 creating build/lib/torchtitan/components/quantization 2026-02-28T02:15:12,884 copying torchtitan/components/quantization/float8.py -> build/lib/torchtitan/components/quantization 2026-02-28T02:15:12,886 copying torchtitan/components/quantization/utils.py -> build/lib/torchtitan/components/quantization 2026-02-28T02:15:12,888 copying torchtitan/components/quantization/__init__.py -> build/lib/torchtitan/components/quantization 2026-02-28T02:15:12,890 copying torchtitan/components/quantization/mx.py -> build/lib/torchtitan/components/quantization 2026-02-28T02:15:12,893 creating build/lib/torchtitan/components/ft 2026-02-28T02:15:12,894 copying torchtitan/components/ft/__init__.py -> build/lib/torchtitan/components/ft 2026-02-28T02:15:12,896 copying torchtitan/components/ft/manager.py -> build/lib/torchtitan/components/ft 2026-02-28T02:15:12,899 creating build/lib/torchtitan/components/ft/diloco 2026-02-28T02:15:12,900 copying torchtitan/components/ft/diloco/protocol.py -> build/lib/torchtitan/components/ft/diloco 2026-02-28T02:15:12,902 copying torchtitan/components/ft/diloco/utils.py -> build/lib/torchtitan/components/ft/diloco 2026-02-28T02:15:12,905 copying torchtitan/components/ft/diloco/__init__.py -> build/lib/torchtitan/components/ft/diloco 2026-02-28T02:15:12,908 creating build/lib/torchtitan/components/ft/config 2026-02-28T02:15:12,909 copying torchtitan/components/ft/config/job_config.py -> build/lib/torchtitan/components/ft/config 2026-02-28T02:15:12,911 copying torchtitan/components/ft/config/__init__.py -> build/lib/torchtitan/components/ft/config 2026-02-28T02:15:12,913 creating build/lib/torchtitan/distributed/deepep 2026-02-28T02:15:12,914 copying torchtitan/distributed/deepep/deepep.py -> build/lib/torchtitan/distributed/deepep 2026-02-28T02:15:12,917 copying torchtitan/distributed/deepep/__init__.py -> build/lib/torchtitan/distributed/deepep 2026-02-28T02:15:12,919 creating build/lib/torchtitan/models/llama4 2026-02-28T02:15:12,920 copying torchtitan/models/llama4/__init__.py -> build/lib/torchtitan/models/llama4 2026-02-28T02:15:12,923 creating build/lib/torchtitan/models/llama3_ft 2026-02-28T02:15:12,924 copying torchtitan/models/llama3_ft/__init__.py -> build/lib/torchtitan/models/llama3_ft 2026-02-28T02:15:12,927 creating build/lib/torchtitan/models/flux 2026-02-28T02:15:12,928 copying torchtitan/models/flux/job_config.py -> build/lib/torchtitan/models/flux 2026-02-28T02:15:12,930 copying torchtitan/models/flux/flux_datasets.py -> build/lib/torchtitan/models/flux 2026-02-28T02:15:12,932 copying torchtitan/models/flux/train.py -> build/lib/torchtitan/models/flux 2026-02-28T02:15:12,935 copying torchtitan/models/flux/validate.py -> build/lib/torchtitan/models/flux 2026-02-28T02:15:12,937 copying torchtitan/models/flux/tokenizer.py -> build/lib/torchtitan/models/flux 2026-02-28T02:15:12,940 copying torchtitan/models/flux/utils.py -> build/lib/torchtitan/models/flux 2026-02-28T02:15:12,942 copying torchtitan/models/flux/__init__.py -> build/lib/torchtitan/models/flux 2026-02-28T02:15:12,944 creating build/lib/torchtitan/models/gpt_oss 2026-02-28T02:15:12,946 copying torchtitan/models/gpt_oss/__init__.py -> build/lib/torchtitan/models/gpt_oss 2026-02-28T02:15:12,949 creating build/lib/torchtitan/models/moe 2026-02-28T02:15:12,950 copying torchtitan/models/moe/moe_deepep.py -> build/lib/torchtitan/models/moe 2026-02-28T02:15:12,952 copying torchtitan/models/moe/moe.py -> build/lib/torchtitan/models/moe 2026-02-28T02:15:12,955 copying torchtitan/models/moe/utils.py -> build/lib/torchtitan/models/moe 2026-02-28T02:15:12,957 copying torchtitan/models/moe/__init__.py -> build/lib/torchtitan/models/moe 2026-02-28T02:15:12,959 copying torchtitan/models/moe/kernels.py -> build/lib/torchtitan/models/moe 2026-02-28T02:15:12,961 creating build/lib/torchtitan/models/llama3 2026-02-28T02:15:12,962 copying torchtitan/models/llama3/__init__.py -> build/lib/torchtitan/models/llama3 2026-02-28T02:15:12,965 creating build/lib/torchtitan/models/deepseek_v3 2026-02-28T02:15:12,966 copying torchtitan/models/deepseek_v3/__init__.py -> build/lib/torchtitan/models/deepseek_v3 2026-02-28T02:15:12,969 creating build/lib/torchtitan/models/qwen3 2026-02-28T02:15:12,970 copying torchtitan/models/qwen3/__init__.py -> build/lib/torchtitan/models/qwen3 2026-02-28T02:15:12,973 creating build/lib/torchtitan/models/llama4/infra 2026-02-28T02:15:12,974 copying torchtitan/models/llama4/infra/parallelize.py -> build/lib/torchtitan/models/llama4/infra 2026-02-28T02:15:12,978 creating build/lib/torchtitan/models/llama4/model 2026-02-28T02:15:12,979 copying torchtitan/models/llama4/model/state_dict_adapter.py -> build/lib/torchtitan/models/llama4/model 2026-02-28T02:15:12,981 copying torchtitan/models/llama4/model/args.py -> build/lib/torchtitan/models/llama4/model 2026-02-28T02:15:12,983 copying torchtitan/models/llama4/model/model.py -> build/lib/torchtitan/models/llama4/model 2026-02-28T02:15:12,987 creating build/lib/torchtitan/models/flux/infra 2026-02-28T02:15:12,988 copying torchtitan/models/flux/infra/parallelize.py -> build/lib/torchtitan/models/flux/infra 2026-02-28T02:15:12,991 creating build/lib/torchtitan/models/flux/model 2026-02-28T02:15:12,992 copying torchtitan/models/flux/model/state_dict_adapter.py -> build/lib/torchtitan/models/flux/model 2026-02-28T02:15:12,995 copying torchtitan/models/flux/model/args.py -> build/lib/torchtitan/models/flux/model 2026-02-28T02:15:12,997 copying torchtitan/models/flux/model/layers.py -> build/lib/torchtitan/models/flux/model 2026-02-28T02:15:13,000 copying torchtitan/models/flux/model/model.py -> build/lib/torchtitan/models/flux/model 2026-02-28T02:15:13,002 copying torchtitan/models/flux/model/hf_embedder.py -> build/lib/torchtitan/models/flux/model 2026-02-28T02:15:13,004 copying torchtitan/models/flux/model/autoencoder.py -> build/lib/torchtitan/models/flux/model 2026-02-28T02:15:13,007 creating build/lib/torchtitan/models/flux/inference 2026-02-28T02:15:13,008 copying torchtitan/models/flux/inference/infer.py -> build/lib/torchtitan/models/flux/inference 2026-02-28T02:15:13,010 copying torchtitan/models/flux/inference/sampling.py -> build/lib/torchtitan/models/flux/inference 2026-02-28T02:15:13,013 creating build/lib/torchtitan/models/gpt_oss/infra 2026-02-28T02:15:13,014 copying torchtitan/models/gpt_oss/infra/parallelize.py -> build/lib/torchtitan/models/gpt_oss/infra 2026-02-28T02:15:13,016 copying torchtitan/models/gpt_oss/infra/expert_parallel.py -> build/lib/torchtitan/models/gpt_oss/infra 2026-02-28T02:15:13,019 creating build/lib/torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,020 copying torchtitan/models/gpt_oss/model/state_dict_adapter.py -> build/lib/torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,022 copying torchtitan/models/gpt_oss/model/args.py -> build/lib/torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,024 copying torchtitan/models/gpt_oss/model/model.py -> build/lib/torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,027 copying torchtitan/models/gpt_oss/model/moe.py -> build/lib/torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,030 creating build/lib/torchtitan/models/llama3/infra 2026-02-28T02:15:13,031 copying torchtitan/models/llama3/infra/parallelize.py -> build/lib/torchtitan/models/llama3/infra 2026-02-28T02:15:13,034 creating build/lib/torchtitan/models/llama3/model 2026-02-28T02:15:13,035 copying torchtitan/models/llama3/model/state_dict_adapter.py -> build/lib/torchtitan/models/llama3/model 2026-02-28T02:15:13,038 copying torchtitan/models/llama3/model/args.py -> build/lib/torchtitan/models/llama3/model 2026-02-28T02:15:13,040 copying torchtitan/models/llama3/model/model.py -> build/lib/torchtitan/models/llama3/model 2026-02-28T02:15:13,043 creating build/lib/torchtitan/models/deepseek_v3/infra 2026-02-28T02:15:13,044 copying torchtitan/models/deepseek_v3/infra/parallelize.py -> build/lib/torchtitan/models/deepseek_v3/infra 2026-02-28T02:15:13,047 creating build/lib/torchtitan/models/deepseek_v3/model 2026-02-28T02:15:13,048 copying torchtitan/models/deepseek_v3/model/state_dict_adapter.py -> build/lib/torchtitan/models/deepseek_v3/model 2026-02-28T02:15:13,051 copying torchtitan/models/deepseek_v3/model/args.py -> build/lib/torchtitan/models/deepseek_v3/model 2026-02-28T02:15:13,053 copying torchtitan/models/deepseek_v3/model/model.py -> build/lib/torchtitan/models/deepseek_v3/model 2026-02-28T02:15:13,056 creating build/lib/torchtitan/models/qwen3/infra 2026-02-28T02:15:13,057 copying torchtitan/models/qwen3/infra/parallelize.py -> build/lib/torchtitan/models/qwen3/infra 2026-02-28T02:15:13,061 creating build/lib/torchtitan/models/qwen3/model 2026-02-28T02:15:13,061 copying torchtitan/models/qwen3/model/state_dict_adapter.py -> build/lib/torchtitan/models/qwen3/model 2026-02-28T02:15:13,064 copying torchtitan/models/qwen3/model/args.py -> build/lib/torchtitan/models/qwen3/model 2026-02-28T02:15:13,066 copying torchtitan/models/qwen3/model/model.py -> build/lib/torchtitan/models/qwen3/model 2026-02-28T02:15:13,069 creating build/lib/torchtitan/experiments/moe_symm_mem_kernels 2026-02-28T02:15:13,070 copying torchtitan/experiments/moe_symm_mem_kernels/combine.py -> build/lib/torchtitan/experiments/moe_symm_mem_kernels 2026-02-28T02:15:13,074 copying torchtitan/experiments/moe_symm_mem_kernels/dispatch.py -> build/lib/torchtitan/experiments/moe_symm_mem_kernels 2026-02-28T02:15:13,077 creating build/lib/torchtitan/experiments/transformers_modeling_backend 2026-02-28T02:15:13,077 copying torchtitan/experiments/transformers_modeling_backend/job_config.py -> build/lib/torchtitan/experiments/transformers_modeling_backend 2026-02-28T02:15:13,079 copying torchtitan/experiments/transformers_modeling_backend/__init__.py -> build/lib/torchtitan/experiments/transformers_modeling_backend 2026-02-28T02:15:13,083 creating build/lib/torchtitan/experiments/torchcomms 2026-02-28T02:15:13,084 copying torchtitan/experiments/torchcomms/train.py -> build/lib/torchtitan/experiments/torchcomms 2026-02-28T02:15:13,086 copying torchtitan/experiments/torchcomms/parallel_dims.py -> build/lib/torchtitan/experiments/torchcomms 2026-02-28T02:15:13,088 copying torchtitan/experiments/torchcomms/integration_tests.py -> build/lib/torchtitan/experiments/torchcomms 2026-02-28T02:15:13,091 creating build/lib/torchtitan/experiments/autoparallel 2026-02-28T02:15:13,092 copying torchtitan/experiments/autoparallel/job_config.py -> build/lib/torchtitan/experiments/autoparallel 2026-02-28T02:15:13,094 creating build/lib/torchtitan/experiments/vlm 2026-02-28T02:15:13,095 copying torchtitan/experiments/vlm/job_config.py -> build/lib/torchtitan/experiments/vlm 2026-02-28T02:15:13,097 copying torchtitan/experiments/vlm/__init__.py -> build/lib/torchtitan/experiments/vlm 2026-02-28T02:15:13,099 creating build/lib/torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,100 copying torchtitan/experiments/simple_fsdp/job_config.py -> build/lib/torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,102 copying torchtitan/experiments/simple_fsdp/reshard_after_forward.py -> build/lib/torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,104 copying torchtitan/experiments/simple_fsdp/simple_fsdp.py -> build/lib/torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,107 copying torchtitan/experiments/simple_fsdp/backend.py -> build/lib/torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,110 creating build/lib/torchtitan/experiments/ft 2026-02-28T02:15:13,111 copying torchtitan/experiments/ft/train.py -> build/lib/torchtitan/experiments/ft 2026-02-28T02:15:13,114 creating build/lib/torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:13,115 copying torchtitan/experiments/compiler_toolkit/job_config.py -> build/lib/torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:13,117 copying torchtitan/experiments/compiler_toolkit/cudagraph.py -> build/lib/torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:13,120 copying torchtitan/experiments/compiler_toolkit/train.py -> build/lib/torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:13,122 copying torchtitan/experiments/compiler_toolkit/passes.py -> build/lib/torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:13,124 copying torchtitan/experiments/compiler_toolkit/common_utils.py -> build/lib/torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:13,126 copying torchtitan/experiments/compiler_toolkit/graph_utils.py -> build/lib/torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:13,129 creating build/lib/torchtitan/experiments/forge 2026-02-28T02:15:13,130 copying torchtitan/experiments/forge/job_config.py -> build/lib/torchtitan/experiments/forge 2026-02-28T02:15:13,132 copying torchtitan/experiments/forge/example_train.py -> build/lib/torchtitan/experiments/forge 2026-02-28T02:15:13,135 copying torchtitan/experiments/forge/engine.py -> build/lib/torchtitan/experiments/forge 2026-02-28T02:15:13,137 copying torchtitan/experiments/forge/train_spec.py -> build/lib/torchtitan/experiments/forge 2026-02-28T02:15:13,139 copying torchtitan/experiments/forge/__init__.py -> build/lib/torchtitan/experiments/forge 2026-02-28T02:15:13,141 creating build/lib/torchtitan/experiments/transformers_modeling_backend/infra 2026-02-28T02:15:13,142 copying torchtitan/experiments/transformers_modeling_backend/infra/parallelize.py -> build/lib/torchtitan/experiments/transformers_modeling_backend/infra 2026-02-28T02:15:13,145 copying torchtitan/experiments/transformers_modeling_backend/infra/pipeline.py -> build/lib/torchtitan/experiments/transformers_modeling_backend/infra 2026-02-28T02:15:13,148 creating build/lib/torchtitan/experiments/transformers_modeling_backend/tests 2026-02-28T02:15:13,149 copying torchtitan/experiments/transformers_modeling_backend/tests/integration_tests.py -> build/lib/torchtitan/experiments/transformers_modeling_backend/tests 2026-02-28T02:15:13,151 creating build/lib/torchtitan/experiments/transformers_modeling_backend/model 2026-02-28T02:15:13,152 copying torchtitan/experiments/transformers_modeling_backend/model/args.py -> build/lib/torchtitan/experiments/transformers_modeling_backend/model 2026-02-28T02:15:13,155 copying torchtitan/experiments/transformers_modeling_backend/model/model.py -> build/lib/torchtitan/experiments/transformers_modeling_backend/model 2026-02-28T02:15:13,158 creating build/lib/torchtitan/experiments/autoparallel/tests 2026-02-28T02:15:13,159 copying torchtitan/experiments/autoparallel/tests/integration_tests.py -> build/lib/torchtitan/experiments/autoparallel/tests 2026-02-28T02:15:13,161 copying torchtitan/experiments/autoparallel/tests/__init__.py -> build/lib/torchtitan/experiments/autoparallel/tests 2026-02-28T02:15:13,163 creating build/lib/torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,164 copying torchtitan/experiments/autoparallel/local_map_deepseek_v3/args.py -> build/lib/torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,166 copying torchtitan/experiments/autoparallel/local_map_deepseek_v3/parallelize_deepseekv3.py -> build/lib/torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,168 copying torchtitan/experiments/autoparallel/local_map_deepseek_v3/model.py -> build/lib/torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,170 copying torchtitan/experiments/autoparallel/local_map_deepseek_v3/__init__.py -> build/lib/torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,172 creating build/lib/torchtitan/experiments/autoparallel/llama3 2026-02-28T02:15:13,173 copying torchtitan/experiments/autoparallel/llama3/parallelize_llama.py -> build/lib/torchtitan/experiments/autoparallel/llama3 2026-02-28T02:15:13,176 copying torchtitan/experiments/autoparallel/llama3/__init__.py -> build/lib/torchtitan/experiments/autoparallel/llama3 2026-02-28T02:15:13,178 creating build/lib/torchtitan/experiments/autoparallel/deepseek_v3 2026-02-28T02:15:13,179 copying torchtitan/experiments/autoparallel/deepseek_v3/parallelize_deepseekv3.py -> build/lib/torchtitan/experiments/autoparallel/deepseek_v3 2026-02-28T02:15:13,181 copying torchtitan/experiments/autoparallel/deepseek_v3/__init__.py -> build/lib/torchtitan/experiments/autoparallel/deepseek_v3 2026-02-28T02:15:13,184 creating build/lib/torchtitan/experiments/vlm/infra 2026-02-28T02:15:13,185 copying torchtitan/experiments/vlm/infra/parallelize.py -> build/lib/torchtitan/experiments/vlm/infra 2026-02-28T02:15:13,187 creating build/lib/torchtitan/experiments/vlm/datasets 2026-02-28T02:15:13,188 copying torchtitan/experiments/vlm/datasets/mm_datasets.py -> build/lib/torchtitan/experiments/vlm/datasets 2026-02-28T02:15:13,191 copying torchtitan/experiments/vlm/datasets/mm_collator_nld.py -> build/lib/torchtitan/experiments/vlm/datasets 2026-02-28T02:15:13,194 creating build/lib/torchtitan/experiments/vlm/tests 2026-02-28T02:15:13,195 copying torchtitan/experiments/vlm/tests/integration_tests.py -> build/lib/torchtitan/experiments/vlm/tests 2026-02-28T02:15:13,197 creating build/lib/torchtitan/experiments/vlm/model 2026-02-28T02:15:13,198 copying torchtitan/experiments/vlm/model/args.py -> build/lib/torchtitan/experiments/vlm/model 2026-02-28T02:15:13,200 copying torchtitan/experiments/vlm/model/siglip2.py -> build/lib/torchtitan/experiments/vlm/model 2026-02-28T02:15:13,203 copying torchtitan/experiments/vlm/model/model.py -> build/lib/torchtitan/experiments/vlm/model 2026-02-28T02:15:13,206 creating build/lib/torchtitan/experiments/vlm/datasets/utils 2026-02-28T02:15:13,207 copying torchtitan/experiments/vlm/datasets/utils/text.py -> build/lib/torchtitan/experiments/vlm/datasets/utils 2026-02-28T02:15:13,209 copying torchtitan/experiments/vlm/datasets/utils/image.py -> build/lib/torchtitan/experiments/vlm/datasets/utils 2026-02-28T02:15:13,211 copying torchtitan/experiments/vlm/datasets/utils/packing.py -> build/lib/torchtitan/experiments/vlm/datasets/utils 2026-02-28T02:15:13,214 creating build/lib/torchtitan/experiments/simple_fsdp/tests 2026-02-28T02:15:13,215 copying torchtitan/experiments/simple_fsdp/tests/test_numerics.py -> build/lib/torchtitan/experiments/simple_fsdp/tests 2026-02-28T02:15:13,217 copying torchtitan/experiments/simple_fsdp/tests/integration_tests.py -> build/lib/torchtitan/experiments/simple_fsdp/tests 2026-02-28T02:15:13,220 copying torchtitan/experiments/simple_fsdp/tests/__init__.py -> build/lib/torchtitan/experiments/simple_fsdp/tests 2026-02-28T02:15:13,222 creating build/lib/torchtitan/experiments/simple_fsdp/llama3 2026-02-28T02:15:13,223 copying torchtitan/experiments/simple_fsdp/llama3/parallelize.py -> build/lib/torchtitan/experiments/simple_fsdp/llama3 2026-02-28T02:15:13,225 copying torchtitan/experiments/simple_fsdp/llama3/model.py -> build/lib/torchtitan/experiments/simple_fsdp/llama3 2026-02-28T02:15:13,227 copying torchtitan/experiments/simple_fsdp/llama3/__init__.py -> build/lib/torchtitan/experiments/simple_fsdp/llama3 2026-02-28T02:15:13,229 creating build/lib/torchtitan/experiments/simple_fsdp/deepseek_v3 2026-02-28T02:15:13,230 copying torchtitan/experiments/simple_fsdp/deepseek_v3/parallelize.py -> build/lib/torchtitan/experiments/simple_fsdp/deepseek_v3 2026-02-28T02:15:13,233 copying torchtitan/experiments/simple_fsdp/deepseek_v3/model.py -> build/lib/torchtitan/experiments/simple_fsdp/deepseek_v3 2026-02-28T02:15:13,235 copying torchtitan/experiments/simple_fsdp/deepseek_v3/__init__.py -> build/lib/torchtitan/experiments/simple_fsdp/deepseek_v3 2026-02-28T02:15:13,237 creating build/lib/torchtitan/experiments/rl/unified 2026-02-28T02:15:13,239 copying torchtitan/experiments/rl/unified/infer.py -> build/lib/torchtitan/experiments/rl/unified 2026-02-28T02:15:13,241 copying torchtitan/experiments/rl/unified/simple_rl_multiprocess.py -> build/lib/torchtitan/experiments/rl/unified 2026-02-28T02:15:13,243 copying torchtitan/experiments/rl/unified/__init__.py -> build/lib/torchtitan/experiments/rl/unified 2026-02-28T02:15:13,246 creating build/lib/torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:13,247 copying torchtitan/experiments/rl/vllm_compat/batch_invariant_backward.py -> build/lib/torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:13,250 copying torchtitan/experiments/rl/vllm_compat/weights_vllm_compat.py -> build/lib/torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:13,252 copying torchtitan/experiments/rl/vllm_compat/simple_rl.py -> build/lib/torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:13,255 copying torchtitan/experiments/rl/vllm_compat/__init__.py -> build/lib/torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:13,257 creating build/lib/torchtitan/experiments/rl/unified/infra 2026-02-28T02:15:13,258 copying torchtitan/experiments/rl/unified/infra/parallelize.py -> build/lib/torchtitan/experiments/rl/unified/infra 2026-02-28T02:15:13,261 copying torchtitan/experiments/rl/unified/infra/parallelism_utils.py -> build/lib/torchtitan/experiments/rl/unified/infra 2026-02-28T02:15:13,264 creating build/lib/torchtitan/experiments/rl/unified/actors 2026-02-28T02:15:13,265 copying torchtitan/experiments/rl/unified/actors/trainer.py -> build/lib/torchtitan/experiments/rl/unified/actors 2026-02-28T02:15:13,267 copying torchtitan/experiments/rl/unified/actors/generator.py -> build/lib/torchtitan/experiments/rl/unified/actors 2026-02-28T02:15:13,270 creating build/lib/torchtitan/experiments/rl/unified/models 2026-02-28T02:15:13,271 copying torchtitan/experiments/rl/unified/models/vllm_wrapper.py -> build/lib/torchtitan/experiments/rl/unified/models 2026-02-28T02:15:13,274 copying torchtitan/experiments/rl/unified/models/utils.py -> build/lib/torchtitan/experiments/rl/unified/models 2026-02-28T02:15:13,276 copying torchtitan/experiments/rl/unified/models/attention.py -> build/lib/torchtitan/experiments/rl/unified/models 2026-02-28T02:15:13,279 creating build/lib/torchtitan/experiments/rl/vllm_compat/weights 2026-02-28T02:15:13,280 copying torchtitan/experiments/rl/vllm_compat/weights/converter.py -> build/lib/torchtitan/experiments/rl/vllm_compat/weights 2026-02-28T02:15:13,282 copying torchtitan/experiments/rl/vllm_compat/weights/__init__.py -> build/lib/torchtitan/experiments/rl/vllm_compat/weights 2026-02-28T02:15:13,285 creating build/lib/torchtitan/experiments/rl/vllm_compat/tests 2026-02-28T02:15:13,286 copying torchtitan/experiments/rl/vllm_compat/tests/test_exact_determinism.py -> build/lib/torchtitan/experiments/rl/vllm_compat/tests 2026-02-28T02:15:13,288 copying torchtitan/experiments/rl/vllm_compat/tests/test_batch_invariant_backward.py -> build/lib/torchtitan/experiments/rl/vllm_compat/tests 2026-02-28T02:15:13,290 copying torchtitan/experiments/rl/vllm_compat/tests/__init__.py -> build/lib/torchtitan/experiments/rl/vllm_compat/tests 2026-02-28T02:15:13,293 creating build/lib/torchtitan/experiments/rl/vllm_compat/models 2026-02-28T02:15:13,294 copying torchtitan/experiments/rl/vllm_compat/models/__init__.py -> build/lib/torchtitan/experiments/rl/vllm_compat/models 2026-02-28T02:15:13,296 copying torchtitan/experiments/rl/vllm_compat/models/attention.py -> build/lib/torchtitan/experiments/rl/vllm_compat/models 2026-02-28T02:15:13,299 creating build/lib/torchtitan/experiments/rl/vllm_compat/models/qwen3 2026-02-28T02:15:13,300 copying torchtitan/experiments/rl/vllm_compat/models/qwen3/model_vllm_compat.py -> build/lib/torchtitan/experiments/rl/vllm_compat/models/qwen3 2026-02-28T02:15:13,302 copying torchtitan/experiments/rl/vllm_compat/models/qwen3/__init__.py -> build/lib/torchtitan/experiments/rl/vllm_compat/models/qwen3 2026-02-28T02:15:13,305 creating build/lib/torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:13,306 copying torchtitan/experiments/compiler_toolkit/tests/test_numerics.py -> build/lib/torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:13,308 copying torchtitan/experiments/compiler_toolkit/tests/test_passes.py -> build/lib/torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:13,310 copying torchtitan/experiments/compiler_toolkit/tests/numerics_utils.py -> build/lib/torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:13,312 copying torchtitan/experiments/compiler_toolkit/tests/integration_tests.py -> build/lib/torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:13,314 copying torchtitan/experiments/compiler_toolkit/tests/__init__.py -> build/lib/torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:13,317 creating build/lib/torchtitan/experiments/compiler_toolkit/llama3 2026-02-28T02:15:13,318 copying torchtitan/experiments/compiler_toolkit/llama3/parallelize.py -> build/lib/torchtitan/experiments/compiler_toolkit/llama3 2026-02-28T02:15:13,320 copying torchtitan/experiments/compiler_toolkit/llama3/__init__.py -> build/lib/torchtitan/experiments/compiler_toolkit/llama3 2026-02-28T02:15:13,322 creating build/lib/torchtitan/experiments/compiler_toolkit/deepseek_v3 2026-02-28T02:15:13,323 copying torchtitan/experiments/compiler_toolkit/deepseek_v3/parallelize.py -> build/lib/torchtitan/experiments/compiler_toolkit/deepseek_v3 2026-02-28T02:15:13,325 copying torchtitan/experiments/compiler_toolkit/deepseek_v3/__init__.py -> build/lib/torchtitan/experiments/compiler_toolkit/deepseek_v3 2026-02-28T02:15:13,327 creating build/lib/torchtitan/experiments/compiler_toolkit/scripts 2026-02-28T02:15:13,328 copying torchtitan/experiments/compiler_toolkit/scripts/check_numerics.py -> build/lib/torchtitan/experiments/compiler_toolkit/scripts 2026-02-28T02:15:13,330 running egg_info 2026-02-28T02:15:13,340 writing torchtitan.egg-info/PKG-INFO 2026-02-28T02:15:13,344 writing dependency_links to torchtitan.egg-info/dependency_links.txt 2026-02-28T02:15:13,346 writing requirements to torchtitan.egg-info/requires.txt 2026-02-28T02:15:13,347 writing top-level names to torchtitan.egg-info/top_level.txt 2026-02-28T02:15:13,436 reading manifest file 'torchtitan.egg-info/SOURCES.txt' 2026-02-28T02:15:13,452 adding license file 'LICENSE' 2026-02-28T02:15:13,463 writing manifest file 'torchtitan.egg-info/SOURCES.txt' 2026-02-28T02:15:13,532 installing to build/bdist.linux-armv7l/wheel 2026-02-28T02:15:13,532 running install 2026-02-28T02:15:13,556 running install_lib 2026-02-28T02:15:13,561 creating build/bdist.linux-armv7l/wheel 2026-02-28T02:15:13,563 creating build/bdist.linux-armv7l/wheel/torchtitan 2026-02-28T02:15:13,565 creating build/bdist.linux-armv7l/wheel/torchtitan/hf_datasets 2026-02-28T02:15:13,566 copying build/lib/torchtitan/hf_datasets/text_datasets.py -> build/bdist.linux-armv7l/wheel/./torchtitan/hf_datasets 2026-02-28T02:15:13,568 copying build/lib/torchtitan/hf_datasets/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/hf_datasets 2026-02-28T02:15:13,570 copying build/lib/torchtitan/train.py -> build/bdist.linux-armv7l/wheel/./torchtitan 2026-02-28T02:15:13,573 creating build/bdist.linux-armv7l/wheel/torchtitan/tools 2026-02-28T02:15:13,574 copying build/lib/torchtitan/tools/profiling.py -> build/bdist.linux-armv7l/wheel/./torchtitan/tools 2026-02-28T02:15:13,577 copying build/lib/torchtitan/tools/logging.py -> build/bdist.linux-armv7l/wheel/./torchtitan/tools 2026-02-28T02:15:13,579 copying build/lib/torchtitan/tools/utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/tools 2026-02-28T02:15:13,582 creating build/bdist.linux-armv7l/wheel/torchtitan/protocols 2026-02-28T02:15:13,583 copying build/lib/torchtitan/protocols/model_converter.py -> build/bdist.linux-armv7l/wheel/./torchtitan/protocols 2026-02-28T02:15:13,585 copying build/lib/torchtitan/protocols/state_dict_adapter.py -> build/bdist.linux-armv7l/wheel/./torchtitan/protocols 2026-02-28T02:15:13,587 copying build/lib/torchtitan/protocols/train_spec.py -> build/bdist.linux-armv7l/wheel/./torchtitan/protocols 2026-02-28T02:15:13,589 copying build/lib/torchtitan/protocols/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/protocols 2026-02-28T02:15:13,591 copying build/lib/torchtitan/protocols/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/protocols 2026-02-28T02:15:13,593 creating build/bdist.linux-armv7l/wheel/torchtitan/components 2026-02-28T02:15:13,594 copying build/lib/torchtitan/components/optimizer.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components 2026-02-28T02:15:13,597 copying build/lib/torchtitan/components/metrics.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components 2026-02-28T02:15:13,601 copying build/lib/torchtitan/components/lr_scheduler.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components 2026-02-28T02:15:13,604 creating build/bdist.linux-armv7l/wheel/torchtitan/components/quantization 2026-02-28T02:15:13,605 copying build/lib/torchtitan/components/quantization/float8.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/quantization 2026-02-28T02:15:13,608 copying build/lib/torchtitan/components/quantization/utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/quantization 2026-02-28T02:15:13,610 copying build/lib/torchtitan/components/quantization/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/quantization 2026-02-28T02:15:13,612 copying build/lib/torchtitan/components/quantization/mx.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/quantization 2026-02-28T02:15:13,614 copying build/lib/torchtitan/components/loss.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components 2026-02-28T02:15:13,616 copying build/lib/torchtitan/components/validate.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components 2026-02-28T02:15:13,619 creating build/bdist.linux-armv7l/wheel/torchtitan/components/ft 2026-02-28T02:15:13,621 creating build/bdist.linux-armv7l/wheel/torchtitan/components/ft/diloco 2026-02-28T02:15:13,622 copying build/lib/torchtitan/components/ft/diloco/protocol.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/ft/diloco 2026-02-28T02:15:13,624 copying build/lib/torchtitan/components/ft/diloco/utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/ft/diloco 2026-02-28T02:15:13,626 copying build/lib/torchtitan/components/ft/diloco/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/ft/diloco 2026-02-28T02:15:13,627 copying build/lib/torchtitan/components/ft/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/ft 2026-02-28T02:15:13,629 copying build/lib/torchtitan/components/ft/manager.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/ft 2026-02-28T02:15:13,632 creating build/bdist.linux-armv7l/wheel/torchtitan/components/ft/config 2026-02-28T02:15:13,633 copying build/lib/torchtitan/components/ft/config/job_config.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/ft/config 2026-02-28T02:15:13,635 copying build/lib/torchtitan/components/ft/config/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components/ft/config 2026-02-28T02:15:13,637 copying build/lib/torchtitan/components/tokenizer.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components 2026-02-28T02:15:13,639 copying build/lib/torchtitan/components/checkpoint.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components 2026-02-28T02:15:13,642 copying build/lib/torchtitan/components/dataloader.py -> build/bdist.linux-armv7l/wheel/./torchtitan/components 2026-02-28T02:15:13,645 creating build/bdist.linux-armv7l/wheel/torchtitan/distributed 2026-02-28T02:15:13,646 copying build/lib/torchtitan/distributed/activation_checkpoint.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed 2026-02-28T02:15:13,649 copying build/lib/torchtitan/distributed/dual_pipe_v.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed 2026-02-28T02:15:13,651 copying build/lib/torchtitan/distributed/tensor_parallel.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed 2026-02-28T02:15:13,653 copying build/lib/torchtitan/distributed/parallel_dims.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed 2026-02-28T02:15:13,656 copying build/lib/torchtitan/distributed/pipeline_parallel.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed 2026-02-28T02:15:13,658 creating build/bdist.linux-armv7l/wheel/torchtitan/distributed/deepep 2026-02-28T02:15:13,659 copying build/lib/torchtitan/distributed/deepep/deepep.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed/deepep 2026-02-28T02:15:13,662 copying build/lib/torchtitan/distributed/deepep/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed/deepep 2026-02-28T02:15:13,664 copying build/lib/torchtitan/distributed/context_parallel.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed 2026-02-28T02:15:13,666 copying build/lib/torchtitan/distributed/utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed 2026-02-28T02:15:13,668 copying build/lib/torchtitan/distributed/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed 2026-02-28T02:15:13,670 copying build/lib/torchtitan/distributed/expert_parallel.py -> build/bdist.linux-armv7l/wheel/./torchtitan/distributed 2026-02-28T02:15:13,673 creating build/bdist.linux-armv7l/wheel/torchtitan/models 2026-02-28T02:15:13,675 creating build/bdist.linux-armv7l/wheel/torchtitan/models/llama4 2026-02-28T02:15:13,677 creating build/bdist.linux-armv7l/wheel/torchtitan/models/llama4/infra 2026-02-28T02:15:13,678 copying build/lib/torchtitan/models/llama4/infra/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama4/infra 2026-02-28T02:15:13,681 creating build/bdist.linux-armv7l/wheel/torchtitan/models/llama4/model 2026-02-28T02:15:13,682 copying build/lib/torchtitan/models/llama4/model/state_dict_adapter.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama4/model 2026-02-28T02:15:13,684 copying build/lib/torchtitan/models/llama4/model/args.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama4/model 2026-02-28T02:15:13,686 copying build/lib/torchtitan/models/llama4/model/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama4/model 2026-02-28T02:15:13,689 copying build/lib/torchtitan/models/llama4/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama4 2026-02-28T02:15:13,691 creating build/bdist.linux-armv7l/wheel/torchtitan/models/llama3_ft 2026-02-28T02:15:13,692 copying build/lib/torchtitan/models/llama3_ft/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama3_ft 2026-02-28T02:15:13,695 creating build/bdist.linux-armv7l/wheel/torchtitan/models/flux 2026-02-28T02:15:13,696 creating build/bdist.linux-armv7l/wheel/torchtitan/models/flux/infra 2026-02-28T02:15:13,697 copying build/lib/torchtitan/models/flux/infra/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux/infra 2026-02-28T02:15:13,700 copying build/lib/torchtitan/models/flux/job_config.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux 2026-02-28T02:15:13,702 copying build/lib/torchtitan/models/flux/flux_datasets.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux 2026-02-28T02:15:13,704 copying build/lib/torchtitan/models/flux/train.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux 2026-02-28T02:15:13,707 creating build/bdist.linux-armv7l/wheel/torchtitan/models/flux/model 2026-02-28T02:15:13,708 copying build/lib/torchtitan/models/flux/model/state_dict_adapter.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux/model 2026-02-28T02:15:13,711 copying build/lib/torchtitan/models/flux/model/args.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux/model 2026-02-28T02:15:13,713 copying build/lib/torchtitan/models/flux/model/layers.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux/model 2026-02-28T02:15:13,716 copying build/lib/torchtitan/models/flux/model/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux/model 2026-02-28T02:15:13,718 copying build/lib/torchtitan/models/flux/model/hf_embedder.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux/model 2026-02-28T02:15:13,720 copying build/lib/torchtitan/models/flux/model/autoencoder.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux/model 2026-02-28T02:15:13,723 creating build/bdist.linux-armv7l/wheel/torchtitan/models/flux/inference 2026-02-28T02:15:13,724 copying build/lib/torchtitan/models/flux/inference/infer.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux/inference 2026-02-28T02:15:13,727 copying build/lib/torchtitan/models/flux/inference/sampling.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux/inference 2026-02-28T02:15:13,729 copying build/lib/torchtitan/models/flux/validate.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux 2026-02-28T02:15:13,732 copying build/lib/torchtitan/models/flux/tokenizer.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux 2026-02-28T02:15:13,734 copying build/lib/torchtitan/models/flux/utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux 2026-02-28T02:15:13,736 copying build/lib/torchtitan/models/flux/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/flux 2026-02-28T02:15:13,738 creating build/bdist.linux-armv7l/wheel/torchtitan/models/gpt_oss 2026-02-28T02:15:13,740 creating build/bdist.linux-armv7l/wheel/torchtitan/models/gpt_oss/infra 2026-02-28T02:15:13,741 copying build/lib/torchtitan/models/gpt_oss/infra/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/gpt_oss/infra 2026-02-28T02:15:13,744 copying build/lib/torchtitan/models/gpt_oss/infra/expert_parallel.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/gpt_oss/infra 2026-02-28T02:15:13,746 creating build/bdist.linux-armv7l/wheel/torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,747 copying build/lib/torchtitan/models/gpt_oss/model/state_dict_adapter.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,750 copying build/lib/torchtitan/models/gpt_oss/model/args.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,751 copying build/lib/torchtitan/models/gpt_oss/model/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,754 copying build/lib/torchtitan/models/gpt_oss/model/moe.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/gpt_oss/model 2026-02-28T02:15:13,756 copying build/lib/torchtitan/models/gpt_oss/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/gpt_oss 2026-02-28T02:15:13,759 creating build/bdist.linux-armv7l/wheel/torchtitan/models/moe 2026-02-28T02:15:13,760 copying build/lib/torchtitan/models/moe/moe_deepep.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/moe 2026-02-28T02:15:13,762 copying build/lib/torchtitan/models/moe/moe.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/moe 2026-02-28T02:15:13,764 copying build/lib/torchtitan/models/moe/utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/moe 2026-02-28T02:15:13,766 copying build/lib/torchtitan/models/moe/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/moe 2026-02-28T02:15:13,768 copying build/lib/torchtitan/models/moe/kernels.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/moe 2026-02-28T02:15:13,771 creating build/bdist.linux-armv7l/wheel/torchtitan/models/llama3 2026-02-28T02:15:13,773 creating build/bdist.linux-armv7l/wheel/torchtitan/models/llama3/infra 2026-02-28T02:15:13,774 copying build/lib/torchtitan/models/llama3/infra/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama3/infra 2026-02-28T02:15:13,777 creating build/bdist.linux-armv7l/wheel/torchtitan/models/llama3/model 2026-02-28T02:15:13,778 copying build/lib/torchtitan/models/llama3/model/state_dict_adapter.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama3/model 2026-02-28T02:15:13,781 copying build/lib/torchtitan/models/llama3/model/args.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama3/model 2026-02-28T02:15:13,783 copying build/lib/torchtitan/models/llama3/model/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama3/model 2026-02-28T02:15:13,785 copying build/lib/torchtitan/models/llama3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/llama3 2026-02-28T02:15:13,788 creating build/bdist.linux-armv7l/wheel/torchtitan/models/deepseek_v3 2026-02-28T02:15:13,790 creating build/bdist.linux-armv7l/wheel/torchtitan/models/deepseek_v3/infra 2026-02-28T02:15:13,791 copying build/lib/torchtitan/models/deepseek_v3/infra/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/deepseek_v3/infra 2026-02-28T02:15:13,794 creating build/bdist.linux-armv7l/wheel/torchtitan/models/deepseek_v3/model 2026-02-28T02:15:13,795 copying build/lib/torchtitan/models/deepseek_v3/model/state_dict_adapter.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/deepseek_v3/model 2026-02-28T02:15:13,798 copying build/lib/torchtitan/models/deepseek_v3/model/args.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/deepseek_v3/model 2026-02-28T02:15:13,800 copying build/lib/torchtitan/models/deepseek_v3/model/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/deepseek_v3/model 2026-02-28T02:15:13,803 copying build/lib/torchtitan/models/deepseek_v3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/deepseek_v3 2026-02-28T02:15:13,805 copying build/lib/torchtitan/models/utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models 2026-02-28T02:15:13,808 copying build/lib/torchtitan/models/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models 2026-02-28T02:15:13,810 creating build/bdist.linux-armv7l/wheel/torchtitan/models/qwen3 2026-02-28T02:15:13,811 creating build/bdist.linux-armv7l/wheel/torchtitan/models/qwen3/infra 2026-02-28T02:15:13,812 copying build/lib/torchtitan/models/qwen3/infra/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/qwen3/infra 2026-02-28T02:15:13,815 creating build/bdist.linux-armv7l/wheel/torchtitan/models/qwen3/model 2026-02-28T02:15:13,816 copying build/lib/torchtitan/models/qwen3/model/state_dict_adapter.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/qwen3/model 2026-02-28T02:15:13,819 copying build/lib/torchtitan/models/qwen3/model/args.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/qwen3/model 2026-02-28T02:15:13,821 copying build/lib/torchtitan/models/qwen3/model/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/qwen3/model 2026-02-28T02:15:13,824 copying build/lib/torchtitan/models/qwen3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models/qwen3 2026-02-28T02:15:13,826 copying build/lib/torchtitan/models/attention.py -> build/bdist.linux-armv7l/wheel/./torchtitan/models 2026-02-28T02:15:13,830 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments 2026-02-28T02:15:13,831 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/moe_symm_mem_kernels 2026-02-28T02:15:13,832 copying build/lib/torchtitan/experiments/moe_symm_mem_kernels/combine.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/moe_symm_mem_kernels 2026-02-28T02:15:13,835 copying build/lib/torchtitan/experiments/moe_symm_mem_kernels/dispatch.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/moe_symm_mem_kernels 2026-02-28T02:15:13,838 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/transformers_modeling_backend 2026-02-28T02:15:13,840 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/transformers_modeling_backend/infra 2026-02-28T02:15:13,841 copying build/lib/torchtitan/experiments/transformers_modeling_backend/infra/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/transformers_modeling_backend/infra 2026-02-28T02:15:13,844 copying build/lib/torchtitan/experiments/transformers_modeling_backend/infra/pipeline.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/transformers_modeling_backend/infra 2026-02-28T02:15:13,846 copying build/lib/torchtitan/experiments/transformers_modeling_backend/job_config.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/transformers_modeling_backend 2026-02-28T02:15:13,849 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/transformers_modeling_backend/tests 2026-02-28T02:15:13,850 copying build/lib/torchtitan/experiments/transformers_modeling_backend/tests/integration_tests.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/transformers_modeling_backend/tests 2026-02-28T02:15:13,852 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/transformers_modeling_backend/model 2026-02-28T02:15:13,853 copying build/lib/torchtitan/experiments/transformers_modeling_backend/model/args.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/transformers_modeling_backend/model 2026-02-28T02:15:13,856 copying build/lib/torchtitan/experiments/transformers_modeling_backend/model/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/transformers_modeling_backend/model 2026-02-28T02:15:13,859 copying build/lib/torchtitan/experiments/transformers_modeling_backend/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/transformers_modeling_backend 2026-02-28T02:15:13,861 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/torchcomms 2026-02-28T02:15:13,863 copying build/lib/torchtitan/experiments/torchcomms/train.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/torchcomms 2026-02-28T02:15:13,865 copying build/lib/torchtitan/experiments/torchcomms/parallel_dims.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/torchcomms 2026-02-28T02:15:13,868 copying build/lib/torchtitan/experiments/torchcomms/integration_tests.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/torchcomms 2026-02-28T02:15:13,870 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/autoparallel 2026-02-28T02:15:13,871 copying build/lib/torchtitan/experiments/autoparallel/job_config.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel 2026-02-28T02:15:13,874 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/autoparallel/tests 2026-02-28T02:15:13,875 copying build/lib/torchtitan/experiments/autoparallel/tests/integration_tests.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/tests 2026-02-28T02:15:13,877 copying build/lib/torchtitan/experiments/autoparallel/tests/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/tests 2026-02-28T02:15:13,879 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,880 copying build/lib/torchtitan/experiments/autoparallel/local_map_deepseek_v3/args.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,882 copying build/lib/torchtitan/experiments/autoparallel/local_map_deepseek_v3/parallelize_deepseekv3.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,884 copying build/lib/torchtitan/experiments/autoparallel/local_map_deepseek_v3/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,886 copying build/lib/torchtitan/experiments/autoparallel/local_map_deepseek_v3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/local_map_deepseek_v3 2026-02-28T02:15:13,889 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/autoparallel/llama3 2026-02-28T02:15:13,890 copying build/lib/torchtitan/experiments/autoparallel/llama3/parallelize_llama.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/llama3 2026-02-28T02:15:13,892 copying build/lib/torchtitan/experiments/autoparallel/llama3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/llama3 2026-02-28T02:15:13,895 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/autoparallel/deepseek_v3 2026-02-28T02:15:13,896 copying build/lib/torchtitan/experiments/autoparallel/deepseek_v3/parallelize_deepseekv3.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/deepseek_v3 2026-02-28T02:15:13,899 copying build/lib/torchtitan/experiments/autoparallel/deepseek_v3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/autoparallel/deepseek_v3 2026-02-28T02:15:13,901 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/vlm 2026-02-28T02:15:13,903 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/vlm/infra 2026-02-28T02:15:13,904 copying build/lib/torchtitan/experiments/vlm/infra/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/infra 2026-02-28T02:15:13,907 copying build/lib/torchtitan/experiments/vlm/job_config.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm 2026-02-28T02:15:13,909 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/vlm/datasets 2026-02-28T02:15:13,910 copying build/lib/torchtitan/experiments/vlm/datasets/mm_datasets.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/datasets 2026-02-28T02:15:13,913 copying build/lib/torchtitan/experiments/vlm/datasets/mm_collator_nld.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/datasets 2026-02-28T02:15:13,917 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/vlm/datasets/utils 2026-02-28T02:15:13,918 copying build/lib/torchtitan/experiments/vlm/datasets/utils/text.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/datasets/utils 2026-02-28T02:15:13,920 copying build/lib/torchtitan/experiments/vlm/datasets/utils/image.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/datasets/utils 2026-02-28T02:15:13,923 copying build/lib/torchtitan/experiments/vlm/datasets/utils/packing.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/datasets/utils 2026-02-28T02:15:13,926 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/vlm/tests 2026-02-28T02:15:13,927 copying build/lib/torchtitan/experiments/vlm/tests/integration_tests.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/tests 2026-02-28T02:15:13,929 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/vlm/model 2026-02-28T02:15:13,931 copying build/lib/torchtitan/experiments/vlm/model/args.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/model 2026-02-28T02:15:13,933 copying build/lib/torchtitan/experiments/vlm/model/siglip2.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/model 2026-02-28T02:15:13,935 copying build/lib/torchtitan/experiments/vlm/model/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm/model 2026-02-28T02:15:13,938 copying build/lib/torchtitan/experiments/vlm/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/vlm 2026-02-28T02:15:13,940 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,941 copying build/lib/torchtitan/experiments/simple_fsdp/job_config.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,944 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/simple_fsdp/tests 2026-02-28T02:15:13,945 copying build/lib/torchtitan/experiments/simple_fsdp/tests/test_numerics.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp/tests 2026-02-28T02:15:13,947 copying build/lib/torchtitan/experiments/simple_fsdp/tests/integration_tests.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp/tests 2026-02-28T02:15:13,950 copying build/lib/torchtitan/experiments/simple_fsdp/tests/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp/tests 2026-02-28T02:15:13,951 copying build/lib/torchtitan/experiments/simple_fsdp/reshard_after_forward.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,954 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/simple_fsdp/llama3 2026-02-28T02:15:13,956 copying build/lib/torchtitan/experiments/simple_fsdp/llama3/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp/llama3 2026-02-28T02:15:13,958 copying build/lib/torchtitan/experiments/simple_fsdp/llama3/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp/llama3 2026-02-28T02:15:13,960 copying build/lib/torchtitan/experiments/simple_fsdp/llama3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp/llama3 2026-02-28T02:15:13,963 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/simple_fsdp/deepseek_v3 2026-02-28T02:15:13,964 copying build/lib/torchtitan/experiments/simple_fsdp/deepseek_v3/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp/deepseek_v3 2026-02-28T02:15:13,966 copying build/lib/torchtitan/experiments/simple_fsdp/deepseek_v3/model.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp/deepseek_v3 2026-02-28T02:15:13,968 copying build/lib/torchtitan/experiments/simple_fsdp/deepseek_v3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp/deepseek_v3 2026-02-28T02:15:13,970 copying build/lib/torchtitan/experiments/simple_fsdp/simple_fsdp.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,973 copying build/lib/torchtitan/experiments/simple_fsdp/backend.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/simple_fsdp 2026-02-28T02:15:13,976 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/ft 2026-02-28T02:15:13,977 copying build/lib/torchtitan/experiments/ft/train.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/ft 2026-02-28T02:15:13,980 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl 2026-02-28T02:15:13,982 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl/unified 2026-02-28T02:15:13,983 copying build/lib/torchtitan/experiments/rl/unified/infer.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified 2026-02-28T02:15:13,986 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl/unified/infra 2026-02-28T02:15:13,987 copying build/lib/torchtitan/experiments/rl/unified/infra/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified/infra 2026-02-28T02:15:13,989 copying build/lib/torchtitan/experiments/rl/unified/infra/parallelism_utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified/infra 2026-02-28T02:15:13,992 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl/unified/actors 2026-02-28T02:15:13,993 copying build/lib/torchtitan/experiments/rl/unified/actors/trainer.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified/actors 2026-02-28T02:15:13,996 copying build/lib/torchtitan/experiments/rl/unified/actors/generator.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified/actors 2026-02-28T02:15:13,999 copying build/lib/torchtitan/experiments/rl/unified/simple_rl_multiprocess.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified 2026-02-28T02:15:14,002 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl/unified/models 2026-02-28T02:15:14,003 copying build/lib/torchtitan/experiments/rl/unified/models/vllm_wrapper.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified/models 2026-02-28T02:15:14,006 copying build/lib/torchtitan/experiments/rl/unified/models/utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified/models 2026-02-28T02:15:14,008 copying build/lib/torchtitan/experiments/rl/unified/models/attention.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified/models 2026-02-28T02:15:14,010 copying build/lib/torchtitan/experiments/rl/unified/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/unified 2026-02-28T02:15:14,013 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:14,014 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl/vllm_compat/weights 2026-02-28T02:15:14,016 copying build/lib/torchtitan/experiments/rl/vllm_compat/weights/converter.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat/weights 2026-02-28T02:15:14,018 copying build/lib/torchtitan/experiments/rl/vllm_compat/weights/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat/weights 2026-02-28T02:15:14,021 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl/vllm_compat/tests 2026-02-28T02:15:14,022 copying build/lib/torchtitan/experiments/rl/vllm_compat/tests/test_exact_determinism.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat/tests 2026-02-28T02:15:14,024 copying build/lib/torchtitan/experiments/rl/vllm_compat/tests/test_batch_invariant_backward.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat/tests 2026-02-28T02:15:14,026 copying build/lib/torchtitan/experiments/rl/vllm_compat/tests/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat/tests 2026-02-28T02:15:14,028 copying build/lib/torchtitan/experiments/rl/vllm_compat/batch_invariant_backward.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:14,031 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl/vllm_compat/models 2026-02-28T02:15:14,033 copying build/lib/torchtitan/experiments/rl/vllm_compat/models/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat/models 2026-02-28T02:15:14,035 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/rl/vllm_compat/models/qwen3 2026-02-28T02:15:14,036 copying build/lib/torchtitan/experiments/rl/vllm_compat/models/qwen3/model_vllm_compat.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat/models/qwen3 2026-02-28T02:15:14,039 copying build/lib/torchtitan/experiments/rl/vllm_compat/models/qwen3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat/models/qwen3 2026-02-28T02:15:14,041 copying build/lib/torchtitan/experiments/rl/vllm_compat/models/attention.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat/models 2026-02-28T02:15:14,043 copying build/lib/torchtitan/experiments/rl/vllm_compat/weights_vllm_compat.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:14,045 copying build/lib/torchtitan/experiments/rl/vllm_compat/simple_rl.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:14,049 copying build/lib/torchtitan/experiments/rl/vllm_compat/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/rl/vllm_compat 2026-02-28T02:15:14,051 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:14,052 copying build/lib/torchtitan/experiments/compiler_toolkit/job_config.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:14,054 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:14,056 copying build/lib/torchtitan/experiments/compiler_toolkit/tests/test_numerics.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:14,058 copying build/lib/torchtitan/experiments/compiler_toolkit/tests/test_passes.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:14,060 copying build/lib/torchtitan/experiments/compiler_toolkit/tests/numerics_utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:14,062 copying build/lib/torchtitan/experiments/compiler_toolkit/tests/integration_tests.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:14,065 copying build/lib/torchtitan/experiments/compiler_toolkit/tests/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/tests 2026-02-28T02:15:14,067 copying build/lib/torchtitan/experiments/compiler_toolkit/cudagraph.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:14,069 copying build/lib/torchtitan/experiments/compiler_toolkit/train.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:14,071 copying build/lib/torchtitan/experiments/compiler_toolkit/passes.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:14,073 copying build/lib/torchtitan/experiments/compiler_toolkit/common_utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:14,075 copying build/lib/torchtitan/experiments/compiler_toolkit/graph_utils.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit 2026-02-28T02:15:14,079 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/compiler_toolkit/llama3 2026-02-28T02:15:14,080 copying build/lib/torchtitan/experiments/compiler_toolkit/llama3/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/llama3 2026-02-28T02:15:14,082 copying build/lib/torchtitan/experiments/compiler_toolkit/llama3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/llama3 2026-02-28T02:15:14,085 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/compiler_toolkit/deepseek_v3 2026-02-28T02:15:14,086 copying build/lib/torchtitan/experiments/compiler_toolkit/deepseek_v3/parallelize.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/deepseek_v3 2026-02-28T02:15:14,088 copying build/lib/torchtitan/experiments/compiler_toolkit/deepseek_v3/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/deepseek_v3 2026-02-28T02:15:14,090 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/compiler_toolkit/scripts 2026-02-28T02:15:14,091 copying build/lib/torchtitan/experiments/compiler_toolkit/scripts/check_numerics.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/compiler_toolkit/scripts 2026-02-28T02:15:14,093 copying build/lib/torchtitan/experiments/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments 2026-02-28T02:15:14,096 creating build/bdist.linux-armv7l/wheel/torchtitan/experiments/forge 2026-02-28T02:15:14,097 copying build/lib/torchtitan/experiments/forge/job_config.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/forge 2026-02-28T02:15:14,099 copying build/lib/torchtitan/experiments/forge/example_train.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/forge 2026-02-28T02:15:14,101 copying build/lib/torchtitan/experiments/forge/engine.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/forge 2026-02-28T02:15:14,103 copying build/lib/torchtitan/experiments/forge/train_spec.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/forge 2026-02-28T02:15:14,105 copying build/lib/torchtitan/experiments/forge/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/experiments/forge 2026-02-28T02:15:14,108 copying build/lib/torchtitan/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan 2026-02-28T02:15:14,111 creating build/bdist.linux-armv7l/wheel/torchtitan/config 2026-02-28T02:15:14,112 copying build/lib/torchtitan/config/job_config.py -> build/bdist.linux-armv7l/wheel/./torchtitan/config 2026-02-28T02:15:14,115 copying build/lib/torchtitan/config/__init__.py -> build/bdist.linux-armv7l/wheel/./torchtitan/config 2026-02-28T02:15:14,117 copying build/lib/torchtitan/config/manager.py -> build/bdist.linux-armv7l/wheel/./torchtitan/config 2026-02-28T02:15:14,120 running install_egg_info 2026-02-28T02:15:14,124 Copying torchtitan.egg-info to build/bdist.linux-armv7l/wheel/./torchtitan-0.2.2-py3.11.egg-info 2026-02-28T02:15:14,136 running install_scripts 2026-02-28T02:15:14,147 creating build/bdist.linux-armv7l/wheel/torchtitan-0.2.2.dist-info/WHEEL 2026-02-28T02:15:14,150 creating '/tmp/pip-wheel-j41ln14x/.tmp-em23gbna/torchtitan-0.2.2-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-02-28T02:15:14,153 adding 'torchtitan/__init__.py' 2026-02-28T02:15:14,158 adding 'torchtitan/train.py' 2026-02-28T02:15:14,163 adding 'torchtitan/components/checkpoint.py' 2026-02-28T02:15:14,165 adding 'torchtitan/components/dataloader.py' 2026-02-28T02:15:14,167 adding 'torchtitan/components/loss.py' 2026-02-28T02:15:14,170 adding 'torchtitan/components/lr_scheduler.py' 2026-02-28T02:15:14,173 adding 'torchtitan/components/metrics.py' 2026-02-28T02:15:14,176 adding 'torchtitan/components/optimizer.py' 2026-02-28T02:15:14,179 adding 'torchtitan/components/tokenizer.py' 2026-02-28T02:15:14,181 adding 'torchtitan/components/validate.py' 2026-02-28T02:15:14,183 adding 'torchtitan/components/ft/__init__.py' 2026-02-28T02:15:14,185 adding 'torchtitan/components/ft/manager.py' 2026-02-28T02:15:14,187 adding 'torchtitan/components/ft/config/__init__.py' 2026-02-28T02:15:14,189 adding 'torchtitan/components/ft/config/job_config.py' 2026-02-28T02:15:14,191 adding 'torchtitan/components/ft/diloco/__init__.py' 2026-02-28T02:15:14,193 adding 'torchtitan/components/ft/diloco/protocol.py' 2026-02-28T02:15:14,195 adding 'torchtitan/components/ft/diloco/utils.py' 2026-02-28T02:15:14,197 adding 'torchtitan/components/quantization/__init__.py' 2026-02-28T02:15:14,199 adding 'torchtitan/components/quantization/float8.py' 2026-02-28T02:15:14,201 adding 'torchtitan/components/quantization/mx.py' 2026-02-28T02:15:14,203 adding 'torchtitan/components/quantization/utils.py' 2026-02-28T02:15:14,205 adding 'torchtitan/config/__init__.py' 2026-02-28T02:15:14,211 adding 'torchtitan/config/job_config.py' 2026-02-28T02:15:14,214 adding 'torchtitan/config/manager.py' 2026-02-28T02:15:14,217 adding 'torchtitan/distributed/__init__.py' 2026-02-28T02:15:14,219 adding 'torchtitan/distributed/activation_checkpoint.py' 2026-02-28T02:15:14,221 adding 'torchtitan/distributed/context_parallel.py' 2026-02-28T02:15:14,223 adding 'torchtitan/distributed/dual_pipe_v.py' 2026-02-28T02:15:14,226 adding 'torchtitan/distributed/expert_parallel.py' 2026-02-28T02:15:14,229 adding 'torchtitan/distributed/parallel_dims.py' 2026-02-28T02:15:14,232 adding 'torchtitan/distributed/pipeline_parallel.py' 2026-02-28T02:15:14,234 adding 'torchtitan/distributed/tensor_parallel.py' 2026-02-28T02:15:14,237 adding 'torchtitan/distributed/utils.py' 2026-02-28T02:15:14,239 adding 'torchtitan/distributed/deepep/__init__.py' 2026-02-28T02:15:14,242 adding 'torchtitan/distributed/deepep/deepep.py' 2026-02-28T02:15:14,245 adding 'torchtitan/experiments/__init__.py' 2026-02-28T02:15:14,247 adding 'torchtitan/experiments/autoparallel/job_config.py' 2026-02-28T02:15:14,249 adding 'torchtitan/experiments/autoparallel/deepseek_v3/__init__.py' 2026-02-28T02:15:14,252 adding 'torchtitan/experiments/autoparallel/deepseek_v3/parallelize_deepseekv3.py' 2026-02-28T02:15:14,254 adding 'torchtitan/experiments/autoparallel/llama3/__init__.py' 2026-02-28T02:15:14,256 adding 'torchtitan/experiments/autoparallel/llama3/parallelize_llama.py' 2026-02-28T02:15:14,258 adding 'torchtitan/experiments/autoparallel/local_map_deepseek_v3/__init__.py' 2026-02-28T02:15:14,260 adding 'torchtitan/experiments/autoparallel/local_map_deepseek_v3/args.py' 2026-02-28T02:15:14,262 adding 'torchtitan/experiments/autoparallel/local_map_deepseek_v3/model.py' 2026-02-28T02:15:14,264 adding 'torchtitan/experiments/autoparallel/local_map_deepseek_v3/parallelize_deepseekv3.py' 2026-02-28T02:15:14,266 adding 'torchtitan/experiments/autoparallel/tests/__init__.py' 2026-02-28T02:15:14,268 adding 'torchtitan/experiments/autoparallel/tests/integration_tests.py' 2026-02-28T02:15:14,270 adding 'torchtitan/experiments/compiler_toolkit/common_utils.py' 2026-02-28T02:15:14,272 adding 'torchtitan/experiments/compiler_toolkit/cudagraph.py' 2026-02-28T02:15:14,275 adding 'torchtitan/experiments/compiler_toolkit/graph_utils.py' 2026-02-28T02:15:14,277 adding 'torchtitan/experiments/compiler_toolkit/job_config.py' 2026-02-28T02:15:14,279 adding 'torchtitan/experiments/compiler_toolkit/passes.py' 2026-02-28T02:15:14,281 adding 'torchtitan/experiments/compiler_toolkit/train.py' 2026-02-28T02:15:14,283 adding 'torchtitan/experiments/compiler_toolkit/deepseek_v3/__init__.py' 2026-02-28T02:15:14,285 adding 'torchtitan/experiments/compiler_toolkit/deepseek_v3/parallelize.py' 2026-02-28T02:15:14,287 adding 'torchtitan/experiments/compiler_toolkit/llama3/__init__.py' 2026-02-28T02:15:14,289 adding 'torchtitan/experiments/compiler_toolkit/llama3/parallelize.py' 2026-02-28T02:15:14,291 adding 'torchtitan/experiments/compiler_toolkit/scripts/check_numerics.py' 2026-02-28T02:15:14,294 adding 'torchtitan/experiments/compiler_toolkit/tests/__init__.py' 2026-02-28T02:15:14,296 adding 'torchtitan/experiments/compiler_toolkit/tests/integration_tests.py' 2026-02-28T02:15:14,298 adding 'torchtitan/experiments/compiler_toolkit/tests/numerics_utils.py' 2026-02-28T02:15:14,300 adding 'torchtitan/experiments/compiler_toolkit/tests/test_numerics.py' 2026-02-28T02:15:14,302 adding 'torchtitan/experiments/compiler_toolkit/tests/test_passes.py' 2026-02-28T02:15:14,304 adding 'torchtitan/experiments/forge/__init__.py' 2026-02-28T02:15:14,306 adding 'torchtitan/experiments/forge/engine.py' 2026-02-28T02:15:14,309 adding 'torchtitan/experiments/forge/example_train.py' 2026-02-28T02:15:14,311 adding 'torchtitan/experiments/forge/job_config.py' 2026-02-28T02:15:14,313 adding 'torchtitan/experiments/forge/train_spec.py' 2026-02-28T02:15:14,316 adding 'torchtitan/experiments/ft/train.py' 2026-02-28T02:15:14,319 adding 'torchtitan/experiments/moe_symm_mem_kernels/combine.py' 2026-02-28T02:15:14,322 adding 'torchtitan/experiments/moe_symm_mem_kernels/dispatch.py' 2026-02-28T02:15:14,325 adding 'torchtitan/experiments/rl/unified/__init__.py' 2026-02-28T02:15:14,326 adding 'torchtitan/experiments/rl/unified/infer.py' 2026-02-28T02:15:14,328 adding 'torchtitan/experiments/rl/unified/simple_rl_multiprocess.py' 2026-02-28T02:15:14,331 adding 'torchtitan/experiments/rl/unified/actors/generator.py' 2026-02-28T02:15:14,332 adding 'torchtitan/experiments/rl/unified/actors/trainer.py' 2026-02-28T02:15:14,334 adding 'torchtitan/experiments/rl/unified/infra/parallelism_utils.py' 2026-02-28T02:15:14,336 adding 'torchtitan/experiments/rl/unified/infra/parallelize.py' 2026-02-28T02:15:14,338 adding 'torchtitan/experiments/rl/unified/models/attention.py' 2026-02-28T02:15:14,339 adding 'torchtitan/experiments/rl/unified/models/utils.py' 2026-02-28T02:15:14,341 adding 'torchtitan/experiments/rl/unified/models/vllm_wrapper.py' 2026-02-28T02:15:14,343 adding 'torchtitan/experiments/rl/vllm_compat/__init__.py' 2026-02-28T02:15:14,345 adding 'torchtitan/experiments/rl/vllm_compat/batch_invariant_backward.py' 2026-02-28T02:15:14,350 adding 'torchtitan/experiments/rl/vllm_compat/simple_rl.py' 2026-02-28T02:15:14,352 adding 'torchtitan/experiments/rl/vllm_compat/weights_vllm_compat.py' 2026-02-28T02:15:14,354 adding 'torchtitan/experiments/rl/vllm_compat/models/__init__.py' 2026-02-28T02:15:14,356 adding 'torchtitan/experiments/rl/vllm_compat/models/attention.py' 2026-02-28T02:15:14,357 adding 'torchtitan/experiments/rl/vllm_compat/models/qwen3/__init__.py' 2026-02-28T02:15:14,359 adding 'torchtitan/experiments/rl/vllm_compat/models/qwen3/model_vllm_compat.py' 2026-02-28T02:15:14,361 adding 'torchtitan/experiments/rl/vllm_compat/tests/__init__.py' 2026-02-28T02:15:14,362 adding 'torchtitan/experiments/rl/vllm_compat/tests/test_batch_invariant_backward.py' 2026-02-28T02:15:14,364 adding 'torchtitan/experiments/rl/vllm_compat/tests/test_exact_determinism.py' 2026-02-28T02:15:14,366 adding 'torchtitan/experiments/rl/vllm_compat/weights/__init__.py' 2026-02-28T02:15:14,367 adding 'torchtitan/experiments/rl/vllm_compat/weights/converter.py' 2026-02-28T02:15:14,370 adding 'torchtitan/experiments/simple_fsdp/backend.py' 2026-02-28T02:15:14,371 adding 'torchtitan/experiments/simple_fsdp/job_config.py' 2026-02-28T02:15:14,372 adding 'torchtitan/experiments/simple_fsdp/reshard_after_forward.py' 2026-02-28T02:15:14,374 adding 'torchtitan/experiments/simple_fsdp/simple_fsdp.py' 2026-02-28T02:15:14,376 adding 'torchtitan/experiments/simple_fsdp/deepseek_v3/__init__.py' 2026-02-28T02:15:14,377 adding 'torchtitan/experiments/simple_fsdp/deepseek_v3/model.py' 2026-02-28T02:15:14,379 adding 'torchtitan/experiments/simple_fsdp/deepseek_v3/parallelize.py' 2026-02-28T02:15:14,381 adding 'torchtitan/experiments/simple_fsdp/llama3/__init__.py' 2026-02-28T02:15:14,382 adding 'torchtitan/experiments/simple_fsdp/llama3/model.py' 2026-02-28T02:15:14,384 adding 'torchtitan/experiments/simple_fsdp/llama3/parallelize.py' 2026-02-28T02:15:14,386 adding 'torchtitan/experiments/simple_fsdp/tests/__init__.py' 2026-02-28T02:15:14,387 adding 'torchtitan/experiments/simple_fsdp/tests/integration_tests.py' 2026-02-28T02:15:14,389 adding 'torchtitan/experiments/simple_fsdp/tests/test_numerics.py' 2026-02-28T02:15:14,391 adding 'torchtitan/experiments/torchcomms/integration_tests.py' 2026-02-28T02:15:14,393 adding 'torchtitan/experiments/torchcomms/parallel_dims.py' 2026-02-28T02:15:14,394 adding 'torchtitan/experiments/torchcomms/train.py' 2026-02-28T02:15:14,396 adding 'torchtitan/experiments/transformers_modeling_backend/__init__.py' 2026-02-28T02:15:14,397 adding 'torchtitan/experiments/transformers_modeling_backend/job_config.py' 2026-02-28T02:15:14,400 adding 'torchtitan/experiments/transformers_modeling_backend/infra/parallelize.py' 2026-02-28T02:15:14,402 adding 'torchtitan/experiments/transformers_modeling_backend/infra/pipeline.py' 2026-02-28T02:15:14,405 adding 'torchtitan/experiments/transformers_modeling_backend/model/args.py' 2026-02-28T02:15:14,407 adding 'torchtitan/experiments/transformers_modeling_backend/model/model.py' 2026-02-28T02:15:14,409 adding 'torchtitan/experiments/transformers_modeling_backend/tests/integration_tests.py' 2026-02-28T02:15:14,411 adding 'torchtitan/experiments/vlm/__init__.py' 2026-02-28T02:15:14,412 adding 'torchtitan/experiments/vlm/job_config.py' 2026-02-28T02:15:14,414 adding 'torchtitan/experiments/vlm/datasets/mm_collator_nld.py' 2026-02-28T02:15:14,417 adding 'torchtitan/experiments/vlm/datasets/mm_datasets.py' 2026-02-28T02:15:14,419 adding 'torchtitan/experiments/vlm/datasets/utils/image.py' 2026-02-28T02:15:14,421 adding 'torchtitan/experiments/vlm/datasets/utils/packing.py' 2026-02-28T02:15:14,422 adding 'torchtitan/experiments/vlm/datasets/utils/text.py' 2026-02-28T02:15:14,424 adding 'torchtitan/experiments/vlm/infra/parallelize.py' 2026-02-28T02:15:14,426 adding 'torchtitan/experiments/vlm/model/args.py' 2026-02-28T02:15:14,428 adding 'torchtitan/experiments/vlm/model/model.py' 2026-02-28T02:15:14,430 adding 'torchtitan/experiments/vlm/model/siglip2.py' 2026-02-28T02:15:14,432 adding 'torchtitan/experiments/vlm/tests/integration_tests.py' 2026-02-28T02:15:14,434 adding 'torchtitan/hf_datasets/__init__.py' 2026-02-28T02:15:14,435 adding 'torchtitan/hf_datasets/text_datasets.py' 2026-02-28T02:15:14,437 adding 'torchtitan/models/__init__.py' 2026-02-28T02:15:14,439 adding 'torchtitan/models/attention.py' 2026-02-28T02:15:14,442 adding 'torchtitan/models/utils.py' 2026-02-28T02:15:14,444 adding 'torchtitan/models/deepseek_v3/__init__.py' 2026-02-28T02:15:14,447 adding 'torchtitan/models/deepseek_v3/infra/parallelize.py' 2026-02-28T02:15:14,449 adding 'torchtitan/models/deepseek_v3/model/args.py' 2026-02-28T02:15:14,451 adding 'torchtitan/models/deepseek_v3/model/model.py' 2026-02-28T02:15:14,453 adding 'torchtitan/models/deepseek_v3/model/state_dict_adapter.py' 2026-02-28T02:15:14,455 adding 'torchtitan/models/flux/__init__.py' 2026-02-28T02:15:14,457 adding 'torchtitan/models/flux/flux_datasets.py' 2026-02-28T02:15:14,459 adding 'torchtitan/models/flux/job_config.py' 2026-02-28T02:15:14,460 adding 'torchtitan/models/flux/tokenizer.py' 2026-02-28T02:15:14,462 adding 'torchtitan/models/flux/train.py' 2026-02-28T02:15:14,464 adding 'torchtitan/models/flux/utils.py' 2026-02-28T02:15:14,466 adding 'torchtitan/models/flux/validate.py' 2026-02-28T02:15:14,468 adding 'torchtitan/models/flux/inference/infer.py' 2026-02-28T02:15:14,470 adding 'torchtitan/models/flux/inference/sampling.py' 2026-02-28T02:15:14,472 adding 'torchtitan/models/flux/infra/parallelize.py' 2026-02-28T02:15:14,474 adding 'torchtitan/models/flux/model/args.py' 2026-02-28T02:15:14,476 adding 'torchtitan/models/flux/model/autoencoder.py' 2026-02-28T02:15:14,477 adding 'torchtitan/models/flux/model/hf_embedder.py' 2026-02-28T02:15:14,479 adding 'torchtitan/models/flux/model/layers.py' 2026-02-28T02:15:14,481 adding 'torchtitan/models/flux/model/model.py' 2026-02-28T02:15:14,483 adding 'torchtitan/models/flux/model/state_dict_adapter.py' 2026-02-28T02:15:14,485 adding 'torchtitan/models/gpt_oss/__init__.py' 2026-02-28T02:15:14,487 adding 'torchtitan/models/gpt_oss/infra/expert_parallel.py' 2026-02-28T02:15:14,489 adding 'torchtitan/models/gpt_oss/infra/parallelize.py' 2026-02-28T02:15:14,491 adding 'torchtitan/models/gpt_oss/model/args.py' 2026-02-28T02:15:14,493 adding 'torchtitan/models/gpt_oss/model/model.py' 2026-02-28T02:15:14,495 adding 'torchtitan/models/gpt_oss/model/moe.py' 2026-02-28T02:15:14,496 adding 'torchtitan/models/gpt_oss/model/state_dict_adapter.py' 2026-02-28T02:15:14,498 adding 'torchtitan/models/llama3/__init__.py' 2026-02-28T02:15:14,500 adding 'torchtitan/models/llama3/infra/parallelize.py' 2026-02-28T02:15:14,502 adding 'torchtitan/models/llama3/model/args.py' 2026-02-28T02:15:14,505 adding 'torchtitan/models/llama3/model/model.py' 2026-02-28T02:15:14,507 adding 'torchtitan/models/llama3/model/state_dict_adapter.py' 2026-02-28T02:15:14,509 adding 'torchtitan/models/llama3_ft/__init__.py' 2026-02-28T02:15:14,510 adding 'torchtitan/models/llama4/__init__.py' 2026-02-28T02:15:14,514 adding 'torchtitan/models/llama4/infra/parallelize.py' 2026-02-28T02:15:14,516 adding 'torchtitan/models/llama4/model/args.py' 2026-02-28T02:15:14,519 adding 'torchtitan/models/llama4/model/model.py' 2026-02-28T02:15:14,521 adding 'torchtitan/models/llama4/model/state_dict_adapter.py' 2026-02-28T02:15:14,522 adding 'torchtitan/models/moe/__init__.py' 2026-02-28T02:15:14,524 adding 'torchtitan/models/moe/kernels.py' 2026-02-28T02:15:14,527 adding 'torchtitan/models/moe/moe.py' 2026-02-28T02:15:14,529 adding 'torchtitan/models/moe/moe_deepep.py' 2026-02-28T02:15:14,530 adding 'torchtitan/models/moe/utils.py' 2026-02-28T02:15:14,532 adding 'torchtitan/models/qwen3/__init__.py' 2026-02-28T02:15:14,535 adding 'torchtitan/models/qwen3/infra/parallelize.py' 2026-02-28T02:15:14,537 adding 'torchtitan/models/qwen3/model/args.py' 2026-02-28T02:15:14,539 adding 'torchtitan/models/qwen3/model/model.py' 2026-02-28T02:15:14,541 adding 'torchtitan/models/qwen3/model/state_dict_adapter.py' 2026-02-28T02:15:14,543 adding 'torchtitan/protocols/__init__.py' 2026-02-28T02:15:14,545 adding 'torchtitan/protocols/model.py' 2026-02-28T02:15:14,546 adding 'torchtitan/protocols/model_converter.py' 2026-02-28T02:15:14,547 adding 'torchtitan/protocols/state_dict_adapter.py' 2026-02-28T02:15:14,549 adding 'torchtitan/protocols/train_spec.py' 2026-02-28T02:15:14,550 adding 'torchtitan/tools/logging.py' 2026-02-28T02:15:14,552 adding 'torchtitan/tools/profiling.py' 2026-02-28T02:15:14,554 adding 'torchtitan/tools/utils.py' 2026-02-28T02:15:14,556 adding 'torchtitan-0.2.2.dist-info/licenses/LICENSE' 2026-02-28T02:15:14,558 adding 'torchtitan-0.2.2.dist-info/METADATA' 2026-02-28T02:15:14,560 adding 'torchtitan-0.2.2.dist-info/WHEEL' 2026-02-28T02:15:14,561 adding 'torchtitan-0.2.2.dist-info/top_level.txt' 2026-02-28T02:15:14,564 adding 'torchtitan-0.2.2.dist-info/RECORD' 2026-02-28T02:15:14,572 removing build/bdist.linux-armv7l/wheel 2026-02-28T02:15:14,748 Building wheel for torchtitan (pyproject.toml): finished with status 'done' 2026-02-28T02:15:14,761 Created wheel for torchtitan: filename=torchtitan-0.2.2-py3-none-any.whl size=450309 sha256=5a5638149689b3aa418f318dd014ec7c401b3365a10ac784f8e0450cabeab736 2026-02-28T02:15:14,762 Stored in directory: /tmp/pip-ephem-wheel-cache-kxdj5waf/wheels/35/70/be/b2c1d1801ac27665cc6f4492a41024345fb782f37fd989a4a0 2026-02-28T02:15:14,781 Successfully built torchtitan 2026-02-28T02:15:14,796 Removed build tracker: '/tmp/pip-build-tracker-ftj7x1td'