2024-03-05T10:54:49,485 Created temporary directory: /tmp/pip-build-tracker-jo7odftx 2024-03-05T10:54:49,486 Initialized build tracking at /tmp/pip-build-tracker-jo7odftx 2024-03-05T10:54:49,487 Created build tracker: /tmp/pip-build-tracker-jo7odftx 2024-03-05T10:54:49,487 Entered build tracker: /tmp/pip-build-tracker-jo7odftx 2024-03-05T10:54:49,488 Created temporary directory: /tmp/pip-wheel-42sf7mt9 2024-03-05T10:54:49,491 Created temporary directory: /tmp/pip-ephem-wheel-cache-td6a07na 2024-03-05T10:54:49,513 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2024-03-05T10:54:49,516 2 location(s) to search for versions of nanotron: 2024-03-05T10:54:49,516 * https://pypi.org/simple/nanotron/ 2024-03-05T10:54:49,516 * https://www.piwheels.org/simple/nanotron/ 2024-03-05T10:54:49,517 Fetching project page and analyzing links: https://pypi.org/simple/nanotron/ 2024-03-05T10:54:49,518 Getting page https://pypi.org/simple/nanotron/ 2024-03-05T10:54:49,519 Found index url https://pypi.org/simple/ 2024-03-05T10:54:49,730 Fetched page https://pypi.org/simple/nanotron/ as application/vnd.pypi.simple.v1+json 2024-03-05T10:54:49,732 Skipping link: No binaries permitted for nanotron: https://files.pythonhosted.org/packages/b4/c6/dfc412e8d4fd4acf19bcfac20753f2512562ee25873661bd338543595357/nanotron-0.1.0-py3-none-any.whl (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10) 2024-03-05T10:54:49,733 Found link https://files.pythonhosted.org/packages/94/1c/185c251ce8c00d25f8dcf2fc9ff70153a6279524b8d87e58f5ce8b3a9e4f/nanotron-0.1.0.tar.gz (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10), version: 0.1.0 2024-03-05T10:54:49,734 Skipping link: No binaries permitted for nanotron: https://files.pythonhosted.org/packages/fd/d2/ed1fb1b103af9167ef2fd295c6373a27f4a279a2ccb102101ef86cefd023/nanotron-0.2-py3-none-any.whl (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10) 2024-03-05T10:54:49,735 Found link https://files.pythonhosted.org/packages/9a/0f/5cd64e3cd26251f737848559bff24584c2a53487d73247ecc704ac4ac7fc/nanotron-0.2.tar.gz (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10), version: 0.2 2024-03-05T10:54:49,736 Skipping link: No binaries permitted for nanotron: https://files.pythonhosted.org/packages/48/54/895f2bb2121ff5dd8dddc68491ccb67840d34cd45c30b1f7a5887cdbc311/nanotron-0.4-py3-none-any.whl (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10) 2024-03-05T10:54:49,737 Found link https://files.pythonhosted.org/packages/4c/25/07e627d9432d503f58af6e1eda61e1c7d2a1da9ef107d92f04919004c142/nanotron-0.4.tar.gz (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10), version: 0.4 2024-03-05T10:54:49,737 Fetching project page and analyzing links: https://www.piwheels.org/simple/nanotron/ 2024-03-05T10:54:49,738 Getting page https://www.piwheels.org/simple/nanotron/ 2024-03-05T10:54:49,740 Found index url https://www.piwheels.org/simple/ 2024-03-05T10:54:50,072 Fetched page https://www.piwheels.org/simple/nanotron/ as text/html 2024-03-05T10:54:50,074 Skipping link: No binaries permitted for nanotron: https://www.piwheels.org/simple/nanotron/nanotron-0.2-py3-none-any.whl#sha256=55fbcfdcb4d7a8dba025e21531a8aeeba206080140d6e28b0eb4571013fb30e5 (from https://www.piwheels.org/simple/nanotron/) (requires-python:~=3.10) 2024-03-05T10:54:50,074 Skipping link: No binaries permitted for nanotron: https://www.piwheels.org/simple/nanotron/nanotron-0.1.0-py3-none-any.whl#sha256=52e73f190d4974eb202a4859449ea2751370aa15b1d92506eb2ff70df23be083 (from https://www.piwheels.org/simple/nanotron/) (requires-python:~=3.10) 2024-03-05T10:54:50,075 Skipping link: not a file: https://www.piwheels.org/simple/nanotron/ 2024-03-05T10:54:50,076 Skipping link: not a file: https://pypi.org/simple/nanotron/ 2024-03-05T10:54:50,096 Given no hashes to check 1 links for project 'nanotron': discarding no candidates 2024-03-05T10:54:50,115 Collecting nanotron==0.4 2024-03-05T10:54:50,117 Created temporary directory: /tmp/pip-unpack-cfxdtvq2 2024-03-05T10:54:50,324 Downloading nanotron-0.4.tar.gz (160 kB) 2024-03-05T10:54:50,581 Added nanotron==0.4 from https://files.pythonhosted.org/packages/4c/25/07e627d9432d503f58af6e1eda61e1c7d2a1da9ef107d92f04919004c142/nanotron-0.4.tar.gz to build tracker '/tmp/pip-build-tracker-jo7odftx' 2024-03-05T10:54:50,586 Created temporary directory: /tmp/pip-build-env-cz973lpq 2024-03-05T10:54:50,590 Installing build dependencies: started 2024-03-05T10:54:50,592 Running command pip subprocess to install build dependencies 2024-03-05T10:54:51,755 Using pip 23.3.1 from /home/piwheels/.local/lib/python3.11/site-packages/pip (python 3.11) 2024-03-05T10:54:52,290 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2024-03-05T10:54:54,247 Collecting setuptools 2024-03-05T10:54:55,379 Using cached https://www.piwheels.org/simple/setuptools/setuptools-69.1.1-py3-none-any.whl (819 kB) 2024-03-05T10:54:58,145 Installing collected packages: setuptools 2024-03-05T10:55:00,484 Successfully installed setuptools-69.1.1 2024-03-05T10:55:00,747 [notice] A new release of pip is available: 23.3.1 -> 24.0 2024-03-05T10:55:00,748 [notice] To update, run: python3 -m pip install --upgrade pip 2024-03-05T10:55:01,017 Installing build dependencies: finished with status 'done' 2024-03-05T10:55:01,027 Getting requirements to build wheel: started 2024-03-05T10:55:01,029 Running command Getting requirements to build wheel 2024-03-05T10:55:01,775 running egg_info 2024-03-05T10:55:01,779 writing src/nanotron.egg-info/PKG-INFO 2024-03-05T10:55:01,784 writing dependency_links to src/nanotron.egg-info/dependency_links.txt 2024-03-05T10:55:01,787 writing requirements to src/nanotron.egg-info/requires.txt 2024-03-05T10:55:01,788 writing top-level names to src/nanotron.egg-info/top_level.txt 2024-03-05T10:55:01,816 reading manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-03-05T10:55:01,820 adding license file 'LICENSE' 2024-03-05T10:55:01,825 writing manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-03-05T10:55:01,936 Getting requirements to build wheel: finished with status 'done' 2024-03-05T10:55:01,944 Installing backend dependencies: started 2024-03-05T10:55:01,945 Running command pip subprocess to install backend dependencies 2024-03-05T10:55:03,091 Using pip 23.3.1 from /home/piwheels/.local/lib/python3.11/site-packages/pip (python 3.11) 2024-03-05T10:55:03,631 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2024-03-05T10:55:04,111 Collecting wheel 2024-03-05T10:55:04,126 Using cached https://www.piwheels.org/simple/wheel/wheel-0.42.0-py3-none-any.whl (65 kB) 2024-03-05T10:55:06,473 Installing collected packages: wheel 2024-03-05T10:55:06,694 Creating /tmp/pip-build-env-cz973lpq/normal/local/bin 2024-03-05T10:55:06,696 changing mode of /tmp/pip-build-env-cz973lpq/normal/local/bin/wheel to 755 2024-03-05T10:55:06,707 Successfully installed wheel-0.42.0 2024-03-05T10:55:06,972 [notice] A new release of pip is available: 23.3.1 -> 24.0 2024-03-05T10:55:06,972 [notice] To update, run: python3 -m pip install --upgrade pip 2024-03-05T10:55:07,204 Installing backend dependencies: finished with status 'done' 2024-03-05T10:55:07,206 Created temporary directory: /tmp/pip-modern-metadata-lqeejz60 2024-03-05T10:55:07,208 Preparing metadata (pyproject.toml): started 2024-03-05T10:55:07,209 Running command Preparing metadata (pyproject.toml) 2024-03-05T10:55:08,010 running dist_info 2024-03-05T10:55:08,015 creating /tmp/pip-modern-metadata-lqeejz60/nanotron.egg-info 2024-03-05T10:55:08,019 writing /tmp/pip-modern-metadata-lqeejz60/nanotron.egg-info/PKG-INFO 2024-03-05T10:55:08,024 writing dependency_links to /tmp/pip-modern-metadata-lqeejz60/nanotron.egg-info/dependency_links.txt 2024-03-05T10:55:08,027 writing requirements to /tmp/pip-modern-metadata-lqeejz60/nanotron.egg-info/requires.txt 2024-03-05T10:55:08,028 writing top-level names to /tmp/pip-modern-metadata-lqeejz60/nanotron.egg-info/top_level.txt 2024-03-05T10:55:08,029 writing manifest file '/tmp/pip-modern-metadata-lqeejz60/nanotron.egg-info/SOURCES.txt' 2024-03-05T10:55:08,065 reading manifest file '/tmp/pip-modern-metadata-lqeejz60/nanotron.egg-info/SOURCES.txt' 2024-03-05T10:55:08,067 adding license file 'LICENSE' 2024-03-05T10:55:08,071 writing manifest file '/tmp/pip-modern-metadata-lqeejz60/nanotron.egg-info/SOURCES.txt' 2024-03-05T10:55:08,072 creating '/tmp/pip-modern-metadata-lqeejz60/nanotron-0.4.dist-info' 2024-03-05T10:55:08,244 Preparing metadata (pyproject.toml): finished with status 'done' 2024-03-05T10:55:08,253 Source in /tmp/pip-wheel-42sf7mt9/nanotron_05abf143b5bd40658e1e2cc6e41eb473 has version 0.4, which satisfies requirement nanotron==0.4 from https://files.pythonhosted.org/packages/4c/25/07e627d9432d503f58af6e1eda61e1c7d2a1da9ef107d92f04919004c142/nanotron-0.4.tar.gz 2024-03-05T10:55:08,255 Removed nanotron==0.4 from https://files.pythonhosted.org/packages/4c/25/07e627d9432d503f58af6e1eda61e1c7d2a1da9ef107d92f04919004c142/nanotron-0.4.tar.gz from build tracker '/tmp/pip-build-tracker-jo7odftx' 2024-03-05T10:55:08,263 Created temporary directory: /tmp/pip-unpack-y6i_e9ee 2024-03-05T10:55:08,266 Created temporary directory: /tmp/pip-unpack-8qert6t5 2024-03-05T10:55:08,290 Building wheels for collected packages: nanotron 2024-03-05T10:55:08,294 Created temporary directory: /tmp/pip-wheel-lvwkb5l5 2024-03-05T10:55:08,295 Destination directory: /tmp/pip-wheel-lvwkb5l5 2024-03-05T10:55:08,297 Building wheel for nanotron (pyproject.toml): started 2024-03-05T10:55:08,298 Running command Building wheel for nanotron (pyproject.toml) 2024-03-05T10:55:09,038 running bdist_wheel 2024-03-05T10:55:09,054 running build 2024-03-05T10:55:09,054 running build_py 2024-03-05T10:55:09,059 creating build 2024-03-05T10:55:09,059 creating build/lib 2024-03-05T10:55:09,060 creating build/lib/nanotron 2024-03-05T10:55:09,061 copying src/nanotron/logging.py -> build/lib/nanotron 2024-03-05T10:55:09,064 copying src/nanotron/helpers.py -> build/lib/nanotron 2024-03-05T10:55:09,066 copying src/nanotron/__init__.py -> build/lib/nanotron 2024-03-05T10:55:09,068 copying src/nanotron/trainer.py -> build/lib/nanotron 2024-03-05T10:55:09,070 copying src/nanotron/random.py -> build/lib/nanotron 2024-03-05T10:55:09,072 copying src/nanotron/utils.py -> build/lib/nanotron 2024-03-05T10:55:09,075 copying src/nanotron/distributed.py -> build/lib/nanotron 2024-03-05T10:55:09,077 copying src/nanotron/constants.py -> build/lib/nanotron 2024-03-05T10:55:09,078 copying src/nanotron/sanity_checks.py -> build/lib/nanotron 2024-03-05T10:55:09,080 copying src/nanotron/dataloader.py -> build/lib/nanotron 2024-03-05T10:55:09,083 creating build/lib/nanotron/parallel 2024-03-05T10:55:09,084 copying src/nanotron/parallel/__init__.py -> build/lib/nanotron/parallel 2024-03-05T10:55:09,086 copying src/nanotron/parallel/utils.py -> build/lib/nanotron/parallel 2024-03-05T10:55:09,088 copying src/nanotron/parallel/sharded_parameters.py -> build/lib/nanotron/parallel 2024-03-05T10:55:09,090 copying src/nanotron/parallel/context.py -> build/lib/nanotron/parallel 2024-03-05T10:55:09,092 copying src/nanotron/parallel/parameters.py -> build/lib/nanotron/parallel 2024-03-05T10:55:09,094 copying src/nanotron/parallel/tied_parameters.py -> build/lib/nanotron/parallel 2024-03-05T10:55:09,097 creating build/lib/nanotron/generation 2024-03-05T10:55:09,098 copying src/nanotron/generation/__init__.py -> build/lib/nanotron/generation 2024-03-05T10:55:09,100 copying src/nanotron/generation/generate_store.py -> build/lib/nanotron/generation 2024-03-05T10:55:09,101 copying src/nanotron/generation/sampler.py -> build/lib/nanotron/generation 2024-03-05T10:55:09,104 copying src/nanotron/generation/decode.py -> build/lib/nanotron/generation 2024-03-05T10:55:09,107 creating build/lib/nanotron/serialize 2024-03-05T10:55:09,108 copying src/nanotron/serialize/optimizer.py -> build/lib/nanotron/serialize 2024-03-05T10:55:09,111 copying src/nanotron/serialize/__init__.py -> build/lib/nanotron/serialize 2024-03-05T10:55:09,112 copying src/nanotron/serialize/random.py -> build/lib/nanotron/serialize 2024-03-05T10:55:09,114 copying src/nanotron/serialize/utils.py -> build/lib/nanotron/serialize 2024-03-05T10:55:09,116 copying src/nanotron/serialize/metadata.py -> build/lib/nanotron/serialize 2024-03-05T10:55:09,118 copying src/nanotron/serialize/main.py -> build/lib/nanotron/serialize 2024-03-05T10:55:09,120 copying src/nanotron/serialize/weights.py -> build/lib/nanotron/serialize 2024-03-05T10:55:09,123 creating build/lib/nanotron/optim 2024-03-05T10:55:09,123 copying src/nanotron/optim/clip_grads.py -> build/lib/nanotron/optim 2024-03-05T10:55:09,125 copying src/nanotron/optim/__init__.py -> build/lib/nanotron/optim 2024-03-05T10:55:09,127 copying src/nanotron/optim/inherit_from_other_optimizer.py -> build/lib/nanotron/optim 2024-03-05T10:55:09,129 copying src/nanotron/optim/optimizer_from_gradient_accumulator.py -> build/lib/nanotron/optim 2024-03-05T10:55:09,130 copying src/nanotron/optim/gradient_accumulator.py -> build/lib/nanotron/optim 2024-03-05T10:55:09,133 copying src/nanotron/optim/named_optimizer.py -> build/lib/nanotron/optim 2024-03-05T10:55:09,134 copying src/nanotron/optim/zero.py -> build/lib/nanotron/optim 2024-03-05T10:55:09,137 copying src/nanotron/optim/base.py -> build/lib/nanotron/optim 2024-03-05T10:55:09,139 creating build/lib/nanotron/models 2024-03-05T10:55:09,140 copying src/nanotron/models/__init__.py -> build/lib/nanotron/models 2024-03-05T10:55:09,142 copying src/nanotron/models/llama.py -> build/lib/nanotron/models 2024-03-05T10:55:09,145 copying src/nanotron/models/starcoder2.py -> build/lib/nanotron/models 2024-03-05T10:55:09,148 copying src/nanotron/models/base.py -> build/lib/nanotron/models 2024-03-05T10:55:09,151 creating build/lib/nanotron/fp8 2024-03-05T10:55:09,152 copying src/nanotron/fp8/__init__.py -> build/lib/nanotron/fp8 2024-03-05T10:55:09,153 copying src/nanotron/fp8/tensor.py -> build/lib/nanotron/fp8 2024-03-05T10:55:09,156 copying src/nanotron/fp8/utils.py -> build/lib/nanotron/fp8 2024-03-05T10:55:09,157 copying src/nanotron/fp8/dtypes.py -> build/lib/nanotron/fp8 2024-03-05T10:55:09,159 copying src/nanotron/fp8/linear.py -> build/lib/nanotron/fp8 2024-03-05T10:55:09,161 copying src/nanotron/fp8/parameter.py -> build/lib/nanotron/fp8 2024-03-05T10:55:09,163 copying src/nanotron/fp8/kernel.py -> build/lib/nanotron/fp8 2024-03-05T10:55:09,165 copying src/nanotron/fp8/constants.py -> build/lib/nanotron/fp8 2024-03-05T10:55:09,167 copying src/nanotron/fp8/meta.py -> build/lib/nanotron/fp8 2024-03-05T10:55:09,170 creating build/lib/nanotron/config 2024-03-05T10:55:09,171 copying src/nanotron/config/__init__.py -> build/lib/nanotron/config 2024-03-05T10:55:09,172 copying src/nanotron/config/lighteval_config.py -> build/lib/nanotron/config 2024-03-05T10:55:09,174 copying src/nanotron/config/config.py -> build/lib/nanotron/config 2024-03-05T10:55:09,176 copying src/nanotron/config/parallelism_config.py -> build/lib/nanotron/config 2024-03-05T10:55:09,178 copying src/nanotron/config/models_config.py -> build/lib/nanotron/config 2024-03-05T10:55:09,180 copying src/nanotron/config/utils_config.py -> build/lib/nanotron/config 2024-03-05T10:55:09,183 creating build/lib/nanotron/nn 2024-03-05T10:55:09,184 copying src/nanotron/nn/__init__.py -> build/lib/nanotron/nn 2024-03-05T10:55:09,185 copying src/nanotron/nn/layer_norm.py -> build/lib/nanotron/nn 2024-03-05T10:55:09,187 copying src/nanotron/nn/activations.py -> build/lib/nanotron/nn 2024-03-05T10:55:09,190 creating build/lib/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,191 copying src/nanotron/parallel/tensor_parallel/nn.py -> build/lib/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,193 copying src/nanotron/parallel/tensor_parallel/distributed_differentiable_primitives.py -> build/lib/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,195 copying src/nanotron/parallel/tensor_parallel/__init__.py -> build/lib/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,197 copying src/nanotron/parallel/tensor_parallel/enum.py -> build/lib/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,198 copying src/nanotron/parallel/tensor_parallel/functional.py -> build/lib/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,202 creating build/lib/nanotron/parallel/data_parallel 2024-03-05T10:55:09,203 copying src/nanotron/parallel/data_parallel/utils.py -> build/lib/nanotron/parallel/data_parallel 2024-03-05T10:55:09,206 creating build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,207 copying src/nanotron/parallel/pipeline_parallel/__init__.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,209 copying src/nanotron/parallel/pipeline_parallel/p2p.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,212 copying src/nanotron/parallel/pipeline_parallel/utils.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,214 copying src/nanotron/parallel/pipeline_parallel/block.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,216 copying src/nanotron/parallel/pipeline_parallel/context_manager.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,217 copying src/nanotron/parallel/pipeline_parallel/state.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,220 copying src/nanotron/parallel/pipeline_parallel/tensor_pointer.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,221 copying src/nanotron/parallel/pipeline_parallel/engine.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,223 copying src/nanotron/parallel/pipeline_parallel/functional.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,225 running egg_info 2024-03-05T10:55:09,229 writing src/nanotron.egg-info/PKG-INFO 2024-03-05T10:55:09,234 writing dependency_links to src/nanotron.egg-info/dependency_links.txt 2024-03-05T10:55:09,237 writing requirements to src/nanotron.egg-info/requires.txt 2024-03-05T10:55:09,238 writing top-level names to src/nanotron.egg-info/top_level.txt 2024-03-05T10:55:09,258 reading manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-03-05T10:55:09,261 adding license file 'LICENSE' 2024-03-05T10:55:09,266 writing manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-03-05T10:55:09,285 installing to build/bdist.linux-armv7l/wheel 2024-03-05T10:55:09,286 running install 2024-03-05T10:55:09,309 running install_lib 2024-03-05T10:55:09,313 creating build/bdist.linux-armv7l 2024-03-05T10:55:09,314 creating build/bdist.linux-armv7l/wheel 2024-03-05T10:55:09,316 creating build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,317 copying build/lib/nanotron/logging.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,319 copying build/lib/nanotron/helpers.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,322 copying build/lib/nanotron/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,324 copying build/lib/nanotron/trainer.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,327 copying build/lib/nanotron/random.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,329 copying build/lib/nanotron/utils.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,331 copying build/lib/nanotron/distributed.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,334 creating build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-03-05T10:55:09,335 copying build/lib/nanotron/parallel/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-03-05T10:55:09,338 creating build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,339 copying build/lib/nanotron/parallel/tensor_parallel/nn.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,341 copying build/lib/nanotron/parallel/tensor_parallel/distributed_differentiable_primitives.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,343 copying build/lib/nanotron/parallel/tensor_parallel/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,345 copying build/lib/nanotron/parallel/tensor_parallel/enum.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,346 copying build/lib/nanotron/parallel/tensor_parallel/functional.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-03-05T10:55:09,350 creating build/bdist.linux-armv7l/wheel/nanotron/parallel/data_parallel 2024-03-05T10:55:09,351 copying build/lib/nanotron/parallel/data_parallel/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/data_parallel 2024-03-05T10:55:09,353 copying build/lib/nanotron/parallel/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-03-05T10:55:09,354 copying build/lib/nanotron/parallel/sharded_parameters.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-03-05T10:55:09,357 copying build/lib/nanotron/parallel/context.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-03-05T10:55:09,359 copying build/lib/nanotron/parallel/parameters.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-03-05T10:55:09,361 creating build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,362 copying build/lib/nanotron/parallel/pipeline_parallel/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,364 copying build/lib/nanotron/parallel/pipeline_parallel/p2p.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,367 copying build/lib/nanotron/parallel/pipeline_parallel/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,369 copying build/lib/nanotron/parallel/pipeline_parallel/block.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,371 copying build/lib/nanotron/parallel/pipeline_parallel/context_manager.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,372 copying build/lib/nanotron/parallel/pipeline_parallel/state.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,375 copying build/lib/nanotron/parallel/pipeline_parallel/tensor_pointer.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,376 copying build/lib/nanotron/parallel/pipeline_parallel/engine.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,379 copying build/lib/nanotron/parallel/pipeline_parallel/functional.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-03-05T10:55:09,381 copying build/lib/nanotron/parallel/tied_parameters.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-03-05T10:55:09,384 creating build/bdist.linux-armv7l/wheel/nanotron/generation 2024-03-05T10:55:09,385 copying build/lib/nanotron/generation/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-03-05T10:55:09,387 copying build/lib/nanotron/generation/generate_store.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-03-05T10:55:09,388 copying build/lib/nanotron/generation/sampler.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-03-05T10:55:09,391 copying build/lib/nanotron/generation/decode.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-03-05T10:55:09,394 creating build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-03-05T10:55:09,395 copying build/lib/nanotron/serialize/optimizer.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-03-05T10:55:09,398 copying build/lib/nanotron/serialize/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-03-05T10:55:09,400 copying build/lib/nanotron/serialize/random.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-03-05T10:55:09,402 copying build/lib/nanotron/serialize/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-03-05T10:55:09,404 copying build/lib/nanotron/serialize/metadata.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-03-05T10:55:09,406 copying build/lib/nanotron/serialize/main.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-03-05T10:55:09,408 copying build/lib/nanotron/serialize/weights.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-03-05T10:55:09,410 copying build/lib/nanotron/constants.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,412 creating build/bdist.linux-armv7l/wheel/nanotron/optim 2024-03-05T10:55:09,413 copying build/lib/nanotron/optim/clip_grads.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-03-05T10:55:09,415 copying build/lib/nanotron/optim/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-03-05T10:55:09,417 copying build/lib/nanotron/optim/inherit_from_other_optimizer.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-03-05T10:55:09,419 copying build/lib/nanotron/optim/optimizer_from_gradient_accumulator.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-03-05T10:55:09,421 copying build/lib/nanotron/optim/gradient_accumulator.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-03-05T10:55:09,423 copying build/lib/nanotron/optim/named_optimizer.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-03-05T10:55:09,425 copying build/lib/nanotron/optim/zero.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-03-05T10:55:09,428 copying build/lib/nanotron/optim/base.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-03-05T10:55:09,430 creating build/bdist.linux-armv7l/wheel/nanotron/models 2024-03-05T10:55:09,431 copying build/lib/nanotron/models/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-03-05T10:55:09,433 copying build/lib/nanotron/models/llama.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-03-05T10:55:09,436 copying build/lib/nanotron/models/starcoder2.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-03-05T10:55:09,440 copying build/lib/nanotron/models/base.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-03-05T10:55:09,442 copying build/lib/nanotron/sanity_checks.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,445 creating build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,446 copying build/lib/nanotron/fp8/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,448 copying build/lib/nanotron/fp8/tensor.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,450 copying build/lib/nanotron/fp8/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,452 copying build/lib/nanotron/fp8/dtypes.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,453 copying build/lib/nanotron/fp8/linear.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,455 copying build/lib/nanotron/fp8/parameter.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,457 copying build/lib/nanotron/fp8/kernel.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,459 copying build/lib/nanotron/fp8/constants.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,461 copying build/lib/nanotron/fp8/meta.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-03-05T10:55:09,462 copying build/lib/nanotron/dataloader.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-03-05T10:55:09,465 creating build/bdist.linux-armv7l/wheel/nanotron/config 2024-03-05T10:55:09,466 copying build/lib/nanotron/config/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-03-05T10:55:09,468 copying build/lib/nanotron/config/lighteval_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-03-05T10:55:09,470 copying build/lib/nanotron/config/config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-03-05T10:55:09,477 copying build/lib/nanotron/config/parallelism_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-03-05T10:55:09,479 copying build/lib/nanotron/config/models_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-03-05T10:55:09,481 copying build/lib/nanotron/config/utils_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-03-05T10:55:09,483 creating build/bdist.linux-armv7l/wheel/nanotron/nn 2024-03-05T10:55:09,484 copying build/lib/nanotron/nn/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/nn 2024-03-05T10:55:09,485 copying build/lib/nanotron/nn/layer_norm.py -> build/bdist.linux-armv7l/wheel/nanotron/nn 2024-03-05T10:55:09,487 copying build/lib/nanotron/nn/activations.py -> build/bdist.linux-armv7l/wheel/nanotron/nn 2024-03-05T10:55:09,489 running install_egg_info 2024-03-05T10:55:09,493 Copying src/nanotron.egg-info to build/bdist.linux-armv7l/wheel/nanotron-0.4-py3.11.egg-info 2024-03-05T10:55:09,502 running install_scripts 2024-03-05T10:55:09,522 creating build/bdist.linux-armv7l/wheel/nanotron-0.4.dist-info/WHEEL 2024-03-05T10:55:09,524 creating '/tmp/pip-wheel-lvwkb5l5/.tmp-l65908s4/nanotron-0.4-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2024-03-05T10:55:09,526 adding 'nanotron/__init__.py' 2024-03-05T10:55:09,527 adding 'nanotron/constants.py' 2024-03-05T10:55:09,530 adding 'nanotron/dataloader.py' 2024-03-05T10:55:09,532 adding 'nanotron/distributed.py' 2024-03-05T10:55:09,535 adding 'nanotron/helpers.py' 2024-03-05T10:55:09,537 adding 'nanotron/logging.py' 2024-03-05T10:55:09,539 adding 'nanotron/random.py' 2024-03-05T10:55:09,541 adding 'nanotron/sanity_checks.py' 2024-03-05T10:55:09,545 adding 'nanotron/trainer.py' 2024-03-05T10:55:09,547 adding 'nanotron/utils.py' 2024-03-05T10:55:09,548 adding 'nanotron/config/__init__.py' 2024-03-05T10:55:09,550 adding 'nanotron/config/config.py' 2024-03-05T10:55:09,552 adding 'nanotron/config/lighteval_config.py' 2024-03-05T10:55:09,553 adding 'nanotron/config/models_config.py' 2024-03-05T10:55:09,555 adding 'nanotron/config/parallelism_config.py' 2024-03-05T10:55:09,556 adding 'nanotron/config/utils_config.py' 2024-03-05T10:55:09,558 adding 'nanotron/fp8/__init__.py' 2024-03-05T10:55:09,559 adding 'nanotron/fp8/constants.py' 2024-03-05T10:55:09,560 adding 'nanotron/fp8/dtypes.py' 2024-03-05T10:55:09,561 adding 'nanotron/fp8/kernel.py' 2024-03-05T10:55:09,563 adding 'nanotron/fp8/linear.py' 2024-03-05T10:55:09,564 adding 'nanotron/fp8/meta.py' 2024-03-05T10:55:09,565 adding 'nanotron/fp8/parameter.py' 2024-03-05T10:55:09,567 adding 'nanotron/fp8/tensor.py' 2024-03-05T10:55:09,568 adding 'nanotron/fp8/utils.py' 2024-03-05T10:55:09,570 adding 'nanotron/generation/__init__.py' 2024-03-05T10:55:09,573 adding 'nanotron/generation/decode.py' 2024-03-05T10:55:09,574 adding 'nanotron/generation/generate_store.py' 2024-03-05T10:55:09,576 adding 'nanotron/generation/sampler.py' 2024-03-05T10:55:09,578 adding 'nanotron/models/__init__.py' 2024-03-05T10:55:09,580 adding 'nanotron/models/base.py' 2024-03-05T10:55:09,585 adding 'nanotron/models/llama.py' 2024-03-05T10:55:09,593 adding 'nanotron/models/starcoder2.py' 2024-03-05T10:55:09,595 adding 'nanotron/nn/__init__.py' 2024-03-05T10:55:09,596 adding 'nanotron/nn/activations.py' 2024-03-05T10:55:09,598 adding 'nanotron/nn/layer_norm.py' 2024-03-05T10:55:09,600 adding 'nanotron/optim/__init__.py' 2024-03-05T10:55:09,601 adding 'nanotron/optim/base.py' 2024-03-05T10:55:09,602 adding 'nanotron/optim/clip_grads.py' 2024-03-05T10:55:09,605 adding 'nanotron/optim/gradient_accumulator.py' 2024-03-05T10:55:09,606 adding 'nanotron/optim/inherit_from_other_optimizer.py' 2024-03-05T10:55:09,608 adding 'nanotron/optim/named_optimizer.py' 2024-03-05T10:55:09,609 adding 'nanotron/optim/optimizer_from_gradient_accumulator.py' 2024-03-05T10:55:09,612 adding 'nanotron/optim/zero.py' 2024-03-05T10:55:09,614 adding 'nanotron/parallel/__init__.py' 2024-03-05T10:55:09,615 adding 'nanotron/parallel/context.py' 2024-03-05T10:55:09,617 adding 'nanotron/parallel/parameters.py' 2024-03-05T10:55:09,619 adding 'nanotron/parallel/sharded_parameters.py' 2024-03-05T10:55:09,620 adding 'nanotron/parallel/tied_parameters.py' 2024-03-05T10:55:09,622 adding 'nanotron/parallel/utils.py' 2024-03-05T10:55:09,623 adding 'nanotron/parallel/data_parallel/utils.py' 2024-03-05T10:55:09,625 adding 'nanotron/parallel/pipeline_parallel/__init__.py' 2024-03-05T10:55:09,627 adding 'nanotron/parallel/pipeline_parallel/block.py' 2024-03-05T10:55:09,628 adding 'nanotron/parallel/pipeline_parallel/context_manager.py' 2024-03-05T10:55:09,629 adding 'nanotron/parallel/pipeline_parallel/engine.py' 2024-03-05T10:55:09,631 adding 'nanotron/parallel/pipeline_parallel/functional.py' 2024-03-05T10:55:09,633 adding 'nanotron/parallel/pipeline_parallel/p2p.py' 2024-03-05T10:55:09,635 adding 'nanotron/parallel/pipeline_parallel/state.py' 2024-03-05T10:55:09,636 adding 'nanotron/parallel/pipeline_parallel/tensor_pointer.py' 2024-03-05T10:55:09,637 adding 'nanotron/parallel/pipeline_parallel/utils.py' 2024-03-05T10:55:09,639 adding 'nanotron/parallel/tensor_parallel/__init__.py' 2024-03-05T10:55:09,640 adding 'nanotron/parallel/tensor_parallel/distributed_differentiable_primitives.py' 2024-03-05T10:55:09,642 adding 'nanotron/parallel/tensor_parallel/enum.py' 2024-03-05T10:55:09,644 adding 'nanotron/parallel/tensor_parallel/functional.py' 2024-03-05T10:55:09,646 adding 'nanotron/parallel/tensor_parallel/nn.py' 2024-03-05T10:55:09,647 adding 'nanotron/serialize/__init__.py' 2024-03-05T10:55:09,649 adding 'nanotron/serialize/main.py' 2024-03-05T10:55:09,650 adding 'nanotron/serialize/metadata.py' 2024-03-05T10:55:09,653 adding 'nanotron/serialize/optimizer.py' 2024-03-05T10:55:09,654 adding 'nanotron/serialize/random.py' 2024-03-05T10:55:09,655 adding 'nanotron/serialize/utils.py' 2024-03-05T10:55:09,657 adding 'nanotron/serialize/weights.py' 2024-03-05T10:55:09,660 adding 'nanotron-0.4.dist-info/LICENSE' 2024-03-05T10:55:09,662 adding 'nanotron-0.4.dist-info/METADATA' 2024-03-05T10:55:09,663 adding 'nanotron-0.4.dist-info/WHEEL' 2024-03-05T10:55:09,664 adding 'nanotron-0.4.dist-info/top_level.txt' 2024-03-05T10:55:09,665 adding 'nanotron-0.4.dist-info/RECORD' 2024-03-05T10:55:09,669 removing build/bdist.linux-armv7l/wheel 2024-03-05T10:55:09,808 Building wheel for nanotron (pyproject.toml): finished with status 'done' 2024-03-05T10:55:09,816 Created wheel for nanotron: filename=nanotron-0.4-py3-none-any.whl size=163614 sha256=bd287b47aaed47544c9ef22067a3b34727025ffe0e129f44ae08a165ab50957f 2024-03-05T10:55:09,817 Stored in directory: /tmp/pip-ephem-wheel-cache-td6a07na/wheels/12/58/0a/e8d49f60ca8f8f6b0c40f39a9300cafa0275c2a2899837da94 2024-03-05T10:55:09,830 Successfully built nanotron 2024-03-05T10:55:09,839 Removed build tracker: '/tmp/pip-build-tracker-jo7odftx'