2024-02-16T20:07:22,605 Created temporary directory: /tmp/pip-build-tracker-z1rlixsm 2024-02-16T20:07:22,607 Initialized build tracking at /tmp/pip-build-tracker-z1rlixsm 2024-02-16T20:07:22,607 Created build tracker: /tmp/pip-build-tracker-z1rlixsm 2024-02-16T20:07:22,608 Entered build tracker: /tmp/pip-build-tracker-z1rlixsm 2024-02-16T20:07:22,608 Created temporary directory: /tmp/pip-wheel-ih4h2bgn 2024-02-16T20:07:22,611 Created temporary directory: /tmp/pip-ephem-wheel-cache-k2ls_49j 2024-02-16T20:07:22,633 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2024-02-16T20:07:22,637 2 location(s) to search for versions of nanotron: 2024-02-16T20:07:22,637 * https://pypi.org/simple/nanotron/ 2024-02-16T20:07:22,637 * https://www.piwheels.org/simple/nanotron/ 2024-02-16T20:07:22,637 Fetching project page and analyzing links: https://pypi.org/simple/nanotron/ 2024-02-16T20:07:22,638 Getting page https://pypi.org/simple/nanotron/ 2024-02-16T20:07:22,640 Found index url https://pypi.org/simple/ 2024-02-16T20:07:22,859 Fetched page https://pypi.org/simple/nanotron/ as application/vnd.pypi.simple.v1+json 2024-02-16T20:07:22,861 Skipping link: No binaries permitted for nanotron: https://files.pythonhosted.org/packages/b4/c6/dfc412e8d4fd4acf19bcfac20753f2512562ee25873661bd338543595357/nanotron-0.1.0-py3-none-any.whl (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10) 2024-02-16T20:07:22,862 Found link https://files.pythonhosted.org/packages/94/1c/185c251ce8c00d25f8dcf2fc9ff70153a6279524b8d87e58f5ce8b3a9e4f/nanotron-0.1.0.tar.gz (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10), version: 0.1.0 2024-02-16T20:07:22,863 Skipping link: No binaries permitted for nanotron: https://files.pythonhosted.org/packages/fd/d2/ed1fb1b103af9167ef2fd295c6373a27f4a279a2ccb102101ef86cefd023/nanotron-0.2-py3-none-any.whl (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10) 2024-02-16T20:07:22,864 Found link https://files.pythonhosted.org/packages/9a/0f/5cd64e3cd26251f737848559bff24584c2a53487d73247ecc704ac4ac7fc/nanotron-0.2.tar.gz (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10), version: 0.2 2024-02-16T20:07:22,864 Fetching project page and analyzing links: https://www.piwheels.org/simple/nanotron/ 2024-02-16T20:07:22,865 Getting page https://www.piwheels.org/simple/nanotron/ 2024-02-16T20:07:22,866 Found index url https://www.piwheels.org/simple/ 2024-02-16T20:07:23,016 Fetched page https://www.piwheels.org/simple/nanotron/ as text/html 2024-02-16T20:07:23,017 Skipping link: No binaries permitted for nanotron: https://www.piwheels.org/simple/nanotron/nanotron-0.1.0-py3-none-any.whl#sha256=52e73f190d4974eb202a4859449ea2751370aa15b1d92506eb2ff70df23be083 (from https://www.piwheels.org/simple/nanotron/) (requires-python:~=3.10) 2024-02-16T20:07:23,018 Skipping link: not a file: https://www.piwheels.org/simple/nanotron/ 2024-02-16T20:07:23,019 Skipping link: not a file: https://pypi.org/simple/nanotron/ 2024-02-16T20:07:23,039 Given no hashes to check 1 links for project 'nanotron': discarding no candidates 2024-02-16T20:07:23,058 Collecting nanotron==0.2 2024-02-16T20:07:23,060 Created temporary directory: /tmp/pip-unpack-xyu5775s 2024-02-16T20:07:23,277 Downloading nanotron-0.2.tar.gz (160 kB) 2024-02-16T20:07:23,543 Added nanotron==0.2 from https://files.pythonhosted.org/packages/9a/0f/5cd64e3cd26251f737848559bff24584c2a53487d73247ecc704ac4ac7fc/nanotron-0.2.tar.gz to build tracker '/tmp/pip-build-tracker-z1rlixsm' 2024-02-16T20:07:23,548 Created temporary directory: /tmp/pip-build-env-52lc6_fh 2024-02-16T20:07:23,552 Installing build dependencies: started 2024-02-16T20:07:23,554 Running command pip subprocess to install build dependencies 2024-02-16T20:07:24,775 Using pip 23.3.1 from /home/piwheels/.local/lib/python3.11/site-packages/pip (python 3.11) 2024-02-16T20:07:25,338 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2024-02-16T20:07:26,749 Collecting setuptools 2024-02-16T20:07:26,763 Using cached https://www.piwheels.org/simple/setuptools/setuptools-69.1.0-py3-none-any.whl (819 kB) 2024-02-16T20:07:29,484 Installing collected packages: setuptools 2024-02-16T20:07:31,732 Successfully installed setuptools-69.1.0 2024-02-16T20:07:31,995 [notice] A new release of pip is available: 23.3.1 -> 24.0 2024-02-16T20:07:31,995 [notice] To update, run: python3 -m pip install --upgrade pip 2024-02-16T20:07:32,260 Installing build dependencies: finished with status 'done' 2024-02-16T20:07:32,271 Getting requirements to build wheel: started 2024-02-16T20:07:32,273 Running command Getting requirements to build wheel 2024-02-16T20:07:33,013 running egg_info 2024-02-16T20:07:33,017 writing src/nanotron.egg-info/PKG-INFO 2024-02-16T20:07:33,022 writing dependency_links to src/nanotron.egg-info/dependency_links.txt 2024-02-16T20:07:33,025 writing requirements to src/nanotron.egg-info/requires.txt 2024-02-16T20:07:33,026 writing top-level names to src/nanotron.egg-info/top_level.txt 2024-02-16T20:07:33,053 reading manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-02-16T20:07:33,057 adding license file 'LICENSE' 2024-02-16T20:07:33,062 writing manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-02-16T20:07:33,174 Getting requirements to build wheel: finished with status 'done' 2024-02-16T20:07:33,181 Installing backend dependencies: started 2024-02-16T20:07:33,182 Running command pip subprocess to install backend dependencies 2024-02-16T20:07:34,323 Using pip 23.3.1 from /home/piwheels/.local/lib/python3.11/site-packages/pip (python 3.11) 2024-02-16T20:07:34,860 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2024-02-16T20:07:35,330 Collecting wheel 2024-02-16T20:07:35,344 Using cached https://www.piwheels.org/simple/wheel/wheel-0.42.0-py3-none-any.whl (65 kB) 2024-02-16T20:07:37,685 Installing collected packages: wheel 2024-02-16T20:07:37,901 Creating /tmp/pip-build-env-52lc6_fh/normal/local/bin 2024-02-16T20:07:37,903 changing mode of /tmp/pip-build-env-52lc6_fh/normal/local/bin/wheel to 755 2024-02-16T20:07:37,914 Successfully installed wheel-0.42.0 2024-02-16T20:07:38,175 [notice] A new release of pip is available: 23.3.1 -> 24.0 2024-02-16T20:07:38,176 [notice] To update, run: python3 -m pip install --upgrade pip 2024-02-16T20:07:38,406 Installing backend dependencies: finished with status 'done' 2024-02-16T20:07:38,407 Created temporary directory: /tmp/pip-modern-metadata-fwly58k1 2024-02-16T20:07:38,410 Preparing metadata (pyproject.toml): started 2024-02-16T20:07:38,411 Running command Preparing metadata (pyproject.toml) 2024-02-16T20:07:39,187 running dist_info 2024-02-16T20:07:39,192 creating /tmp/pip-modern-metadata-fwly58k1/nanotron.egg-info 2024-02-16T20:07:39,195 writing /tmp/pip-modern-metadata-fwly58k1/nanotron.egg-info/PKG-INFO 2024-02-16T20:07:39,201 writing dependency_links to /tmp/pip-modern-metadata-fwly58k1/nanotron.egg-info/dependency_links.txt 2024-02-16T20:07:39,204 writing requirements to /tmp/pip-modern-metadata-fwly58k1/nanotron.egg-info/requires.txt 2024-02-16T20:07:39,205 writing top-level names to /tmp/pip-modern-metadata-fwly58k1/nanotron.egg-info/top_level.txt 2024-02-16T20:07:39,207 writing manifest file '/tmp/pip-modern-metadata-fwly58k1/nanotron.egg-info/SOURCES.txt' 2024-02-16T20:07:39,243 reading manifest file '/tmp/pip-modern-metadata-fwly58k1/nanotron.egg-info/SOURCES.txt' 2024-02-16T20:07:39,245 adding license file 'LICENSE' 2024-02-16T20:07:39,249 writing manifest file '/tmp/pip-modern-metadata-fwly58k1/nanotron.egg-info/SOURCES.txt' 2024-02-16T20:07:39,250 creating '/tmp/pip-modern-metadata-fwly58k1/nanotron-0.2.dist-info' 2024-02-16T20:07:39,400 Preparing metadata (pyproject.toml): finished with status 'done' 2024-02-16T20:07:39,417 Source in /tmp/pip-wheel-ih4h2bgn/nanotron_61369439d1e742f29e1e8de57472bfcd has version 0.2, which satisfies requirement nanotron==0.2 from https://files.pythonhosted.org/packages/9a/0f/5cd64e3cd26251f737848559bff24584c2a53487d73247ecc704ac4ac7fc/nanotron-0.2.tar.gz 2024-02-16T20:07:39,421 Removed nanotron==0.2 from https://files.pythonhosted.org/packages/9a/0f/5cd64e3cd26251f737848559bff24584c2a53487d73247ecc704ac4ac7fc/nanotron-0.2.tar.gz from build tracker '/tmp/pip-build-tracker-z1rlixsm' 2024-02-16T20:07:39,435 Created temporary directory: /tmp/pip-unpack-hj2qo25o 2024-02-16T20:07:39,436 Created temporary directory: /tmp/pip-unpack-pvwqm7z2 2024-02-16T20:07:39,459 Building wheels for collected packages: nanotron 2024-02-16T20:07:39,463 Created temporary directory: /tmp/pip-wheel-boyavm1g 2024-02-16T20:07:39,464 Destination directory: /tmp/pip-wheel-boyavm1g 2024-02-16T20:07:39,466 Building wheel for nanotron (pyproject.toml): started 2024-02-16T20:07:39,467 Running command Building wheel for nanotron (pyproject.toml) 2024-02-16T20:07:40,204 running bdist_wheel 2024-02-16T20:07:40,220 running build 2024-02-16T20:07:40,220 running build_py 2024-02-16T20:07:40,225 creating build 2024-02-16T20:07:40,226 creating build/lib 2024-02-16T20:07:40,226 creating build/lib/nanotron 2024-02-16T20:07:40,227 copying src/nanotron/logging.py -> build/lib/nanotron 2024-02-16T20:07:40,230 copying src/nanotron/helpers.py -> build/lib/nanotron 2024-02-16T20:07:40,232 copying src/nanotron/__init__.py -> build/lib/nanotron 2024-02-16T20:07:40,234 copying src/nanotron/trainer.py -> build/lib/nanotron 2024-02-16T20:07:40,236 copying src/nanotron/random.py -> build/lib/nanotron 2024-02-16T20:07:40,238 copying src/nanotron/utils.py -> build/lib/nanotron 2024-02-16T20:07:40,240 copying src/nanotron/distributed.py -> build/lib/nanotron 2024-02-16T20:07:40,243 copying src/nanotron/constants.py -> build/lib/nanotron 2024-02-16T20:07:40,244 copying src/nanotron/sanity_checks.py -> build/lib/nanotron 2024-02-16T20:07:40,246 copying src/nanotron/dataloader.py -> build/lib/nanotron 2024-02-16T20:07:40,250 creating build/lib/nanotron/parallel 2024-02-16T20:07:40,250 copying src/nanotron/parallel/__init__.py -> build/lib/nanotron/parallel 2024-02-16T20:07:40,252 copying src/nanotron/parallel/utils.py -> build/lib/nanotron/parallel 2024-02-16T20:07:40,254 copying src/nanotron/parallel/sharded_parameters.py -> build/lib/nanotron/parallel 2024-02-16T20:07:40,256 copying src/nanotron/parallel/context.py -> build/lib/nanotron/parallel 2024-02-16T20:07:40,258 copying src/nanotron/parallel/parameters.py -> build/lib/nanotron/parallel 2024-02-16T20:07:40,260 copying src/nanotron/parallel/tied_parameters.py -> build/lib/nanotron/parallel 2024-02-16T20:07:40,263 creating build/lib/nanotron/generation 2024-02-16T20:07:40,264 copying src/nanotron/generation/__init__.py -> build/lib/nanotron/generation 2024-02-16T20:07:40,266 copying src/nanotron/generation/generate_store.py -> build/lib/nanotron/generation 2024-02-16T20:07:40,267 copying src/nanotron/generation/sampler.py -> build/lib/nanotron/generation 2024-02-16T20:07:40,270 copying src/nanotron/generation/decode.py -> build/lib/nanotron/generation 2024-02-16T20:07:40,273 creating build/lib/nanotron/serialize 2024-02-16T20:07:40,274 copying src/nanotron/serialize/optimizer.py -> build/lib/nanotron/serialize 2024-02-16T20:07:40,276 copying src/nanotron/serialize/__init__.py -> build/lib/nanotron/serialize 2024-02-16T20:07:40,278 copying src/nanotron/serialize/random.py -> build/lib/nanotron/serialize 2024-02-16T20:07:40,280 copying src/nanotron/serialize/utils.py -> build/lib/nanotron/serialize 2024-02-16T20:07:40,281 copying src/nanotron/serialize/metadata.py -> build/lib/nanotron/serialize 2024-02-16T20:07:40,283 copying src/nanotron/serialize/main.py -> build/lib/nanotron/serialize 2024-02-16T20:07:40,286 copying src/nanotron/serialize/weights.py -> build/lib/nanotron/serialize 2024-02-16T20:07:40,289 creating build/lib/nanotron/optim 2024-02-16T20:07:40,290 copying src/nanotron/optim/clip_grads.py -> build/lib/nanotron/optim 2024-02-16T20:07:40,292 copying src/nanotron/optim/__init__.py -> build/lib/nanotron/optim 2024-02-16T20:07:40,293 copying src/nanotron/optim/inherit_from_other_optimizer.py -> build/lib/nanotron/optim 2024-02-16T20:07:40,295 copying src/nanotron/optim/optimizer_from_gradient_accumulator.py -> build/lib/nanotron/optim 2024-02-16T20:07:40,297 copying src/nanotron/optim/gradient_accumulator.py -> build/lib/nanotron/optim 2024-02-16T20:07:40,299 copying src/nanotron/optim/named_optimizer.py -> build/lib/nanotron/optim 2024-02-16T20:07:40,301 copying src/nanotron/optim/zero.py -> build/lib/nanotron/optim 2024-02-16T20:07:40,303 copying src/nanotron/optim/base.py -> build/lib/nanotron/optim 2024-02-16T20:07:40,305 creating build/lib/nanotron/models 2024-02-16T20:07:40,306 copying src/nanotron/models/__init__.py -> build/lib/nanotron/models 2024-02-16T20:07:40,308 copying src/nanotron/models/llama.py -> build/lib/nanotron/models 2024-02-16T20:07:40,311 copying src/nanotron/models/starcoder2.py -> build/lib/nanotron/models 2024-02-16T20:07:40,314 copying src/nanotron/models/base.py -> build/lib/nanotron/models 2024-02-16T20:07:40,317 creating build/lib/nanotron/fp8 2024-02-16T20:07:40,318 copying src/nanotron/fp8/__init__.py -> build/lib/nanotron/fp8 2024-02-16T20:07:40,320 copying src/nanotron/fp8/tensor.py -> build/lib/nanotron/fp8 2024-02-16T20:07:40,322 copying src/nanotron/fp8/utils.py -> build/lib/nanotron/fp8 2024-02-16T20:07:40,323 copying src/nanotron/fp8/dtypes.py -> build/lib/nanotron/fp8 2024-02-16T20:07:40,325 copying src/nanotron/fp8/linear.py -> build/lib/nanotron/fp8 2024-02-16T20:07:40,327 copying src/nanotron/fp8/parameter.py -> build/lib/nanotron/fp8 2024-02-16T20:07:40,328 copying src/nanotron/fp8/kernel.py -> build/lib/nanotron/fp8 2024-02-16T20:07:40,330 copying src/nanotron/fp8/constants.py -> build/lib/nanotron/fp8 2024-02-16T20:07:40,331 copying src/nanotron/fp8/meta.py -> build/lib/nanotron/fp8 2024-02-16T20:07:40,334 creating build/lib/nanotron/config 2024-02-16T20:07:40,334 copying src/nanotron/config/__init__.py -> build/lib/nanotron/config 2024-02-16T20:07:40,336 copying src/nanotron/config/lighteval_config.py -> build/lib/nanotron/config 2024-02-16T20:07:40,338 copying src/nanotron/config/config.py -> build/lib/nanotron/config 2024-02-16T20:07:40,340 copying src/nanotron/config/parallelism_config.py -> build/lib/nanotron/config 2024-02-16T20:07:40,342 copying src/nanotron/config/models_config.py -> build/lib/nanotron/config 2024-02-16T20:07:40,344 copying src/nanotron/config/utils_config.py -> build/lib/nanotron/config 2024-02-16T20:07:40,346 creating build/lib/nanotron/nn 2024-02-16T20:07:40,347 copying src/nanotron/nn/__init__.py -> build/lib/nanotron/nn 2024-02-16T20:07:40,348 copying src/nanotron/nn/layer_norm.py -> build/lib/nanotron/nn 2024-02-16T20:07:40,350 copying src/nanotron/nn/activations.py -> build/lib/nanotron/nn 2024-02-16T20:07:40,353 creating build/lib/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,354 copying src/nanotron/parallel/tensor_parallel/nn.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,356 copying src/nanotron/parallel/tensor_parallel/distributed_differentiable_primitives.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,358 copying src/nanotron/parallel/tensor_parallel/__init__.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,360 copying src/nanotron/parallel/tensor_parallel/enum.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,361 copying src/nanotron/parallel/tensor_parallel/functional.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,364 creating build/lib/nanotron/parallel/data_parallel 2024-02-16T20:07:40,365 copying src/nanotron/parallel/data_parallel/utils.py -> build/lib/nanotron/parallel/data_parallel 2024-02-16T20:07:40,368 creating build/lib/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,369 copying src/nanotron/parallel/pipeline_parallel/p2p.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,371 copying src/nanotron/parallel/pipeline_parallel/utils.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,373 copying src/nanotron/parallel/pipeline_parallel/block.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,375 copying src/nanotron/parallel/pipeline_parallel/context_manager.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,377 copying src/nanotron/parallel/pipeline_parallel/state.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,379 copying src/nanotron/parallel/pipeline_parallel/tensor_pointer.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,380 copying src/nanotron/parallel/pipeline_parallel/engine.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,383 copying src/nanotron/parallel/pipeline_parallel/functional.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,385 running egg_info 2024-02-16T20:07:40,388 writing src/nanotron.egg-info/PKG-INFO 2024-02-16T20:07:40,393 writing dependency_links to src/nanotron.egg-info/dependency_links.txt 2024-02-16T20:07:40,395 writing requirements to src/nanotron.egg-info/requires.txt 2024-02-16T20:07:40,396 writing top-level names to src/nanotron.egg-info/top_level.txt 2024-02-16T20:07:40,415 reading manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-02-16T20:07:40,418 adding license file 'LICENSE' 2024-02-16T20:07:40,423 writing manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-02-16T20:07:40,442 installing to build/bdist.linux-armv7l/wheel 2024-02-16T20:07:40,442 running install 2024-02-16T20:07:40,466 running install_lib 2024-02-16T20:07:40,470 creating build/bdist.linux-armv7l 2024-02-16T20:07:40,471 creating build/bdist.linux-armv7l/wheel 2024-02-16T20:07:40,472 creating build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,473 copying build/lib/nanotron/logging.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,476 copying build/lib/nanotron/helpers.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,478 copying build/lib/nanotron/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,480 copying build/lib/nanotron/trainer.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,483 copying build/lib/nanotron/random.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,485 copying build/lib/nanotron/utils.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,487 copying build/lib/nanotron/distributed.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,490 creating build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-16T20:07:40,491 copying build/lib/nanotron/parallel/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-16T20:07:40,493 creating build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,494 copying build/lib/nanotron/parallel/tensor_parallel/nn.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,496 copying build/lib/nanotron/parallel/tensor_parallel/distributed_differentiable_primitives.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,499 copying build/lib/nanotron/parallel/tensor_parallel/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,500 copying build/lib/nanotron/parallel/tensor_parallel/enum.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,502 copying build/lib/nanotron/parallel/tensor_parallel/functional.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-16T20:07:40,505 creating build/bdist.linux-armv7l/wheel/nanotron/parallel/data_parallel 2024-02-16T20:07:40,506 copying build/lib/nanotron/parallel/data_parallel/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/data_parallel 2024-02-16T20:07:40,508 copying build/lib/nanotron/parallel/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-16T20:07:40,510 copying build/lib/nanotron/parallel/sharded_parameters.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-16T20:07:40,512 copying build/lib/nanotron/parallel/context.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-16T20:07:40,514 copying build/lib/nanotron/parallel/parameters.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-16T20:07:40,517 creating build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,517 copying build/lib/nanotron/parallel/pipeline_parallel/p2p.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,520 copying build/lib/nanotron/parallel/pipeline_parallel/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,522 copying build/lib/nanotron/parallel/pipeline_parallel/block.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,524 copying build/lib/nanotron/parallel/pipeline_parallel/context_manager.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,525 copying build/lib/nanotron/parallel/pipeline_parallel/state.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,528 copying build/lib/nanotron/parallel/pipeline_parallel/tensor_pointer.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,529 copying build/lib/nanotron/parallel/pipeline_parallel/engine.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,531 copying build/lib/nanotron/parallel/pipeline_parallel/functional.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-16T20:07:40,533 copying build/lib/nanotron/parallel/tied_parameters.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-16T20:07:40,536 creating build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-16T20:07:40,537 copying build/lib/nanotron/generation/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-16T20:07:40,539 copying build/lib/nanotron/generation/generate_store.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-16T20:07:40,541 copying build/lib/nanotron/generation/sampler.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-16T20:07:40,543 copying build/lib/nanotron/generation/decode.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-16T20:07:40,546 creating build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-16T20:07:40,547 copying build/lib/nanotron/serialize/optimizer.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-16T20:07:40,550 copying build/lib/nanotron/serialize/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-16T20:07:40,552 copying build/lib/nanotron/serialize/random.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-16T20:07:40,553 copying build/lib/nanotron/serialize/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-16T20:07:40,555 copying build/lib/nanotron/serialize/metadata.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-16T20:07:40,557 copying build/lib/nanotron/serialize/main.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-16T20:07:40,560 copying build/lib/nanotron/serialize/weights.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-16T20:07:40,562 copying build/lib/nanotron/constants.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,564 creating build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-16T20:07:40,565 copying build/lib/nanotron/optim/clip_grads.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-16T20:07:40,567 copying build/lib/nanotron/optim/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-16T20:07:40,569 copying build/lib/nanotron/optim/inherit_from_other_optimizer.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-16T20:07:40,571 copying build/lib/nanotron/optim/optimizer_from_gradient_accumulator.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-16T20:07:40,573 copying build/lib/nanotron/optim/gradient_accumulator.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-16T20:07:40,575 copying build/lib/nanotron/optim/named_optimizer.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-16T20:07:40,577 copying build/lib/nanotron/optim/zero.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-16T20:07:40,579 copying build/lib/nanotron/optim/base.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-16T20:07:40,582 creating build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-16T20:07:40,582 copying build/lib/nanotron/models/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-16T20:07:40,584 copying build/lib/nanotron/models/llama.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-16T20:07:40,587 copying build/lib/nanotron/models/starcoder2.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-16T20:07:40,591 copying build/lib/nanotron/models/base.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-16T20:07:40,593 copying build/lib/nanotron/sanity_checks.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,595 creating build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,596 copying build/lib/nanotron/fp8/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,598 copying build/lib/nanotron/fp8/tensor.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,600 copying build/lib/nanotron/fp8/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,601 copying build/lib/nanotron/fp8/dtypes.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,603 copying build/lib/nanotron/fp8/linear.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,605 copying build/lib/nanotron/fp8/parameter.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,607 copying build/lib/nanotron/fp8/kernel.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,609 copying build/lib/nanotron/fp8/constants.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,610 copying build/lib/nanotron/fp8/meta.py -> build/bdist.linux-armv7l/wheel/nanotron/fp8 2024-02-16T20:07:40,612 copying build/lib/nanotron/dataloader.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-16T20:07:40,615 creating build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-16T20:07:40,616 copying build/lib/nanotron/config/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-16T20:07:40,618 copying build/lib/nanotron/config/lighteval_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-16T20:07:40,620 copying build/lib/nanotron/config/config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-16T20:07:40,622 copying build/lib/nanotron/config/parallelism_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-16T20:07:40,624 copying build/lib/nanotron/config/models_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-16T20:07:40,626 copying build/lib/nanotron/config/utils_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-16T20:07:40,628 creating build/bdist.linux-armv7l/wheel/nanotron/nn 2024-02-16T20:07:40,629 copying build/lib/nanotron/nn/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/nn 2024-02-16T20:07:40,631 copying build/lib/nanotron/nn/layer_norm.py -> build/bdist.linux-armv7l/wheel/nanotron/nn 2024-02-16T20:07:40,632 copying build/lib/nanotron/nn/activations.py -> build/bdist.linux-armv7l/wheel/nanotron/nn 2024-02-16T20:07:40,634 running install_egg_info 2024-02-16T20:07:40,638 Copying src/nanotron.egg-info to build/bdist.linux-armv7l/wheel/nanotron-0.2-py3.11.egg-info 2024-02-16T20:07:40,647 running install_scripts 2024-02-16T20:07:40,663 creating build/bdist.linux-armv7l/wheel/nanotron-0.2.dist-info/WHEEL 2024-02-16T20:07:40,665 creating '/tmp/pip-wheel-boyavm1g/.tmp-pzflgv68/nanotron-0.2-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2024-02-16T20:07:40,667 adding 'nanotron/__init__.py' 2024-02-16T20:07:40,669 adding 'nanotron/constants.py' 2024-02-16T20:07:40,672 adding 'nanotron/dataloader.py' 2024-02-16T20:07:40,674 adding 'nanotron/distributed.py' 2024-02-16T20:07:40,677 adding 'nanotron/helpers.py' 2024-02-16T20:07:40,679 adding 'nanotron/logging.py' 2024-02-16T20:07:40,681 adding 'nanotron/random.py' 2024-02-16T20:07:40,682 adding 'nanotron/sanity_checks.py' 2024-02-16T20:07:40,686 adding 'nanotron/trainer.py' 2024-02-16T20:07:40,688 adding 'nanotron/utils.py' 2024-02-16T20:07:40,690 adding 'nanotron/config/__init__.py' 2024-02-16T20:07:40,692 adding 'nanotron/config/config.py' 2024-02-16T20:07:40,693 adding 'nanotron/config/lighteval_config.py' 2024-02-16T20:07:40,695 adding 'nanotron/config/models_config.py' 2024-02-16T20:07:40,696 adding 'nanotron/config/parallelism_config.py' 2024-02-16T20:07:40,697 adding 'nanotron/config/utils_config.py' 2024-02-16T20:07:40,699 adding 'nanotron/fp8/__init__.py' 2024-02-16T20:07:40,700 adding 'nanotron/fp8/constants.py' 2024-02-16T20:07:40,701 adding 'nanotron/fp8/dtypes.py' 2024-02-16T20:07:40,703 adding 'nanotron/fp8/kernel.py' 2024-02-16T20:07:40,704 adding 'nanotron/fp8/linear.py' 2024-02-16T20:07:40,705 adding 'nanotron/fp8/meta.py' 2024-02-16T20:07:40,707 adding 'nanotron/fp8/parameter.py' 2024-02-16T20:07:40,708 adding 'nanotron/fp8/tensor.py' 2024-02-16T20:07:40,709 adding 'nanotron/fp8/utils.py' 2024-02-16T20:07:40,711 adding 'nanotron/generation/__init__.py' 2024-02-16T20:07:40,714 adding 'nanotron/generation/decode.py' 2024-02-16T20:07:40,716 adding 'nanotron/generation/generate_store.py' 2024-02-16T20:07:40,718 adding 'nanotron/generation/sampler.py' 2024-02-16T20:07:40,719 adding 'nanotron/models/__init__.py' 2024-02-16T20:07:40,721 adding 'nanotron/models/base.py' 2024-02-16T20:07:40,727 adding 'nanotron/models/llama.py' 2024-02-16T20:07:40,735 adding 'nanotron/models/starcoder2.py' 2024-02-16T20:07:40,737 adding 'nanotron/nn/__init__.py' 2024-02-16T20:07:40,739 adding 'nanotron/nn/activations.py' 2024-02-16T20:07:40,740 adding 'nanotron/nn/layer_norm.py' 2024-02-16T20:07:40,742 adding 'nanotron/optim/__init__.py' 2024-02-16T20:07:40,743 adding 'nanotron/optim/base.py' 2024-02-16T20:07:40,744 adding 'nanotron/optim/clip_grads.py' 2024-02-16T20:07:40,747 adding 'nanotron/optim/gradient_accumulator.py' 2024-02-16T20:07:40,748 adding 'nanotron/optim/inherit_from_other_optimizer.py' 2024-02-16T20:07:40,750 adding 'nanotron/optim/named_optimizer.py' 2024-02-16T20:07:40,751 adding 'nanotron/optim/optimizer_from_gradient_accumulator.py' 2024-02-16T20:07:40,754 adding 'nanotron/optim/zero.py' 2024-02-16T20:07:40,756 adding 'nanotron/parallel/__init__.py' 2024-02-16T20:07:40,757 adding 'nanotron/parallel/context.py' 2024-02-16T20:07:40,759 adding 'nanotron/parallel/parameters.py' 2024-02-16T20:07:40,760 adding 'nanotron/parallel/sharded_parameters.py' 2024-02-16T20:07:40,762 adding 'nanotron/parallel/tied_parameters.py' 2024-02-16T20:07:40,763 adding 'nanotron/parallel/utils.py' 2024-02-16T20:07:40,765 adding 'nanotron/parallel/data_parallel/utils.py' 2024-02-16T20:07:40,767 adding 'nanotron/parallel/pipeline_parallel/block.py' 2024-02-16T20:07:40,769 adding 'nanotron/parallel/pipeline_parallel/context_manager.py' 2024-02-16T20:07:40,770 adding 'nanotron/parallel/pipeline_parallel/engine.py' 2024-02-16T20:07:40,772 adding 'nanotron/parallel/pipeline_parallel/functional.py' 2024-02-16T20:07:40,775 adding 'nanotron/parallel/pipeline_parallel/p2p.py' 2024-02-16T20:07:40,777 adding 'nanotron/parallel/pipeline_parallel/state.py' 2024-02-16T20:07:40,778 adding 'nanotron/parallel/pipeline_parallel/tensor_pointer.py' 2024-02-16T20:07:40,779 adding 'nanotron/parallel/pipeline_parallel/utils.py' 2024-02-16T20:07:40,781 adding 'nanotron/parallel/tensor_parallel/__init__.py' 2024-02-16T20:07:40,782 adding 'nanotron/parallel/tensor_parallel/distributed_differentiable_primitives.py' 2024-02-16T20:07:40,784 adding 'nanotron/parallel/tensor_parallel/enum.py' 2024-02-16T20:07:40,786 adding 'nanotron/parallel/tensor_parallel/functional.py' 2024-02-16T20:07:40,787 adding 'nanotron/parallel/tensor_parallel/nn.py' 2024-02-16T20:07:40,789 adding 'nanotron/serialize/__init__.py' 2024-02-16T20:07:40,791 adding 'nanotron/serialize/main.py' 2024-02-16T20:07:40,792 adding 'nanotron/serialize/metadata.py' 2024-02-16T20:07:40,795 adding 'nanotron/serialize/optimizer.py' 2024-02-16T20:07:40,796 adding 'nanotron/serialize/random.py' 2024-02-16T20:07:40,797 adding 'nanotron/serialize/utils.py' 2024-02-16T20:07:40,799 adding 'nanotron/serialize/weights.py' 2024-02-16T20:07:40,802 adding 'nanotron-0.2.dist-info/LICENSE' 2024-02-16T20:07:40,803 adding 'nanotron-0.2.dist-info/METADATA' 2024-02-16T20:07:40,804 adding 'nanotron-0.2.dist-info/WHEEL' 2024-02-16T20:07:40,805 adding 'nanotron-0.2.dist-info/top_level.txt' 2024-02-16T20:07:40,807 adding 'nanotron-0.2.dist-info/RECORD' 2024-02-16T20:07:40,810 removing build/bdist.linux-armv7l/wheel 2024-02-16T20:07:40,948 Building wheel for nanotron (pyproject.toml): finished with status 'done' 2024-02-16T20:07:40,956 Created wheel for nanotron: filename=nanotron-0.2-py3-none-any.whl size=163894 sha256=55fbcfdcb4d7a8dba025e21531a8aeeba206080140d6e28b0eb4571013fb30e5 2024-02-16T20:07:40,957 Stored in directory: /tmp/pip-ephem-wheel-cache-k2ls_49j/wheels/10/c8/d2/7a941319fd9bc301b3b16ce4154a49f0e26c0b774acb8e9582 2024-02-16T20:07:40,970 Successfully built nanotron 2024-02-16T20:07:40,976 Removed build tracker: '/tmp/pip-build-tracker-z1rlixsm'