2024-02-08T10:44:47,915 Created temporary directory: /tmp/pip-build-tracker-fbregr5y 2024-02-08T10:44:47,916 Initialized build tracking at /tmp/pip-build-tracker-fbregr5y 2024-02-08T10:44:47,916 Created build tracker: /tmp/pip-build-tracker-fbregr5y 2024-02-08T10:44:47,917 Entered build tracker: /tmp/pip-build-tracker-fbregr5y 2024-02-08T10:44:47,918 Created temporary directory: /tmp/pip-wheel-qf2ny435 2024-02-08T10:44:47,920 Created temporary directory: /tmp/pip-ephem-wheel-cache-ej6i8vom 2024-02-08T10:44:47,944 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2024-02-08T10:44:47,948 2 location(s) to search for versions of nanotron: 2024-02-08T10:44:47,948 * https://pypi.org/simple/nanotron/ 2024-02-08T10:44:47,948 * https://www.piwheels.org/simple/nanotron/ 2024-02-08T10:44:47,949 Fetching project page and analyzing links: https://pypi.org/simple/nanotron/ 2024-02-08T10:44:47,949 Getting page https://pypi.org/simple/nanotron/ 2024-02-08T10:44:47,951 Found index url https://pypi.org/simple/ 2024-02-08T10:44:48,175 Fetched page https://pypi.org/simple/nanotron/ as application/vnd.pypi.simple.v1+json 2024-02-08T10:44:48,176 Skipping link: No binaries permitted for nanotron: https://files.pythonhosted.org/packages/b4/c6/dfc412e8d4fd4acf19bcfac20753f2512562ee25873661bd338543595357/nanotron-0.1.0-py3-none-any.whl (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10) 2024-02-08T10:44:48,177 Found link https://files.pythonhosted.org/packages/94/1c/185c251ce8c00d25f8dcf2fc9ff70153a6279524b8d87e58f5ce8b3a9e4f/nanotron-0.1.0.tar.gz (from https://pypi.org/simple/nanotron/) (requires-python:~=3.10), version: 0.1.0 2024-02-08T10:44:48,178 Fetching project page and analyzing links: https://www.piwheels.org/simple/nanotron/ 2024-02-08T10:44:48,178 Getting page https://www.piwheels.org/simple/nanotron/ 2024-02-08T10:44:48,180 Found index url https://www.piwheels.org/simple/ 2024-02-08T10:44:48,327 Fetched page https://www.piwheels.org/simple/nanotron/ as text/html 2024-02-08T10:44:48,329 Skipping link: not a file: https://www.piwheels.org/simple/nanotron/ 2024-02-08T10:44:48,329 Skipping link: not a file: https://pypi.org/simple/nanotron/ 2024-02-08T10:44:48,349 Given no hashes to check 1 links for project 'nanotron': discarding no candidates 2024-02-08T10:44:48,368 Collecting nanotron==0.1.0 2024-02-08T10:44:48,370 Created temporary directory: /tmp/pip-unpack-xyvimwhc 2024-02-08T10:44:48,589 Downloading nanotron-0.1.0.tar.gz (154 kB) 2024-02-08T10:44:48,827 Added nanotron==0.1.0 from https://files.pythonhosted.org/packages/94/1c/185c251ce8c00d25f8dcf2fc9ff70153a6279524b8d87e58f5ce8b3a9e4f/nanotron-0.1.0.tar.gz to build tracker '/tmp/pip-build-tracker-fbregr5y' 2024-02-08T10:44:48,831 Created temporary directory: /tmp/pip-build-env-i3vaocrz 2024-02-08T10:44:48,835 Installing build dependencies: started 2024-02-08T10:44:48,837 Running command pip subprocess to install build dependencies 2024-02-08T10:44:49,995 Using pip 23.3.1 from /usr/local/lib/python3.11/dist-packages/pip (python 3.11) 2024-02-08T10:44:50,598 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2024-02-08T10:44:52,041 Collecting setuptools 2024-02-08T10:44:52,057 Using cached https://www.piwheels.org/simple/setuptools/setuptools-69.0.3-py3-none-any.whl (819 kB) 2024-02-08T10:44:55,580 Installing collected packages: setuptools 2024-02-08T10:44:57,803 Successfully installed setuptools-69.0.3 2024-02-08T10:44:58,273 [notice] A new release of pip is available: 23.3.1 -> 24.0 2024-02-08T10:44:58,273 [notice] To update, run: python3 -m pip install --upgrade pip 2024-02-08T10:44:58,536 Installing build dependencies: finished with status 'done' 2024-02-08T10:44:58,547 Getting requirements to build wheel: started 2024-02-08T10:44:58,549 Running command Getting requirements to build wheel 2024-02-08T10:44:59,295 running egg_info 2024-02-08T10:44:59,298 writing src/nanotron.egg-info/PKG-INFO 2024-02-08T10:44:59,303 writing dependency_links to src/nanotron.egg-info/dependency_links.txt 2024-02-08T10:44:59,306 writing requirements to src/nanotron.egg-info/requires.txt 2024-02-08T10:44:59,308 writing top-level names to src/nanotron.egg-info/top_level.txt 2024-02-08T10:44:59,334 reading manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-02-08T10:44:59,337 adding license file 'LICENSE' 2024-02-08T10:44:59,342 writing manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-02-08T10:44:59,453 Getting requirements to build wheel: finished with status 'done' 2024-02-08T10:44:59,460 Installing backend dependencies: started 2024-02-08T10:44:59,461 Running command pip subprocess to install backend dependencies 2024-02-08T10:45:00,585 Using pip 23.3.1 from /usr/local/lib/python3.11/dist-packages/pip (python 3.11) 2024-02-08T10:45:01,179 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2024-02-08T10:45:01,670 Collecting wheel 2024-02-08T10:45:01,684 Using cached https://www.piwheels.org/simple/wheel/wheel-0.42.0-py3-none-any.whl (65 kB) 2024-02-08T10:45:04,838 Installing collected packages: wheel 2024-02-08T10:45:05,064 Creating /tmp/pip-build-env-i3vaocrz/normal/local/bin 2024-02-08T10:45:05,066 changing mode of /tmp/pip-build-env-i3vaocrz/normal/local/bin/wheel to 755 2024-02-08T10:45:05,077 Successfully installed wheel-0.42.0 2024-02-08T10:45:05,552 [notice] A new release of pip is available: 23.3.1 -> 24.0 2024-02-08T10:45:05,553 [notice] To update, run: python3 -m pip install --upgrade pip 2024-02-08T10:45:05,782 Installing backend dependencies: finished with status 'done' 2024-02-08T10:45:05,783 Created temporary directory: /tmp/pip-modern-metadata-8ajet7l3 2024-02-08T10:45:05,786 Preparing metadata (pyproject.toml): started 2024-02-08T10:45:05,787 Running command Preparing metadata (pyproject.toml) 2024-02-08T10:45:06,571 running dist_info 2024-02-08T10:45:06,575 creating /tmp/pip-modern-metadata-8ajet7l3/nanotron.egg-info 2024-02-08T10:45:06,579 writing /tmp/pip-modern-metadata-8ajet7l3/nanotron.egg-info/PKG-INFO 2024-02-08T10:45:06,584 writing dependency_links to /tmp/pip-modern-metadata-8ajet7l3/nanotron.egg-info/dependency_links.txt 2024-02-08T10:45:06,587 writing requirements to /tmp/pip-modern-metadata-8ajet7l3/nanotron.egg-info/requires.txt 2024-02-08T10:45:06,588 writing top-level names to /tmp/pip-modern-metadata-8ajet7l3/nanotron.egg-info/top_level.txt 2024-02-08T10:45:06,589 writing manifest file '/tmp/pip-modern-metadata-8ajet7l3/nanotron.egg-info/SOURCES.txt' 2024-02-08T10:45:06,621 reading manifest file '/tmp/pip-modern-metadata-8ajet7l3/nanotron.egg-info/SOURCES.txt' 2024-02-08T10:45:06,623 adding license file 'LICENSE' 2024-02-08T10:45:06,627 writing manifest file '/tmp/pip-modern-metadata-8ajet7l3/nanotron.egg-info/SOURCES.txt' 2024-02-08T10:45:06,628 creating '/tmp/pip-modern-metadata-8ajet7l3/nanotron-0.1.0.dist-info' 2024-02-08T10:45:06,775 Preparing metadata (pyproject.toml): finished with status 'done' 2024-02-08T10:45:06,779 Source in /tmp/pip-wheel-qf2ny435/nanotron_f6524d8cee044ffab51458f261d5b571 has version 0.1.0, which satisfies requirement nanotron==0.1.0 from https://files.pythonhosted.org/packages/94/1c/185c251ce8c00d25f8dcf2fc9ff70153a6279524b8d87e58f5ce8b3a9e4f/nanotron-0.1.0.tar.gz 2024-02-08T10:45:06,780 Removed nanotron==0.1.0 from https://files.pythonhosted.org/packages/94/1c/185c251ce8c00d25f8dcf2fc9ff70153a6279524b8d87e58f5ce8b3a9e4f/nanotron-0.1.0.tar.gz from build tracker '/tmp/pip-build-tracker-fbregr5y' 2024-02-08T10:45:06,786 Created temporary directory: /tmp/pip-unpack-137im157 2024-02-08T10:45:06,787 Created temporary directory: /tmp/pip-unpack-eti9s0d1 2024-02-08T10:45:06,812 Building wheels for collected packages: nanotron 2024-02-08T10:45:06,816 Created temporary directory: /tmp/pip-wheel-7gmx5zrv 2024-02-08T10:45:06,817 Destination directory: /tmp/pip-wheel-7gmx5zrv 2024-02-08T10:45:06,819 Building wheel for nanotron (pyproject.toml): started 2024-02-08T10:45:06,820 Running command Building wheel for nanotron (pyproject.toml) 2024-02-08T10:45:07,561 running bdist_wheel 2024-02-08T10:45:07,577 running build 2024-02-08T10:45:07,578 running build_py 2024-02-08T10:45:07,582 creating build 2024-02-08T10:45:07,583 creating build/lib 2024-02-08T10:45:07,583 creating build/lib/nanotron 2024-02-08T10:45:07,585 copying src/nanotron/trainer.py -> build/lib/nanotron 2024-02-08T10:45:07,587 copying src/nanotron/logging.py -> build/lib/nanotron 2024-02-08T10:45:07,589 copying src/nanotron/helpers.py -> build/lib/nanotron 2024-02-08T10:45:07,592 copying src/nanotron/dataloader.py -> build/lib/nanotron 2024-02-08T10:45:07,594 copying src/nanotron/distributed.py -> build/lib/nanotron 2024-02-08T10:45:07,596 copying src/nanotron/sanity_checks.py -> build/lib/nanotron 2024-02-08T10:45:07,599 copying src/nanotron/__init__.py -> build/lib/nanotron 2024-02-08T10:45:07,600 copying src/nanotron/utils.py -> build/lib/nanotron 2024-02-08T10:45:07,602 copying src/nanotron/constants.py -> build/lib/nanotron 2024-02-08T10:45:07,604 copying src/nanotron/random.py -> build/lib/nanotron 2024-02-08T10:45:07,607 creating build/lib/nanotron/parallel 2024-02-08T10:45:07,607 copying src/nanotron/parallel/sharded_parameters.py -> build/lib/nanotron/parallel 2024-02-08T10:45:07,610 copying src/nanotron/parallel/tied_parameters.py -> build/lib/nanotron/parallel 2024-02-08T10:45:07,612 copying src/nanotron/parallel/__init__.py -> build/lib/nanotron/parallel 2024-02-08T10:45:07,613 copying src/nanotron/parallel/parameters.py -> build/lib/nanotron/parallel 2024-02-08T10:45:07,615 copying src/nanotron/parallel/utils.py -> build/lib/nanotron/parallel 2024-02-08T10:45:07,617 copying src/nanotron/parallel/context.py -> build/lib/nanotron/parallel 2024-02-08T10:45:07,619 creating build/lib/nanotron/serialize 2024-02-08T10:45:07,620 copying src/nanotron/serialize/main.py -> build/lib/nanotron/serialize 2024-02-08T10:45:07,623 copying src/nanotron/serialize/weights.py -> build/lib/nanotron/serialize 2024-02-08T10:45:07,626 copying src/nanotron/serialize/optimizer.py -> build/lib/nanotron/serialize 2024-02-08T10:45:07,628 copying src/nanotron/serialize/metadata.py -> build/lib/nanotron/serialize 2024-02-08T10:45:07,630 copying src/nanotron/serialize/__init__.py -> build/lib/nanotron/serialize 2024-02-08T10:45:07,631 copying src/nanotron/serialize/utils.py -> build/lib/nanotron/serialize 2024-02-08T10:45:07,633 copying src/nanotron/serialize/random.py -> build/lib/nanotron/serialize 2024-02-08T10:45:07,635 creating build/lib/nanotron/nn 2024-02-08T10:45:07,636 copying src/nanotron/nn/layer_norm.py -> build/lib/nanotron/nn 2024-02-08T10:45:07,638 copying src/nanotron/nn/activations.py -> build/lib/nanotron/nn 2024-02-08T10:45:07,640 copying src/nanotron/nn/__init__.py -> build/lib/nanotron/nn 2024-02-08T10:45:07,642 creating build/lib/nanotron/models 2024-02-08T10:45:07,643 copying src/nanotron/models/starcoder2.py -> build/lib/nanotron/models 2024-02-08T10:45:07,647 copying src/nanotron/models/llama.py -> build/lib/nanotron/models 2024-02-08T10:45:07,649 copying src/nanotron/models/base.py -> build/lib/nanotron/models 2024-02-08T10:45:07,652 copying src/nanotron/models/__init__.py -> build/lib/nanotron/models 2024-02-08T10:45:07,654 creating build/lib/nanotron/optim 2024-02-08T10:45:07,655 copying src/nanotron/optim/inherit_from_other_optimizer.py -> build/lib/nanotron/optim 2024-02-08T10:45:07,657 copying src/nanotron/optim/zero.py -> build/lib/nanotron/optim 2024-02-08T10:45:07,660 copying src/nanotron/optim/clip_grads.py -> build/lib/nanotron/optim 2024-02-08T10:45:07,662 copying src/nanotron/optim/named_optimizer.py -> build/lib/nanotron/optim 2024-02-08T10:45:07,664 copying src/nanotron/optim/base.py -> build/lib/nanotron/optim 2024-02-08T10:45:07,667 copying src/nanotron/optim/__init__.py -> build/lib/nanotron/optim 2024-02-08T10:45:07,668 copying src/nanotron/optim/optimizer_from_gradient_accumulator.py -> build/lib/nanotron/optim 2024-02-08T10:45:07,670 copying src/nanotron/optim/gradient_accumulator.py -> build/lib/nanotron/optim 2024-02-08T10:45:07,673 creating build/lib/nanotron/config 2024-02-08T10:45:07,674 copying src/nanotron/config/config.py -> build/lib/nanotron/config 2024-02-08T10:45:07,676 copying src/nanotron/config/parallelism_config.py -> build/lib/nanotron/config 2024-02-08T10:45:07,678 copying src/nanotron/config/utils_config.py -> build/lib/nanotron/config 2024-02-08T10:45:07,680 copying src/nanotron/config/models_config.py -> build/lib/nanotron/config 2024-02-08T10:45:07,681 copying src/nanotron/config/__init__.py -> build/lib/nanotron/config 2024-02-08T10:45:07,683 copying src/nanotron/config/lighteval_config.py -> build/lib/nanotron/config 2024-02-08T10:45:07,685 creating build/lib/nanotron/generation 2024-02-08T10:45:07,686 copying src/nanotron/generation/decode.py -> build/lib/nanotron/generation 2024-02-08T10:45:07,689 copying src/nanotron/generation/__init__.py -> build/lib/nanotron/generation 2024-02-08T10:45:07,691 copying src/nanotron/generation/generate_store.py -> build/lib/nanotron/generation 2024-02-08T10:45:07,693 copying src/nanotron/generation/sampler.py -> build/lib/nanotron/generation 2024-02-08T10:45:07,696 creating build/lib/nanotron/parallel/data_parallel 2024-02-08T10:45:07,697 copying src/nanotron/parallel/data_parallel/utils.py -> build/lib/nanotron/parallel/data_parallel 2024-02-08T10:45:07,700 creating build/lib/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,700 copying src/nanotron/parallel/pipeline_parallel/context_manager.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,702 copying src/nanotron/parallel/pipeline_parallel/tensor_pointer.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,704 copying src/nanotron/parallel/pipeline_parallel/state.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,706 copying src/nanotron/parallel/pipeline_parallel/engine.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,708 copying src/nanotron/parallel/pipeline_parallel/functional.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,710 copying src/nanotron/parallel/pipeline_parallel/block.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,712 copying src/nanotron/parallel/pipeline_parallel/utils.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,714 copying src/nanotron/parallel/pipeline_parallel/p2p.py -> build/lib/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,717 creating build/lib/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,718 copying src/nanotron/parallel/tensor_parallel/enum.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,720 copying src/nanotron/parallel/tensor_parallel/nn.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,722 copying src/nanotron/parallel/tensor_parallel/functional.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,725 copying src/nanotron/parallel/tensor_parallel/__init__.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,726 copying src/nanotron/parallel/tensor_parallel/distributed_differentiable_primitives.py -> build/lib/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,728 running egg_info 2024-02-08T10:45:07,732 writing src/nanotron.egg-info/PKG-INFO 2024-02-08T10:45:07,736 writing dependency_links to src/nanotron.egg-info/dependency_links.txt 2024-02-08T10:45:07,739 writing requirements to src/nanotron.egg-info/requires.txt 2024-02-08T10:45:07,740 writing top-level names to src/nanotron.egg-info/top_level.txt 2024-02-08T10:45:07,756 reading manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-02-08T10:45:07,759 adding license file 'LICENSE' 2024-02-08T10:45:07,764 writing manifest file 'src/nanotron.egg-info/SOURCES.txt' 2024-02-08T10:45:07,782 installing to build/bdist.linux-armv7l/wheel 2024-02-08T10:45:07,782 running install 2024-02-08T10:45:07,806 running install_lib 2024-02-08T10:45:07,810 creating build/bdist.linux-armv7l 2024-02-08T10:45:07,811 creating build/bdist.linux-armv7l/wheel 2024-02-08T10:45:07,813 creating build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,815 creating build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-08T10:45:07,816 creating build/bdist.linux-armv7l/wheel/nanotron/parallel/data_parallel 2024-02-08T10:45:07,817 copying build/lib/nanotron/parallel/data_parallel/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/data_parallel 2024-02-08T10:45:07,819 copying build/lib/nanotron/parallel/sharded_parameters.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-08T10:45:07,821 copying build/lib/nanotron/parallel/tied_parameters.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-08T10:45:07,824 copying build/lib/nanotron/parallel/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-08T10:45:07,826 creating build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,826 copying build/lib/nanotron/parallel/pipeline_parallel/context_manager.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,828 copying build/lib/nanotron/parallel/pipeline_parallel/tensor_pointer.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,830 copying build/lib/nanotron/parallel/pipeline_parallel/state.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,832 copying build/lib/nanotron/parallel/pipeline_parallel/engine.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,834 copying build/lib/nanotron/parallel/pipeline_parallel/functional.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,836 copying build/lib/nanotron/parallel/pipeline_parallel/block.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,839 copying build/lib/nanotron/parallel/pipeline_parallel/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,841 copying build/lib/nanotron/parallel/pipeline_parallel/p2p.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/pipeline_parallel 2024-02-08T10:45:07,844 copying build/lib/nanotron/parallel/parameters.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-08T10:45:07,847 copying build/lib/nanotron/parallel/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-08T10:45:07,849 copying build/lib/nanotron/parallel/context.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel 2024-02-08T10:45:07,851 creating build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,852 copying build/lib/nanotron/parallel/tensor_parallel/enum.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,854 copying build/lib/nanotron/parallel/tensor_parallel/nn.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,856 copying build/lib/nanotron/parallel/tensor_parallel/functional.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,859 copying build/lib/nanotron/parallel/tensor_parallel/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,860 copying build/lib/nanotron/parallel/tensor_parallel/distributed_differentiable_primitives.py -> build/bdist.linux-armv7l/wheel/nanotron/parallel/tensor_parallel 2024-02-08T10:45:07,863 creating build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-08T10:45:07,864 copying build/lib/nanotron/serialize/main.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-08T10:45:07,867 copying build/lib/nanotron/serialize/weights.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-08T10:45:07,870 copying build/lib/nanotron/serialize/optimizer.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-08T10:45:07,873 copying build/lib/nanotron/serialize/metadata.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-08T10:45:07,875 copying build/lib/nanotron/serialize/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-08T10:45:07,877 copying build/lib/nanotron/serialize/utils.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-08T10:45:07,879 copying build/lib/nanotron/serialize/random.py -> build/bdist.linux-armv7l/wheel/nanotron/serialize 2024-02-08T10:45:07,880 creating build/bdist.linux-armv7l/wheel/nanotron/nn 2024-02-08T10:45:07,881 copying build/lib/nanotron/nn/layer_norm.py -> build/bdist.linux-armv7l/wheel/nanotron/nn 2024-02-08T10:45:07,883 copying build/lib/nanotron/nn/activations.py -> build/bdist.linux-armv7l/wheel/nanotron/nn 2024-02-08T10:45:07,885 copying build/lib/nanotron/nn/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/nn 2024-02-08T10:45:07,887 copying build/lib/nanotron/trainer.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,889 copying build/lib/nanotron/logging.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,891 copying build/lib/nanotron/helpers.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,894 copying build/lib/nanotron/dataloader.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,898 copying build/lib/nanotron/distributed.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,901 copying build/lib/nanotron/sanity_checks.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,905 creating build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-08T10:45:07,906 copying build/lib/nanotron/models/starcoder2.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-08T10:45:07,909 copying build/lib/nanotron/models/llama.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-08T10:45:07,913 copying build/lib/nanotron/models/base.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-08T10:45:07,915 copying build/lib/nanotron/models/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/models 2024-02-08T10:45:07,917 copying build/lib/nanotron/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,919 creating build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-08T10:45:07,920 copying build/lib/nanotron/optim/inherit_from_other_optimizer.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-08T10:45:07,922 copying build/lib/nanotron/optim/zero.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-08T10:45:07,925 copying build/lib/nanotron/optim/clip_grads.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-08T10:45:07,927 copying build/lib/nanotron/optim/named_optimizer.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-08T10:45:07,929 copying build/lib/nanotron/optim/base.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-08T10:45:07,931 copying build/lib/nanotron/optim/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-08T10:45:07,932 copying build/lib/nanotron/optim/optimizer_from_gradient_accumulator.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-08T10:45:07,934 copying build/lib/nanotron/optim/gradient_accumulator.py -> build/bdist.linux-armv7l/wheel/nanotron/optim 2024-02-08T10:45:07,937 copying build/lib/nanotron/utils.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,939 creating build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-08T10:45:07,940 copying build/lib/nanotron/config/config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-08T10:45:07,943 copying build/lib/nanotron/config/parallelism_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-08T10:45:07,944 copying build/lib/nanotron/config/utils_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-08T10:45:07,946 copying build/lib/nanotron/config/models_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-08T10:45:07,948 copying build/lib/nanotron/config/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-08T10:45:07,949 copying build/lib/nanotron/config/lighteval_config.py -> build/bdist.linux-armv7l/wheel/nanotron/config 2024-02-08T10:45:07,952 copying build/lib/nanotron/constants.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,953 copying build/lib/nanotron/random.py -> build/bdist.linux-armv7l/wheel/nanotron 2024-02-08T10:45:07,955 creating build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-08T10:45:07,956 copying build/lib/nanotron/generation/decode.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-08T10:45:07,959 copying build/lib/nanotron/generation/__init__.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-08T10:45:07,961 copying build/lib/nanotron/generation/generate_store.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-08T10:45:07,963 copying build/lib/nanotron/generation/sampler.py -> build/bdist.linux-armv7l/wheel/nanotron/generation 2024-02-08T10:45:07,965 running install_egg_info 2024-02-08T10:45:07,968 Copying src/nanotron.egg-info to build/bdist.linux-armv7l/wheel/nanotron-0.1.0-py3.11.egg-info 2024-02-08T10:45:07,978 running install_scripts 2024-02-08T10:45:07,995 creating build/bdist.linux-armv7l/wheel/nanotron-0.1.0.dist-info/WHEEL 2024-02-08T10:45:07,997 creating '/tmp/pip-wheel-7gmx5zrv/.tmp-wu4iw2qx/nanotron-0.1.0-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2024-02-08T10:45:07,999 adding 'nanotron/__init__.py' 2024-02-08T10:45:08,001 adding 'nanotron/constants.py' 2024-02-08T10:45:08,003 adding 'nanotron/dataloader.py' 2024-02-08T10:45:08,005 adding 'nanotron/distributed.py' 2024-02-08T10:45:08,008 adding 'nanotron/helpers.py' 2024-02-08T10:45:08,010 adding 'nanotron/logging.py' 2024-02-08T10:45:08,012 adding 'nanotron/random.py' 2024-02-08T10:45:08,013 adding 'nanotron/sanity_checks.py' 2024-02-08T10:45:08,018 adding 'nanotron/trainer.py' 2024-02-08T10:45:08,020 adding 'nanotron/utils.py' 2024-02-08T10:45:08,021 adding 'nanotron/config/__init__.py' 2024-02-08T10:45:08,023 adding 'nanotron/config/config.py' 2024-02-08T10:45:08,025 adding 'nanotron/config/lighteval_config.py' 2024-02-08T10:45:08,026 adding 'nanotron/config/models_config.py' 2024-02-08T10:45:08,027 adding 'nanotron/config/parallelism_config.py' 2024-02-08T10:45:08,029 adding 'nanotron/config/utils_config.py' 2024-02-08T10:45:08,031 adding 'nanotron/generation/__init__.py' 2024-02-08T10:45:08,035 adding 'nanotron/generation/decode.py' 2024-02-08T10:45:08,037 adding 'nanotron/generation/generate_store.py' 2024-02-08T10:45:08,039 adding 'nanotron/generation/sampler.py' 2024-02-08T10:45:08,040 adding 'nanotron/models/__init__.py' 2024-02-08T10:45:08,042 adding 'nanotron/models/base.py' 2024-02-08T10:45:08,047 adding 'nanotron/models/llama.py' 2024-02-08T10:45:08,055 adding 'nanotron/models/starcoder2.py' 2024-02-08T10:45:08,057 adding 'nanotron/nn/__init__.py' 2024-02-08T10:45:08,059 adding 'nanotron/nn/activations.py' 2024-02-08T10:45:08,060 adding 'nanotron/nn/layer_norm.py' 2024-02-08T10:45:08,062 adding 'nanotron/optim/__init__.py' 2024-02-08T10:45:08,063 adding 'nanotron/optim/base.py' 2024-02-08T10:45:08,065 adding 'nanotron/optim/clip_grads.py' 2024-02-08T10:45:08,067 adding 'nanotron/optim/gradient_accumulator.py' 2024-02-08T10:45:08,069 adding 'nanotron/optim/inherit_from_other_optimizer.py' 2024-02-08T10:45:08,070 adding 'nanotron/optim/named_optimizer.py' 2024-02-08T10:45:08,071 adding 'nanotron/optim/optimizer_from_gradient_accumulator.py' 2024-02-08T10:45:08,074 adding 'nanotron/optim/zero.py' 2024-02-08T10:45:08,076 adding 'nanotron/parallel/__init__.py' 2024-02-08T10:45:08,078 adding 'nanotron/parallel/context.py' 2024-02-08T10:45:08,079 adding 'nanotron/parallel/parameters.py' 2024-02-08T10:45:08,081 adding 'nanotron/parallel/sharded_parameters.py' 2024-02-08T10:45:08,082 adding 'nanotron/parallel/tied_parameters.py' 2024-02-08T10:45:08,084 adding 'nanotron/parallel/utils.py' 2024-02-08T10:45:08,085 adding 'nanotron/parallel/data_parallel/utils.py' 2024-02-08T10:45:08,088 adding 'nanotron/parallel/pipeline_parallel/block.py' 2024-02-08T10:45:08,089 adding 'nanotron/parallel/pipeline_parallel/context_manager.py' 2024-02-08T10:45:08,091 adding 'nanotron/parallel/pipeline_parallel/engine.py' 2024-02-08T10:45:08,092 adding 'nanotron/parallel/pipeline_parallel/functional.py' 2024-02-08T10:45:08,095 adding 'nanotron/parallel/pipeline_parallel/p2p.py' 2024-02-08T10:45:08,096 adding 'nanotron/parallel/pipeline_parallel/state.py' 2024-02-08T10:45:08,098 adding 'nanotron/parallel/pipeline_parallel/tensor_pointer.py' 2024-02-08T10:45:08,099 adding 'nanotron/parallel/pipeline_parallel/utils.py' 2024-02-08T10:45:08,101 adding 'nanotron/parallel/tensor_parallel/__init__.py' 2024-02-08T10:45:08,102 adding 'nanotron/parallel/tensor_parallel/distributed_differentiable_primitives.py' 2024-02-08T10:45:08,103 adding 'nanotron/parallel/tensor_parallel/enum.py' 2024-02-08T10:45:08,106 adding 'nanotron/parallel/tensor_parallel/functional.py' 2024-02-08T10:45:08,108 adding 'nanotron/parallel/tensor_parallel/nn.py' 2024-02-08T10:45:08,109 adding 'nanotron/serialize/__init__.py' 2024-02-08T10:45:08,111 adding 'nanotron/serialize/main.py' 2024-02-08T10:45:08,113 adding 'nanotron/serialize/metadata.py' 2024-02-08T10:45:08,115 adding 'nanotron/serialize/optimizer.py' 2024-02-08T10:45:08,116 adding 'nanotron/serialize/random.py' 2024-02-08T10:45:08,117 adding 'nanotron/serialize/utils.py' 2024-02-08T10:45:08,120 adding 'nanotron/serialize/weights.py' 2024-02-08T10:45:08,122 adding 'nanotron-0.1.0.dist-info/LICENSE' 2024-02-08T10:45:08,124 adding 'nanotron-0.1.0.dist-info/METADATA' 2024-02-08T10:45:08,125 adding 'nanotron-0.1.0.dist-info/WHEEL' 2024-02-08T10:45:08,126 adding 'nanotron-0.1.0.dist-info/top_level.txt' 2024-02-08T10:45:08,127 adding 'nanotron-0.1.0.dist-info/RECORD' 2024-02-08T10:45:08,130 removing build/bdist.linux-armv7l/wheel 2024-02-08T10:45:08,267 Building wheel for nanotron (pyproject.toml): finished with status 'done' 2024-02-08T10:45:08,275 Created wheel for nanotron: filename=nanotron-0.1.0-py3-none-any.whl size=155012 sha256=52e73f190d4974eb202a4859449ea2751370aa15b1d92506eb2ff70df23be083 2024-02-08T10:45:08,276 Stored in directory: /tmp/pip-ephem-wheel-cache-ej6i8vom/wheels/83/8a/a8/10c3f104e64d926b80173c4457e39affa6416ca0da49ef7d8d 2024-02-08T10:45:08,289 Successfully built nanotron 2024-02-08T10:45:08,297 Removed build tracker: '/tmp/pip-build-tracker-fbregr5y'