2026-04-16T04:55:38,837 Created temporary directory: /tmp/pip-ephem-wheel-cache-ukd5d1ue 2026-04-16T04:55:38,839 Created temporary directory: /tmp/pip-build-tracker-2hn1r72n 2026-04-16T04:55:38,840 Initialized build tracking at /tmp/pip-build-tracker-2hn1r72n 2026-04-16T04:55:38,840 Created build tracker: /tmp/pip-build-tracker-2hn1r72n 2026-04-16T04:55:38,841 Entered build tracker: /tmp/pip-build-tracker-2hn1r72n 2026-04-16T04:55:38,842 Created temporary directory: /tmp/pip-wheel-lp_5eu38 2026-04-16T04:55:38,845 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-16T04:55:38,847 Created temporary directory: /tmp/pip-ephem-wheel-cache-q06utz8s 2026-04-16T04:55:38,872 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-16T04:55:38,876 2 location(s) to search for versions of flashinfer-python: 2026-04-16T04:55:38,876 * https://pypi.org/simple/flashinfer-python/ 2026-04-16T04:55:38,876 * https://www.piwheels.org/simple/flashinfer-python/ 2026-04-16T04:55:38,877 Fetching project page and analyzing links: https://pypi.org/simple/flashinfer-python/ 2026-04-16T04:55:38,878 Getting page https://pypi.org/simple/flashinfer-python/ 2026-04-16T04:55:38,879 Found index url https://pypi.org/simple 2026-04-16T04:55:39,035 Fetched page https://pypi.org/simple/flashinfer-python/ as application/vnd.pypi.simple.v1+json 2026-04-16T04:55:39,051 Found link https://files.pythonhosted.org/packages/6c/e9/5d6adcf888922a17c6fc52a0e5bed78785239af1219f41e1073b063a07ff/flashinfer_python-0.2.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post1 2026-04-16T04:55:39,053 Found link https://files.pythonhosted.org/packages/c8/39/bac839234a3beaab4292e489b4d8941cc97ba4f76474aff0407d7b05a84f/flashinfer_python-0.2.0.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post2 2026-04-16T04:55:39,054 Found link https://files.pythonhosted.org/packages/94/74/4dda2a7a7aa08bcfb8039faf2202bf0fea6b378d0d4968864737400fc329/flashinfer_python-0.2.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1 2026-04-16T04:55:39,055 Found link https://files.pythonhosted.org/packages/7f/3d/aab500609825108d3f6a4b440a7eeb6436d578d3e781e97ea015fd49a530/flashinfer_python-0.2.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post1 2026-04-16T04:55:39,057 Found link https://files.pythonhosted.org/packages/30/ac/afd1d2c472857be8f83389eb506e1413a2ac3a603889bea3cf24d5ab5be5/flashinfer_python-0.2.1.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post2 2026-04-16T04:55:39,058 Found link https://files.pythonhosted.org/packages/90/00/833dd50745bc15bb7a7451b77589d444ce963d48c0cb730b4760bfebffad/flashinfer_python-0.2.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2 2026-04-16T04:55:39,059 Found link https://files.pythonhosted.org/packages/02/cc/db9635c56653d3fa5a28f14ac858e0801de621aa33d3b528e4781aee906f/flashinfer_python-0.2.2.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2.post1 2026-04-16T04:55:39,060 Found link https://files.pythonhosted.org/packages/b6/10/2a63f1d09c5b337705236005dc9ccce513dcc08b7fd037cb40426f1695b1/flashinfer_python-0.2.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.3 2026-04-16T04:55:39,062 Found link https://files.pythonhosted.org/packages/a4/e5/8d193ccf65b92c009c4be50fdffa88fa0edc8fd6e6169bacaca6bab84d89/flashinfer_python-0.2.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.4 2026-04-16T04:55:39,063 Found link https://files.pythonhosted.org/packages/b2/c4/9ec0f79e2480fc5c93307c4a1ac903e5cf33c551c0eaeb648196234b55af/flashinfer_python-0.2.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.5 2026-04-16T04:55:39,064 Found link https://files.pythonhosted.org/packages/95/4a/a3109d57463d25a153b16c0d0f06495e4d18b727c81f8e08e42e97faaf45/flashinfer_python-0.2.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6 2026-04-16T04:55:39,065 Found link https://files.pythonhosted.org/packages/34/26/3c6f12ffaefbfa0c453030d6e15941269b3a4ffcd267daec32d1a10dda96/flashinfer_python-0.2.6.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6.post1 2026-04-16T04:55:39,066 Found link https://files.pythonhosted.org/packages/f9/a0/5e700751f2393a504bc5eb2879e77d783a5b70778a254289711323126abc/flashinfer_python-0.2.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7 2026-04-16T04:55:39,067 Found link https://files.pythonhosted.org/packages/c0/10/43cf1ea7a03ca8e75a185190708e48286e1583d781e93d1de130e5d450ca/flashinfer_python-0.2.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7.post1 2026-04-16T04:55:39,068 Found link https://files.pythonhosted.org/packages/f1/80/8dfae62d04af4597d7615b892f346ace68bcb07dfbef2a9e614219d96a8a/flashinfer_python-0.2.8rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8rc1 2026-04-16T04:55:39,069 Found link https://files.pythonhosted.org/packages/72/0e/827624993516e80f62ba88dd368ad5e180c41324f063c00d27fa638a430e/flashinfer_python-0.2.8.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8 2026-04-16T04:55:39,070 Found link https://files.pythonhosted.org/packages/17/50/42afc9a81031939140fcbfd93e5a3652dc4995e338b4e6d007b0dda04f93/flashinfer_python-0.2.9rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc1 2026-04-16T04:55:39,072 Found link https://files.pythonhosted.org/packages/ed/1a/9f30eda3178ed2f5f7e311ae0011d02c4542d087f84c9247e4b30668b767/flashinfer_python-0.2.9rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc2 2026-04-16T04:55:39,073 Found link https://files.pythonhosted.org/packages/45/fc/4deff13f1420cc6e5871b7505a6c0d9031eb49cd09571ae576aec59bed61/flashinfer_python-0.2.9.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9 2026-04-16T04:55:39,074 Found link https://files.pythonhosted.org/packages/74/e4/2c6d6a19d13ed13d4863f6900febe72b502334e43292d5fe9a1ac2f6c5be/flashinfer_python-0.2.10.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.10 2026-04-16T04:55:39,075 Found link https://files.pythonhosted.org/packages/72/8b/f315dda5993d1c018ca5ecfef0775c6a3c7a8f59ac426fabb7f3f6b93482/flashinfer_python-0.2.11.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11 2026-04-16T04:55:39,076 Found link https://files.pythonhosted.org/packages/37/e3/2e8e31f7f7ee26f39968264e4fcf74f9810d90e940859016d974106ed5c6/flashinfer_python-0.2.11.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post1 2026-04-16T04:55:39,077 Found link https://files.pythonhosted.org/packages/b6/01/fa069f076cfe5bed34ddc3b7f772aa09c70e03e572dd9d3569ff887f33b1/flashinfer_python-0.2.11.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post2 2026-04-16T04:55:39,078 Found link https://files.pythonhosted.org/packages/a3/09/5d89ef0bc2d19d3ebcf3b9fa621c945909f681818c9d55aa3181921db874/flashinfer_python-0.2.11.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post3 2026-04-16T04:55:39,080 Found link https://files.pythonhosted.org/packages/b9/5a/7a839afb07af313549b9d9f1057b02aaf067f020267d5a9d128e50596bf4/flashinfer_python-0.2.12.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.12 2026-04-16T04:55:39,081 Found link https://files.pythonhosted.org/packages/f2/20/e79142a9f26aab61b17e2c906a49e9a3d3c656d97608c8773785c3b13140/flashinfer_python-0.2.13.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.13 2026-04-16T04:55:39,082 Found link https://files.pythonhosted.org/packages/ed/26/d1eac56b37d225cb3f84495bd897829dece21f62463487f3c1d9cafe78a0/flashinfer_python-0.2.14.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14 2026-04-16T04:55:39,083 Found link https://files.pythonhosted.org/packages/94/d4/4a2bf3d49f84b2d975925c1c024790b4e4768bdefbc5e27529d68368355a/flashinfer_python-0.2.14.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14.post1 2026-04-16T04:55:39,084 Found link https://files.pythonhosted.org/packages/56/e3/7c0a4df2640a97ecfed45fe9110ecc6a67d4967278723abf8e6531b6bc1f/flashinfer_python-0.3.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0rc1 2026-04-16T04:55:39,085 Found link https://files.pythonhosted.org/packages/1f/b4/5c4cbb0f3cbc5e8d4c19b3f163c048eed959a0ac0c603cfb3939a3079c52/flashinfer_python-0.3.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0 2026-04-16T04:55:39,086 Found link https://files.pythonhosted.org/packages/59/1b/83a9c58432b4a5d6ff04b97d4873bedfb5e35d38972ca8946b3acdbffeb4/flashinfer_python-0.3.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0.post1 2026-04-16T04:55:39,087 Found link https://files.pythonhosted.org/packages/ba/71/dd3001b8be8174d90561764a5f3be4ca219517bde2841189ea6973a3873f/flashinfer_python-0.3.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1 2026-04-16T04:55:39,088 Found link https://files.pythonhosted.org/packages/49/a7/f5bd3878f94fc47e25ecc0828f910233022366f7e832dfa02f3617fad41f/flashinfer_python-0.3.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1.post1 2026-04-16T04:55:39,089 Found link https://files.pythonhosted.org/packages/df/b4/f113bb950e5244d1c72c3d73c03fac0db939f085670e3a45a41fe92ffde0/flashinfer_python-0.4.0rc0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc0 2026-04-16T04:55:39,091 Found link https://files.pythonhosted.org/packages/2e/a8/adceccda3aae01b7bdb5f99c68a2b401c58600f34a6386d9489ff736cdbc/flashinfer_python-0.4.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc1 2026-04-16T04:55:39,092 Found link https://files.pythonhosted.org/packages/15/c0/5fb88fc273fed23dbf3b0ef0bffa7db26e2df24e016202df1b4e98b95879/flashinfer_python-0.4.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc2 2026-04-16T04:55:39,093 Found link https://files.pythonhosted.org/packages/65/91/cf9e3a0a2626711bfab18ea4a4c739e0eb823e9513addc0e9e1b8f929538/flashinfer_python-0.4.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc3 2026-04-16T04:55:39,094 Found link https://files.pythonhosted.org/packages/94/ec/bdcc0ec502994d544cbe69763d999458ae2deda67e58c1cb2d85867677c4/flashinfer_python-0.4.0rc4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc4 2026-04-16T04:55:39,095 Found link https://files.pythonhosted.org/packages/08/29/f5609be182174e8c97124baeb90bb955fe05e2e1353776f48e226c153214/flashinfer_python-0.4.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0 2026-04-16T04:55:39,097 Found link https://files.pythonhosted.org/packages/64/cf/f82142abd7c819fb84a53f18fe1ac9e7cf1af8790b93c06dbf430001473b/flashinfer_python-0.4.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.1 2026-04-16T04:55:39,098 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/c7/92/126dacc3476fab07478bdfc9944abd22aafa1000088d93bf86fb9ec78a29/flashinfer_python-0.5.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,099 Found link https://files.pythonhosted.org/packages/53/47/a759f1ae9ef4ceb4e12895665b65dfacea2085494626e764627dd3548fa8/flashinfer_python-0.5.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc1 2026-04-16T04:55:39,099 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/fb/aa/7b5d28c2aec11acfce18f2655d0b4614c7e34547fab218b4f2fd0d57bdce/flashinfer_python-0.5.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,100 Found link https://files.pythonhosted.org/packages/3d/5a/58a7b60f79a1ac9c652b4055b06e88b5f57e8ef4c7dd4830ef48fa4cc265/flashinfer_python-0.5.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc2 2026-04-16T04:55:39,101 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/5f/8f/7077cf0a44056a65045a793d6d55845d95818fb6455bfebb44ddea7e1f12/flashinfer_python-0.5.0rc3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,102 Found link https://files.pythonhosted.org/packages/60/d1/8c90d6dfc95ab609028e9d541a6cdb3483f5c1475b07d97465ff3f0db14c/flashinfer_python-0.5.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc3 2026-04-16T04:55:39,103 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/eb/8a/425b75b44ce5eeefe01dd61d4ee260b8e5f9dcf1a500d5f08d6cd4095d3a/flashinfer_python-0.5.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,104 Found link https://files.pythonhosted.org/packages/e3/1d/b82cd2606f4f0033e2fb28194dc3b04fd8101643e4ceb1d13fb1466cfd28/flashinfer_python-0.5.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0 2026-04-16T04:55:39,105 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f4/f1/33dedad087a2bc3d66244126bd5d1c79721ea22d1f2124299f9e5bdaf3b1/flashinfer_python-0.5.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,106 Found link https://files.pythonhosted.org/packages/6c/bb/897c3b9d683dcf6490f70e468efb585eebcd673970b13a04ed947b491982/flashinfer_python-0.5.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.1 2026-04-16T04:55:39,107 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/8d/0c/4a8ffbbc0d85e314f534cf5c32711f2af5d5e6e49225a5a414400a67b684/flashinfer_python-0.5.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,108 Found link https://files.pythonhosted.org/packages/d8/04/e357eaa50238e12c49e66fcf47f83e066e741ef19a117c136782b32eafbb/flashinfer_python-0.5.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.2 2026-04-16T04:55:39,108 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/78/6dc7e7da8cb87c9965644ea0d2439457a1bc9256c45ceda0044595be4143/flashinfer_python-0.5.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,110 Found link https://files.pythonhosted.org/packages/b4/91/cca69baeff24bb3efd12c7479a026432c8717ee47193694010494c528b22/flashinfer_python-0.5.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.5.3 2026-04-16T04:55:39,111 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/b2/0c/cb2d60eb86f0171451d676f17b90484ab66baf73c54cefe15c9a7c800739/flashinfer_python-0.6.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,112 Found link https://files.pythonhosted.org/packages/53/2a/e855be4851ad6bfcebed929807fb541715f9a3a7d7b239b696e635b49d0e/flashinfer_python-0.6.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc1 2026-04-16T04:55:39,112 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/05/22/9193f1da2468acec8ba99c4bee8aeacbda489777acf00b5871a73209acf7/flashinfer_python-0.6.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,113 Found link https://files.pythonhosted.org/packages/1b/71/dd1bb86ea531e5c1a34f8ad851901bf2e2ce500618b5a4da19bd69f7de11/flashinfer_python-0.6.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc2 2026-04-16T04:55:39,114 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/90/5834597488f5ea62b1cc874338125c79ce21c11d777ac6f7b47f12cf2bb3/flashinfer_python-0.6.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,115 Found link https://files.pythonhosted.org/packages/ad/8d/c7330f27f09b9110af2f6c44c6f68d7b536f525f8ac539210073bfcdb965/flashinfer_python-0.6.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0 2026-04-16T04:55:39,116 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/d5/bca632bb5781689415186421bbee2ad39ae8a39b0996d579c76901e5c66f/flashinfer_python-0.6.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,117 Found link https://files.pythonhosted.org/packages/68/81/5a84e14df7358d2c2903b18c6f2779bd4b4a6739076d01a847d4c18fb102/flashinfer_python-0.6.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.1 2026-04-16T04:55:39,118 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/aa/c0/ee819d16f6b40e287727bb3db471f4eaa9e0372e233bf2f7343faaa3009f/flashinfer_python-0.6.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,119 Found link https://files.pythonhosted.org/packages/89/86/b25115177606ae3b6cec373d290798c28e185d033b66f6b80a89589e7786/flashinfer_python-0.6.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.2 2026-04-16T04:55:39,119 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/13/2d95248101d8cb978db9000a4dceafb5b122484a694b53e84df1ac2a7b3d/flashinfer_python-0.6.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,120 Found link https://files.pythonhosted.org/packages/d6/aa/c564313b42dee7573da4ed0e441844f0c2bd827aecc9f29ea02c3838ffae/flashinfer_python-0.6.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.3 2026-04-16T04:55:39,121 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/17/9a/d2bab76d2bb15062c6a2329614653e4f8bec9c78eec9069856ef0c7c0a79/flashinfer_python-0.6.4-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,122 Found link https://files.pythonhosted.org/packages/77/45/15645d2a4ee81d08206f3e132a77323e48312f510462415d7cd1122eba43/flashinfer_python-0.6.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.4 2026-04-16T04:55:39,123 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/4f/83/eea2a74700b5fcae36ee2b748db9c3554a83a3f9e2dc4f3816369c5cb653/flashinfer_python-0.6.5-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,124 Found link https://files.pythonhosted.org/packages/e2/2f/5c52276af3cc40ac1f6eaf823ccd8e257f77e2fe5d465fa641ad3dba4d1b/flashinfer_python-0.6.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.5 2026-04-16T04:55:39,125 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/e0/61/385d06755f3ab66333018285657adf0daf8a90a129448231fd09e315bd2e/flashinfer_python-0.6.6-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,126 Found link https://files.pythonhosted.org/packages/03/70/c5a235297351021f5d3d3233523a85f5a6468495587489ad2f257e8eafe2/flashinfer_python-0.6.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.6 2026-04-16T04:55:39,126 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f1/e8/91361a5f07667f36181cfd08e2d7d28be4cae2aa5a24016339174b308c38/flashinfer_python-0.6.7-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,127 Found link https://files.pythonhosted.org/packages/d9/2d/aa36fa1fee744c46fef99436baea5cda4a34244846c1df0fea97eaa9a856/flashinfer_python-0.6.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7 2026-04-16T04:55:39,128 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/16/92/516c79e5d8d1f0b41793e499c37a9299115ac8bc05171661b30d4a94beb8/flashinfer_python-0.6.7.post1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,129 Found link https://files.pythonhosted.org/packages/60/6c/4b1a3d380c04306bde63412043e679d5a52d3da7feed91f1e9ba8ce8bc3f/flashinfer_python-0.6.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post1 2026-04-16T04:55:39,130 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/62/9e/bf26a95bb219eb3d43cc6f3cd1dde6f560081fbcb50f846535c9f571a807/flashinfer_python-0.6.7.post2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,131 Found link https://files.pythonhosted.org/packages/cc/95/81eafb78574312db79ef7144a4e77f2fee015343f413ef3000f279c8a118/flashinfer_python-0.6.7.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post2 2026-04-16T04:55:39,131 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/01/6b/4117cd7cbeff07818ae7c6b8bf5a6d1ee3eed29356672b731b55af3d4453/flashinfer_python-0.6.7.post3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,132 Found link https://files.pythonhosted.org/packages/12/b5/466778818d195b96a062467ee389d0fcfa51fdfecad4a831922916d4c48a/flashinfer_python-0.6.7.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post3 2026-04-16T04:55:39,133 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/6a/0a/e8ae05fd59f800e74ec24fa6a58a04c6c0d9308917880c42f2b53cfe36bb/flashinfer_python-0.6.8rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,134 Found link https://files.pythonhosted.org/packages/68/e1/67b0b5eb9f3ea23e05e7d454571ad7a186ede6a9c30fec55e51291bfa461/flashinfer_python-0.6.8rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.8rc1 2026-04-16T04:55:39,135 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/9e/f8/54f8764748f1ba7d45a1915a1a51ad08f63b68a2f2141e399bdb0379d146/flashinfer_python-0.6.8-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,136 Found link https://files.pythonhosted.org/packages/7e/14/869ae016b4249db0b312203e4ba19b86406ce98417abf80fd2003af0a1a7/flashinfer_python-0.6.8.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.8 2026-04-16T04:55:39,137 Fetching project page and analyzing links: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-16T04:55:39,138 Getting page https://www.piwheels.org/simple/flashinfer-python/ 2026-04-16T04:55:39,139 Found index url https://www.piwheels.org/simple 2026-04-16T04:55:39,312 Fetched page https://www.piwheels.org/simple/flashinfer-python/ as text/html 2026-04-16T04:55:39,322 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.8rc1-py3-none-any.whl#sha256=03391613bde22d44aa6cc1c6e53ba651f46f264a4dee2b4b6b1088a35f872db7 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,322 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7.post3-py3-none-any.whl#sha256=7a81720af5bdc04efcb67207f3867adb1b068f961d2e048e55baf32fb8e2cfc5 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,323 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7.post1-py3-none-any.whl#sha256=c9bf5183228f6636ddb26d7354f250af4b2385876527538a0ff7f94fd48207d2 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,323 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7-py3-none-any.whl#sha256=9b349825a2d26c3e4653c594d7a1d7b2126a43b29a4a70a6d48f3aaac23b96f3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,324 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.6-py3-none-any.whl#sha256=94791e01c31510c057b4decabff24cbc62466682667867e84214c62c45d9b343 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,325 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.5-py3-none-any.whl#sha256=4b0a6c246959ca2dbc232fa1fe2f17ff857fd258de5dfacfa45033f21b6b7b93 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,325 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.4-py3-none-any.whl#sha256=22ee7972266bb31ce1583330769efc0ecd001fb70371531ce4c77f2d6eda0d59 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,326 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.3-py3-none-any.whl#sha256=ed3282188580afd663819924a772b2b531ac5bb88438bbe89d0baf67fe8c9fa5 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,327 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.1-py3-none-any.whl#sha256=9e0e308062a81d4e4c462313bfe33edce7712309e8c89aed722065249e644833 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,327 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0-py3-none-any.whl#sha256=7ebc0582df714a933fc4c58ed4d12f4e61b4ad30b22b9155f290e96ee3eee3a0 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,328 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc2-py3-none-any.whl#sha256=63057b7ee43a4f6764c6ed8fe4c4c6de5a94da058fe0975bf279db0567c26204 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,328 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc1-py3-none-any.whl#sha256=e30a125bf89f8155f83aca80e5fb88a3d81224225485ce70f0f4c4c3a27da92c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,329 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.3-py3-none-any.whl#sha256=1de562233dfbd8de835c2eb757275a7759eda034460093c1eb9ff3c7d5c0845d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-16T04:55:39,329 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.2-py3-none-any.whl#sha256=bd3d206d1243bee523cf6cda27e0219e8fdf9026ade2e32045c8d9d4b7f7bf7a (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,330 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.1-py3-none-any.whl#sha256=8d73e4b66b7eb7fc4500f7f7e61aa194efebc769e7da1635a86506c97bf6fa0d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,330 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0-py3-none-any.whl#sha256=ac991d1911cff4a7453f02d88922803e7ca794a0af1dceaa920e33b81c78f5c8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,331 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc3-py3-none-any.whl#sha256=8799f4a93afc14042ac6f521f6fb682e4d62d738dc18a1e8798b7a2ba5b2e4ec (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,332 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc2-py3-none-any.whl#sha256=4ee4d438c8c7fdc242a917c3f97076562f3c44411dcaceb4f7d29082c41c0f8c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,332 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc1-py3-none-any.whl#sha256=a9d675075f3cb79ac1b5cba9e8430496d3983127609dc780a117b2b44bdb025d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,333 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.1-py3-none-any.whl#sha256=8fc8fc3233781e384689c5f202124ae7d266cb8dee14055cbb3c90fca530bf7f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,333 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.0-py3-none-any.whl#sha256=da0141b2163f9703e49972728eeb502d45eda60c25529a460d0d0d61963eedb2 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-16T04:55:39,334 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.5-py3-none-any.whl#sha256=cb2a17c3ea5f47f8129f6410e2892f30051e15665f2ae54db540c8677c187d31 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,335 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.4-py3-none-any.whl#sha256=4a85bd6ac785f106f0ad9fe213abf42f96ab84ccd04aec3ab9acf76d47d2aa3f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,335 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.3-py3-none-any.whl#sha256=b8ead688a4857a2b360c992fb46ae2930fc4c43b50a092b7e42a13b40ee195da (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,336 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2.post1-py3-none-any.whl#sha256=0097a08376ae147084ea6bd0848fc2ea1764f524c510a48755aa8c63259b4466 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,336 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2-py3-none-any.whl#sha256=c109a340b7e60cb57d8c9ccec2c10e303a36b82a56ba8dcaaa0efbee2a48b97f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,337 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post2-py3-none-any.whl#sha256=dc91f387ba09e4df899238705ec37bbe3648395d828240b77db84378d1b91e9e (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,337 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post1-py3-none-any.whl#sha256=a44b9d872cf2ba6812d3c0750d98ad01b73e9ccbede933c7eade01b6c27b6232 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,338 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1-py3-none-any.whl#sha256=e07427d9eff1b8d091b5837c3ffc4fe7885dbf01d271d7225f7a89a2e3925f27 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,339 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post2-py3-none-any.whl#sha256=52c20b84ef1e848dd49c726ffc27801df8acccb4038aea61a2d73fa685bf75f8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,339 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post1-py3-none-any.whl#sha256=783c1039e0a7db0478a579d5cc54894def70ae601b1e5b90a3c3de2209334bf3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-16T04:55:39,340 Skipping link: not a file: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-16T04:55:39,340 Skipping link: not a file: https://pypi.org/simple/flashinfer-python/ 2026-04-16T04:55:39,371 Given no hashes to check 1 links for project 'flashinfer-python': discarding no candidates 2026-04-16T04:55:39,391 Collecting flashinfer-python==0.6.8 2026-04-16T04:55:39,393 Created temporary directory: /tmp/pip-unpack-e17dm92p 2026-04-16T04:55:39,622 Downloading flashinfer_python-0.6.8.tar.gz (6.7 MB) 2026-04-16T04:55:46,801 Added flashinfer-python==0.6.8 from https://files.pythonhosted.org/packages/7e/14/869ae016b4249db0b312203e4ba19b86406ce98417abf80fd2003af0a1a7/flashinfer_python-0.6.8.tar.gz to build tracker '/tmp/pip-build-tracker-2hn1r72n' 2026-04-16T04:55:46,814 Created temporary directory: /tmp/pip-build-env-631cyict 2026-04-16T04:55:46,820 Installing build dependencies: started 2026-04-16T04:55:46,822 Running command pip subprocess to install build dependencies 2026-04-16T04:55:47,995 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-04-16T04:55:48,440 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-16T04:55:48,463 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-16T04:55:50,215 Collecting setuptools>=77 2026-04-16T04:55:50,290 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-04-16T04:55:50,512 Collecting packaging>=24 2026-04-16T04:55:50,529 Using cached https://www.piwheels.org/simple/packaging/packaging-26.1-py3-none-any.whl (95 kB) 2026-04-16T04:55:51,173 Collecting apache-tvm-ffi!=0.1.8,!=0.1.8.post0,<0.2,>=0.1.6 2026-04-16T04:55:51,403 Downloading https://archive1.piwheels.org/simple/apache-tvm-ffi/apache_tvm_ffi-0.1.10-cp311-cp311-linux_armv7l.whl (2.6 MB) 2026-04-16T04:55:51,607 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.6/2.6 MB 13.0 MB/s eta 0:00:00 2026-04-16T04:55:51,835 Collecting typing-extensions>=4.5 2026-04-16T04:55:51,850 Using cached https://www.piwheels.org/simple/typing-extensions/typing_extensions-4.15.0-py3-none-any.whl (44 kB) 2026-04-16T04:55:54,847 Installing collected packages: typing-extensions, setuptools, packaging, apache-tvm-ffi 2026-04-16T04:55:59,255 Creating /tmp/pip-build-env-631cyict/overlay/local/bin 2026-04-16T04:55:59,257 changing mode of /tmp/pip-build-env-631cyict/overlay/local/bin/tvm-ffi-config to 755 2026-04-16T04:55:59,260 changing mode of /tmp/pip-build-env-631cyict/overlay/local/bin/tvm-ffi-stubgen to 755 2026-04-16T04:55:59,293 Successfully installed apache-tvm-ffi-0.1.10 packaging-26.1 setuptools-82.0.1 typing-extensions-4.15.0 2026-04-16T04:55:59,598 Installing build dependencies: finished with status 'done' 2026-04-16T04:55:59,605 Getting requirements to build wheel: started 2026-04-16T04:55:59,606 Running command Getting requirements to build wheel 2026-04-16T04:56:05,269 Build metadata file already exists (not in git repo), keeping it 2026-04-16T04:56:05,337 Getting requirements to build wheel: finished with status 'done' 2026-04-16T04:56:05,341 Created temporary directory: /tmp/pip-modern-metadata-hii0plv3 2026-04-16T04:56:05,343 Preparing metadata (pyproject.toml): started 2026-04-16T04:56:05,345 Running command Preparing metadata (pyproject.toml) 2026-04-16T04:56:11,890 /tmp/pip-build-env-631cyict/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-16T04:56:11,891 !! 2026-04-16T04:56:11,893 ******************************************************************************** 2026-04-16T04:56:11,893 Pattern 'LICENSE*.txt' did not match any files. 2026-04-16T04:56:11,895 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-16T04:56:11,895 or your builds will no longer be supported. 2026-04-16T04:56:11,896 ******************************************************************************** 2026-04-16T04:56:11,896 !! 2026-04-16T04:56:11,897 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-16T04:56:11,900 Build metadata file already exists (not in git repo), keeping it 2026-04-16T04:56:11,901 running dist_info 2026-04-16T04:56:11,913 creating /tmp/pip-modern-metadata-hii0plv3/flashinfer_python.egg-info 2026-04-16T04:56:11,914 writing /tmp/pip-modern-metadata-hii0plv3/flashinfer_python.egg-info/PKG-INFO 2026-04-16T04:56:11,919 writing dependency_links to /tmp/pip-modern-metadata-hii0plv3/flashinfer_python.egg-info/dependency_links.txt 2026-04-16T04:56:11,921 writing entry points to /tmp/pip-modern-metadata-hii0plv3/flashinfer_python.egg-info/entry_points.txt 2026-04-16T04:56:11,923 writing requirements to /tmp/pip-modern-metadata-hii0plv3/flashinfer_python.egg-info/requires.txt 2026-04-16T04:56:11,924 writing top-level names to /tmp/pip-modern-metadata-hii0plv3/flashinfer_python.egg-info/top_level.txt 2026-04-16T04:56:11,926 writing manifest file '/tmp/pip-modern-metadata-hii0plv3/flashinfer_python.egg-info/SOURCES.txt' 2026-04-16T04:56:12,766 reading manifest file '/tmp/pip-modern-metadata-hii0plv3/flashinfer_python.egg-info/SOURCES.txt' 2026-04-16T04:56:12,768 adding license file 'LICENSE' 2026-04-16T04:56:12,845 writing manifest file '/tmp/pip-modern-metadata-hii0plv3/flashinfer_python.egg-info/SOURCES.txt' 2026-04-16T04:56:12,851 creating '/tmp/pip-modern-metadata-hii0plv3/flashinfer_python-0.6.8.dist-info' 2026-04-16T04:56:12,984 Preparing metadata (pyproject.toml): finished with status 'done' 2026-04-16T04:56:12,989 Source in /tmp/pip-wheel-lp_5eu38/flashinfer-python_1cbd03571e0c4b43a8da35788411737c has version 0.6.8, which satisfies requirement flashinfer-python==0.6.8 from https://files.pythonhosted.org/packages/7e/14/869ae016b4249db0b312203e4ba19b86406ce98417abf80fd2003af0a1a7/flashinfer_python-0.6.8.tar.gz 2026-04-16T04:56:12,990 Removed flashinfer-python==0.6.8 from https://files.pythonhosted.org/packages/7e/14/869ae016b4249db0b312203e4ba19b86406ce98417abf80fd2003af0a1a7/flashinfer_python-0.6.8.tar.gz from build tracker '/tmp/pip-build-tracker-2hn1r72n' 2026-04-16T04:56:12,997 Created temporary directory: /tmp/pip-unpack-jpm0avvm 2026-04-16T04:56:12,998 Building wheels for collected packages: flashinfer-python 2026-04-16T04:56:13,002 Created temporary directory: /tmp/pip-wheel-asfseyvc 2026-04-16T04:56:13,003 Destination directory: /tmp/pip-wheel-asfseyvc 2026-04-16T04:56:13,005 Building wheel for flashinfer-python (pyproject.toml): started 2026-04-16T04:56:13,007 Running command Building wheel for flashinfer-python (pyproject.toml) 2026-04-16T04:56:18,937 /tmp/pip-build-env-631cyict/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-16T04:56:18,937 !! 2026-04-16T04:56:18,938 ******************************************************************************** 2026-04-16T04:56:18,939 Pattern 'LICENSE*.txt' did not match any files. 2026-04-16T04:56:18,940 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-16T04:56:18,941 or your builds will no longer be supported. 2026-04-16T04:56:18,941 ******************************************************************************** 2026-04-16T04:56:18,942 !! 2026-04-16T04:56:18,943 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-16T04:56:18,944 Build metadata file already exists (not in git repo), keeping it 2026-04-16T04:56:18,944 running bdist_wheel 2026-04-16T04:56:18,962 running build 2026-04-16T04:56:18,963 running build_py 2026-04-16T04:56:18,970 creating build/lib 2026-04-16T04:56:18,971 copying build_backend.py -> build/lib 2026-04-16T04:56:18,974 copying build_utils.py -> build/lib 2026-04-16T04:56:18,977 creating build/lib/flashinfer 2026-04-16T04:56:18,979 copying flashinfer/pod.py -> build/lib/flashinfer 2026-04-16T04:56:18,981 copying flashinfer/sampling.py -> build/lib/flashinfer 2026-04-16T04:56:18,985 copying flashinfer/concat_ops.py -> build/lib/flashinfer 2026-04-16T04:56:18,986 copying flashinfer/tllm_utils.py -> build/lib/flashinfer 2026-04-16T04:56:18,988 copying flashinfer/green_ctx.py -> build/lib/flashinfer 2026-04-16T04:56:18,990 copying flashinfer/autotuner.py -> build/lib/flashinfer 2026-04-16T04:56:18,993 copying flashinfer/gdn_prefill.py -> build/lib/flashinfer 2026-04-16T04:56:18,995 copying flashinfer/page.py -> build/lib/flashinfer 2026-04-16T04:56:18,998 copying flashinfer/cuda_utils.py -> build/lib/flashinfer 2026-04-16T04:56:18,999 copying flashinfer/activation.py -> build/lib/flashinfer 2026-04-16T04:56:19,002 copying flashinfer/attention.py -> build/lib/flashinfer 2026-04-16T04:56:19,004 copying flashinfer/api_logging.py -> build/lib/flashinfer 2026-04-16T04:56:19,007 copying flashinfer/decode.py -> build/lib/flashinfer 2026-04-16T04:56:19,011 copying flashinfer/cascade.py -> build/lib/flashinfer 2026-04-16T04:56:19,013 copying flashinfer/__init__.py -> build/lib/flashinfer 2026-04-16T04:56:19,015 copying flashinfer/fp8_quantization.py -> build/lib/flashinfer 2026-04-16T04:56:19,017 copying flashinfer/utils.py -> build/lib/flashinfer 2026-04-16T04:56:19,020 copying flashinfer/rope.py -> build/lib/flashinfer 2026-04-16T04:56:19,023 copying flashinfer/gdn_decode.py -> build/lib/flashinfer 2026-04-16T04:56:19,026 copying flashinfer/aot.py -> build/lib/flashinfer 2026-04-16T04:56:19,028 copying flashinfer/deep_gemm.py -> build/lib/flashinfer 2026-04-16T04:56:19,031 copying flashinfer/_build_meta.py -> build/lib/flashinfer 2026-04-16T04:56:19,032 copying flashinfer/trtllm_low_latency_gemm.py -> build/lib/flashinfer 2026-04-16T04:56:19,035 copying flashinfer/tllm_enums.py -> build/lib/flashinfer 2026-04-16T04:56:19,037 copying flashinfer/xqa.py -> build/lib/flashinfer 2026-04-16T04:56:19,040 copying flashinfer/compilation_context.py -> build/lib/flashinfer 2026-04-16T04:56:19,041 copying flashinfer/version.py -> build/lib/flashinfer 2026-04-16T04:56:19,043 copying flashinfer/prefill.py -> build/lib/flashinfer 2026-04-16T04:56:19,049 copying flashinfer/topk.py -> build/lib/flashinfer 2026-04-16T04:56:19,051 copying flashinfer/sparse.py -> build/lib/flashinfer 2026-04-16T04:56:19,054 copying flashinfer/__main__.py -> build/lib/flashinfer 2026-04-16T04:56:19,057 copying flashinfer/fp4_quantization.py -> build/lib/flashinfer 2026-04-16T04:56:19,059 copying flashinfer/artifacts.py -> build/lib/flashinfer 2026-04-16T04:56:19,061 creating build/lib/flashinfer/mamba 2026-04-16T04:56:19,063 copying flashinfer/mamba/ssd_tile_scheduler.py -> build/lib/flashinfer/mamba 2026-04-16T04:56:19,065 copying flashinfer/mamba/ssd_combined.py -> build/lib/flashinfer/mamba 2026-04-16T04:56:19,068 copying flashinfer/mamba/selective_state_update.py -> build/lib/flashinfer/mamba 2026-04-16T04:56:19,070 copying flashinfer/mamba/__init__.py -> build/lib/flashinfer/mamba 2026-04-16T04:56:19,072 copying flashinfer/mamba/ssd_kernel.py -> build/lib/flashinfer/mamba 2026-04-16T04:56:19,078 creating build/lib/flashinfer/quantization 2026-04-16T04:56:19,079 copying flashinfer/quantization/packbits.py -> build/lib/flashinfer/quantization 2026-04-16T04:56:19,081 copying flashinfer/quantization/quantization_cute_dsl_utils.py -> build/lib/flashinfer/quantization 2026-04-16T04:56:19,084 copying flashinfer/quantization/__init__.py -> build/lib/flashinfer/quantization 2026-04-16T04:56:19,086 copying flashinfer/quantization/fp8_quantization.py -> build/lib/flashinfer/quantization 2026-04-16T04:56:19,089 copying flashinfer/quantization/fp4_quantization.py -> build/lib/flashinfer/quantization 2026-04-16T04:56:19,092 creating build/lib/flashinfer/gemm 2026-04-16T04:56:19,094 copying flashinfer/gemm/routergemm.py -> build/lib/flashinfer/gemm 2026-04-16T04:56:19,096 copying flashinfer/gemm/gemm_base.py -> build/lib/flashinfer/gemm 2026-04-16T04:56:19,104 copying flashinfer/gemm/__init__.py -> build/lib/flashinfer/gemm 2026-04-16T04:56:19,106 creating build/lib/flashinfer/tuning_configs 2026-04-16T04:56:19,107 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/lib/flashinfer/tuning_configs 2026-04-16T04:56:19,110 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/lib/flashinfer/tuning_configs 2026-04-16T04:56:19,113 creating build/lib/flashinfer/cudnn 2026-04-16T04:56:19,114 copying flashinfer/cudnn/decode.py -> build/lib/flashinfer/cudnn 2026-04-16T04:56:19,116 copying flashinfer/cudnn/__init__.py -> build/lib/flashinfer/cudnn 2026-04-16T04:56:19,118 copying flashinfer/cudnn/utils.py -> build/lib/flashinfer/cudnn 2026-04-16T04:56:19,120 copying flashinfer/cudnn/prefill.py -> build/lib/flashinfer/cudnn 2026-04-16T04:56:19,123 creating build/lib/flashinfer/comm 2026-04-16T04:56:19,124 copying flashinfer/comm/trtllm_ar.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,128 copying flashinfer/comm/trtllm_mnnvl_ar.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,130 copying flashinfer/comm/trtllm_alltoall.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,133 copying flashinfer/comm/mapping.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,135 copying flashinfer/comm/vllm_ar.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,137 copying flashinfer/comm/trtllm_moe_alltoall.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,140 copying flashinfer/comm/__init__.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,142 copying flashinfer/comm/allreduce.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,145 copying flashinfer/comm/workspace_base.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,147 copying flashinfer/comm/mnnvl.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,150 copying flashinfer/comm/nvshmem_allreduce.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,153 copying flashinfer/comm/nvshmem.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,155 copying flashinfer/comm/dlpack_utils.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,157 copying flashinfer/comm/cuda_ipc.py -> build/lib/flashinfer/comm 2026-04-16T04:56:19,160 creating build/lib/flashinfer/profiler 2026-04-16T04:56:19,161 copying flashinfer/profiler/__init__.py -> build/lib/flashinfer/profiler 2026-04-16T04:56:19,164 creating build/lib/flashinfer/norm 2026-04-16T04:56:19,164 copying flashinfer/norm/__init__.py -> build/lib/flashinfer/norm 2026-04-16T04:56:19,167 copying flashinfer/norm/utils.py -> build/lib/flashinfer/norm 2026-04-16T04:56:19,171 creating build/lib/flashinfer/gdn_kernels 2026-04-16T04:56:19,172 copying flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/lib/flashinfer/gdn_kernels 2026-04-16T04:56:19,176 copying flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-16T04:56:19,179 copying flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-16T04:56:19,182 copying flashinfer/gdn_kernels/__init__.py -> build/lib/flashinfer/gdn_kernels 2026-04-16T04:56:19,184 copying flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/lib/flashinfer/gdn_kernels 2026-04-16T04:56:19,189 creating build/lib/flashinfer/dsv3_ops 2026-04-16T04:56:19,190 copying flashinfer/dsv3_ops/__init__.py -> build/lib/flashinfer/dsv3_ops 2026-04-16T04:56:19,192 creating build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,193 copying flashinfer/logits_processor/fusion_rules.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,196 copying flashinfer/logits_processor/compiler.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,198 copying flashinfer/logits_processor/op.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,200 copying flashinfer/logits_processor/types.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,202 copying flashinfer/logits_processor/__init__.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,203 copying flashinfer/logits_processor/operators.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,206 copying flashinfer/logits_processor/validators.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,208 copying flashinfer/logits_processor/legalization.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,210 copying flashinfer/logits_processor/pipeline.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,212 copying flashinfer/logits_processor/processors.py -> build/lib/flashinfer/logits_processor 2026-04-16T04:56:19,215 creating build/lib/flashinfer/jit 2026-04-16T04:56:19,216 copying flashinfer/jit/spdlog.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,218 copying flashinfer/jit/sampling.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,220 copying flashinfer/jit/comm.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,222 copying flashinfer/jit/quantization.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,224 copying flashinfer/jit/tllm_utils.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,226 copying flashinfer/jit/fused_moe.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,228 copying flashinfer/jit/dsv3_optimizations.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,230 copying flashinfer/jit/page.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,232 copying flashinfer/jit/env.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,234 copying flashinfer/jit/gdn.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,236 copying flashinfer/jit/activation.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,238 copying flashinfer/jit/norm.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,240 copying flashinfer/jit/cascade.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,241 copying flashinfer/jit/__init__.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,244 copying flashinfer/jit/fp8_quantization.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,246 copying flashinfer/jit/moe_utils.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,248 copying flashinfer/jit/fp4_kv_quantization.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,250 copying flashinfer/jit/mla.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,252 copying flashinfer/jit/utils.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,254 copying flashinfer/jit/rope.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,256 copying flashinfer/jit/rmsnorm_silu.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,258 copying flashinfer/jit/fp4_kv_dequantization.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,260 copying flashinfer/jit/xqa.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,262 copying flashinfer/jit/topk.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,264 copying flashinfer/jit/cubin_loader.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,267 copying flashinfer/jit/tinygemm2.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,269 copying flashinfer/jit/core.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,271 copying flashinfer/jit/cpp_ext.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,274 copying flashinfer/jit/fp4_quantization.py -> build/lib/flashinfer/jit 2026-04-16T04:56:19,276 creating build/lib/flashinfer/mla 2026-04-16T04:56:19,277 copying flashinfer/mla/__init__.py -> build/lib/flashinfer/mla 2026-04-16T04:56:19,279 copying flashinfer/mla/_core.py -> build/lib/flashinfer/mla 2026-04-16T04:56:19,283 creating build/lib/flashinfer/triton 2026-04-16T04:56:19,284 copying flashinfer/triton/page.py -> build/lib/flashinfer/triton 2026-04-16T04:56:19,286 copying flashinfer/triton/sm_constraint_gemm.py -> build/lib/flashinfer/triton 2026-04-16T04:56:19,289 copying flashinfer/triton/activation.py -> build/lib/flashinfer/triton 2026-04-16T04:56:19,290 copying flashinfer/triton/norm.py -> build/lib/flashinfer/triton 2026-04-16T04:56:19,293 copying flashinfer/triton/cascade.py -> build/lib/flashinfer/triton 2026-04-16T04:56:19,294 copying flashinfer/triton/__init__.py -> build/lib/flashinfer/triton 2026-04-16T04:56:19,296 copying flashinfer/triton/utils.py -> build/lib/flashinfer/triton 2026-04-16T04:56:19,298 copying flashinfer/triton/gemm.py -> build/lib/flashinfer/triton 2026-04-16T04:56:19,301 creating build/lib/flashinfer/cute_dsl 2026-04-16T04:56:19,302 copying flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/lib/flashinfer/cute_dsl 2026-04-16T04:56:19,306 copying flashinfer/cute_dsl/__init__.py -> build/lib/flashinfer/cute_dsl 2026-04-16T04:56:19,308 copying flashinfer/cute_dsl/utils.py -> build/lib/flashinfer/cute_dsl 2026-04-16T04:56:19,310 copying flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-16T04:56:19,313 copying flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-16T04:56:19,316 copying flashinfer/cute_dsl/fp4_common.py -> build/lib/flashinfer/cute_dsl 2026-04-16T04:56:19,319 copying flashinfer/cute_dsl/blockscaled_gemm.py -> build/lib/flashinfer/cute_dsl 2026-04-16T04:56:19,322 creating build/lib/flashinfer/data 2026-04-16T04:56:19,323 copying ./build_utils.py -> build/lib/flashinfer/data 2026-04-16T04:56:19,325 copying ./build_backend.py -> build/lib/flashinfer/data 2026-04-16T04:56:19,327 creating build/lib/flashinfer/fused_moe 2026-04-16T04:56:19,329 copying flashinfer/fused_moe/fused_routing_dsv3.py -> build/lib/flashinfer/fused_moe 2026-04-16T04:56:19,331 copying flashinfer/fused_moe/__init__.py -> build/lib/flashinfer/fused_moe 2026-04-16T04:56:19,333 copying flashinfer/fused_moe/utils.py -> build/lib/flashinfer/fused_moe 2026-04-16T04:56:19,335 copying flashinfer/fused_moe/core.py -> build/lib/flashinfer/fused_moe 2026-04-16T04:56:19,340 creating build/lib/flashinfer/testing 2026-04-16T04:56:19,341 copying flashinfer/testing/__init__.py -> build/lib/flashinfer/testing 2026-04-16T04:56:19,343 copying flashinfer/testing/utils.py -> build/lib/flashinfer/testing 2026-04-16T04:56:19,347 creating build/lib/flashinfer/quantization/kernels 2026-04-16T04:56:19,348 copying flashinfer/quantization/kernels/nvfp4_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-16T04:56:19,351 copying flashinfer/quantization/kernels/mxfp4_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-16T04:56:19,354 copying flashinfer/quantization/kernels/mxfp8_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-16T04:56:19,357 copying flashinfer/quantization/kernels/__init__.py -> build/lib/flashinfer/quantization/kernels 2026-04-16T04:56:19,360 creating build/lib/flashinfer/gemm/kernels 2026-04-16T04:56:19,361 copying flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/lib/flashinfer/gemm/kernels 2026-04-16T04:56:19,366 copying flashinfer/gemm/kernels/__init__.py -> build/lib/flashinfer/gemm/kernels 2026-04-16T04:56:19,368 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/lib/flashinfer/gemm/kernels 2026-04-16T04:56:19,372 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/lib/flashinfer/gemm/kernels 2026-04-16T04:56:19,376 creating build/lib/flashinfer/norm/kernels 2026-04-16T04:56:19,378 copying flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-16T04:56:19,381 copying flashinfer/norm/kernels/rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-16T04:56:19,384 copying flashinfer/norm/kernels/__init__.py -> build/lib/flashinfer/norm/kernels 2026-04-16T04:56:19,386 copying flashinfer/norm/kernels/layernorm.py -> build/lib/flashinfer/norm/kernels 2026-04-16T04:56:19,390 creating build/lib/flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:19,391 copying flashinfer/gdn_kernels/blackwell/gated_delta_net_tile_scheduler.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:19,393 copying flashinfer/gdn_kernels/blackwell/gdn_prefill.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:19,396 copying flashinfer/gdn_kernels/blackwell/__init__.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:19,398 copying flashinfer/gdn_kernels/blackwell/gated_delta_net_chunked.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:19,403 creating build/lib/flashinfer/jit/attention 2026-04-16T04:56:19,404 copying flashinfer/jit/attention/variants.py -> build/lib/flashinfer/jit/attention 2026-04-16T04:56:19,407 copying flashinfer/jit/attention/modules.py -> build/lib/flashinfer/jit/attention 2026-04-16T04:56:19,410 copying flashinfer/jit/attention/__init__.py -> build/lib/flashinfer/jit/attention 2026-04-16T04:56:19,412 copying flashinfer/jit/attention/utils.py -> build/lib/flashinfer/jit/attention 2026-04-16T04:56:19,414 creating build/lib/flashinfer/jit/mamba 2026-04-16T04:56:19,415 copying flashinfer/jit/mamba/selective_state_update.py -> build/lib/flashinfer/jit/mamba 2026-04-16T04:56:19,418 copying flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/lib/flashinfer/jit/mamba 2026-04-16T04:56:19,420 copying flashinfer/jit/mamba/__init__.py -> build/lib/flashinfer/jit/mamba 2026-04-16T04:56:19,423 creating build/lib/flashinfer/jit/gemm 2026-04-16T04:56:19,424 copying flashinfer/jit/gemm/fp8_blockscale.py -> build/lib/flashinfer/jit/gemm 2026-04-16T04:56:19,426 copying flashinfer/jit/gemm/__init__.py -> build/lib/flashinfer/jit/gemm 2026-04-16T04:56:19,428 copying flashinfer/jit/gemm/deepgemm.py -> build/lib/flashinfer/jit/gemm 2026-04-16T04:56:19,430 copying flashinfer/jit/gemm/core.py -> build/lib/flashinfer/jit/gemm 2026-04-16T04:56:19,434 creating build/lib/flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:19,435 copying flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:19,441 copying flashinfer/jit/attention/fmha_v2/utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:19,444 copying flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:19,448 copying flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:19,451 creating build/lib/flashinfer/jit/gemm/cutlass 2026-04-16T04:56:19,453 copying flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-16T04:56:19,456 copying flashinfer/jit/gemm/cutlass/__init__.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-16T04:56:19,458 copying flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-16T04:56:19,462 creating build/lib/flashinfer/triton/kernels 2026-04-16T04:56:19,464 copying flashinfer/triton/kernels/ssd_chunk_state.py -> build/lib/flashinfer/triton/kernels 2026-04-16T04:56:19,466 copying flashinfer/triton/kernels/sm_constraint_gemm.py -> build/lib/flashinfer/triton/kernels 2026-04-16T04:56:19,469 copying flashinfer/triton/kernels/activation.py -> build/lib/flashinfer/triton/kernels 2026-04-16T04:56:19,471 copying flashinfer/triton/kernels/norm.py -> build/lib/flashinfer/triton/kernels 2026-04-16T04:56:19,474 copying flashinfer/triton/kernels/cascade.py -> build/lib/flashinfer/triton/kernels 2026-04-16T04:56:19,476 copying flashinfer/triton/kernels/__init__.py -> build/lib/flashinfer/triton/kernels 2026-04-16T04:56:19,478 copying flashinfer/triton/kernels/quant.py -> build/lib/flashinfer/triton/kernels 2026-04-16T04:56:19,481 creating build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,483 copying flashinfer/cute_dsl/attention/config.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,486 copying flashinfer/cute_dsl/attention/tmem_layout.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,488 copying flashinfer/cute_dsl/attention/pipeline_topology.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,491 copying flashinfer/cute_dsl/attention/collective_builder.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,494 copying flashinfer/cute_dsl/attention/__init__.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,496 copying flashinfer/cute_dsl/attention/mainloop_spec.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,499 copying flashinfer/cute_dsl/attention/mla_warp_schedule.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,501 copying flashinfer/cute_dsl/attention/compat.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,504 copying flashinfer/cute_dsl/attention/mla_decode_fp8.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,507 copying flashinfer/cute_dsl/attention/mla_config.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,509 copying flashinfer/cute_dsl/attention/prefill.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,512 copying flashinfer/cute_dsl/attention/warp_schedule.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,514 copying flashinfer/cute_dsl/attention/mla_decode.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-16T04:56:19,518 creating build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-16T04:56:19,519 copying flashinfer/cute_dsl/attention/scheduler/mla_persistent.py -> build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-16T04:56:19,522 copying flashinfer/cute_dsl/attention/scheduler/__init__.py -> build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-16T04:56:19,524 copying flashinfer/cute_dsl/attention/scheduler/persistent.py -> build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-16T04:56:19,527 creating build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,529 copying flashinfer/cute_dsl/attention/roles/mma.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,531 copying flashinfer/cute_dsl/attention/roles/mla_mma_fp8.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,534 copying flashinfer/cute_dsl/attention/roles/loader_tma.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,537 copying flashinfer/cute_dsl/attention/roles/mla_mma.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,540 copying flashinfer/cute_dsl/attention/roles/mla_compute.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,543 copying flashinfer/cute_dsl/attention/roles/mla_pt_loader.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,545 copying flashinfer/cute_dsl/attention/roles/mla_loader_fp8.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,548 copying flashinfer/cute_dsl/attention/roles/epilogue.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,550 copying flashinfer/cute_dsl/attention/roles/__init__.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,552 copying flashinfer/cute_dsl/attention/roles/mla_correction.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,555 copying flashinfer/cute_dsl/attention/roles/mla_loader.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,558 copying flashinfer/cute_dsl/attention/roles/softmax.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,561 copying flashinfer/cute_dsl/attention/roles/correction.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,563 copying flashinfer/cute_dsl/attention/roles/softmax_math.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:19,566 creating build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-16T04:56:19,567 copying flashinfer/cute_dsl/attention/fusion/mask.py -> build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-16T04:56:19,570 copying flashinfer/cute_dsl/attention/fusion/variant.py -> build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-16T04:56:19,572 copying flashinfer/cute_dsl/attention/fusion/__init__.py -> build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-16T04:56:19,575 creating build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-16T04:56:19,576 copying flashinfer/cute_dsl/attention/wrappers/batch_prefill.py -> build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-16T04:56:19,578 copying flashinfer/cute_dsl/attention/wrappers/__init__.py -> build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-16T04:56:19,580 copying flashinfer/cute_dsl/attention/wrappers/batch_mla.py -> build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-16T04:56:19,585 creating build/lib/flashinfer/data/cutlass/python 2026-04-16T04:56:19,587 copying 3rdparty/cutlass/python/setup_cutlass.py -> build/lib/flashinfer/data/cutlass/python 2026-04-16T04:56:19,589 copying 3rdparty/cutlass/python/setup_library.py -> build/lib/flashinfer/data/cutlass/python 2026-04-16T04:56:19,590 copying 3rdparty/cutlass/python/setup_pycute.py -> build/lib/flashinfer/data/cutlass/python 2026-04-16T04:56:19,593 creating build/lib/flashinfer/data/cutlass/test/utils 2026-04-16T04:56:19,595 copying 3rdparty/cutlass/test/utils/test_sharding.py -> build/lib/flashinfer/data/cutlass/test/utils 2026-04-16T04:56:19,599 creating build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-16T04:56:19,601 copying 3rdparty/cutlass/test/python/cutlass/installation.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-16T04:56:19,604 creating build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:19,605 copying 3rdparty/cutlass/test/python/pycute/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:19,607 copying 3rdparty/cutlass/test/python/pycute/test_coalesce.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:19,610 copying 3rdparty/cutlass/test/python/pycute/test_complement.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:19,612 copying 3rdparty/cutlass/test/python/pycute/test_typing.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:19,614 copying 3rdparty/cutlass/test/python/pycute/test_right_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:19,616 copying 3rdparty/cutlass/test/python/pycute/test_composition.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:19,618 copying 3rdparty/cutlass/test/python/pycute/test_int_tuple.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:19,621 copying 3rdparty/cutlass/test/python/pycute/test_left_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:19,624 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,625 copying 3rdparty/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,627 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,630 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,632 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,634 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,637 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,639 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,642 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,645 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,648 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,650 copying 3rdparty/cutlass/test/python/cutlass/gemm/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,653 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,656 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:19,659 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:19,660 copying 3rdparty/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:19,663 copying 3rdparty/cutlass/test/python/cutlass/interface/evt_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:19,666 copying 3rdparty/cutlass/test/python/cutlass/interface/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:19,668 copying 3rdparty/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:19,672 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:19,673 copying 3rdparty/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:19,675 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:19,678 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:19,681 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:19,684 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:19,686 copying 3rdparty/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:19,688 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:19,691 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:19,693 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:19,696 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:19,699 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:19,702 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-16T04:56:19,703 copying 3rdparty/cutlass/test/python/cutlass/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-16T04:56:19,706 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-16T04:56:19,708 copying 3rdparty/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-16T04:56:19,711 creating build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-16T04:56:19,714 copying 3rdparty/cutlass/test/unit/gemm/device/simt_sm50.py -> build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-16T04:56:19,717 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-16T04:56:19,719 copying 3rdparty/cutlass/test/examples/CuTeDSL/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-16T04:56:19,723 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:19,724 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:19,727 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:19,730 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:19,732 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:19,734 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:19,737 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:19,738 copying 3rdparty/cutlass/python/cutlass_cppgen/shape.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:19,741 copying 3rdparty/cutlass/python/cutlass_cppgen/library_defaults.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:19,744 copying 3rdparty/cutlass/python/cutlass_cppgen/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:19,746 copying 3rdparty/cutlass/python/cutlass_cppgen/swizzle.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:19,750 creating build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,751 copying 3rdparty/cutlass/python/cutlass_library/sm90_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,754 copying 3rdparty/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,757 copying 3rdparty/cutlass/python/cutlass_library/sm100_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,760 copying 3rdparty/cutlass/python/cutlass_library/manifest.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,763 copying 3rdparty/cutlass/python/cutlass_library/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,766 copying 3rdparty/cutlass/python/cutlass_library/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,769 copying 3rdparty/cutlass/python/cutlass_library/sm100_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,772 copying 3rdparty/cutlass/python/cutlass_library/rank_k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,775 copying 3rdparty/cutlass/python/cutlass_library/rank_2k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,778 copying 3rdparty/cutlass/python/cutlass_library/heuristics.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,781 copying 3rdparty/cutlass/python/cutlass_library/generator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,794 copying 3rdparty/cutlass/python/cutlass_library/conv3d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,797 copying 3rdparty/cutlass/python/cutlass_library/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,801 copying 3rdparty/cutlass/python/cutlass_library/conv3x_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,804 copying 3rdparty/cutlass/python/cutlass_library/sm90_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,807 copying 3rdparty/cutlass/python/cutlass_library/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,811 copying 3rdparty/cutlass/python/cutlass_library/trmm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,813 copying 3rdparty/cutlass/python/cutlass_library/heuristics_provider.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,816 copying 3rdparty/cutlass/python/cutlass_library/symm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:19,818 creating build/lib/flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:19,819 copying 3rdparty/cutlass/python/pycute/__init__.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:19,821 copying 3rdparty/cutlass/python/pycute/typing.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:19,823 copying 3rdparty/cutlass/python/pycute/int_tuple.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:19,825 copying 3rdparty/cutlass/python/pycute/layout.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:19,828 copying 3rdparty/cutlass/python/pycute/swizzle.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:19,830 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-16T04:56:19,831 copying 3rdparty/cutlass/python/CuTeDSL/prep_editable_install.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-16T04:56:19,834 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:19,835 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/check.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:19,838 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:19,840 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:19,841 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:19,844 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:19,847 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:19,849 copying 3rdparty/cutlass/python/cutlass_cppgen/op/op.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:19,852 copying 3rdparty/cutlass/python/cutlass_cppgen/op/conv.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:19,855 copying 3rdparty/cutlass/python/cutlass_cppgen/op/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:19,857 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:19,860 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:19,864 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-16T04:56:19,865 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-16T04:56:19,868 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-16T04:56:19,870 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-16T04:56:19,873 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,875 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,877 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,880 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,882 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,885 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,887 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,890 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,893 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,896 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,898 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,900 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,904 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,907 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:19,910 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-16T04:56:19,911 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/common.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-16T04:56:19,914 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-16T04:56:19,916 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-16T04:56:19,920 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-16T04:56:19,921 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-16T04:56:19,924 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-16T04:56:19,926 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-16T04:56:19,928 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-16T04:56:19,930 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-16T04:56:19,933 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,934 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,936 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,939 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,941 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,944 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,946 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,948 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,950 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,952 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:19,955 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-16T04:56:19,956 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-16T04:56:19,958 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-16T04:56:19,960 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-16T04:56:19,963 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:19,964 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:19,966 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:19,968 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:19,970 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:19,972 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:19,974 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:19,976 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:19,978 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:19,981 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:19,982 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:19,984 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:19,986 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:19,988 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:19,991 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:19,993 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:19,995 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:19,997 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:19,999 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:20,001 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:20,003 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:20,005 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:20,006 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:20,009 creating build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-16T04:56:20,010 copying 3rdparty/cutlass/python/docs_src/source/conf.py -> build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-16T04:56:20,013 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-16T04:56:20,014 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/torch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-16T04:56:20,016 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-16T04:56:20,018 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-16T04:56:20,020 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:20,021 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:20,025 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:20,027 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:20,029 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:20,031 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:20,033 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:20,036 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:20,037 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:20,040 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:20,042 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:20,044 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:20,047 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:20,049 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:20,052 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,053 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,056 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,058 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,061 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,064 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,066 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,068 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,071 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,075 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,077 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,080 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,083 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,086 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,088 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,091 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,094 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,097 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:20,101 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,102 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,107 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,110 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,114 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,117 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,121 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,124 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,127 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,130 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,133 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,136 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,142 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:20,146 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:20,147 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:20,151 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:20,154 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:20,156 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:20,161 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,162 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,164 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,167 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,170 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,173 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,176 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,179 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,182 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,186 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,188 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,192 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,195 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:20,199 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-16T04:56:20,200 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-16T04:56:20,202 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-16T04:56:20,206 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:20,207 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:20,210 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:20,212 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:20,214 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:20,217 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:20,219 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:20,222 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:20,225 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-16T04:56:20,227 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-16T04:56:20,229 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-16T04:56:20,232 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-16T04:56:20,235 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:20,236 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:20,239 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:20,242 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:20,246 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:20,248 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:20,251 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:20,253 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:20,255 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:20,258 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:20,259 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:20,262 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:20,264 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:20,266 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:20,269 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:20,271 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:20,273 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:20,275 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:20,279 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:20,281 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:20,284 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-16T04:56:20,286 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-16T04:56:20,289 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-16T04:56:20,291 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-16T04:56:20,294 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-16T04:56:20,296 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-16T04:56:20,299 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-16T04:56:20,302 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-16T04:56:20,304 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-16T04:56:20,306 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-16T04:56:20,308 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-16T04:56:20,311 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-16T04:56:20,314 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:20,316 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:20,318 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:20,321 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:20,324 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:20,326 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:20,328 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:20,331 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:20,332 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:20,335 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:20,338 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:20,341 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:20,343 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:20,347 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:20,348 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:20,350 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:20,353 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:20,356 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:20,358 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:20,361 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:20,362 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:20,365 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:20,367 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:20,370 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:20,372 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:20,374 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:20,376 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:20,379 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:20,381 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:20,384 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:20,386 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:20,388 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:20,392 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-16T04:56:20,394 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-16T04:56:20,396 copying 3rdparty/cutlass/examples/40_cutlass_py/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-16T04:56:20,399 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-16T04:56:20,401 creating build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-16T04:56:20,403 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-16T04:56:20,405 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-16T04:56:20,408 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:20,411 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:20,413 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:20,416 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:20,419 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:20,422 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:20,423 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:20,426 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:20,429 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:20,431 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:20,435 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:20,436 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:20,439 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:20,443 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:20,446 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:20,450 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:20,453 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:20,455 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:20,460 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,461 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,466 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,471 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,476 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,481 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,486 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,490 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,493 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,498 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,504 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,509 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,517 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,521 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,524 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,529 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,532 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,536 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,540 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:20,545 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-16T04:56:20,547 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-16T04:56:20,549 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-16T04:56:20,552 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-16T04:56:20,553 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-16T04:56:20,558 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,559 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,563 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,566 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,568 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,571 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,574 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,577 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,580 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,584 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,587 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,590 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,593 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,594 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:20,598 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:20,599 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:20,603 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:20,606 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:20,609 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:20,612 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-16T04:56:20,614 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-16T04:56:20,615 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-16T04:56:20,619 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:20,620 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:20,623 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:20,626 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:20,629 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:20,633 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:20,637 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-16T04:56:20,638 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-16T04:56:20,641 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:20,642 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:20,645 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:20,647 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:20,651 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:20,654 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:20,657 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:20,658 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:20,661 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:20,664 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:20,667 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:20,669 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:20,672 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-16T04:56:20,674 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-16T04:56:20,679 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-16T04:56:20,683 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-16T04:56:20,687 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:20,688 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:20,692 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:20,695 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:20,699 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:20,702 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-16T04:56:20,703 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-16T04:56:20,709 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-16T04:56:20,714 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-16T04:56:20,717 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:20,719 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:20,723 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:20,727 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:20,729 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:20,734 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-16T04:56:20,735 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-16T04:56:20,740 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-16T04:56:20,743 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-16T04:56:20,745 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:20,747 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:20,749 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:20,751 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:20,752 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:20,754 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:20,756 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:20,758 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:20,760 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:20,762 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-16T04:56:20,763 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-16T04:56:20,766 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-16T04:56:20,767 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-16T04:56:20,769 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-16T04:56:20,772 creating build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,773 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,776 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,778 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,781 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,782 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,785 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,788 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,790 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,792 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,795 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,797 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,800 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:20,803 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-16T04:56:20,804 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-16T04:56:20,806 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-16T04:56:20,809 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-16T04:56:20,812 creating build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-16T04:56:20,814 copying 3rdparty/cutlass/tools/util/scripts/split_test_cmake.py -> build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-16T04:56:20,844 creating build/lib/flashinfer/data/spdlog/scripts 2026-04-16T04:56:20,845 copying 3rdparty/spdlog/scripts/extract_version.py -> build/lib/flashinfer/data/spdlog/scripts 2026-04-16T04:56:20,884 creating build/lib/flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:20,885 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:20,889 copying flashinfer/fused_moe/cute_dsl/tuner.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:20,893 copying flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:20,897 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:20,900 copying flashinfer/fused_moe/cute_dsl/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:20,903 copying flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:20,908 creating build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:20,909 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:20,918 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:20,923 copying flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:20,925 copying flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:20,929 copying flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:21,487 copying flashinfer/py.typed -> build/lib/flashinfer 2026-04-16T04:56:21,490 creating build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,491 copying ./csrc/single_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,493 copying ./csrc/trtllm_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,496 copying ./csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,499 copying ./csrc/mxfp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,501 copying ./csrc/trtllm_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,504 copying ./csrc/selective_state_update_dtype_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,506 copying ./csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,508 copying ./csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,510 copying ./csrc/flashinfer_rope_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,512 copying ./csrc/batch_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,514 copying ./csrc/fmha_v2_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,517 copying ./csrc/cudnn_sdpa_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,520 copying ./csrc/fmha_v2_run.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,523 copying ./csrc/page.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,525 copying ./csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,527 copying ./csrc/batch_prefill_ragged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,529 copying ./csrc/fp4_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,532 copying ./csrc/batch_decode_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,534 copying ./csrc/batch_decode_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,536 copying ./csrc/mxfp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,538 copying ./csrc/pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,540 copying ./csrc/flashinfer_page_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,542 copying ./csrc/fp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,545 copying ./csrc/group_gemm_nvfp4_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,547 copying ./csrc/gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,549 copying ./csrc/single_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,551 copying ./csrc/group_gemm_mxfp4_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,554 copying ./csrc/gemm_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,556 copying ./csrc/trtllm_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:21,559 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:21,561 copying ./csrc/nv_internal/cpp/common/logger.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:21,564 copying ./csrc/nv_internal/cpp/common/tllmException.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:21,566 copying ./csrc/nv_internal/cpp/common/memoryUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:21,569 copying ./csrc/nv_internal/cpp/common/stringUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:21,571 copying ./csrc/nv_internal/cpp/common/envUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:21,574 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-16T04:56:21,575 copying ./csrc/nv_internal/cpp/kernels/quantization.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-16T04:56:21,578 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,579 copying ./csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,582 copying ./csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,584 copying ./csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,587 copying ./csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,589 copying ./csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,591 copying ./csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,594 copying ./csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,596 copying ./csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,599 copying ./csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:21,601 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,602 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,605 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,608 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,610 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,614 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,618 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,620 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,623 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,626 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,628 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,631 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:21,633 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:21,634 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:21,637 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:21,639 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:21,641 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:21,643 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-16T04:56:21,644 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-16T04:56:21,648 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-16T04:56:21,651 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-16T04:56:21,652 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-16T04:56:21,655 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-16T04:56:21,657 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:21,659 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,661 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,663 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,666 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,668 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,670 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,673 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,675 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,677 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,679 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,681 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:21,682 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:21,684 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:21,689 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:21,691 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:21,694 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:21,697 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:21,699 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,701 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,703 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,705 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,708 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,710 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,712 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,715 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,717 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,720 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,722 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:21,725 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-16T04:56:21,727 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-16T04:56:21,729 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:21,730 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:21,733 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-16T04:56:21,735 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-16T04:56:21,738 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-16T04:56:21,741 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:21,744 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:21,747 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:21,752 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:21,755 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-16T04:56:21,758 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,759 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,762 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,764 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,766 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,768 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,770 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,773 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,775 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,777 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,779 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,782 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,784 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,787 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,790 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-16T04:56:21,791 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-16T04:56:21,794 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-16T04:56:21,796 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,798 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,800 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,802 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,804 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,806 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,809 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:21,811 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:21,812 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:21,814 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:21,818 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:21,821 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:21,823 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:21,825 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:21,828 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-16T04:56:21,829 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-16T04:56:21,832 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-16T04:56:21,834 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:21,837 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:21,839 copying ./csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:21,841 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:21,843 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:21,845 copying ./csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:21,847 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:21,850 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:21,853 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:21,855 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-16T04:56:21,858 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-16T04:56:21,860 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-16T04:56:21,862 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-16T04:56:21,865 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-16T04:56:21,867 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-16T04:56:21,870 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,871 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,874 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,877 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,880 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,882 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,885 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,887 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,890 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,893 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,895 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,898 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,900 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:21,903 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,904 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,907 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,910 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,912 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,915 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,918 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,921 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,923 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,926 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,929 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,931 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,934 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:21,937 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-16T04:56:21,938 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-16T04:56:21,941 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-16T04:56:21,944 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-16T04:56:21,947 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,950 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,954 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,956 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,958 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,961 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,964 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,968 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,970 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,972 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:21,975 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:21,977 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-16T04:56:21,978 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-16T04:56:21,981 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-16T04:56:21,982 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-16T04:56:21,985 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-16T04:56:21,986 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-16T04:56:21,989 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-16T04:56:21,992 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:21,994 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-16T04:56:21,996 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-16T04:56:21,999 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:22,002 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-16T04:56:22,004 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-16T04:56:22,006 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:22,008 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-16T04:56:22,010 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-16T04:56:22,012 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:22,014 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:22,015 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:22,018 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:22,021 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:22,023 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:22,025 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:22,028 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:22,031 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:22,033 creating build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,036 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,039 copying ./csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,042 copying ./csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,044 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,046 copying ./csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,048 copying ./csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,050 copying ./csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,053 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,055 copying ./csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,058 copying ./csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,060 copying ./csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:22,062 copying ./csrc/flashinfer_cascade_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,064 copying ./csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,066 copying ./csrc/flashinfer_mamba_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,068 copying ./csrc/batch_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,071 copying ./csrc/batch_pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,073 copying ./csrc/batch_prefill_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,075 copying ./csrc/batch_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,077 copying ./csrc/batch_attention_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,079 copying ./csrc/batch_decode_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,081 copying ./csrc/trtllm_alltoall_prepare.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,084 copying ./csrc/norm.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,086 copying ./csrc/selective_state_update.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,089 copying ./csrc/quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,091 copying ./csrc/batch_attention_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,094 copying ./csrc/batch_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,096 copying ./csrc/single_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,098 copying ./csrc/flashinfer_rmsnorm_silu_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,100 copying ./csrc/batch_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,103 copying ./csrc/group_gemm_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,105 copying ./csrc/flashinfer_quantization_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,107 copying ./csrc/bmm_fp8.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,110 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,111 copying ./csrc/fmha_v2/fmha/paged_kv_cache.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,114 copying ./csrc/fmha_v2/fmha/softmax.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,119 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,123 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,125 copying ./csrc/fmha_v2/fmha/utils.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,129 copying ./csrc/fmha_v2/fmha/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,132 copying ./csrc/fmha_v2/fmha/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,135 copying ./csrc/fmha_v2/fmha/alibi_params.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,137 copying ./csrc/fmha_v2/fmha/smem_tile_v.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,140 copying ./csrc/fmha_v2/fmha/gemm.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,142 copying ./csrc/fmha_v2/fmha/mask.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,145 copying ./csrc/fmha_v2/fmha/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,149 copying ./csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,152 copying ./csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,155 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:22,156 copying ./csrc/fmha_v2/fmha/warpspec/compute.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:22,159 copying ./csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:22,162 copying ./csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:22,165 copying ./csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:22,167 copying ./csrc/fmha_v2/fmha/warpspec/dma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:22,170 copying ./csrc/fmha_v2/fmha/traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,173 copying ./csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,176 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,177 copying ./csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,180 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,183 copying ./csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,185 copying ./csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,188 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,191 copying ./csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,194 copying ./csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,196 copying ./csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,199 copying ./csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,203 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,206 copying ./csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,209 copying ./csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,212 copying ./csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,217 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,220 copying ./csrc/fmha_v2/fmha/hopper/tma_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,222 copying ./csrc/fmha_v2/fmha/hopper/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,225 copying ./csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,228 copying ./csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:22,230 copying ./csrc/fmha_v2/fmha/numeric_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,232 copying ./csrc/fmha_v2/fmha/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,235 copying ./csrc/fmha_v2/fmha/gmem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:22,238 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,241 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,243 creating build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:22,244 copying ./csrc/fmha_v2/templates/kernel_hopper.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:22,247 copying ./csrc/fmha_v2/templates/fa_kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:22,249 copying ./csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:22,252 copying ./csrc/fmha_v2/templates/kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:22,255 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,257 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,260 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,263 copying ./csrc/fmha_v2/fused_multihead_cross_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,265 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,267 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,270 copying ./csrc/fmha_v2/fused_multihead_attention_utils.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,274 copying ./csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,276 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,279 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,281 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,284 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,286 copying ./csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,288 copying ./csrc/fmha_v2/fused_multihead_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,291 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,293 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:22,296 copying ./csrc/fp4_kv_quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,298 copying ./csrc/flashinfer_topk_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,300 copying ./csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,302 copying ./csrc/single_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,304 copying ./csrc/trtllm_batched_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,307 copying ./csrc/group_gemm_nvfp4_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,309 copying ./csrc/blackwell_fmha_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,311 copying ./csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,313 copying ./csrc/batch_pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,315 copying ./csrc/batch_mla_sm90_run.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,317 copying ./csrc/batch_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,320 copying ./csrc/single_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,322 copying ./csrc/logging.cc -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,324 copying ./csrc/trtllm_low_latency_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,326 copying ./csrc/flashinfer_xqa_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,328 copying ./csrc/fp4_gemm_cutlass_sm120.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,330 copying ./csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,332 copying ./csrc/cutlass_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,334 copying ./csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,336 copying ./csrc/batch_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,338 copying ./csrc/single_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,340 copying ./csrc/group_gemm_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,342 copying ./csrc/batch_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,344 copying ./csrc/trtllm_moe_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,346 copying ./csrc/mxfp8_gemm_cutlass_sm120.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,348 copying ./csrc/seq_chunk_cumsum.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,350 copying ./csrc/tvm_ffi_utils.h -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,353 copying ./csrc/renorm.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,355 copying ./csrc/trtllm_fused_moe_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,358 copying ./csrc/group_gemm_fp8_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,360 copying ./csrc/batch_mla_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,362 copying ./csrc/sampling_utils.h -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,364 copying ./csrc/batch_decode_mla_cute_sm80.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,366 copying ./csrc/runtime_utils.h -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,368 copying ./csrc/seq_chunk_cumsum_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,370 copying ./csrc/single_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,372 copying ./csrc/batch_pod.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,375 copying ./csrc/batch_mla_sm90_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,377 copying ./csrc/cascade.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,379 copying ./csrc/cudnn_sdpa_utils.h -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,382 copying ./csrc/fmhaReduction.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,384 copying ./csrc/moe_utils_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,387 copying ./csrc/fp4_kv_dequantization.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,391 copying ./csrc/group_gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,393 copying ./csrc/batch_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,396 copying ./csrc/batch_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,399 copying ./csrc/fp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,402 copying ./csrc/batch_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,405 copying ./csrc/selective_state_update_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,408 copying ./csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,410 copying ./csrc/bf16_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,413 copying ./csrc/trtllm_moe_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,416 creating build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,418 copying ./csrc/xqa/tma.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,421 copying ./csrc/xqa/barriers.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,425 copying ./csrc/xqa/mha.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,428 copying ./csrc/xqa/utils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,431 copying ./csrc/xqa/mla_sm120.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,436 copying ./csrc/xqa/tensorMap.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,439 copying ./csrc/xqa/specDec.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,441 copying ./csrc/xqa/mha.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,447 copying ./csrc/xqa/platform.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,450 copying ./csrc/xqa/tensorMap.cpp -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,453 copying ./csrc/xqa/gmma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,456 copying ./csrc/xqa/gmma_impl.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,467 copying ./csrc/xqa/mhaUtils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,471 copying ./csrc/xqa/hostUtils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,474 copying ./csrc/xqa/mha_components.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,477 copying ./csrc/xqa/ldgsts.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,479 copying ./csrc/xqa/mha_sm90.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,486 copying ./csrc/xqa/mma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,488 copying ./csrc/xqa/utils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,491 copying ./csrc/xqa/mha_stdheaders.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,494 copying ./csrc/xqa/mla_sm120.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,496 copying ./csrc/xqa/defines.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,498 copying ./csrc/xqa/cuda_hint.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,500 copying ./csrc/xqa/xqa_wrapper.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-16T04:56:22,503 copying ./csrc/bf16_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,505 copying ./csrc/fp4_gemm_cutlass_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,508 copying ./csrc/vllm_custom_all_reduce.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,510 copying ./csrc/batch_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,513 copying ./csrc/single_prefill_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,515 copying ./csrc/group_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,517 copying ./csrc/group_gemm_fp8_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,520 copying ./csrc/single_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,522 copying ./csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,524 copying ./csrc/trtllm_fmha_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,528 copying ./csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,531 copying ./csrc/flashinfer_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,533 copying ./csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,535 copying ./csrc/batch_pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,537 copying ./csrc/batch_attention.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,540 copying ./csrc/batch_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,542 copying ./csrc/trtllm_fused_moe_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,546 copying ./csrc/group_gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,548 copying ./csrc/flashinfer_gemm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,550 copying ./csrc/sampling.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,553 copying ./csrc/single_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,555 copying ./csrc/gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,557 copying ./csrc/batch_attention_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,559 copying ./csrc/trtllm_fmha_v2_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,562 copying ./csrc/selective_state_update_kernel_inst.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,564 copying ./csrc/flashinfer_sampling_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,566 copying ./csrc/rope.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,569 copying ./csrc/pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,571 copying ./csrc/prefill_kernel_delta_rule_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,573 copying ./csrc/rmsnorm_silu.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,575 creating build/lib/flashinfer/data/csrc/fused_moe 2026-04-16T04:56:22,576 copying ./csrc/fused_moe/noAuxTcKernels.cu -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-16T04:56:22,579 copying ./csrc/fused_moe/moeTopKFuncs.cuh -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-16T04:56:22,581 creating build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:22,582 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:22,593 copying ./csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:22,598 copying ./csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:22,605 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:22,607 creating build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:22,608 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_common.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:22,612 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:22,616 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_custom.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:22,623 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:22,628 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:22,635 copying ./csrc/fp4_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,640 copying ./csrc/trtllm_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,643 copying ./csrc/fp4_gemm_cutlass_sm103.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,645 copying ./csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,647 copying ./csrc/single_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,649 copying ./csrc/concat_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,651 copying ./csrc/fmha_cutlass_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,653 copying ./csrc/flashinfer_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,655 copying ./csrc/flashinfer_norm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,657 copying ./csrc/single_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,659 copying ./csrc/fmha_cutlass_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,662 copying ./csrc/mxfp8_gemm_cutlass_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,664 copying ./csrc/tgv_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,666 copying ./csrc/batch_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,669 copying ./csrc/gemm_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,671 copying ./csrc/pod.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,674 copying ./csrc/pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,676 copying ./csrc/tgv_gemm.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,678 copying ./csrc/group_gemm_mxfp4_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,680 copying ./csrc/fp4_gemm_cutlass_sm103.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,682 copying ./csrc/gdn_prefill_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,684 copying ./csrc/tinygemm2.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,687 copying ./csrc/single_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,689 copying ./csrc/trtllm_mnnvl_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,691 copying ./csrc/topk.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,694 copying ./csrc/batch_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,696 copying ./csrc/dsv3_router_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,698 copying ./csrc/batch_decode_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-16T04:56:22,700 creating build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,702 copying ./include/flashinfer/attention/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,704 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-16T04:56:22,706 copying ./include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-16T04:56:22,708 copying ./include/flashinfer/attention/blackwell/plan.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-16T04:56:22,710 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:22,712 copying ./include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:22,714 copying ./include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:22,716 copying ./include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:22,719 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:22,721 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:22,725 copying ./include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:22,727 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:22,730 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:22,733 copying ./include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-16T04:56:22,735 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-16T04:56:22,736 copying ./include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-16T04:56:22,739 copying ./include/flashinfer/attention/blackwell/device/fmha.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-16T04:56:22,741 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:22,742 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:22,745 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:22,747 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:22,749 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:22,752 copying ./include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:22,754 copying ./include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:22,757 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:22,760 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:22,762 copying ./include/flashinfer/attention/persistent.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,765 copying ./include/flashinfer/attention/mask.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,767 copying ./include/flashinfer/attention/default_prefill_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,770 copying ./include/flashinfer/attention/heap.h -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,772 copying ./include/flashinfer/attention/pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,774 copying ./include/flashinfer/attention/cutlass_mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,777 copying ./include/flashinfer/attention/default_decode_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,779 copying ./include/flashinfer/attention/decode.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,782 copying ./include/flashinfer/attention/persistent_template.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,784 copying ./include/flashinfer/attention/batch_pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,787 copying ./include/flashinfer/attention/prefill.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,792 copying ./include/flashinfer/attention/cascade.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,794 copying ./include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,797 copying ./include/flashinfer/attention/scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,801 copying ./include/flashinfer/attention/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,804 copying ./include/flashinfer/attention/mla_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,805 copying ./include/flashinfer/attention/mla_hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,809 copying ./include/flashinfer/attention/state.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,810 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,811 copying ./include/flashinfer/attention/hopper/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,814 copying ./include/flashinfer/attention/hopper/named_barrier.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,816 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:22,816 copying ./include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:22,819 copying ./include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:22,822 copying ./include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:22,824 copying ./include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:22,826 copying ./include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:22,829 copying ./include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:22,831 copying ./include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,834 copying ./include/flashinfer/attention/hopper/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,836 copying ./include/flashinfer/attention/hopper/default_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,838 copying ./include/flashinfer/attention/hopper/attention_updater.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,841 copying ./include/flashinfer/attention/hopper/mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,843 copying ./include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,846 copying ./include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,849 copying ./include/flashinfer/attention/hopper/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,851 copying ./include/flashinfer/attention/hopper/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,853 copying ./include/flashinfer/attention/hopper/utils.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,856 copying ./include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:22,858 copying ./include/flashinfer/attention/hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,861 copying ./include/flashinfer/attention/mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:22,864 creating build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,865 copying ./include/flashinfer/mamba/common.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,867 copying ./include/flashinfer/mamba/conversion.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,870 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_simple.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,872 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_vertical.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,875 copying ./include/flashinfer/mamba/invoke_selective_state_update_mtp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,878 copying ./include/flashinfer/mamba/selective_state_update.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,880 copying ./include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,882 copying ./include/flashinfer/mamba/create_tensor_map.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,884 copying ./include/flashinfer/mamba/ssu_mtp_common.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,886 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_horizontal.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,889 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_async_horizontal.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,892 copying ./include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:22,895 copying ./include/flashinfer/fp4_layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:22,897 creating build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,898 copying ./include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,901 copying ./include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,903 copying ./include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,906 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,908 copying ./include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,910 copying ./include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,912 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,915 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,917 copying ./include/flashinfer/gemm/cutlass_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,920 copying ./include/flashinfer/gemm/group_gemv.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,922 copying ./include/flashinfer/gemm/tgv_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,926 copying ./include/flashinfer/gemm/group_gemm_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,928 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,930 copying ./include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,933 copying ./include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,936 copying ./include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,938 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,940 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,942 copying ./include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,945 copying ./include/flashinfer/gemm/tgv_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,947 copying ./include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,949 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,952 copying ./include/flashinfer/gemm/group_gemm_nvfp4_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,955 copying ./include/flashinfer/gemm/bmm_fp8.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,957 copying ./include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,959 copying ./include/flashinfer/gemm/group_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,962 copying ./include/flashinfer/gemm/group_gemm_lora.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,964 copying ./include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,966 copying ./include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,968 copying ./include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,970 copying ./include/flashinfer/gemm/tgv_gemm_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,972 copying ./include/flashinfer/gemm/mxfp8_gemm_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,974 copying ./include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,977 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,979 copying ./include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:22,982 copying ./include/flashinfer/fp16.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:22,984 copying ./include/flashinfer/page.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:22,987 copying ./include/flashinfer/allocator.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:22,989 creating build/lib/flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:22,990 copying ./include/flashinfer/flat/type_traits.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:22,992 copying ./include/flashinfer/flat/debug.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:22,994 copying ./include/flashinfer/flat/cute_ext.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:22,996 copying ./include/flashinfer/flat/unused.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:22,998 creating build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-16T04:56:22,999 copying ./include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-16T04:56:23,002 copying ./include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-16T04:56:23,004 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:23,006 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:23,008 copying ./include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:23,011 copying ./include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:23,013 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:23,015 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-16T04:56:23,016 copying ./include/flashinfer/flat/hopper/device/device_universal.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-16T04:56:23,019 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:23,020 copying ./include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:23,023 copying ./include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:23,025 copying ./include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:23,028 copying ./include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:23,030 copying ./include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:23,033 copying ./include/flashinfer/flat/math_order_barrier.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:23,035 creating build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-16T04:56:23,036 copying ./include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-16T04:56:23,038 copying ./include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-16T04:56:23,041 copying ./include/flashinfer/flat/math.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:23,043 copying ./include/flashinfer/flat/common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:23,045 copying ./include/flashinfer/layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,047 copying ./include/flashinfer/topk.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,052 copying ./include/flashinfer/permuted_smem.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,054 copying ./include/flashinfer/pos_enc.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,058 copying ./include/flashinfer/math.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,061 copying ./include/flashinfer/profiler.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,063 creating build/lib/flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:23,065 copying ./include/flashinfer/comm/trtllm_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:23,069 copying ./include/flashinfer/comm/trtllm_alltoall.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:23,073 copying ./include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:23,076 copying ./include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:23,079 copying ./include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:23,082 copying ./include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:23,086 copying ./include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:23,090 copying ./include/flashinfer/cubin_loader.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,093 creating build/lib/flashinfer/data/include/flashinfer/norm 2026-04-16T04:56:23,094 copying ./include/flashinfer/norm/ln_fwd_silu_kernel.cuh -> build/lib/flashinfer/data/include/flashinfer/norm 2026-04-16T04:56:23,097 copying ./include/flashinfer/norm/ln_silu_headers.cuh -> build/lib/flashinfer/data/include/flashinfer/norm 2026-04-16T04:56:23,102 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,104 copying ./include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,107 copying ./include/flashinfer/trtllm/fmha/lse.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,109 copying ./include/flashinfer/trtllm/fmha/kernelUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,112 copying ./include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,115 copying ./include/flashinfer/trtllm/fmha/decoder_params.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,117 copying ./include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,120 copying ./include/flashinfer/trtllm/fmha/kernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,124 copying ./include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,126 copying ./include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:23,130 copying ./include/flashinfer/trtllm/common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm 2026-04-16T04:56:23,133 creating build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:23,134 copying ./include/flashinfer/trtllm/common/cudaUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:23,137 copying ./include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:23,140 copying ./include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:23,143 copying ./include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:23,145 copying ./include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:23,148 copying ./include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:23,151 creating build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-16T04:56:23,153 copying ./include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-16T04:56:23,155 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,157 copying ./include/flashinfer/trtllm/fused_moe/RoutingCustomPolicy.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,161 copying ./include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,164 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,168 copying ./include/flashinfer/trtllm/fused_moe/RoutingDevKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,171 copying ./include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,174 copying ./include/flashinfer/trtllm/fused_moe/runner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,177 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,180 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,183 copying ./include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:23,186 copying ./include/flashinfer/fastdiv.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,188 copying ./include/flashinfer/exception.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,191 copying ./include/flashinfer/attention_impl.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,193 copying ./include/flashinfer/concat_mla.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,197 copying ./include/flashinfer/cutlass_utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,200 copying ./include/flashinfer/frag_layout_swizzle.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,203 copying ./include/flashinfer/activation.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,206 copying ./include/flashinfer/mma.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,210 copying ./include/flashinfer/cp_async.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,214 copying ./include/flashinfer/air_top_p.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,218 copying ./include/flashinfer/norm.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,222 copying ./include/flashinfer/logging.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,225 copying ./include/flashinfer/utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,229 copying ./include/flashinfer/arch_condition.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,232 copying ./include/flashinfer/sampling.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,239 copying ./include/flashinfer/quantization.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,242 copying ./include/flashinfer/vec_dtypes.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-16T04:56:23,248 creating build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:23,251 copying 3rdparty/cutlass/include/cute/util/print_tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:23,255 copying 3rdparty/cutlass/include/cute/util/type_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:23,259 copying 3rdparty/cutlass/include/cute/util/print_svg.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:23,262 copying 3rdparty/cutlass/include/cute/util/debug.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:23,266 copying 3rdparty/cutlass/include/cute/util/print_latex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:23,269 copying 3rdparty/cutlass/include/cute/util/print.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:23,272 copying 3rdparty/cutlass/include/cute/swizzle_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,276 copying 3rdparty/cutlass/include/cute/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,278 copying 3rdparty/cutlass/include/cute/pointer_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,281 copying 3rdparty/cutlass/include/cute/pointer_base.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,284 copying 3rdparty/cutlass/include/cute/pointer_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,287 copying 3rdparty/cutlass/include/cute/swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,290 copying 3rdparty/cutlass/include/cute/pointer_flagged.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,292 copying 3rdparty/cutlass/include/cute/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,296 copying 3rdparty/cutlass/include/cute/tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,298 copying 3rdparty/cutlass/include/cute/underscore.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,300 creating build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:23,301 copying 3rdparty/cutlass/include/cute/container/tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:23,304 copying 3rdparty/cutlass/include/cute/container/bit_field.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:23,306 copying 3rdparty/cutlass/include/cute/container/alignment.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:23,308 copying 3rdparty/cutlass/include/cute/container/array_aligned.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:23,310 copying 3rdparty/cutlass/include/cute/container/type_list.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:23,313 copying 3rdparty/cutlass/include/cute/container/array.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:23,315 copying 3rdparty/cutlass/include/cute/container/array_subbyte.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:23,317 copying 3rdparty/cutlass/include/cute/container/cuda_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:23,320 creating build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,321 copying 3rdparty/cutlass/include/cute/atom/partitioner.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,323 copying 3rdparty/cutlass/include/cute/atom/mma_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,326 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,330 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,333 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,335 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,338 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,341 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,347 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,349 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,360 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,366 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,368 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,370 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,373 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,375 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,382 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,398 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,401 copying 3rdparty/cutlass/include/cute/atom/mma_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,404 copying 3rdparty/cutlass/include/cute/atom/copy_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,407 copying 3rdparty/cutlass/include/cute/atom/copy_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,410 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,412 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,415 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,417 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,420 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,429 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,432 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:23,435 creating build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,436 copying 3rdparty/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,438 copying 3rdparty/cutlass/include/cute/arch/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,441 copying 3rdparty/cutlass/include/cute/arch/mma_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,444 copying 3rdparty/cutlass/include/cute/arch/copy_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,454 copying 3rdparty/cutlass/include/cute/arch/copy_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,456 copying 3rdparty/cutlass/include/cute/arch/mma_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,460 copying 3rdparty/cutlass/include/cute/arch/mma_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,462 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,466 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,483 copying 3rdparty/cutlass/include/cute/arch/mma_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,487 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,533 copying 3rdparty/cutlass/include/cute/arch/copy_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,536 copying 3rdparty/cutlass/include/cute/arch/cluster_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,539 copying 3rdparty/cutlass/include/cute/arch/mma_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,541 copying 3rdparty/cutlass/include/cute/arch/mma_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,550 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,553 copying 3rdparty/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,558 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,560 copying 3rdparty/cutlass/include/cute/arch/mma_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,562 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,581 copying 3rdparty/cutlass/include/cute/arch/cluster_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,583 copying 3rdparty/cutlass/include/cute/arch/simd_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,585 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,588 copying 3rdparty/cutlass/include/cute/arch/copy_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,591 copying 3rdparty/cutlass/include/cute/arch/copy_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,594 copying 3rdparty/cutlass/include/cute/arch/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,596 copying 3rdparty/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,599 copying 3rdparty/cutlass/include/cute/arch/util.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,602 copying 3rdparty/cutlass/include/cute/arch/mma_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,604 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,647 copying 3rdparty/cutlass/include/cute/arch/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,649 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:23,652 copying 3rdparty/cutlass/include/cute/tensor_impl.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,655 copying 3rdparty/cutlass/include/cute/int_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,658 copying 3rdparty/cutlass/include/cute/stride.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,661 copying 3rdparty/cutlass/include/cute/tensor_zip.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,664 creating build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,665 copying 3rdparty/cutlass/include/cute/algorithm/gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,668 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,671 copying 3rdparty/cutlass/include/cute/algorithm/clear.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,673 copying 3rdparty/cutlass/include/cute/algorithm/prefetch.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,676 copying 3rdparty/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,679 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,681 copying 3rdparty/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,685 copying 3rdparty/cutlass/include/cute/algorithm/fill.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,687 copying 3rdparty/cutlass/include/cute/algorithm/prefer.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,690 copying 3rdparty/cutlass/include/cute/algorithm/axpby.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,692 copying 3rdparty/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,694 copying 3rdparty/cutlass/include/cute/algorithm/functional.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,697 copying 3rdparty/cutlass/include/cute/algorithm/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:23,700 copying 3rdparty/cutlass/include/cute/pointer.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,703 creating build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,704 copying 3rdparty/cutlass/include/cute/numeric/complex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,707 copying 3rdparty/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,710 copying 3rdparty/cutlass/include/cute/numeric/integral_constant.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,713 copying 3rdparty/cutlass/include/cute/numeric/integer_sequence.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,716 copying 3rdparty/cutlass/include/cute/numeric/real.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,719 copying 3rdparty/cutlass/include/cute/numeric/numeric_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,722 copying 3rdparty/cutlass/include/cute/numeric/integral_ratio.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,724 copying 3rdparty/cutlass/include/cute/numeric/int.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,727 copying 3rdparty/cutlass/include/cute/numeric/math.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:23,730 copying 3rdparty/cutlass/include/cute/layout_composed.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:23,733 creating build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:23,734 copying 3rdparty/cutlass/include/cutlass/predicate_vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:23,738 copying 3rdparty/cutlass/include/cutlass/fast_math.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:23,742 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-16T04:56:23,744 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-16T04:56:23,748 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-16T04:56:23,752 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-16T04:56:23,754 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-16T04:56:23,756 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-16T04:56:23,759 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-16T04:56:23,761 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-16T04:56:23,763 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-16T04:56:23,765 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-16T04:56:23,768 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-16T04:56:23,771 copying 3rdparty/cutlass/include/cutlass/tensor_ref.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:23,774 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:23,777 copying 3rdparty/cutlass/include/cutlass/bfloat16.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:23,780 copying 3rdparty/cutlass/include/cutlass/gemm_coord.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:23,782 copying 3rdparty/cutlass/include/cutlass/blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:23,785 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,787 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,789 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,792 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,794 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,797 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,802 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,804 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,808 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,811 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,814 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,817 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,822 copying 3rdparty/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,824 copying 3rdparty/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,827 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,830 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,833 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,836 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,838 copying 3rdparty/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,841 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,844 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,847 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,850 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,853 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,856 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,859 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,862 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,868 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,871 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,874 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,877 copying 3rdparty/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,879 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,882 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,885 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,888 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,891 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:23,894 copying 3rdparty/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-16T04:56:23,897 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,898 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,900 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,903 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,905 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,908 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,911 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,914 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,916 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,919 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,922 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,925 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,928 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,932 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,935 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,938 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,941 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,944 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,947 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,950 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,953 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,956 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,959 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,962 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,966 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,969 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,972 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,975 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,978 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,981 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,983 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,986 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,989 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,992 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:23,995 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,000 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,003 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,006 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,009 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,012 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,016 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,019 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,023 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,026 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,030 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:24,032 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,034 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,038 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,041 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,045 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,048 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,052 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,055 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,059 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,063 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,067 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,071 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,075 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,078 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,082 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,086 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,089 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,093 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,097 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,101 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,104 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,107 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,109 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,113 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,115 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,119 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,122 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,125 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,128 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,131 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,134 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,137 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,140 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,143 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,146 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,150 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,153 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,156 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,160 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,163 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,167 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,170 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,173 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,176 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,179 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,182 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,186 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,188 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,191 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,195 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,197 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,200 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,203 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,207 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,209 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,212 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,214 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,217 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,220 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,223 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,226 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,229 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,232 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,234 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,237 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,239 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,242 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,245 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,248 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,250 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,253 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,255 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,258 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,261 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,263 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,266 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,269 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,272 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,274 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,277 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,280 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,282 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,285 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,287 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,290 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,292 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,295 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,297 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,300 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,303 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,306 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,309 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,311 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,314 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,316 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,319 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,322 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,324 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,326 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,329 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,332 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,334 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,337 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,339 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,341 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,343 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,345 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,348 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,351 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,354 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,357 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,362 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,364 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,367 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,370 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,373 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:24,376 copying 3rdparty/cutlass/include/cutlass/gemm/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-16T04:56:24,379 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,380 copying 3rdparty/cutlass/include/cutlass/gemm/device/trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,383 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,386 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,388 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,391 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,393 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,396 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,399 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,401 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,404 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,407 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,410 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,412 copying 3rdparty/cutlass/include/cutlass/gemm/device/symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,415 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,418 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,421 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,424 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,426 copying 3rdparty/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,429 copying 3rdparty/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,432 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,435 copying 3rdparty/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,438 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,441 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,444 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,447 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,449 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,452 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,454 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,457 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,459 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:24,462 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,462 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,466 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,468 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,471 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,474 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,477 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,478 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,480 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,483 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,485 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,488 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,491 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,493 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,496 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,498 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,501 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,504 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,506 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,509 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,512 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,515 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,517 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,519 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,522 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,524 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,527 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,529 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,532 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,535 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,537 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,540 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,542 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,545 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,547 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,550 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:24,552 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,556 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,559 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,561 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,564 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,567 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,570 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,573 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,576 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,580 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,583 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,586 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,589 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,592 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,594 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,597 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,600 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,603 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,607 copying 3rdparty/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,609 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,612 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,616 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,619 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,622 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,624 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,628 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,630 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,633 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,636 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,639 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,643 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,645 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,648 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,651 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,654 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,657 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,661 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,663 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,666 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,669 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,673 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,676 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,680 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,683 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,685 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,689 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,692 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,695 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:24,699 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:24,700 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:24,702 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:24,705 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:24,708 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:24,711 copying 3rdparty/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-16T04:56:24,716 copying 3rdparty/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-16T04:56:24,718 copying 3rdparty/cutlass/include/cutlass/device_kernel.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:24,721 copying 3rdparty/cutlass/include/cutlass/numeric_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:24,723 copying 3rdparty/cutlass/include/cutlass/quaternion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:24,726 copying 3rdparty/cutlass/include/cutlass/relatively_equal.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:24,729 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,731 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,734 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,738 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,741 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,744 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,746 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,749 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,752 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,755 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,757 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,760 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,763 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,766 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,769 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,771 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:24,774 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,775 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,779 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,781 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,785 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,787 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,790 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,793 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,796 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,799 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,802 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,805 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,809 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,813 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,817 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,821 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,824 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,828 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,832 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,836 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,839 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,842 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,846 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,849 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,852 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,856 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,860 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,863 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,867 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,871 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,874 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,878 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,881 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,886 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,889 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,893 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,897 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,901 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,904 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:24,906 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:24,908 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:24,912 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:24,914 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:24,917 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:24,921 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,923 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,926 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,929 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,932 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,935 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,938 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,941 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,944 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,947 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,950 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,953 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:24,956 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:24,958 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:24,961 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:24,964 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:24,965 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:24,968 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:24,971 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:24,975 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:24,978 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:24,981 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:24,984 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:24,987 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:24,990 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:24,993 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:24,996 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:24,999 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,001 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,004 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,007 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,010 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,012 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,015 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,018 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,022 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,025 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,027 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:25,031 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,032 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,035 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,038 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,040 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,042 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,045 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,047 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,050 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,052 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,054 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,056 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,059 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,062 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,064 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,067 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,069 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,072 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,075 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,078 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,080 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,083 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,086 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,088 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,091 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,093 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/activation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:25,097 copying 3rdparty/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-16T04:56:25,099 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,100 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,104 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,107 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,110 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,112 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,115 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,120 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,124 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,127 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,131 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,135 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,139 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,142 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:25,146 copying 3rdparty/cutlass/include/cutlass/array_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,149 copying 3rdparty/cutlass/include/cutlass/exmy_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,152 copying 3rdparty/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,155 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,156 copying 3rdparty/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,159 copying 3rdparty/cutlass/include/cutlass/detail/helper_macros.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,162 copying 3rdparty/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,164 copying 3rdparty/cutlass/include/cutlass/detail/dependent_false.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,167 copying 3rdparty/cutlass/include/cutlass/detail/collective.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,169 copying 3rdparty/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,171 copying 3rdparty/cutlass/include/cutlass/detail/cluster.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,174 copying 3rdparty/cutlass/include/cutlass/detail/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,176 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-16T04:56:25,177 copying 3rdparty/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-16T04:56:25,180 copying 3rdparty/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-16T04:56:25,184 copying 3rdparty/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-16T04:56:25,186 copying 3rdparty/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,188 copying 3rdparty/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,191 copying 3rdparty/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,193 copying 3rdparty/cutlass/include/cutlass/detail/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:25,195 copying 3rdparty/cutlass/include/cutlass/floating_point_nvrtc.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,198 copying 3rdparty/cutlass/include/cutlass/array_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,200 creating build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-16T04:56:25,201 copying 3rdparty/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-16T04:56:25,205 copying 3rdparty/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-16T04:56:25,209 copying 3rdparty/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-16T04:56:25,212 copying 3rdparty/cutlass/include/cutlass/numeric_conversion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,220 copying 3rdparty/cutlass/include/cutlass/real.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,223 copying 3rdparty/cutlass/include/cutlass/float8.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,226 copying 3rdparty/cutlass/include/cutlass/coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,229 copying 3rdparty/cutlass/include/cutlass/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,237 copying 3rdparty/cutlass/include/cutlass/uint128.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,240 copying 3rdparty/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,242 copying 3rdparty/cutlass/include/cutlass/functional.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,245 copying 3rdparty/cutlass/include/cutlass/block_striped.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,248 copying 3rdparty/cutlass/include/cutlass/trace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,250 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-16T04:56:25,251 copying 3rdparty/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-16T04:56:25,254 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,255 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,258 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,261 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,263 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,266 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,269 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,271 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,274 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,277 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,280 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,283 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,286 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,290 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,293 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,296 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,299 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,302 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,305 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,308 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,311 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,314 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,317 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,320 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,324 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,327 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:25,329 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-16T04:56:25,330 copying 3rdparty/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-16T04:56:25,333 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-16T04:56:25,336 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-16T04:56:25,339 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-16T04:56:25,340 copying 3rdparty/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-16T04:56:25,343 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-16T04:56:25,344 copying 3rdparty/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-16T04:56:25,347 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-16T04:56:25,348 copying 3rdparty/cutlass/include/cutlass/transform/thread/unary_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-16T04:56:25,351 copying 3rdparty/cutlass/include/cutlass/transform/thread/transpose.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-16T04:56:25,354 copying 3rdparty/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform 2026-04-16T04:56:25,357 copying 3rdparty/cutlass/include/cutlass/blas3_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,359 copying 3rdparty/cutlass/include/cutlass/matrix_shape.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,362 copying 3rdparty/cutlass/include/cutlass/kernel_launch.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,364 creating build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-16T04:56:25,365 copying 3rdparty/cutlass/include/cutlass/thread/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-16T04:56:25,368 copying 3rdparty/cutlass/include/cutlass/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,370 copying 3rdparty/cutlass/include/cutlass/aligned_buffer.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,373 copying 3rdparty/cutlass/include/cutlass/half.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,375 copying 3rdparty/cutlass/include/cutlass/cluster_launch.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,378 creating build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,379 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm100.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,381 copying 3rdparty/cutlass/include/cutlass/arch/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,384 copying 3rdparty/cutlass/include/cutlass/arch/cache_operation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,386 copying 3rdparty/cutlass/include/cutlass/arch/memory.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,389 copying 3rdparty/cutlass/include/cutlass/arch/config.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,392 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,395 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,398 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,400 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,403 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,406 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,409 copying 3rdparty/cutlass/include/cutlass/arch/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,412 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,415 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,418 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,420 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm72.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,423 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,425 copying 3rdparty/cutlass/include/cutlass/arch/reg_reconfig.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,428 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,430 copying 3rdparty/cutlass/include/cutlass/arch/arch.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,432 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm90.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,435 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,438 copying 3rdparty/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,440 copying 3rdparty/cutlass/include/cutlass/arch/wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,442 copying 3rdparty/cutlass/include/cutlass/arch/simd.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,445 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,448 copying 3rdparty/cutlass/include/cutlass/arch/synclog.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,451 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,454 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:25,456 copying 3rdparty/cutlass/include/cutlass/uint256.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,458 copying 3rdparty/cutlass/include/cutlass/integer_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,461 copying 3rdparty/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,464 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:25,465 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:25,469 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:25,471 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:25,474 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:25,476 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:25,477 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:25,480 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:25,483 copying 3rdparty/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:25,486 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:25,488 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-16T04:56:25,490 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-16T04:56:25,492 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-16T04:56:25,495 copying 3rdparty/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-16T04:56:25,498 copying 3rdparty/cutlass/include/cutlass/subbyte_reference.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,501 copying 3rdparty/cutlass/include/cutlass/core_io.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,504 copying 3rdparty/cutlass/include/cutlass/float_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,507 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,509 copying 3rdparty/cutlass/include/cutlass/tensor_view.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,511 copying 3rdparty/cutlass/include/cutlass/semaphore.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,514 copying 3rdparty/cutlass/include/cutlass/constants.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,517 copying 3rdparty/cutlass/include/cutlass/wmma_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,519 creating build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,520 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,524 copying 3rdparty/cutlass/include/cutlass/layout/layout.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,526 copying 3rdparty/cutlass/include/cutlass/layout/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,529 copying 3rdparty/cutlass/include/cutlass/layout/permute.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,532 copying 3rdparty/cutlass/include/cutlass/layout/vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,534 copying 3rdparty/cutlass/include/cutlass/layout/pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,536 copying 3rdparty/cutlass/include/cutlass/layout/tensor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,539 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,542 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:25,545 copying 3rdparty/cutlass/include/cutlass/numeric_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,548 copying 3rdparty/cutlass/include/cutlass/workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,550 copying 3rdparty/cutlass/include/cutlass/matrix_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,553 copying 3rdparty/cutlass/include/cutlass/array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,556 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-16T04:56:25,558 copying 3rdparty/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-16T04:56:25,561 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-16T04:56:25,564 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-16T04:56:25,567 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,568 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,571 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,574 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,577 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,580 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,583 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,585 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,588 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,590 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,593 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,595 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,598 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,601 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,603 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,606 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,609 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,612 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,615 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,617 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,620 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,623 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,626 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,629 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,632 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,635 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,638 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,640 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,643 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,645 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,648 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,651 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,654 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,656 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,659 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,662 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,665 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,668 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,670 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,674 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,677 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,679 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,682 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,685 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,689 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,691 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,694 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:25,697 copying 3rdparty/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:25,700 copying 3rdparty/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:25,703 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,704 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,707 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,710 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,713 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,716 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,718 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,721 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,724 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,727 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,730 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,733 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,735 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,739 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,741 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,743 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,746 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,750 copying 3rdparty/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,753 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,755 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,759 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,763 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,765 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,768 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,770 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,773 copying 3rdparty/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,775 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,778 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,781 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,783 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:25,786 copying 3rdparty/cutlass/include/cutlass/conv/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:25,788 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:25,789 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:25,792 copying 3rdparty/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:25,795 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:25,797 copying 3rdparty/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:25,800 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:25,801 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:25,805 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:25,807 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:25,810 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:25,813 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:25,816 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:25,819 copying 3rdparty/cutlass/include/cutlass/conv/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:25,822 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:25,825 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:25,827 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:25,829 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-16T04:56:25,830 copying 3rdparty/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-16T04:56:25,833 copying 3rdparty/cutlass/include/cutlass/conv/convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:25,836 copying 3rdparty/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:25,838 copying 3rdparty/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:25,841 copying 3rdparty/cutlass/include/cutlass/tensor_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,844 copying 3rdparty/cutlass/include/cutlass/gemm_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,846 copying 3rdparty/cutlass/include/cutlass/version.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,848 copying 3rdparty/cutlass/include/cutlass/tfloat32.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,851 creating build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-16T04:56:25,852 copying 3rdparty/cutlass/include/cutlass/platform/platform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-16T04:56:25,855 copying 3rdparty/cutlass/include/cutlass/pitch_linear_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,858 copying 3rdparty/cutlass/include/cutlass/complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,861 copying 3rdparty/cutlass/include/cutlass/cutlass.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:25,863 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,866 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,869 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,871 copying 3rdparty/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,874 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,877 copying 3rdparty/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,880 copying 3rdparty/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,883 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,885 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,888 copying 3rdparty/cutlass/tools/util/include/cutlass/util/command_line.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,891 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:25,894 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-16T04:56:25,896 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-16T04:56:25,898 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-16T04:56:25,900 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,901 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,904 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,907 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,909 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,912 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,914 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,918 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,921 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,923 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,927 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,929 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,932 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,935 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,939 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,941 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,944 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,948 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,950 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,953 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,955 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,958 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,960 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,963 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,966 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:25,968 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:25,969 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:25,972 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:25,975 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:25,979 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:25,982 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:25,984 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-16T04:56:25,985 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-16T04:56:25,988 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-16T04:56:25,991 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-16T04:56:25,994 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:25,997 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:26,000 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-16T04:56:26,001 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-16T04:56:26,004 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:26,008 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:26,010 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:26,013 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:26,015 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,018 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,020 copying 3rdparty/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,022 copying 3rdparty/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,025 copying 3rdparty/cutlass/tools/util/include/cutlass/util/debug.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,028 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,030 copying 3rdparty/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,033 copying 3rdparty/cutlass/tools/util/include/cutlass/util/distribution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,035 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,037 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,040 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,042 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,044 copying 3rdparty/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,046 copying 3rdparty/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,049 copying 3rdparty/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,051 copying 3rdparty/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,053 copying 3rdparty/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,055 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:26,058 creating build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,059 copying 3rdparty/spdlog/include/spdlog/common-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,062 copying 3rdparty/spdlog/include/spdlog/spdlog.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,064 copying 3rdparty/spdlog/include/spdlog/common.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,067 copying 3rdparty/spdlog/include/spdlog/stopwatch.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,069 creating build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,070 copying 3rdparty/spdlog/include/spdlog/details/tcp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,072 copying 3rdparty/spdlog/include/spdlog/details/thread_pool.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,074 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,076 copying 3rdparty/spdlog/include/spdlog/details/file_helper-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,078 copying 3rdparty/spdlog/include/spdlog/details/file_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,080 copying 3rdparty/spdlog/include/spdlog/details/udp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,081 copying 3rdparty/spdlog/include/spdlog/details/log_msg-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,083 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,085 copying 3rdparty/spdlog/include/spdlog/details/os-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,088 copying 3rdparty/spdlog/include/spdlog/details/fmt_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,090 copying 3rdparty/spdlog/include/spdlog/details/circular_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,092 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,093 copying 3rdparty/spdlog/include/spdlog/details/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,095 copying 3rdparty/spdlog/include/spdlog/details/log_msg.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,097 copying 3rdparty/spdlog/include/spdlog/details/windows_include.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,099 copying 3rdparty/spdlog/include/spdlog/details/registry.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,101 copying 3rdparty/spdlog/include/spdlog/details/synchronous_factory.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,103 copying 3rdparty/spdlog/include/spdlog/details/backtracer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,105 copying 3rdparty/spdlog/include/spdlog/details/console_globals.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,107 copying 3rdparty/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,109 copying 3rdparty/spdlog/include/spdlog/details/backtracer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,111 copying 3rdparty/spdlog/include/spdlog/details/udp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,113 copying 3rdparty/spdlog/include/spdlog/details/registry-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,116 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,118 copying 3rdparty/spdlog/include/spdlog/details/tcp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,120 copying 3rdparty/spdlog/include/spdlog/details/null_mutex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,122 copying 3rdparty/spdlog/include/spdlog/details/thread_pool-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:26,124 copying 3rdparty/spdlog/include/spdlog/tweakme.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,126 copying 3rdparty/spdlog/include/spdlog/async.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,128 copying 3rdparty/spdlog/include/spdlog/async_logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,130 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:26,131 copying 3rdparty/spdlog/include/spdlog/fmt/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:26,133 copying 3rdparty/spdlog/include/spdlog/fmt/ostr.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:26,135 copying 3rdparty/spdlog/include/spdlog/fmt/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:26,137 copying 3rdparty/spdlog/include/spdlog/fmt/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:26,138 copying 3rdparty/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:26,141 copying 3rdparty/spdlog/include/spdlog/fmt/fmt.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:26,142 copying 3rdparty/spdlog/include/spdlog/fmt/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:26,144 copying 3rdparty/spdlog/include/spdlog/fmt/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:26,146 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,147 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/args.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,149 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,152 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/printf.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,154 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,157 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,160 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/locale.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,161 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,164 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,168 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/core.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,171 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,174 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/color.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,177 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,182 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,184 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,188 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:26,190 creating build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:26,191 copying 3rdparty/spdlog/include/spdlog/cfg/helpers.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:26,193 copying 3rdparty/spdlog/include/spdlog/cfg/helpers-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:26,195 copying 3rdparty/spdlog/include/spdlog/cfg/env.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:26,197 copying 3rdparty/spdlog/include/spdlog/cfg/argv.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:26,200 copying 3rdparty/spdlog/include/spdlog/pattern_formatter-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,203 copying 3rdparty/spdlog/include/spdlog/spdlog-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,205 copying 3rdparty/spdlog/include/spdlog/mdc.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,207 copying 3rdparty/spdlog/include/spdlog/logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,209 copying 3rdparty/spdlog/include/spdlog/logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,212 creating build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,212 copying 3rdparty/spdlog/include/spdlog/sinks/callback_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,215 copying 3rdparty/spdlog/include/spdlog/sinks/dist_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,217 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,219 copying 3rdparty/spdlog/include/spdlog/sinks/systemd_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,221 copying 3rdparty/spdlog/include/spdlog/sinks/kafka_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,224 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,226 copying 3rdparty/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,228 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,230 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,232 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,236 copying 3rdparty/spdlog/include/spdlog/sinks/udp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,238 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,240 copying 3rdparty/spdlog/include/spdlog/sinks/msvc_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,242 copying 3rdparty/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,245 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,247 copying 3rdparty/spdlog/include/spdlog/sinks/qt_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,250 copying 3rdparty/spdlog/include/spdlog/sinks/sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,252 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,254 copying 3rdparty/spdlog/include/spdlog/sinks/null_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,256 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,259 copying 3rdparty/spdlog/include/spdlog/sinks/android_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,261 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,263 copying 3rdparty/spdlog/include/spdlog/sinks/syslog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,265 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,268 copying 3rdparty/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,270 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,273 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,275 copying 3rdparty/spdlog/include/spdlog/sinks/sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,277 copying 3rdparty/spdlog/include/spdlog/sinks/ostream_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,279 copying 3rdparty/spdlog/include/spdlog/sinks/mongo_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,281 copying 3rdparty/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,284 copying 3rdparty/spdlog/include/spdlog/sinks/tcp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,286 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,288 copying 3rdparty/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:26,290 copying 3rdparty/spdlog/include/spdlog/async_logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,292 copying 3rdparty/spdlog/include/spdlog/pattern_formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,294 copying 3rdparty/spdlog/include/spdlog/version.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,296 copying 3rdparty/spdlog/include/spdlog/fwd.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,298 copying 3rdparty/spdlog/include/spdlog/formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:26,455 installing to build/bdist.linux-armv7l/wheel 2026-04-16T04:56:26,455 running install 2026-04-16T04:56:26,478 running install_lib 2026-04-16T04:56:26,485 creating build/bdist.linux-armv7l/wheel 2026-04-16T04:56:26,486 copying build/lib/build_utils.py -> build/bdist.linux-armv7l/wheel/. 2026-04-16T04:56:26,489 copying build/lib/build_backend.py -> build/bdist.linux-armv7l/wheel/. 2026-04-16T04:56:26,492 creating build/bdist.linux-armv7l/wheel/flashinfer 2026-04-16T04:56:26,494 creating build/bdist.linux-armv7l/wheel/flashinfer/mamba 2026-04-16T04:56:26,495 copying build/lib/flashinfer/mamba/ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-16T04:56:26,497 copying build/lib/flashinfer/mamba/ssd_combined.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-16T04:56:26,500 copying build/lib/flashinfer/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-16T04:56:26,502 copying build/lib/flashinfer/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-16T04:56:26,504 copying build/lib/flashinfer/mamba/ssd_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-16T04:56:26,510 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization 2026-04-16T04:56:26,511 copying build/lib/flashinfer/quantization/packbits.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-16T04:56:26,514 copying build/lib/flashinfer/quantization/quantization_cute_dsl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-16T04:56:26,517 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization/kernels 2026-04-16T04:56:26,518 copying build/lib/flashinfer/quantization/kernels/nvfp4_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-16T04:56:26,521 copying build/lib/flashinfer/quantization/kernels/mxfp4_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-16T04:56:26,524 copying build/lib/flashinfer/quantization/kernels/mxfp8_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-16T04:56:26,527 copying build/lib/flashinfer/quantization/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-16T04:56:26,529 copying build/lib/flashinfer/quantization/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-16T04:56:26,531 copying build/lib/flashinfer/quantization/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-16T04:56:26,533 copying build/lib/flashinfer/quantization/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-16T04:56:26,536 copying build/lib/flashinfer/pod.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,540 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm 2026-04-16T04:56:26,541 copying build/lib/flashinfer/gemm/routergemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-16T04:56:26,544 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm/kernels 2026-04-16T04:56:26,545 copying build/lib/flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-16T04:56:26,550 copying build/lib/flashinfer/gemm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-16T04:56:26,552 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-16T04:56:26,555 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-16T04:56:26,559 copying build/lib/flashinfer/gemm/gemm_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-16T04:56:26,565 copying build/lib/flashinfer/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-16T04:56:26,567 creating build/bdist.linux-armv7l/wheel/flashinfer/tuning_configs 2026-04-16T04:56:26,569 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-16T04:56:26,571 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-16T04:56:26,573 copying build/lib/flashinfer/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,576 copying build/lib/flashinfer/concat_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,579 creating build/bdist.linux-armv7l/wheel/flashinfer/cudnn 2026-04-16T04:56:26,580 copying build/lib/flashinfer/cudnn/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-16T04:56:26,582 copying build/lib/flashinfer/cudnn/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-16T04:56:26,584 copying build/lib/flashinfer/cudnn/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-16T04:56:26,586 copying build/lib/flashinfer/cudnn/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-16T04:56:26,588 copying build/lib/flashinfer/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,590 copying build/lib/flashinfer/green_ctx.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,592 copying build/lib/flashinfer/autotuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,595 copying build/lib/flashinfer/gdn_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,597 copying build/lib/flashinfer/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,599 copying build/lib/flashinfer/cuda_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,602 creating build/bdist.linux-armv7l/wheel/flashinfer/comm 2026-04-16T04:56:26,603 copying build/lib/flashinfer/comm/trtllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,606 copying build/lib/flashinfer/comm/trtllm_mnnvl_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,608 copying build/lib/flashinfer/comm/trtllm_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,610 copying build/lib/flashinfer/comm/mapping.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,613 copying build/lib/flashinfer/comm/vllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,615 copying build/lib/flashinfer/comm/trtllm_moe_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,617 copying build/lib/flashinfer/comm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,619 copying build/lib/flashinfer/comm/allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,622 copying build/lib/flashinfer/comm/workspace_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,624 copying build/lib/flashinfer/comm/mnnvl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,626 copying build/lib/flashinfer/comm/nvshmem_allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,629 copying build/lib/flashinfer/comm/nvshmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,630 copying build/lib/flashinfer/comm/dlpack_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,633 copying build/lib/flashinfer/comm/cuda_ipc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-16T04:56:26,635 creating build/bdist.linux-armv7l/wheel/flashinfer/profiler 2026-04-16T04:56:26,636 copying build/lib/flashinfer/profiler/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/profiler 2026-04-16T04:56:26,639 creating build/bdist.linux-armv7l/wheel/flashinfer/norm 2026-04-16T04:56:26,640 creating build/bdist.linux-armv7l/wheel/flashinfer/norm/kernels 2026-04-16T04:56:26,641 copying build/lib/flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-16T04:56:26,644 copying build/lib/flashinfer/norm/kernels/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-16T04:56:26,647 copying build/lib/flashinfer/norm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-16T04:56:26,649 copying build/lib/flashinfer/norm/kernels/layernorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-16T04:56:26,651 copying build/lib/flashinfer/norm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-16T04:56:26,654 copying build/lib/flashinfer/norm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-16T04:56:26,657 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels 2026-04-16T04:56:26,658 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:26,660 copying build/lib/flashinfer/gdn_kernels/blackwell/gated_delta_net_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:26,662 copying build/lib/flashinfer/gdn_kernels/blackwell/gdn_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:26,665 copying build/lib/flashinfer/gdn_kernels/blackwell/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:26,666 copying build/lib/flashinfer/gdn_kernels/blackwell/gated_delta_net_chunked.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-16T04:56:26,671 copying build/lib/flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-16T04:56:26,674 copying build/lib/flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-16T04:56:26,677 copying build/lib/flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-16T04:56:26,680 copying build/lib/flashinfer/gdn_kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-16T04:56:26,682 copying build/lib/flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-16T04:56:26,686 creating build/bdist.linux-armv7l/wheel/flashinfer/dsv3_ops 2026-04-16T04:56:26,688 copying build/lib/flashinfer/dsv3_ops/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/dsv3_ops 2026-04-16T04:56:26,690 copying build/lib/flashinfer/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,692 copying build/lib/flashinfer/attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,695 creating build/bdist.linux-armv7l/wheel/flashinfer/logits_processor 2026-04-16T04:56:26,696 copying build/lib/flashinfer/logits_processor/fusion_rules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,699 copying build/lib/flashinfer/logits_processor/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,701 copying build/lib/flashinfer/logits_processor/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,703 copying build/lib/flashinfer/logits_processor/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,705 copying build/lib/flashinfer/logits_processor/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,707 copying build/lib/flashinfer/logits_processor/operators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,710 copying build/lib/flashinfer/logits_processor/validators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,712 copying build/lib/flashinfer/logits_processor/legalization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,715 copying build/lib/flashinfer/logits_processor/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,717 copying build/lib/flashinfer/logits_processor/processors.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-16T04:56:26,720 copying build/lib/flashinfer/api_logging.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,724 creating build/bdist.linux-armv7l/wheel/flashinfer/jit 2026-04-16T04:56:26,726 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention 2026-04-16T04:56:26,728 copying build/lib/flashinfer/jit/attention/variants.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-16T04:56:26,731 copying build/lib/flashinfer/jit/attention/modules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-16T04:56:26,735 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:26,736 copying build/lib/flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:26,744 copying build/lib/flashinfer/jit/attention/fmha_v2/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:26,748 copying build/lib/flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:26,751 copying build/lib/flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-16T04:56:26,754 copying build/lib/flashinfer/jit/attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-16T04:56:26,756 copying build/lib/flashinfer/jit/attention/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-16T04:56:26,759 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/mamba 2026-04-16T04:56:26,761 copying build/lib/flashinfer/jit/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-16T04:56:26,763 copying build/lib/flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-16T04:56:26,766 copying build/lib/flashinfer/jit/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-16T04:56:26,768 copying build/lib/flashinfer/jit/spdlog.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,771 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm 2026-04-16T04:56:26,773 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm/cutlass 2026-04-16T04:56:26,774 copying build/lib/flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-16T04:56:26,778 copying build/lib/flashinfer/jit/gemm/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-16T04:56:26,780 copying build/lib/flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-16T04:56:26,783 copying build/lib/flashinfer/jit/gemm/fp8_blockscale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-16T04:56:26,786 copying build/lib/flashinfer/jit/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-16T04:56:26,788 copying build/lib/flashinfer/jit/gemm/deepgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-16T04:56:26,790 copying build/lib/flashinfer/jit/gemm/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-16T04:56:26,793 copying build/lib/flashinfer/jit/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,796 copying build/lib/flashinfer/jit/comm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,798 copying build/lib/flashinfer/jit/quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,800 copying build/lib/flashinfer/jit/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,802 copying build/lib/flashinfer/jit/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,805 copying build/lib/flashinfer/jit/dsv3_optimizations.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,808 copying build/lib/flashinfer/jit/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,810 copying build/lib/flashinfer/jit/env.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,813 copying build/lib/flashinfer/jit/gdn.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,815 copying build/lib/flashinfer/jit/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,817 copying build/lib/flashinfer/jit/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,819 copying build/lib/flashinfer/jit/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,821 copying build/lib/flashinfer/jit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,824 copying build/lib/flashinfer/jit/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,826 copying build/lib/flashinfer/jit/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,829 copying build/lib/flashinfer/jit/fp4_kv_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,831 copying build/lib/flashinfer/jit/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,833 copying build/lib/flashinfer/jit/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,835 copying build/lib/flashinfer/jit/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,837 copying build/lib/flashinfer/jit/rmsnorm_silu.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,840 copying build/lib/flashinfer/jit/fp4_kv_dequantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,842 copying build/lib/flashinfer/jit/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,845 copying build/lib/flashinfer/jit/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,847 copying build/lib/flashinfer/jit/cubin_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,850 copying build/lib/flashinfer/jit/tinygemm2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,852 copying build/lib/flashinfer/jit/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,855 copying build/lib/flashinfer/jit/cpp_ext.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,858 copying build/lib/flashinfer/jit/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-16T04:56:26,861 creating build/bdist.linux-armv7l/wheel/flashinfer/mla 2026-04-16T04:56:26,862 copying build/lib/flashinfer/mla/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mla 2026-04-16T04:56:26,864 copying build/lib/flashinfer/mla/_core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mla 2026-04-16T04:56:26,868 copying build/lib/flashinfer/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,873 copying build/lib/flashinfer/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,876 copying build/lib/flashinfer/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,879 copying build/lib/flashinfer/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,881 copying build/lib/flashinfer/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,884 copying build/lib/flashinfer/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,888 creating build/bdist.linux-armv7l/wheel/flashinfer/triton 2026-04-16T04:56:26,889 copying build/lib/flashinfer/triton/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-16T04:56:26,891 copying build/lib/flashinfer/triton/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-16T04:56:26,894 creating build/bdist.linux-armv7l/wheel/flashinfer/triton/kernels 2026-04-16T04:56:26,896 copying build/lib/flashinfer/triton/kernels/ssd_chunk_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-16T04:56:26,898 copying build/lib/flashinfer/triton/kernels/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-16T04:56:26,901 copying build/lib/flashinfer/triton/kernels/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-16T04:56:26,903 copying build/lib/flashinfer/triton/kernels/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-16T04:56:26,905 copying build/lib/flashinfer/triton/kernels/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-16T04:56:26,908 copying build/lib/flashinfer/triton/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-16T04:56:26,910 copying build/lib/flashinfer/triton/kernels/quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-16T04:56:26,912 copying build/lib/flashinfer/triton/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-16T04:56:26,914 copying build/lib/flashinfer/triton/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-16T04:56:26,916 copying build/lib/flashinfer/triton/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-16T04:56:26,918 copying build/lib/flashinfer/triton/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-16T04:56:26,921 copying build/lib/flashinfer/triton/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-16T04:56:26,922 copying build/lib/flashinfer/triton/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-16T04:56:26,924 copying build/lib/flashinfer/gdn_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,927 copying build/lib/flashinfer/aot.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:26,931 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl 2026-04-16T04:56:26,933 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention 2026-04-16T04:56:26,935 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/scheduler 2026-04-16T04:56:26,936 copying build/lib/flashinfer/cute_dsl/attention/scheduler/mla_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/scheduler 2026-04-16T04:56:26,939 copying build/lib/flashinfer/cute_dsl/attention/scheduler/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/scheduler 2026-04-16T04:56:26,941 copying build/lib/flashinfer/cute_dsl/attention/scheduler/persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/scheduler 2026-04-16T04:56:26,944 copying build/lib/flashinfer/cute_dsl/attention/config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,946 copying build/lib/flashinfer/cute_dsl/attention/tmem_layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,949 copying build/lib/flashinfer/cute_dsl/attention/pipeline_topology.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,952 copying build/lib/flashinfer/cute_dsl/attention/collective_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,955 copying build/lib/flashinfer/cute_dsl/attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,957 copying build/lib/flashinfer/cute_dsl/attention/mainloop_spec.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,959 copying build/lib/flashinfer/cute_dsl/attention/mla_warp_schedule.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,962 copying build/lib/flashinfer/cute_dsl/attention/compat.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,964 copying build/lib/flashinfer/cute_dsl/attention/mla_decode_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,967 copying build/lib/flashinfer/cute_dsl/attention/mla_config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:26,970 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,971 copying build/lib/flashinfer/cute_dsl/attention/roles/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,974 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_mma_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,977 copying build/lib/flashinfer/cute_dsl/attention/roles/loader_tma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,980 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,983 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_compute.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,986 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_pt_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,988 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_loader_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,991 copying build/lib/flashinfer/cute_dsl/attention/roles/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,993 copying build/lib/flashinfer/cute_dsl/attention/roles/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,995 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_correction.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:26,998 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:27,000 copying build/lib/flashinfer/cute_dsl/attention/roles/softmax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:27,003 copying build/lib/flashinfer/cute_dsl/attention/roles/correction.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:27,006 copying build/lib/flashinfer/cute_dsl/attention/roles/softmax_math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-16T04:56:27,008 copying build/lib/flashinfer/cute_dsl/attention/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:27,011 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/fusion 2026-04-16T04:56:27,012 copying build/lib/flashinfer/cute_dsl/attention/fusion/mask.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/fusion 2026-04-16T04:56:27,015 copying build/lib/flashinfer/cute_dsl/attention/fusion/variant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/fusion 2026-04-16T04:56:27,018 copying build/lib/flashinfer/cute_dsl/attention/fusion/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/fusion 2026-04-16T04:56:27,020 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/wrappers 2026-04-16T04:56:27,021 copying build/lib/flashinfer/cute_dsl/attention/wrappers/batch_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/wrappers 2026-04-16T04:56:27,024 copying build/lib/flashinfer/cute_dsl/attention/wrappers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/wrappers 2026-04-16T04:56:27,026 copying build/lib/flashinfer/cute_dsl/attention/wrappers/batch_mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/wrappers 2026-04-16T04:56:27,029 copying build/lib/flashinfer/cute_dsl/attention/warp_schedule.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:27,031 copying build/lib/flashinfer/cute_dsl/attention/mla_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-16T04:56:27,034 copying build/lib/flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-16T04:56:27,038 copying build/lib/flashinfer/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-16T04:56:27,040 copying build/lib/flashinfer/cute_dsl/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-16T04:56:27,043 copying build/lib/flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-16T04:56:27,046 copying build/lib/flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-16T04:56:27,049 copying build/lib/flashinfer/cute_dsl/fp4_common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-16T04:56:27,052 copying build/lib/flashinfer/cute_dsl/blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-16T04:56:27,054 copying build/lib/flashinfer/deep_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:27,058 creating build/bdist.linux-armv7l/wheel/flashinfer/data 2026-04-16T04:56:27,060 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass 2026-04-16T04:56:27,062 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test 2026-04-16T04:56:27,064 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/utils 2026-04-16T04:56:27,065 copying build/lib/flashinfer/data/cutlass/test/utils/test_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/utils 2026-04-16T04:56:27,068 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python 2026-04-16T04:56:27,070 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass 2026-04-16T04:56:27,072 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/installation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass 2026-04-16T04:56:27,075 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,076 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,079 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,081 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,084 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,086 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,089 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,091 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,094 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,097 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,100 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,103 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,106 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,108 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-16T04:56:27,112 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:27,113 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:27,116 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:27,119 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:27,121 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-16T04:56:27,126 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:27,128 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:27,131 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:27,133 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:27,136 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-16T04:56:27,140 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:27,141 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:27,145 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-16T04:56:27,146 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-16T04:56:27,149 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:27,152 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:27,155 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:27,158 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:27,161 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-16T04:56:27,164 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-16T04:56:27,166 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-16T04:56:27,170 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:27,171 copying build/lib/flashinfer/data/cutlass/test/python/pycute/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:27,174 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_coalesce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:27,177 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_complement.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:27,179 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:27,182 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:27,184 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_composition.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:27,187 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:27,189 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-16T04:56:27,192 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit 2026-04-16T04:56:27,194 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm 2026-04-16T04:56:27,197 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-16T04:56:27,198 copying build/lib/flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/unit/gemm/device 2026-04-16T04:56:27,202 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples 2026-04-16T04:56:27,204 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-16T04:56:27,206 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:27,208 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:27,211 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:27,214 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:27,217 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:27,220 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-16T04:56:27,222 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-16T04:56:27,225 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python 2026-04-16T04:56:27,227 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:27,229 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:27,230 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:27,232 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:27,234 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:27,236 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:27,239 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-16T04:56:27,242 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:27,243 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:27,246 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:27,249 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:27,251 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:27,253 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-16T04:56:27,256 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/shape.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:27,259 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-16T04:56:27,260 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-16T04:56:27,262 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-16T04:56:27,264 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-16T04:56:27,266 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:27,270 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:27,273 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,274 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-16T04:56:27,275 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-16T04:56:27,278 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-16T04:56:27,280 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,282 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,285 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,287 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,289 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,291 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,294 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,296 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,299 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,301 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,304 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-16T04:56:27,305 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-16T04:56:27,309 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,310 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,312 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,315 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,317 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,320 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,322 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,324 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,327 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,329 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-16T04:56:27,332 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-16T04:56:27,335 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-16T04:56:27,336 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-16T04:56:27,338 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-16T04:56:27,340 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-16T04:56:27,344 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:27,345 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:27,347 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:27,350 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:27,353 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:27,355 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:27,357 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:27,360 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:27,362 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-16T04:56:27,366 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,367 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,370 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,372 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,375 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,378 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,380 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,383 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,386 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,388 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,390 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,393 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,396 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,398 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-16T04:56:27,400 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,405 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,408 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-16T04:56:27,411 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-16T04:56:27,414 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-16T04:56:27,415 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-16T04:56:27,418 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-16T04:56:27,421 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-16T04:56:27,425 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src 2026-04-16T04:56:27,427 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src/source 2026-04-16T04:56:27,429 copying build/lib/flashinfer/data/cutlass/python/docs_src/source/conf.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/docs_src/source 2026-04-16T04:56:27,431 copying build/lib/flashinfer/data/cutlass/python/setup_cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-16T04:56:27,434 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,436 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,439 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,442 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,445 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/manifest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,448 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,451 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,453 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,457 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,460 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,463 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,466 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,480 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,483 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,487 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,490 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,494 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,498 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,501 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,504 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/symm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-16T04:56:27,507 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:27,509 copying build/lib/flashinfer/data/cutlass/python/pycute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:27,511 copying build/lib/flashinfer/data/cutlass/python/pycute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:27,514 copying build/lib/flashinfer/data/cutlass/python/pycute/int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:27,516 copying build/lib/flashinfer/data/cutlass/python/pycute/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:27,519 copying build/lib/flashinfer/data/cutlass/python/pycute/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-16T04:56:27,522 copying build/lib/flashinfer/data/cutlass/python/setup_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-16T04:56:27,524 copying build/lib/flashinfer/data/cutlass/python/setup_pycute.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-16T04:56:27,527 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL 2026-04-16T04:56:27,529 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-16T04:56:27,531 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:27,532 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:27,537 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:27,540 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:27,543 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:27,545 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:27,547 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-16T04:56:27,551 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:27,553 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:27,555 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:27,558 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:27,561 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:27,564 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:27,566 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-16T04:56:27,570 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,571 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,574 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,578 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-16T04:56:27,580 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-16T04:56:27,582 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-16T04:56:27,585 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,589 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,591 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,594 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,597 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,600 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,603 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,606 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,609 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,612 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,615 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,617 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,620 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,623 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,626 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-16T04:56:27,630 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,631 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,636 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:27,637 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:27,640 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:27,641 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:27,644 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:27,646 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:27,648 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:27,650 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-16T04:56:27,653 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,656 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,658 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,661 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,665 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,668 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,670 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,673 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,676 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-16T04:56:27,684 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-16T04:56:27,687 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:27,688 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:27,690 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:27,693 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:27,695 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-16T04:56:27,699 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-16T04:56:27,700 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-16T04:56:27,702 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-16T04:56:27,704 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-16T04:56:27,708 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-16T04:56:27,709 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-16T04:56:27,711 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-16T04:56:27,713 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-16T04:56:27,715 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-16T04:56:27,718 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-16T04:56:27,720 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-16T04:56:27,721 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-16T04:56:27,724 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-16T04:56:27,726 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-16T04:56:27,730 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:27,731 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:27,733 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:27,736 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:27,740 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:27,742 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:27,744 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:27,746 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:27,748 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-16T04:56:27,750 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,752 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,757 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-16T04:56:27,760 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:27,761 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:27,764 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:27,766 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:27,768 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:27,770 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-16T04:56:27,773 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:27,774 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:27,777 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:27,780 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:27,782 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-16T04:56:27,785 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-16T04:56:27,788 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-16T04:56:27,791 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,792 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:27,794 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:27,796 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:27,798 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:27,800 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:27,802 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:27,804 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-16T04:56:27,806 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,808 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,810 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,813 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,816 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,819 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,822 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,825 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,830 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:27,831 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:27,834 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:27,837 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:27,841 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:27,843 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-16T04:56:27,846 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,848 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,852 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:27,853 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:27,856 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:27,859 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:27,861 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:27,864 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-16T04:56:27,866 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:27,868 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:27,871 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:27,873 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:27,876 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:27,879 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:27,881 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:27,883 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-16T04:56:27,886 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,889 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-16T04:56:27,893 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:27,894 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:27,897 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:27,900 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:27,902 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-16T04:56:27,904 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-16T04:56:27,906 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL 2026-04-16T04:56:27,910 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples 2026-04-16T04:56:27,912 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python 2026-04-16T04:56:27,914 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL 2026-04-16T04:56:27,916 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:27,917 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:27,920 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:27,923 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:27,925 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-16T04:56:27,929 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:27,930 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:27,934 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:27,936 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:27,938 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-16T04:56:27,942 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental 2026-04-16T04:56:27,944 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:27,946 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:27,950 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:27,953 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:27,957 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:27,961 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-16T04:56:27,966 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-16T04:56:27,967 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-16T04:56:27,971 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:27,972 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:27,976 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:27,982 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:27,985 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:27,990 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:27,994 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:27,997 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-16T04:56:28,003 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,005 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,009 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,015 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,025 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:28,026 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:28,030 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:28,033 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:28,037 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:28,042 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-16T04:56:28,046 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:28,047 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:28,050 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:28,053 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:28,057 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:28,060 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-16T04:56:28,063 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-16T04:56:28,064 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-16T04:56:28,071 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-16T04:56:28,076 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-16T04:56:28,082 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,088 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,093 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,098 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,101 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,106 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:28,107 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:28,111 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:28,115 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:28,118 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-16T04:56:28,121 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,128 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-16T04:56:28,129 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-16T04:56:28,137 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-16T04:56:28,144 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-16T04:56:28,147 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,153 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,162 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,167 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:28,168 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:28,172 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:28,177 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:28,180 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-16T04:56:28,186 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-16T04:56:28,187 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-16T04:56:28,194 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-16T04:56:28,197 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-16T04:56:28,200 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,202 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,208 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,210 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,215 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,219 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-16T04:56:28,225 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-16T04:56:28,226 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-16T04:56:28,229 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:28,231 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:28,233 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:28,235 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:28,237 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:28,240 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:28,242 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:28,245 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:28,246 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-16T04:56:28,249 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-16T04:56:28,250 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-16T04:56:28,253 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-16T04:56:28,256 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-16T04:56:28,257 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-16T04:56:28,259 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-16T04:56:28,262 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-16T04:56:28,263 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-16T04:56:28,267 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,268 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,271 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,274 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,277 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,279 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,282 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,285 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,287 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,291 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,294 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,297 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,299 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,301 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-16T04:56:28,305 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:28,306 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:28,310 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:28,314 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:28,318 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-16T04:56:28,321 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-16T04:56:28,322 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-16T04:56:28,324 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-16T04:56:28,328 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen 2026-04-16T04:56:28,329 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,331 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,334 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,336 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,338 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,341 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,343 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,346 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,349 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,351 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,353 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,356 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,358 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-16T04:56:28,361 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-16T04:56:28,362 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-16T04:56:28,365 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-16T04:56:28,366 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-16T04:56:28,369 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-16T04:56:28,372 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-16T04:56:28,375 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-16T04:56:28,377 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-16T04:56:28,380 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-16T04:56:28,381 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-16T04:56:28,384 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-16T04:56:28,387 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools 2026-04-16T04:56:28,388 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util 2026-04-16T04:56:28,390 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/scripts 2026-04-16T04:56:28,392 copying build/lib/flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/scripts 2026-04-16T04:56:28,395 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include 2026-04-16T04:56:28,397 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass 2026-04-16T04:56:28,399 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,400 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,403 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,405 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,408 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,410 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,413 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,415 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,418 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,420 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,422 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,426 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference 2026-04-16T04:56:28,427 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-16T04:56:28,429 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-16T04:56:28,431 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-16T04:56:28,434 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,435 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,438 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,440 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,443 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,445 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,448 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,451 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,453 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,456 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,459 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,461 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,464 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,466 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,469 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,472 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,474 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,477 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,480 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,482 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,484 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,486 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,489 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,491 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,494 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-16T04:56:28,497 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,498 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,500 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,503 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,507 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,509 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,512 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-16T04:56:28,513 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-16T04:56:28,515 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-16T04:56:28,518 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-16T04:56:28,520 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,523 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,526 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-16T04:56:28,527 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-16T04:56:28,530 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,533 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,535 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,538 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-16T04:56:28,540 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,543 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,545 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,547 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,550 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,553 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,555 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,558 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,561 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,563 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,566 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,568 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,571 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,573 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,575 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,578 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,580 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,583 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-16T04:56:28,586 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include 2026-04-16T04:56:28,588 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,590 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:28,591 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:28,593 copying build/lib/flashinfer/data/cutlass/include/cute/util/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:28,596 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_svg.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:28,598 copying build/lib/flashinfer/data/cutlass/include/cute/util/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:28,601 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_latex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:28,603 copying build/lib/flashinfer/data/cutlass/include/cute/util/print.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-16T04:56:28,605 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,608 copying build/lib/flashinfer/data/cutlass/include/cute/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,611 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,613 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_base.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,615 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,618 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,621 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_flagged.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,623 copying build/lib/flashinfer/data/cutlass/include/cute/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,626 copying build/lib/flashinfer/data/cutlass/include/cute/tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,628 copying build/lib/flashinfer/data/cutlass/include/cute/underscore.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:28,631 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:28,633 copying build/lib/flashinfer/data/cutlass/include/cute/container/tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:28,635 copying build/lib/flashinfer/data/cutlass/include/cute/container/bit_field.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:28,638 copying build/lib/flashinfer/data/cutlass/include/cute/container/alignment.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:28,640 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_aligned.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:28,642 copying build/lib/flashinfer/data/cutlass/include/cute/container/type_list.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:28,644 copying build/lib/flashinfer/data/cutlass/include/cute/container/array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:28,646 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:28,649 copying build/lib/flashinfer/data/cutlass/include/cute/container/cuda_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-16T04:56:28,652 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,653 copying build/lib/flashinfer/data/cutlass/include/cute/atom/partitioner.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,655 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,657 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,661 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,663 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,665 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,670 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,673 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,681 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,683 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,700 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,705 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,708 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,710 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,713 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,715 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,721 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,733 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,736 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,739 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,742 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,746 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,749 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,752 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,755 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,758 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,767 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,770 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-16T04:56:28,774 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,776 copying build/lib/flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,779 copying build/lib/flashinfer/data/cutlass/include/cute/arch/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,782 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,785 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,797 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,799 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,804 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,807 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,812 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,835 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,839 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,881 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,884 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,886 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,889 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,901 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,904 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,908 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,910 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,912 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,929 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,932 copying build/lib/flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,934 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,936 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,939 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,941 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,943 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,946 copying build/lib/flashinfer/data/cutlass/include/cute/arch/util.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,948 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,950 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,994 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,996 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-16T04:56:28,998 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_impl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:29,001 copying build/lib/flashinfer/data/cutlass/include/cute/int_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:29,004 copying build/lib/flashinfer/data/cutlass/include/cute/stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:29,006 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_zip.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:29,009 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,011 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,014 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,016 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/clear.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,018 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,021 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,023 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,026 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,029 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,031 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,032 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,034 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,037 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/functional.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,039 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-16T04:56:29,041 copying build/lib/flashinfer/data/cutlass/include/cute/pointer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:29,044 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,045 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,047 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,049 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,052 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,054 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/real.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,056 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,058 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,060 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/int.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,062 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-16T04:56:29,064 copying build/lib/flashinfer/data/cutlass/include/cute/layout_composed.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-16T04:56:29,068 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:29,069 copying build/lib/flashinfer/data/cutlass/include/cutlass/predicate_vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:29,072 copying build/lib/flashinfer/data/cutlass/include/cutlass/fast_math.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:29,075 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental 2026-04-16T04:56:29,077 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed 2026-04-16T04:56:29,079 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-16T04:56:29,080 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-16T04:56:29,082 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-16T04:56:29,086 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-16T04:56:29,087 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-16T04:56:29,089 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-16T04:56:29,091 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-16T04:56:29,094 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-16T04:56:29,095 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-16T04:56:29,097 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-16T04:56:29,099 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-16T04:56:29,102 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:29,104 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:29,106 copying build/lib/flashinfer/data/cutlass/include/cutlass/bfloat16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:29,109 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:29,110 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:29,113 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-16T04:56:29,115 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,116 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,118 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,120 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,122 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,125 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,128 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,130 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,134 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,136 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,138 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,141 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,144 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,146 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,149 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,151 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,154 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,156 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,158 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,160 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,163 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,165 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,168 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,171 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,173 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,175 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,178 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,183 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,186 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,188 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,191 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,193 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,195 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,197 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,200 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,202 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-16T04:56:29,204 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-16T04:56:29,208 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,209 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,211 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,213 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,215 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,218 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,220 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,223 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,225 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,227 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,231 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,234 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,237 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,240 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,242 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,245 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,247 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,249 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,252 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,254 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,256 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,258 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,261 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,263 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,266 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,268 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,271 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,273 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,276 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,278 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,280 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,283 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,285 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,288 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,290 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,294 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,296 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,299 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,302 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,304 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,307 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,310 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,314 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,316 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,319 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-16T04:56:29,323 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,325 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,328 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,331 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,333 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,336 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,339 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,342 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,345 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,348 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,351 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,354 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,357 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,359 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,362 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,366 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,368 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,371 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,375 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,377 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,380 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,383 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,386 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,389 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,392 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,396 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,399 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,401 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,404 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,407 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,410 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,413 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,416 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,419 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,422 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,426 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,428 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,431 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,434 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,437 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,441 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,444 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,447 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,450 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,453 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,456 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,460 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,462 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,465 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,470 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,472 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,475 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,477 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,480 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,482 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,485 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,487 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,489 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,491 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,494 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,496 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,499 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,501 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,503 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,506 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,508 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,511 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,514 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,516 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,519 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,521 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,523 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,527 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,529 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,532 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,535 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,538 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,542 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,545 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,549 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,551 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,554 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,557 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,560 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,563 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,566 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,569 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,572 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,576 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,579 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,582 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,586 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,589 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,592 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,595 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,598 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,601 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,604 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,607 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,610 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,613 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,615 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,618 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,621 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,624 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,629 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,632 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,636 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,639 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,642 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,645 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,647 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,651 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,653 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,656 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-16T04:56:29,659 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-16T04:56:29,662 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,663 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,666 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,669 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,671 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,674 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,676 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,678 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,681 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,684 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,686 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,689 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,691 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,693 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,695 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,698 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,701 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,703 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,705 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,708 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,711 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,714 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,716 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,719 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,722 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,724 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,727 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,733 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,736 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,738 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-16T04:56:29,743 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,744 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,747 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,749 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,752 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,756 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,759 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,760 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,763 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,766 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,768 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,771 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,774 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,777 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,779 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,781 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,784 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,787 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,790 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,792 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,796 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,798 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,801 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,803 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,806 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,809 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,811 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,814 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,817 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,820 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,822 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,825 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,828 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,830 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,833 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,836 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-16T04:56:29,839 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,841 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,845 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,848 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,852 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,855 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,858 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,862 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,865 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,868 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,872 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,875 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,879 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,882 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,884 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,887 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,891 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,894 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,897 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,899 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,902 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,906 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,909 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,912 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,915 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,919 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,921 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,924 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,927 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,931 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,934 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,937 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,940 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,943 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,947 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,950 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,953 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,955 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,958 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,961 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,964 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,967 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,970 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,973 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,975 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,978 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,981 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,984 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-16T04:56:29,987 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:29,989 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:29,991 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:29,995 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:29,997 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-16T04:56:30,000 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-16T04:56:30,004 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-16T04:56:30,006 copying build/lib/flashinfer/data/cutlass/include/cutlass/device_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,009 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,012 copying build/lib/flashinfer/data/cutlass/include/cutlass/quaternion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,014 copying build/lib/flashinfer/data/cutlass/include/cutlass/relatively_equal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,018 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-16T04:56:30,020 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,021 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,025 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,028 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,031 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,034 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,036 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,038 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,041 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,043 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,045 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,048 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,050 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,053 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,055 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,058 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-16T04:56:30,061 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,062 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,065 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,068 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,070 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,073 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,075 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,077 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,080 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,082 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,085 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,087 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,090 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,093 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,095 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,098 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,101 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,103 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,106 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,108 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,111 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,113 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,116 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,118 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,120 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,123 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,126 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,128 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,130 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,133 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,136 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,138 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,140 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,143 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,146 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,148 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,151 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,154 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,157 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:30,158 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:30,160 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:30,163 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:30,165 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:30,168 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-16T04:56:30,171 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,172 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,175 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,177 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,180 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,183 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,185 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,187 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,190 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,193 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,195 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,198 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-16T04:56:30,201 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,202 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,206 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,209 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:30,210 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:30,213 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:30,215 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:30,218 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:30,220 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:30,224 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-16T04:56:30,226 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,230 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,232 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,235 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,237 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,240 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,243 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,245 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,248 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,251 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,253 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,256 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,259 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,262 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,264 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,267 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-16T04:56:30,271 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,272 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,275 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,278 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,280 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,282 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,285 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,286 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,289 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,291 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,293 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,295 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,297 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,300 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,302 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,304 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,307 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,309 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,312 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,315 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,317 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,320 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,322 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,325 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,328 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,330 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-16T04:56:30,332 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-16T04:56:30,336 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,337 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,340 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,342 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,346 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,349 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,353 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,359 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,363 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,367 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,371 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,375 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,379 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,383 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-16T04:56:30,388 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,391 copying build/lib/flashinfer/data/cutlass/include/cutlass/exmy_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,397 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,401 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,403 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,407 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,410 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,414 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,417 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,421 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,424 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,427 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,432 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-16T04:56:30,434 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-16T04:56:30,437 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-16T04:56:30,443 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-16T04:56:30,446 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,448 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,451 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,454 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-16T04:56:30,456 copying build/lib/flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,458 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,461 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-16T04:56:30,463 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-16T04:56:30,467 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-16T04:56:30,470 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-16T04:56:30,473 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,479 copying build/lib/flashinfer/data/cutlass/include/cutlass/real.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,482 copying build/lib/flashinfer/data/cutlass/include/cutlass/float8.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,485 copying build/lib/flashinfer/data/cutlass/include/cutlass/coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,488 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,496 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint128.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,498 copying build/lib/flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,501 copying build/lib/flashinfer/data/cutlass/include/cutlass/functional.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,504 copying build/lib/flashinfer/data/cutlass/include/cutlass/block_striped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,507 copying build/lib/flashinfer/data/cutlass/include/cutlass/trace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,510 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform 2026-04-16T04:56:30,512 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-16T04:56:30,513 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-16T04:56:30,517 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,518 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,521 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,524 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,527 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,529 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,533 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,535 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,538 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,541 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,544 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,547 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,550 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,554 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,557 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,560 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,562 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,565 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,568 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,571 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,574 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,577 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,580 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,583 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,587 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,590 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-16T04:56:30,593 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-16T04:56:30,594 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-16T04:56:30,597 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-16T04:56:30,599 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-16T04:56:30,603 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-16T04:56:30,604 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-16T04:56:30,607 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-16T04:56:30,609 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-16T04:56:30,613 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-16T04:56:30,614 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-16T04:56:30,616 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-16T04:56:30,619 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform 2026-04-16T04:56:30,622 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,624 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_shape.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_launch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,630 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/thread 2026-04-16T04:56:30,631 copying build/lib/flashinfer/data/cutlass/include/cutlass/thread/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/thread 2026-04-16T04:56:30,634 copying build/lib/flashinfer/data/cutlass/include/cutlass/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,636 copying build/lib/flashinfer/data/cutlass/include/cutlass/aligned_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,639 copying build/lib/flashinfer/data/cutlass/include/cutlass/half.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,642 copying build/lib/flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,645 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,647 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,649 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,652 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,654 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,657 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,659 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,662 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,664 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,667 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,670 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,673 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,675 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,678 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,681 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,684 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,687 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,690 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,693 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,695 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,697 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/arch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,700 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,702 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,705 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,708 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,710 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,713 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,716 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,719 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,722 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-16T04:56:30,725 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint256.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,727 copying build/lib/flashinfer/data/cutlass/include/cutlass/integer_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,733 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-16T04:56:30,735 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:30,737 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:30,740 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:30,742 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:30,745 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-16T04:56:30,748 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:30,750 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:30,753 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:30,755 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:30,758 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-16T04:56:30,762 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-16T04:56:30,763 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-16T04:56:30,766 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-16T04:56:30,768 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction 2026-04-16T04:56:30,771 copying build/lib/flashinfer/data/cutlass/include/cutlass/subbyte_reference.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,774 copying build/lib/flashinfer/data/cutlass/include/cutlass/core_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,776 copying build/lib/flashinfer/data/cutlass/include/cutlass/float_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,779 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,782 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,784 copying build/lib/flashinfer/data/cutlass/include/cutlass/semaphore.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,787 copying build/lib/flashinfer/data/cutlass/include/cutlass/constants.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,791 copying build/lib/flashinfer/data/cutlass/include/cutlass/wmma_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,794 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,795 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,799 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,801 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,804 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/permute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,807 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,809 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,812 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,815 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,818 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-16T04:56:30,822 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,824 copying build/lib/flashinfer/data/cutlass/include/cutlass/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,827 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,830 copying build/lib/flashinfer/data/cutlass/include/cutlass/array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:30,834 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:30,836 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-16T04:56:30,837 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-16T04:56:30,840 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-16T04:56:30,843 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-16T04:56:30,847 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,849 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,852 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,854 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,857 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,860 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,863 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,866 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,868 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,871 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,874 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,877 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,880 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,882 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,885 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,887 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,891 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,893 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,896 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,899 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,902 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,905 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,908 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,911 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,914 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,916 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,919 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,922 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,925 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,927 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,930 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,933 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,936 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,939 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,942 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,945 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,948 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,951 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,954 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,957 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,960 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,963 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,965 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,969 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,972 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,975 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,978 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-16T04:56:30,980 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:30,984 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:30,987 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:30,989 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:30,992 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:30,995 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:30,998 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,001 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,004 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,007 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,010 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,014 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,017 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,020 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,023 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,026 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,029 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,032 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,035 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,038 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,041 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,044 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,048 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,051 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,053 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,056 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,059 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,062 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,064 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,068 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,071 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,073 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-16T04:56:31,076 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:31,080 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:31,082 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:31,084 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:31,087 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:31,090 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-16T04:56:31,094 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:31,096 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:31,100 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:31,101 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:31,104 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:31,107 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:31,109 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-16T04:56:31,112 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:31,115 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:31,118 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:31,121 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-16T04:56:31,124 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-16T04:56:31,125 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-16T04:56:31,128 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:31,131 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:31,134 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-16T04:56:31,137 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:31,140 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:31,143 copying build/lib/flashinfer/data/cutlass/include/cutlass/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:31,145 copying build/lib/flashinfer/data/cutlass/include/cutlass/tfloat32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:31,149 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/platform 2026-04-16T04:56:31,150 copying build/lib/flashinfer/data/cutlass/include/cutlass/platform/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/platform 2026-04-16T04:56:31,154 copying build/lib/flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:31,156 copying build/lib/flashinfer/data/cutlass/include/cutlass/complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:31,160 copying build/lib/flashinfer/data/cutlass/include/cutlass/cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-16T04:56:31,163 copying build/lib/flashinfer/data/build_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-16T04:56:31,165 copying build/lib/flashinfer/data/build_backend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-16T04:56:31,169 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog 2026-04-16T04:56:31,171 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/scripts 2026-04-16T04:56:31,172 copying build/lib/flashinfer/data/spdlog/scripts/extract_version.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/scripts 2026-04-16T04:56:31,175 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include 2026-04-16T04:56:31,178 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,179 copying build/lib/flashinfer/data/spdlog/include/spdlog/common-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,182 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,185 copying build/lib/flashinfer/data/spdlog/include/spdlog/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,188 copying build/lib/flashinfer/data/spdlog/include/spdlog/stopwatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,191 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,193 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,196 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,198 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,200 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,203 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,205 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,208 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,210 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,212 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,215 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,218 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/circular_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,220 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,223 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,225 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,227 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/windows_include.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,229 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,232 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,234 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,236 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/console_globals.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,239 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,241 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,244 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,246 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,249 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,251 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,254 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/null_mutex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,256 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-16T04:56:31,259 copying build/lib/flashinfer/data/spdlog/include/spdlog/tweakme.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,261 copying build/lib/flashinfer/data/spdlog/include/spdlog/async.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,264 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,267 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:31,268 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:31,271 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ostr.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:31,273 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:31,275 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:31,277 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:31,280 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/fmt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:31,282 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:31,284 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-16T04:56:31,287 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,288 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,291 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,294 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,297 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,300 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,302 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,304 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,307 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,311 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,315 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,318 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,321 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,328 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,330 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,334 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-16T04:56:31,338 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:31,339 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:31,342 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:31,344 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/env.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:31,346 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/argv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-16T04:56:31,349 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,352 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,355 copying build/lib/flashinfer/data/spdlog/include/spdlog/mdc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,357 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,360 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,364 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,365 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,368 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,370 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,373 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,375 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,378 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,380 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,383 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,385 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,388 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,390 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,392 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,395 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,397 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,400 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,402 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,405 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,408 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,410 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,412 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,414 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,416 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,418 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,420 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,423 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,425 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,427 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,430 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,432 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,434 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,436 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,438 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,440 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,443 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-16T04:56:31,445 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,447 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,449 copying build/lib/flashinfer/data/spdlog/include/spdlog/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,451 copying build/lib/flashinfer/data/spdlog/include/spdlog/fwd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,452 copying build/lib/flashinfer/data/spdlog/include/spdlog/formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-16T04:56:31,455 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include 2026-04-16T04:56:31,457 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer 2026-04-16T04:56:31,459 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,460 copying build/lib/flashinfer/data/include/flashinfer/attention/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,463 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-16T04:56:31,464 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-16T04:56:31,466 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-16T04:56:31,468 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-16T04:56:31,471 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:31,472 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:31,474 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:31,476 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:31,479 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:31,481 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:31,485 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:31,487 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:31,490 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-16T04:56:31,493 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-16T04:56:31,496 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-16T04:56:31,497 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-16T04:56:31,499 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-16T04:56:31,502 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:31,503 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:31,506 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:31,509 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:31,511 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:31,514 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:31,516 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:31,519 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:31,522 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-16T04:56:31,525 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,528 copying build/lib/flashinfer/data/include/flashinfer/attention/mask.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,530 copying build/lib/flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,532 copying build/lib/flashinfer/data/include/flashinfer/attention/heap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,534 copying build/lib/flashinfer/data/include/flashinfer/attention/pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,537 copying build/lib/flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,540 copying build/lib/flashinfer/data/include/flashinfer/attention/default_decode_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,542 copying build/lib/flashinfer/data/include/flashinfer/attention/decode.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,545 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent_template.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,547 copying build/lib/flashinfer/data/include/flashinfer/attention/batch_pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,550 copying build/lib/flashinfer/data/include/flashinfer/attention/prefill.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,555 copying build/lib/flashinfer/data/include/flashinfer/attention/cascade.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,558 copying build/lib/flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,561 copying build/lib/flashinfer/data/include/flashinfer/attention/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,565 copying build/lib/flashinfer/data/include/flashinfer/attention/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,568 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,570 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,573 copying build/lib/flashinfer/data/include/flashinfer/attention/state.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,576 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,577 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,579 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,582 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:31,583 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:31,586 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:31,589 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:31,591 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:31,593 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:31,596 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-16T04:56:31,599 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,601 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,603 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,605 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,608 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,610 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,612 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,615 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,617 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,619 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,622 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-16T04:56:31,625 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,627 copying build/lib/flashinfer/data/include/flashinfer/attention/mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-16T04:56:31,631 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,632 copying build/lib/flashinfer/data/include/flashinfer/mamba/common.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,635 copying build/lib/flashinfer/data/include/flashinfer/mamba/conversion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,637 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_simple.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,640 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_vertical.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,643 copying build/lib/flashinfer/data/include/flashinfer/mamba/invoke_selective_state_update_mtp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,646 copying build/lib/flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,648 copying build/lib/flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,650 copying build/lib/flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,652 copying build/lib/flashinfer/data/include/flashinfer/mamba/ssu_mtp_common.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,654 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_horizontal.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,657 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_async_horizontal.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,660 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-16T04:56:31,663 copying build/lib/flashinfer/data/include/flashinfer/fp4_layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,666 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,667 copying build/lib/flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,670 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,672 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,674 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,677 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,679 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,681 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,684 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,686 copying build/lib/flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,689 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,690 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,694 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,696 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,698 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,701 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,703 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,706 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,708 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,710 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,713 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,715 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,717 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,719 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_nvfp4_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,722 copying build/lib/flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,724 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,727 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,729 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,731 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,733 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,735 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,737 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,739 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,741 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,744 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,746 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-16T04:56:31,749 copying build/lib/flashinfer/data/include/flashinfer/fp16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,751 copying build/lib/flashinfer/data/include/flashinfer/page.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,754 copying build/lib/flashinfer/data/include/flashinfer/allocator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,756 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:31,757 copying build/lib/flashinfer/data/include/flashinfer/flat/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:31,759 copying build/lib/flashinfer/data/include/flashinfer/flat/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:31,761 copying build/lib/flashinfer/data/include/flashinfer/flat/cute_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:31,763 copying build/lib/flashinfer/data/include/flashinfer/flat/unused.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:31,765 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere 2026-04-16T04:56:31,766 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-16T04:56:31,768 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-16T04:56:31,770 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-16T04:56:31,773 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper 2026-04-16T04:56:31,775 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:31,776 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:31,778 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:31,780 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:31,782 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-16T04:56:31,785 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-16T04:56:31,786 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-16T04:56:31,789 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:31,790 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:31,793 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:31,795 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:31,798 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:31,800 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-16T04:56:31,803 copying build/lib/flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:31,805 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/prefill 2026-04-16T04:56:31,806 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-16T04:56:31,808 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-16T04:56:31,810 copying build/lib/flashinfer/data/include/flashinfer/flat/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:31,812 copying build/lib/flashinfer/data/include/flashinfer/flat/common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-16T04:56:31,814 copying build/lib/flashinfer/data/include/flashinfer/layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,816 copying build/lib/flashinfer/data/include/flashinfer/topk.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,820 copying build/lib/flashinfer/data/include/flashinfer/permuted_smem.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,822 copying build/lib/flashinfer/data/include/flashinfer/pos_enc.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,825 copying build/lib/flashinfer/data/include/flashinfer/math.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,828 copying build/lib/flashinfer/data/include/flashinfer/profiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,830 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:31,831 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:31,835 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:31,838 copying build/lib/flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:31,841 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:31,843 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:31,846 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:31,850 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-16T04:56:31,853 copying build/lib/flashinfer/data/include/flashinfer/cubin_loader.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,856 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/norm 2026-04-16T04:56:31,858 copying build/lib/flashinfer/data/include/flashinfer/norm/ln_fwd_silu_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/norm 2026-04-16T04:56:31,862 copying build/lib/flashinfer/data/include/flashinfer/norm/ln_silu_headers.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/norm 2026-04-16T04:56:31,868 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm 2026-04-16T04:56:31,870 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,872 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,875 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,878 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,881 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,885 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,887 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,890 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,893 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,896 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-16T04:56:31,900 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm 2026-04-16T04:56:31,904 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:31,905 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:31,908 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:31,911 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:31,913 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:31,916 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:31,919 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-16T04:56:31,923 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-16T04:56:31,924 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-16T04:56:31,928 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,930 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingCustomPolicy.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,933 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,937 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,941 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingDevKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,945 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,947 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,950 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,953 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,956 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-16T04:56:31,958 copying build/lib/flashinfer/data/include/flashinfer/fastdiv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,960 copying build/lib/flashinfer/data/include/flashinfer/exception.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,962 copying build/lib/flashinfer/data/include/flashinfer/attention_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,964 copying build/lib/flashinfer/data/include/flashinfer/concat_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,966 copying build/lib/flashinfer/data/include/flashinfer/cutlass_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,968 copying build/lib/flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,970 copying build/lib/flashinfer/data/include/flashinfer/activation.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,973 copying build/lib/flashinfer/data/include/flashinfer/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,975 copying build/lib/flashinfer/data/include/flashinfer/cp_async.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,977 copying build/lib/flashinfer/data/include/flashinfer/air_top_p.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,980 copying build/lib/flashinfer/data/include/flashinfer/norm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,984 copying build/lib/flashinfer/data/include/flashinfer/logging.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,986 copying build/lib/flashinfer/data/include/flashinfer/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,988 copying build/lib/flashinfer/data/include/flashinfer/arch_condition.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,990 copying build/lib/flashinfer/data/include/flashinfer/sampling.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,994 copying build/lib/flashinfer/data/include/flashinfer/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:31,996 copying build/lib/flashinfer/data/include/flashinfer/vec_dtypes.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-16T04:56:32,003 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc 2026-04-16T04:56:32,004 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,006 copying build/lib/flashinfer/data/csrc/trtllm_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,009 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,011 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,013 copying build/lib/flashinfer/data/csrc/trtllm_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,015 copying build/lib/flashinfer/data/csrc/selective_state_update_dtype_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,017 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,019 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,021 copying build/lib/flashinfer/data/csrc/flashinfer_rope_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,022 copying build/lib/flashinfer/data/csrc/batch_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,024 copying build/lib/flashinfer/data/csrc/fmha_v2_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,026 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,030 copying build/lib/flashinfer/data/csrc/fmha_v2_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,032 copying build/lib/flashinfer/data/csrc/page.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,035 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,037 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,038 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,041 copying build/lib/flashinfer/data/csrc/batch_decode_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,043 copying build/lib/flashinfer/data/csrc/batch_decode_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,044 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,047 copying build/lib/flashinfer/data/csrc/pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,049 copying build/lib/flashinfer/data/csrc/flashinfer_page_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,050 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,052 copying build/lib/flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,054 copying build/lib/flashinfer/data/csrc/gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,056 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,058 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,060 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,062 copying build/lib/flashinfer/data/csrc/trtllm_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,065 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal 2026-04-16T04:56:32,067 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp 2026-04-16T04:56:32,068 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:32,070 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:32,071 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:32,073 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:32,076 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:32,078 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-16T04:56:32,081 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-16T04:56:32,082 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-16T04:56:32,086 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm 2026-04-16T04:56:32,088 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,089 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,091 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,093 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,096 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,098 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,100 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,103 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,106 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,108 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-16T04:56:32,111 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,112 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,114 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,117 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,120 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,123 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,126 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,129 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,132 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,134 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,137 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,139 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-16T04:56:32,141 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:32,142 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:32,145 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:32,147 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:32,149 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:32,151 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-16T04:56:32,153 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-16T04:56:32,156 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-16T04:56:32,159 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-16T04:56:32,160 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-16T04:56:32,162 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-16T04:56:32,165 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:32,167 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-16T04:56:32,169 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,170 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,172 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,175 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,177 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,179 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,181 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,183 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,185 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,187 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,189 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:32,190 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:32,193 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:32,198 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:32,200 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:32,202 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:32,204 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-16T04:56:32,206 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,208 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,210 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,212 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,213 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,215 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,217 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,221 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,222 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,225 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,227 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-16T04:56:32,229 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-16T04:56:32,231 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-16T04:56:32,234 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:32,235 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:32,238 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-16T04:56:32,239 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-16T04:56:32,242 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-16T04:56:32,244 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:32,247 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:32,249 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:32,253 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-16T04:56:32,255 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-16T04:56:32,259 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,260 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,262 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,263 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,265 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,267 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,269 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,270 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,272 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,274 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,276 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,277 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,279 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,282 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,284 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-16T04:56:32,285 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-16T04:56:32,288 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-16T04:56:32,290 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,292 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,293 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,296 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,297 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,299 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,301 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-16T04:56:32,304 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:32,305 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:32,307 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:32,310 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:32,312 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:32,314 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-16T04:56:32,317 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:32,321 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-16T04:56:32,322 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-16T04:56:32,324 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-16T04:56:32,326 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-16T04:56:32,329 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:32,330 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:32,332 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:32,334 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:32,336 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:32,338 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:32,340 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:32,343 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-16T04:56:32,345 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions 2026-04-16T04:56:32,346 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include 2026-04-16T04:56:32,348 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:32,350 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-16T04:56:32,351 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-16T04:56:32,354 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm 2026-04-16T04:56:32,356 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-16T04:56:32,357 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-16T04:56:32,359 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-16T04:56:32,361 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-16T04:56:32,364 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,365 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,368 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,370 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,373 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,375 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,377 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,380 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,383 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,385 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,387 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,390 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,392 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-16T04:56:32,396 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,397 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,400 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,403 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,405 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,407 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,409 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,412 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,414 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,417 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,419 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,421 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,423 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-16T04:56:32,427 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,429 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-16T04:56:32,430 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-16T04:56:32,433 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-16T04:56:32,435 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-16T04:56:32,438 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,441 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,444 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,446 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,448 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,450 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,453 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,456 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,458 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,460 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-16T04:56:32,462 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:32,465 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue 2026-04-16T04:56:32,466 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-16T04:56:32,467 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-16T04:56:32,471 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-16T04:56:32,471 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-16T04:56:32,474 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-16T04:56:32,475 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-16T04:56:32,478 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-16T04:56:32,481 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:32,483 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail 2026-04-16T04:56:32,485 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-16T04:56:32,486 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-16T04:56:32,489 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:32,492 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication 2026-04-16T04:56:32,493 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-16T04:56:32,494 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-16T04:56:32,497 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:32,499 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform 2026-04-16T04:56:32,501 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-16T04:56:32,502 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-16T04:56:32,504 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:32,507 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:32,508 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:32,510 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:32,512 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:32,514 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:32,516 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-16T04:56:32,518 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:32,521 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-16T04:56:32,524 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include 2026-04-16T04:56:32,525 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm 2026-04-16T04:56:32,527 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,528 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,531 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,533 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,535 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,537 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,539 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,541 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,543 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,545 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,547 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,550 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-16T04:56:32,552 copying build/lib/flashinfer/data/csrc/flashinfer_cascade_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,553 copying build/lib/flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,555 copying build/lib/flashinfer/data/csrc/flashinfer_mamba_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,557 copying build/lib/flashinfer/data/csrc/batch_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,559 copying build/lib/flashinfer/data/csrc/batch_pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,561 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,563 copying build/lib/flashinfer/data/csrc/batch_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,565 copying build/lib/flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,567 copying build/lib/flashinfer/data/csrc/batch_decode_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,569 copying build/lib/flashinfer/data/csrc/trtllm_alltoall_prepare.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,571 copying build/lib/flashinfer/data/csrc/norm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,573 copying build/lib/flashinfer/data/csrc/selective_state_update.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,576 copying build/lib/flashinfer/data/csrc/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,578 copying build/lib/flashinfer/data/csrc/batch_attention_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,580 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,582 copying build/lib/flashinfer/data/csrc/single_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,584 copying build/lib/flashinfer/data/csrc/flashinfer_rmsnorm_silu_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,586 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,588 copying build/lib/flashinfer/data/csrc/group_gemm_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,590 copying build/lib/flashinfer/data/csrc/flashinfer_quantization_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,592 copying build/lib/flashinfer/data/csrc/bmm_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,595 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,596 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,597 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,599 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,604 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,607 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,609 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,612 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,616 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,619 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,620 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,623 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,625 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/mask.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,628 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,632 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,635 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,638 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:32,640 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:32,642 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:32,645 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:32,648 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:32,651 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-16T04:56:32,653 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,656 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,660 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,661 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,663 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,665 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,667 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,670 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,673 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,676 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,678 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,681 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,685 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,688 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,691 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,694 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,698 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,701 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,703 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,706 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,708 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-16T04:56:32,710 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,712 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,715 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-16T04:56:32,717 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,720 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,723 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:32,724 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:32,726 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:32,729 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:32,731 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-16T04:56:32,733 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,736 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,738 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,741 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,743 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,746 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,748 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,751 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,754 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,756 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,759 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,762 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,764 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,766 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,769 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,771 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-16T04:56:32,773 copying build/lib/flashinfer/data/csrc/fp4_kv_quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,776 copying build/lib/flashinfer/data/csrc/flashinfer_topk_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,778 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,779 copying build/lib/flashinfer/data/csrc/single_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,782 copying build/lib/flashinfer/data/csrc/trtllm_batched_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,784 copying build/lib/flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,786 copying build/lib/flashinfer/data/csrc/blackwell_fmha_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,788 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,790 copying build/lib/flashinfer/data/csrc/batch_pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,792 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,794 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,796 copying build/lib/flashinfer/data/csrc/single_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,798 copying build/lib/flashinfer/data/csrc/logging.cc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,800 copying build/lib/flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,803 copying build/lib/flashinfer/data/csrc/flashinfer_xqa_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,805 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,806 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,808 copying build/lib/flashinfer/data/csrc/cutlass_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,810 copying build/lib/flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,812 copying build/lib/flashinfer/data/csrc/batch_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,814 copying build/lib/flashinfer/data/csrc/single_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,816 copying build/lib/flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,818 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,820 copying build/lib/flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,822 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,823 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,825 copying build/lib/flashinfer/data/csrc/tvm_ffi_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,828 copying build/lib/flashinfer/data/csrc/renorm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,830 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,833 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,835 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,836 copying build/lib/flashinfer/data/csrc/sampling_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,838 copying build/lib/flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,840 copying build/lib/flashinfer/data/csrc/runtime_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,842 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,844 copying build/lib/flashinfer/data/csrc/single_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,846 copying build/lib/flashinfer/data/csrc/batch_pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,848 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,850 copying build/lib/flashinfer/data/csrc/cascade.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,853 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,855 copying build/lib/flashinfer/data/csrc/fmhaReduction.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,858 copying build/lib/flashinfer/data/csrc/moe_utils_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,860 copying build/lib/flashinfer/data/csrc/fp4_kv_dequantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,863 copying build/lib/flashinfer/data/csrc/group_gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,865 copying build/lib/flashinfer/data/csrc/batch_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,866 copying build/lib/flashinfer/data/csrc/batch_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,868 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,871 copying build/lib/flashinfer/data/csrc/batch_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,873 copying build/lib/flashinfer/data/csrc/selective_state_update_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,875 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,877 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,879 copying build/lib/flashinfer/data/csrc/trtllm_moe_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,882 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/xqa 2026-04-16T04:56:32,883 copying build/lib/flashinfer/data/csrc/xqa/tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,886 copying build/lib/flashinfer/data/csrc/xqa/barriers.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,888 copying build/lib/flashinfer/data/csrc/xqa/mha.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,891 copying build/lib/flashinfer/data/csrc/xqa/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,893 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,897 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,899 copying build/lib/flashinfer/data/csrc/xqa/specDec.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,900 copying build/lib/flashinfer/data/csrc/xqa/mha.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,904 copying build/lib/flashinfer/data/csrc/xqa/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,906 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,908 copying build/lib/flashinfer/data/csrc/xqa/gmma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,910 copying build/lib/flashinfer/data/csrc/xqa/gmma_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,919 copying build/lib/flashinfer/data/csrc/xqa/mhaUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,922 copying build/lib/flashinfer/data/csrc/xqa/hostUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,924 copying build/lib/flashinfer/data/csrc/xqa/mha_components.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,926 copying build/lib/flashinfer/data/csrc/xqa/ldgsts.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,928 copying build/lib/flashinfer/data/csrc/xqa/mha_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,933 copying build/lib/flashinfer/data/csrc/xqa/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,936 copying build/lib/flashinfer/data/csrc/xqa/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,939 copying build/lib/flashinfer/data/csrc/xqa/mha_stdheaders.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,942 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,944 copying build/lib/flashinfer/data/csrc/xqa/defines.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,946 copying build/lib/flashinfer/data/csrc/xqa/cuda_hint.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,948 copying build/lib/flashinfer/data/csrc/xqa/xqa_wrapper.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-16T04:56:32,950 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,952 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,955 copying build/lib/flashinfer/data/csrc/vllm_custom_all_reduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,957 copying build/lib/flashinfer/data/csrc/batch_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,960 copying build/lib/flashinfer/data/csrc/single_prefill_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,962 copying build/lib/flashinfer/data/csrc/group_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,964 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,966 copying build/lib/flashinfer/data/csrc/single_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,968 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,970 copying build/lib/flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,973 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,975 copying build/lib/flashinfer/data/csrc/flashinfer_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,977 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,978 copying build/lib/flashinfer/data/csrc/batch_pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,980 copying build/lib/flashinfer/data/csrc/batch_attention.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,983 copying build/lib/flashinfer/data/csrc/batch_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,984 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,988 copying build/lib/flashinfer/data/csrc/group_gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,990 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,992 copying build/lib/flashinfer/data/csrc/sampling.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,994 copying build/lib/flashinfer/data/csrc/single_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,996 copying build/lib/flashinfer/data/csrc/gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:32,998 copying build/lib/flashinfer/data/csrc/batch_attention_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,000 copying build/lib/flashinfer/data/csrc/trtllm_fmha_v2_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,002 copying build/lib/flashinfer/data/csrc/selective_state_update_kernel_inst.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,004 copying build/lib/flashinfer/data/csrc/flashinfer_sampling_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,007 copying build/lib/flashinfer/data/csrc/rope.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,009 copying build/lib/flashinfer/data/csrc/pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,011 copying build/lib/flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,014 copying build/lib/flashinfer/data/csrc/rmsnorm_silu.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,016 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe 2026-04-16T04:56:33,017 copying build/lib/flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-16T04:56:33,020 copying build/lib/flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-16T04:56:33,023 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:33,024 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:33,030 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:33,032 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:33,036 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-16T04:56:33,039 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:33,040 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_common.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:33,043 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:33,047 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_custom.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:33,050 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:33,053 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-16T04:56:33,056 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,058 copying build/lib/flashinfer/data/csrc/trtllm_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,062 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,064 copying build/lib/flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,067 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,070 copying build/lib/flashinfer/data/csrc/concat_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,072 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,074 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,077 copying build/lib/flashinfer/data/csrc/flashinfer_norm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,079 copying build/lib/flashinfer/data/csrc/single_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,081 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,084 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,087 copying build/lib/flashinfer/data/csrc/tgv_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,090 copying build/lib/flashinfer/data/csrc/batch_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,092 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,095 copying build/lib/flashinfer/data/csrc/pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,098 copying build/lib/flashinfer/data/csrc/pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,100 copying build/lib/flashinfer/data/csrc/tgv_gemm.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,102 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,105 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,107 copying build/lib/flashinfer/data/csrc/gdn_prefill_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,109 copying build/lib/flashinfer/data/csrc/tinygemm2.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,112 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,115 copying build/lib/flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,117 copying build/lib/flashinfer/data/csrc/topk.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,120 copying build/lib/flashinfer/data/csrc/batch_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,122 copying build/lib/flashinfer/data/csrc/dsv3_router_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,125 copying build/lib/flashinfer/data/csrc/batch_decode_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-16T04:56:33,128 copying build/lib/flashinfer/_build_meta.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,131 copying build/lib/flashinfer/trtllm_low_latency_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,133 copying build/lib/flashinfer/tllm_enums.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,136 copying build/lib/flashinfer/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,139 copying build/lib/flashinfer/compilation_context.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,142 copying build/lib/flashinfer/version.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,144 copying build/lib/flashinfer/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,151 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe 2026-04-16T04:56:33,153 copying build/lib/flashinfer/fused_moe/fused_routing_dsv3.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-16T04:56:33,156 copying build/lib/flashinfer/fused_moe/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-16T04:56:33,158 copying build/lib/flashinfer/fused_moe/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-16T04:56:33,161 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:33,163 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:33,165 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:33,171 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:33,176 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:33,179 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:33,182 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-16T04:56:33,184 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:33,188 copying build/lib/flashinfer/fused_moe/cute_dsl/tuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:33,190 copying build/lib/flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:33,193 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:33,196 copying build/lib/flashinfer/fused_moe/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:33,197 copying build/lib/flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-16T04:56:33,200 copying build/lib/flashinfer/fused_moe/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-16T04:56:33,204 copying build/lib/flashinfer/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,207 copying build/lib/flashinfer/sparse.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,210 copying build/lib/flashinfer/__main__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,213 copying build/lib/flashinfer/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,215 copying build/lib/flashinfer/py.typed -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,216 copying build/lib/flashinfer/artifacts.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-16T04:56:33,219 creating build/bdist.linux-armv7l/wheel/flashinfer/testing 2026-04-16T04:56:33,220 copying build/lib/flashinfer/testing/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-16T04:56:33,222 copying build/lib/flashinfer/testing/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-16T04:56:33,226 running install_egg_info 2026-04-16T04:56:33,238 running egg_info 2026-04-16T04:56:33,244 writing flashinfer_python.egg-info/PKG-INFO 2026-04-16T04:56:33,248 writing dependency_links to flashinfer_python.egg-info/dependency_links.txt 2026-04-16T04:56:33,250 writing entry points to flashinfer_python.egg-info/entry_points.txt 2026-04-16T04:56:33,252 writing requirements to flashinfer_python.egg-info/requires.txt 2026-04-16T04:56:33,254 writing top-level names to flashinfer_python.egg-info/top_level.txt 2026-04-16T04:56:34,082 reading manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-16T04:56:34,201 adding license file 'LICENSE' 2026-04-16T04:56:34,326 writing manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-16T04:56:34,331 Copying flashinfer_python.egg-info to build/bdist.linux-armv7l/wheel/./flashinfer_python-0.6.8-py3.11.egg-info 2026-04-16T04:56:34,347 running install_scripts 2026-04-16T04:56:34,360 creating build/bdist.linux-armv7l/wheel/flashinfer_python-0.6.8.dist-info/WHEEL 2026-04-16T04:56:34,363 creating '/tmp/pip-wheel-asfseyvc/.tmp-9xjn5chg/flashinfer_python-0.6.8-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-04-16T04:56:34,365 adding 'build_backend.py' 2026-04-16T04:56:34,367 adding 'build_utils.py' 2026-04-16T04:56:34,370 adding 'flashinfer/__init__.py' 2026-04-16T04:56:34,372 adding 'flashinfer/__main__.py' 2026-04-16T04:56:34,374 adding 'flashinfer/_build_meta.py' 2026-04-16T04:56:34,375 adding 'flashinfer/activation.py' 2026-04-16T04:56:34,379 adding 'flashinfer/aot.py' 2026-04-16T04:56:34,386 adding 'flashinfer/api_logging.py' 2026-04-16T04:56:34,388 adding 'flashinfer/artifacts.py' 2026-04-16T04:56:34,390 adding 'flashinfer/attention.py' 2026-04-16T04:56:34,397 adding 'flashinfer/autotuner.py' 2026-04-16T04:56:34,401 adding 'flashinfer/cascade.py' 2026-04-16T04:56:34,403 adding 'flashinfer/compilation_context.py' 2026-04-16T04:56:34,404 adding 'flashinfer/concat_ops.py' 2026-04-16T04:56:34,406 adding 'flashinfer/cuda_utils.py' 2026-04-16T04:56:34,417 adding 'flashinfer/decode.py' 2026-04-16T04:56:34,422 adding 'flashinfer/deep_gemm.py' 2026-04-16T04:56:34,424 adding 'flashinfer/fp4_quantization.py' 2026-04-16T04:56:34,425 adding 'flashinfer/fp8_quantization.py' 2026-04-16T04:56:34,428 adding 'flashinfer/gdn_decode.py' 2026-04-16T04:56:34,430 adding 'flashinfer/gdn_prefill.py' 2026-04-16T04:56:34,432 adding 'flashinfer/green_ctx.py' 2026-04-16T04:56:34,434 adding 'flashinfer/page.py' 2026-04-16T04:56:34,438 adding 'flashinfer/pod.py' 2026-04-16T04:56:34,455 adding 'flashinfer/prefill.py' 2026-04-16T04:56:34,458 adding 'flashinfer/py.typed' 2026-04-16T04:56:34,463 adding 'flashinfer/rope.py' 2026-04-16T04:56:34,469 adding 'flashinfer/sampling.py' 2026-04-16T04:56:34,473 adding 'flashinfer/sparse.py' 2026-04-16T04:56:34,475 adding 'flashinfer/tllm_enums.py' 2026-04-16T04:56:34,476 adding 'flashinfer/tllm_utils.py' 2026-04-16T04:56:34,479 adding 'flashinfer/topk.py' 2026-04-16T04:56:34,480 adding 'flashinfer/trtllm_low_latency_gemm.py' 2026-04-16T04:56:34,486 adding 'flashinfer/utils.py' 2026-04-16T04:56:34,487 adding 'flashinfer/version.py' 2026-04-16T04:56:34,490 adding 'flashinfer/xqa.py' 2026-04-16T04:56:34,492 adding 'flashinfer/comm/__init__.py' 2026-04-16T04:56:34,495 adding 'flashinfer/comm/allreduce.py' 2026-04-16T04:56:34,497 adding 'flashinfer/comm/cuda_ipc.py' 2026-04-16T04:56:34,499 adding 'flashinfer/comm/dlpack_utils.py' 2026-04-16T04:56:34,501 adding 'flashinfer/comm/mapping.py' 2026-04-16T04:56:34,507 adding 'flashinfer/comm/mnnvl.py' 2026-04-16T04:56:34,509 adding 'flashinfer/comm/nvshmem.py' 2026-04-16T04:56:34,511 adding 'flashinfer/comm/nvshmem_allreduce.py' 2026-04-16T04:56:34,513 adding 'flashinfer/comm/trtllm_alltoall.py' 2026-04-16T04:56:34,517 adding 'flashinfer/comm/trtllm_ar.py' 2026-04-16T04:56:34,521 adding 'flashinfer/comm/trtllm_mnnvl_ar.py' 2026-04-16T04:56:34,524 adding 'flashinfer/comm/trtllm_moe_alltoall.py' 2026-04-16T04:56:34,525 adding 'flashinfer/comm/vllm_ar.py' 2026-04-16T04:56:34,527 adding 'flashinfer/comm/workspace_base.py' 2026-04-16T04:56:34,529 adding 'flashinfer/cudnn/__init__.py' 2026-04-16T04:56:34,531 adding 'flashinfer/cudnn/decode.py' 2026-04-16T04:56:34,534 adding 'flashinfer/cudnn/prefill.py' 2026-04-16T04:56:34,535 adding 'flashinfer/cudnn/utils.py' 2026-04-16T04:56:34,537 adding 'flashinfer/cute_dsl/__init__.py' 2026-04-16T04:56:34,542 adding 'flashinfer/cute_dsl/add_rmsnorm_fp4quant.py' 2026-04-16T04:56:34,544 adding 'flashinfer/cute_dsl/blockscaled_gemm.py' 2026-04-16T04:56:34,547 adding 'flashinfer/cute_dsl/fp4_common.py' 2026-04-16T04:56:34,555 adding 'flashinfer/cute_dsl/gemm_allreduce_two_shot.py' 2026-04-16T04:56:34,561 adding 'flashinfer/cute_dsl/rmsnorm_fp4quant.py' 2026-04-16T04:56:34,563 adding 'flashinfer/cute_dsl/utils.py' 2026-04-16T04:56:34,565 adding 'flashinfer/cute_dsl/attention/__init__.py' 2026-04-16T04:56:34,568 adding 'flashinfer/cute_dsl/attention/collective_builder.py' 2026-04-16T04:56:34,570 adding 'flashinfer/cute_dsl/attention/compat.py' 2026-04-16T04:56:34,572 adding 'flashinfer/cute_dsl/attention/config.py' 2026-04-16T04:56:34,573 adding 'flashinfer/cute_dsl/attention/mainloop_spec.py' 2026-04-16T04:56:34,575 adding 'flashinfer/cute_dsl/attention/mla_config.py' 2026-04-16T04:56:34,578 adding 'flashinfer/cute_dsl/attention/mla_decode.py' 2026-04-16T04:56:34,581 adding 'flashinfer/cute_dsl/attention/mla_decode_fp8.py' 2026-04-16T04:56:34,582 adding 'flashinfer/cute_dsl/attention/mla_warp_schedule.py' 2026-04-16T04:56:34,585 adding 'flashinfer/cute_dsl/attention/pipeline_topology.py' 2026-04-16T04:56:34,587 adding 'flashinfer/cute_dsl/attention/prefill.py' 2026-04-16T04:56:34,589 adding 'flashinfer/cute_dsl/attention/tmem_layout.py' 2026-04-16T04:56:34,590 adding 'flashinfer/cute_dsl/attention/warp_schedule.py' 2026-04-16T04:56:34,592 adding 'flashinfer/cute_dsl/attention/fusion/__init__.py' 2026-04-16T04:56:34,593 adding 'flashinfer/cute_dsl/attention/fusion/mask.py' 2026-04-16T04:56:34,596 adding 'flashinfer/cute_dsl/attention/fusion/variant.py' 2026-04-16T04:56:34,598 adding 'flashinfer/cute_dsl/attention/roles/__init__.py' 2026-04-16T04:56:34,601 adding 'flashinfer/cute_dsl/attention/roles/correction.py' 2026-04-16T04:56:34,602 adding 'flashinfer/cute_dsl/attention/roles/epilogue.py' 2026-04-16T04:56:34,604 adding 'flashinfer/cute_dsl/attention/roles/loader_tma.py' 2026-04-16T04:56:34,607 adding 'flashinfer/cute_dsl/attention/roles/mla_compute.py' 2026-04-16T04:56:34,610 adding 'flashinfer/cute_dsl/attention/roles/mla_correction.py' 2026-04-16T04:56:34,613 adding 'flashinfer/cute_dsl/attention/roles/mla_loader.py' 2026-04-16T04:56:34,615 adding 'flashinfer/cute_dsl/attention/roles/mla_loader_fp8.py' 2026-04-16T04:56:34,617 adding 'flashinfer/cute_dsl/attention/roles/mla_mma.py' 2026-04-16T04:56:34,619 adding 'flashinfer/cute_dsl/attention/roles/mla_mma_fp8.py' 2026-04-16T04:56:34,621 adding 'flashinfer/cute_dsl/attention/roles/mla_pt_loader.py' 2026-04-16T04:56:34,623 adding 'flashinfer/cute_dsl/attention/roles/mma.py' 2026-04-16T04:56:34,626 adding 'flashinfer/cute_dsl/attention/roles/softmax.py' 2026-04-16T04:56:34,627 adding 'flashinfer/cute_dsl/attention/roles/softmax_math.py' 2026-04-16T04:56:34,629 adding 'flashinfer/cute_dsl/attention/scheduler/__init__.py' 2026-04-16T04:56:34,631 adding 'flashinfer/cute_dsl/attention/scheduler/mla_persistent.py' 2026-04-16T04:56:34,633 adding 'flashinfer/cute_dsl/attention/scheduler/persistent.py' 2026-04-16T04:56:34,634 adding 'flashinfer/cute_dsl/attention/wrappers/__init__.py' 2026-04-16T04:56:34,637 adding 'flashinfer/cute_dsl/attention/wrappers/batch_mla.py' 2026-04-16T04:56:34,640 adding 'flashinfer/cute_dsl/attention/wrappers/batch_prefill.py' 2026-04-16T04:56:34,642 adding 'flashinfer/data/build_backend.py' 2026-04-16T04:56:34,643 adding 'flashinfer/data/build_utils.py' 2026-04-16T04:56:34,648 adding 'flashinfer/data/csrc/batch_attention.cu' 2026-04-16T04:56:34,650 adding 'flashinfer/data/csrc/batch_attention_customize_config.jinja' 2026-04-16T04:56:34,652 adding 'flashinfer/data/csrc/batch_attention_jit_binding.cu' 2026-04-16T04:56:34,653 adding 'flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja' 2026-04-16T04:56:34,655 adding 'flashinfer/data/csrc/batch_decode.cu' 2026-04-16T04:56:34,656 adding 'flashinfer/data/csrc/batch_decode_customize_config.jinja' 2026-04-16T04:56:34,657 adding 'flashinfer/data/csrc/batch_decode_jit_binding.cu' 2026-04-16T04:56:34,659 adding 'flashinfer/data/csrc/batch_decode_kernel_inst.jinja' 2026-04-16T04:56:34,660 adding 'flashinfer/data/csrc/batch_decode_mla_binding.cu' 2026-04-16T04:56:34,661 adding 'flashinfer/data/csrc/batch_decode_mla_config.jinja' 2026-04-16T04:56:34,663 adding 'flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu' 2026-04-16T04:56:34,664 adding 'flashinfer/data/csrc/batch_decode_mla_plan.cu' 2026-04-16T04:56:34,666 adding 'flashinfer/data/csrc/batch_decode_mla_run.cu' 2026-04-16T04:56:34,667 adding 'flashinfer/data/csrc/batch_mla_binding.cu' 2026-04-16T04:56:34,668 adding 'flashinfer/data/csrc/batch_mla_config.jinja' 2026-04-16T04:56:34,670 adding 'flashinfer/data/csrc/batch_mla_plan.cu' 2026-04-16T04:56:34,671 adding 'flashinfer/data/csrc/batch_mla_run.cu' 2026-04-16T04:56:34,672 adding 'flashinfer/data/csrc/batch_mla_sm90_binding.cu' 2026-04-16T04:56:34,674 adding 'flashinfer/data/csrc/batch_mla_sm90_plan.cu' 2026-04-16T04:56:34,675 adding 'flashinfer/data/csrc/batch_mla_sm90_run.cu' 2026-04-16T04:56:34,677 adding 'flashinfer/data/csrc/batch_pod.cu' 2026-04-16T04:56:34,679 adding 'flashinfer/data/csrc/batch_pod_customize_config.jinja' 2026-04-16T04:56:34,680 adding 'flashinfer/data/csrc/batch_pod_jit_binding.cu' 2026-04-16T04:56:34,681 adding 'flashinfer/data/csrc/batch_pod_kernel_inst.jinja' 2026-04-16T04:56:34,683 adding 'flashinfer/data/csrc/batch_prefill.cu' 2026-04-16T04:56:34,684 adding 'flashinfer/data/csrc/batch_prefill_customize_config.jinja' 2026-04-16T04:56:34,685 adding 'flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja' 2026-04-16T04:56:34,687 adding 'flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja' 2026-04-16T04:56:34,689 adding 'flashinfer/data/csrc/batch_prefill_fp8_sm90.cu' 2026-04-16T04:56:34,690 adding 'flashinfer/data/csrc/batch_prefill_jit_binding.cu' 2026-04-16T04:56:34,691 adding 'flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja' 2026-04-16T04:56:34,692 adding 'flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja' 2026-04-16T04:56:34,694 adding 'flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja' 2026-04-16T04:56:34,695 adding 'flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja' 2026-04-16T04:56:34,697 adding 'flashinfer/data/csrc/batch_prefill_sm90.cu' 2026-04-16T04:56:34,698 adding 'flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja' 2026-04-16T04:56:34,699 adding 'flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu' 2026-04-16T04:56:34,701 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.cu' 2026-04-16T04:56:34,702 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.jinja' 2026-04-16T04:56:34,704 adding 'flashinfer/data/csrc/blackwell_fmha_plan.cu' 2026-04-16T04:56:34,705 adding 'flashinfer/data/csrc/bmm_fp8.cu' 2026-04-16T04:56:34,706 adding 'flashinfer/data/csrc/cascade.cu' 2026-04-16T04:56:34,708 adding 'flashinfer/data/csrc/concat_mla.cu' 2026-04-16T04:56:34,713 adding 'flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu' 2026-04-16T04:56:34,715 adding 'flashinfer/data/csrc/cudnn_sdpa_utils.h' 2026-04-16T04:56:34,716 adding 'flashinfer/data/csrc/cutlass_mla.cu' 2026-04-16T04:56:34,718 adding 'flashinfer/data/csrc/dsv3_router_gemm.cu' 2026-04-16T04:56:34,719 adding 'flashinfer/data/csrc/flashinfer_cascade_binding.cu' 2026-04-16T04:56:34,721 adding 'flashinfer/data/csrc/flashinfer_gemm_binding.cu' 2026-04-16T04:56:34,722 adding 'flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu' 2026-04-16T04:56:34,723 adding 'flashinfer/data/csrc/flashinfer_mamba_binding.cu' 2026-04-16T04:56:34,725 adding 'flashinfer/data/csrc/flashinfer_mla_binding.cu' 2026-04-16T04:56:34,726 adding 'flashinfer/data/csrc/flashinfer_norm_binding.cu' 2026-04-16T04:56:34,727 adding 'flashinfer/data/csrc/flashinfer_page_binding.cu' 2026-04-16T04:56:34,728 adding 'flashinfer/data/csrc/flashinfer_quantization_binding.cu' 2026-04-16T04:56:34,730 adding 'flashinfer/data/csrc/flashinfer_rmsnorm_silu_binding.cu' 2026-04-16T04:56:34,731 adding 'flashinfer/data/csrc/flashinfer_rope_binding.cu' 2026-04-16T04:56:34,733 adding 'flashinfer/data/csrc/flashinfer_sampling_binding.cu' 2026-04-16T04:56:34,734 adding 'flashinfer/data/csrc/flashinfer_topk_binding.cu' 2026-04-16T04:56:34,735 adding 'flashinfer/data/csrc/flashinfer_xqa_binding.cu' 2026-04-16T04:56:34,737 adding 'flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc' 2026-04-16T04:56:34,739 adding 'flashinfer/data/csrc/fmhaReduction.cu' 2026-04-16T04:56:34,741 adding 'flashinfer/data/csrc/fmha_cutlass_sm100.cu' 2026-04-16T04:56:34,742 adding 'flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu' 2026-04-16T04:56:34,744 adding 'flashinfer/data/csrc/fmha_v2_jit_binding.cu' 2026-04-16T04:56:34,748 adding 'flashinfer/data/csrc/fmha_v2_run.cu' 2026-04-16T04:56:34,749 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.cu' 2026-04-16T04:56:34,751 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.jinja' 2026-04-16T04:56:34,752 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu' 2026-04-16T04:56:34,754 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja' 2026-04-16T04:56:34,755 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu' 2026-04-16T04:56:34,756 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja' 2026-04-16T04:56:34,758 adding 'flashinfer/data/csrc/fp4_kv_dequantization.cu' 2026-04-16T04:56:34,760 adding 'flashinfer/data/csrc/fp4_kv_quantization.cu' 2026-04-16T04:56:34,762 adding 'flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu' 2026-04-16T04:56:34,763 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.cu' 2026-04-16T04:56:34,765 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.jinja' 2026-04-16T04:56:34,766 adding 'flashinfer/data/csrc/gdn_prefill_launcher.cu' 2026-04-16T04:56:34,768 adding 'flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja' 2026-04-16T04:56:34,770 adding 'flashinfer/data/csrc/gemm_groupwise_sm100.cu' 2026-04-16T04:56:34,771 adding 'flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja' 2026-04-16T04:56:34,772 adding 'flashinfer/data/csrc/gemm_groupwise_sm120.cu' 2026-04-16T04:56:34,774 adding 'flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja' 2026-04-16T04:56:34,775 adding 'flashinfer/data/csrc/gemm_sm100_binding.cu' 2026-04-16T04:56:34,776 adding 'flashinfer/data/csrc/gemm_sm120_binding.cu' 2026-04-16T04:56:34,777 adding 'flashinfer/data/csrc/group_gemm.cu' 2026-04-16T04:56:34,779 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu' 2026-04-16T04:56:34,780 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja' 2026-04-16T04:56:34,782 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu' 2026-04-16T04:56:34,783 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja' 2026-04-16T04:56:34,785 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu' 2026-04-16T04:56:34,786 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja' 2026-04-16T04:56:34,788 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120.cu' 2026-04-16T04:56:34,789 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120_kernel_inst.jinja' 2026-04-16T04:56:34,791 adding 'flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120.cu' 2026-04-16T04:56:34,792 adding 'flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120_kernel_inst.jinja' 2026-04-16T04:56:34,793 adding 'flashinfer/data/csrc/group_gemm_sm100_binding.cu' 2026-04-16T04:56:34,795 adding 'flashinfer/data/csrc/group_gemm_sm120_binding.cu' 2026-04-16T04:56:34,796 adding 'flashinfer/data/csrc/group_gemm_sm90.cu' 2026-04-16T04:56:34,798 adding 'flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja' 2026-04-16T04:56:34,799 adding 'flashinfer/data/csrc/logging.cc' 2026-04-16T04:56:34,801 adding 'flashinfer/data/csrc/moe_utils_binding.cu' 2026-04-16T04:56:34,803 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.cu' 2026-04-16T04:56:34,805 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja' 2026-04-16T04:56:34,807 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.cu' 2026-04-16T04:56:34,808 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.jinja' 2026-04-16T04:56:34,810 adding 'flashinfer/data/csrc/norm.cu' 2026-04-16T04:56:34,811 adding 'flashinfer/data/csrc/page.cu' 2026-04-16T04:56:34,813 adding 'flashinfer/data/csrc/pod.cu' 2026-04-16T04:56:34,815 adding 'flashinfer/data/csrc/pod_customize_config.jinja' 2026-04-16T04:56:34,816 adding 'flashinfer/data/csrc/pod_jit_binding.cu' 2026-04-16T04:56:34,818 adding 'flashinfer/data/csrc/pod_kernel_inst.jinja' 2026-04-16T04:56:34,819 adding 'flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu' 2026-04-16T04:56:34,821 adding 'flashinfer/data/csrc/quantization.cu' 2026-04-16T04:56:34,822 adding 'flashinfer/data/csrc/renorm.cu' 2026-04-16T04:56:34,823 adding 'flashinfer/data/csrc/rmsnorm_silu.cu' 2026-04-16T04:56:34,826 adding 'flashinfer/data/csrc/rope.cu' 2026-04-16T04:56:34,827 adding 'flashinfer/data/csrc/runtime_utils.h' 2026-04-16T04:56:34,829 adding 'flashinfer/data/csrc/sampling.cu' 2026-04-16T04:56:34,830 adding 'flashinfer/data/csrc/sampling_utils.h' 2026-04-16T04:56:34,833 adding 'flashinfer/data/csrc/selective_state_update.cu' 2026-04-16T04:56:34,835 adding 'flashinfer/data/csrc/selective_state_update_customize_config.jinja' 2026-04-16T04:56:34,836 adding 'flashinfer/data/csrc/selective_state_update_dtype_inst.jinja' 2026-04-16T04:56:34,837 adding 'flashinfer/data/csrc/selective_state_update_kernel_inst.cu' 2026-04-16T04:56:34,838 adding 'flashinfer/data/csrc/seq_chunk_cumsum.cu' 2026-04-16T04:56:34,840 adding 'flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu' 2026-04-16T04:56:34,841 adding 'flashinfer/data/csrc/single_decode.cu' 2026-04-16T04:56:34,843 adding 'flashinfer/data/csrc/single_decode_customize_config.jinja' 2026-04-16T04:56:34,844 adding 'flashinfer/data/csrc/single_decode_jit_binding.cu' 2026-04-16T04:56:34,845 adding 'flashinfer/data/csrc/single_decode_kernel_inst.jinja' 2026-04-16T04:56:34,847 adding 'flashinfer/data/csrc/single_prefill.cu' 2026-04-16T04:56:34,848 adding 'flashinfer/data/csrc/single_prefill_customize_config.jinja' 2026-04-16T04:56:34,849 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90.cu' 2026-04-16T04:56:34,851 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja' 2026-04-16T04:56:34,852 adding 'flashinfer/data/csrc/single_prefill_jit_binding.cu' 2026-04-16T04:56:34,853 adding 'flashinfer/data/csrc/single_prefill_kernel_inst.jinja' 2026-04-16T04:56:34,854 adding 'flashinfer/data/csrc/single_prefill_sm90.cu' 2026-04-16T04:56:34,856 adding 'flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja' 2026-04-16T04:56:34,857 adding 'flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu' 2026-04-16T04:56:34,858 adding 'flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja' 2026-04-16T04:56:34,860 adding 'flashinfer/data/csrc/tgv_gemm.cu' 2026-04-16T04:56:34,861 adding 'flashinfer/data/csrc/tgv_gemm.jinja' 2026-04-16T04:56:34,864 adding 'flashinfer/data/csrc/tinygemm2.cu' 2026-04-16T04:56:34,866 adding 'flashinfer/data/csrc/topk.cu' 2026-04-16T04:56:34,867 adding 'flashinfer/data/csrc/trtllm_allreduce.cu' 2026-04-16T04:56:34,869 adding 'flashinfer/data/csrc/trtllm_allreduce_fusion.cu' 2026-04-16T04:56:34,871 adding 'flashinfer/data/csrc/trtllm_alltoall.cu' 2026-04-16T04:56:34,874 adding 'flashinfer/data/csrc/trtllm_alltoall_prepare.cu' 2026-04-16T04:56:34,877 adding 'flashinfer/data/csrc/trtllm_batched_gemm_runner.cu' 2026-04-16T04:56:34,880 adding 'flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu' 2026-04-16T04:56:34,883 adding 'flashinfer/data/csrc/trtllm_fmha_v2_binding.cu' 2026-04-16T04:56:34,892 adding 'flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu' 2026-04-16T04:56:34,896 adding 'flashinfer/data/csrc/trtllm_fused_moe_runner.cu' 2026-04-16T04:56:34,899 adding 'flashinfer/data/csrc/trtllm_gemm_runner.cu' 2026-04-16T04:56:34,901 adding 'flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu' 2026-04-16T04:56:34,902 adding 'flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu' 2026-04-16T04:56:34,904 adding 'flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu' 2026-04-16T04:56:34,906 adding 'flashinfer/data/csrc/trtllm_moe_alltoall.cu' 2026-04-16T04:56:34,909 adding 'flashinfer/data/csrc/tvm_ffi_utils.h' 2026-04-16T04:56:34,911 adding 'flashinfer/data/csrc/vllm_custom_all_reduce.cu' 2026-04-16T04:56:34,913 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h' 2026-04-16T04:56:34,915 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h' 2026-04-16T04:56:34,917 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h' 2026-04-16T04:56:34,919 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h' 2026-04-16T04:56:34,921 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h' 2026-04-16T04:56:34,923 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h' 2026-04-16T04:56:34,925 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h' 2026-04-16T04:56:34,927 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h' 2026-04-16T04:56:34,930 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h' 2026-04-16T04:56:34,932 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h' 2026-04-16T04:56:34,934 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h' 2026-04-16T04:56:34,939 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h' 2026-04-16T04:56:34,941 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h' 2026-04-16T04:56:34,943 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h' 2026-04-16T04:56:34,945 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h' 2026-04-16T04:56:34,947 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h' 2026-04-16T04:56:34,950 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h' 2026-04-16T04:56:34,953 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h' 2026-04-16T04:56:34,955 adding 'flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h' 2026-04-16T04:56:34,960 adding 'flashinfer/data/csrc/fmha_v2/fmha/fragment.h' 2026-04-16T04:56:34,961 adding 'flashinfer/data/csrc/fmha_v2/fmha/gemm.h' 2026-04-16T04:56:34,963 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h' 2026-04-16T04:56:34,967 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h' 2026-04-16T04:56:34,970 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h' 2026-04-16T04:56:34,972 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h' 2026-04-16T04:56:34,976 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h' 2026-04-16T04:56:34,979 adding 'flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h' 2026-04-16T04:56:34,982 adding 'flashinfer/data/csrc/fmha_v2/fmha/mask.h' 2026-04-16T04:56:34,984 adding 'flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h' 2026-04-16T04:56:34,985 adding 'flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h' 2026-04-16T04:56:34,990 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h' 2026-04-16T04:56:34,995 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h' 2026-04-16T04:56:34,997 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h' 2026-04-16T04:56:35,000 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h' 2026-04-16T04:56:35,012 adding 'flashinfer/data/csrc/fmha_v2/fmha/softmax.h' 2026-04-16T04:56:35,015 adding 'flashinfer/data/csrc/fmha_v2/fmha/traits.h' 2026-04-16T04:56:35,021 adding 'flashinfer/data/csrc/fmha_v2/fmha/utils.h' 2026-04-16T04:56:35,024 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h' 2026-04-16T04:56:35,026 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h' 2026-04-16T04:56:35,029 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h' 2026-04-16T04:56:35,033 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h' 2026-04-16T04:56:35,035 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h' 2026-04-16T04:56:35,037 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h' 2026-04-16T04:56:35,040 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h' 2026-04-16T04:56:35,047 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h' 2026-04-16T04:56:35,049 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h' 2026-04-16T04:56:35,051 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h' 2026-04-16T04:56:35,053 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h' 2026-04-16T04:56:35,054 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h' 2026-04-16T04:56:35,056 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h' 2026-04-16T04:56:35,058 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h' 2026-04-16T04:56:35,060 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h' 2026-04-16T04:56:35,065 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h' 2026-04-16T04:56:35,067 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h' 2026-04-16T04:56:35,068 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h' 2026-04-16T04:56:35,070 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h' 2026-04-16T04:56:35,073 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h' 2026-04-16T04:56:35,078 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h' 2026-04-16T04:56:35,082 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h' 2026-04-16T04:56:35,085 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h' 2026-04-16T04:56:35,087 adding 'flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja' 2026-04-16T04:56:35,089 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel.jinja' 2026-04-16T04:56:35,091 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja' 2026-04-16T04:56:35,093 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja' 2026-04-16T04:56:35,096 adding 'flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh' 2026-04-16T04:56:35,098 adding 'flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu' 2026-04-16T04:56:35,100 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu' 2026-04-16T04:56:35,123 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh' 2026-04-16T04:56:35,126 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu' 2026-04-16T04:56:35,132 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu' 2026-04-16T04:56:35,137 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu' 2026-04-16T04:56:35,139 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_common.cu' 2026-04-16T04:56:35,144 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_custom.cu' 2026-04-16T04:56:35,147 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu' 2026-04-16T04:56:35,151 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu' 2026-04-16T04:56:35,155 adding 'flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp' 2026-04-16T04:56:35,156 adding 'flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp' 2026-04-16T04:56:35,160 adding 'flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu' 2026-04-16T04:56:35,162 adding 'flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp' 2026-04-16T04:56:35,163 adding 'flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp' 2026-04-16T04:56:35,166 adding 'flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu' 2026-04-16T04:56:35,170 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h' 2026-04-16T04:56:35,171 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h' 2026-04-16T04:56:35,173 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h' 2026-04-16T04:56:35,174 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h' 2026-04-16T04:56:35,176 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h' 2026-04-16T04:56:35,180 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h' 2026-04-16T04:56:35,182 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h' 2026-04-16T04:56:35,183 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h' 2026-04-16T04:56:35,185 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h' 2026-04-16T04:56:35,187 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h' 2026-04-16T04:56:35,188 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h' 2026-04-16T04:56:35,190 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h' 2026-04-16T04:56:35,192 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh' 2026-04-16T04:56:35,194 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h' 2026-04-16T04:56:35,195 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh' 2026-04-16T04:56:35,197 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h' 2026-04-16T04:56:35,199 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h' 2026-04-16T04:56:35,200 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh' 2026-04-16T04:56:35,202 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh' 2026-04-16T04:56:35,204 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h' 2026-04-16T04:56:35,207 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h' 2026-04-16T04:56:35,208 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h' 2026-04-16T04:56:35,210 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h' 2026-04-16T04:56:35,212 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h' 2026-04-16T04:56:35,214 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h' 2026-04-16T04:56:35,215 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h' 2026-04-16T04:56:35,217 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h' 2026-04-16T04:56:35,219 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp' 2026-04-16T04:56:35,221 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp' 2026-04-16T04:56:35,222 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp' 2026-04-16T04:56:35,224 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h' 2026-04-16T04:56:35,225 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h' 2026-04-16T04:56:35,228 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp' 2026-04-16T04:56:35,232 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp' 2026-04-16T04:56:35,236 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp' 2026-04-16T04:56:35,239 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp' 2026-04-16T04:56:35,243 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp' 2026-04-16T04:56:35,245 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h' 2026-04-16T04:56:35,247 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp' 2026-04-16T04:56:35,248 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp' 2026-04-16T04:56:35,250 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp' 2026-04-16T04:56:35,251 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp' 2026-04-16T04:56:35,252 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp' 2026-04-16T04:56:35,254 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp' 2026-04-16T04:56:35,261 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp' 2026-04-16T04:56:35,265 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp' 2026-04-16T04:56:35,268 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-16T04:56:35,275 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-16T04:56:35,279 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl' 2026-04-16T04:56:35,280 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl' 2026-04-16T04:56:35,282 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl' 2026-04-16T04:56:35,284 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h' 2026-04-16T04:56:35,286 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh' 2026-04-16T04:56:35,289 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh' 2026-04-16T04:56:35,291 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh' 2026-04-16T04:56:35,293 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h' 2026-04-16T04:56:35,294 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp' 2026-04-16T04:56:35,295 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h' 2026-04-16T04:56:35,297 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh' 2026-04-16T04:56:35,300 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h' 2026-04-16T04:56:35,302 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h' 2026-04-16T04:56:35,305 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp' 2026-04-16T04:56:35,309 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp' 2026-04-16T04:56:35,311 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h' 2026-04-16T04:56:35,313 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h' 2026-04-16T04:56:35,314 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h' 2026-04-16T04:56:35,316 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h' 2026-04-16T04:56:35,318 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h' 2026-04-16T04:56:35,320 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h' 2026-04-16T04:56:35,321 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h' 2026-04-16T04:56:35,324 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h' 2026-04-16T04:56:35,327 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h' 2026-04-16T04:56:35,328 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h' 2026-04-16T04:56:35,331 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h' 2026-04-16T04:56:35,333 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h' 2026-04-16T04:56:35,335 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h' 2026-04-16T04:56:35,337 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h' 2026-04-16T04:56:35,339 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h' 2026-04-16T04:56:35,342 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h' 2026-04-16T04:56:35,344 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp' 2026-04-16T04:56:35,347 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh' 2026-04-16T04:56:35,349 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh' 2026-04-16T04:56:35,353 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh' 2026-04-16T04:56:35,355 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh' 2026-04-16T04:56:35,358 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh' 2026-04-16T04:56:35,364 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh' 2026-04-16T04:56:35,366 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh' 2026-04-16T04:56:35,368 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh' 2026-04-16T04:56:35,370 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh' 2026-04-16T04:56:35,372 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh' 2026-04-16T04:56:35,373 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh' 2026-04-16T04:56:35,375 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu' 2026-04-16T04:56:35,376 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h' 2026-04-16T04:56:35,378 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu' 2026-04-16T04:56:35,379 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h' 2026-04-16T04:56:35,382 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh' 2026-04-16T04:56:35,383 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h' 2026-04-16T04:56:35,387 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh' 2026-04-16T04:56:35,392 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu' 2026-04-16T04:56:35,394 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h' 2026-04-16T04:56:35,396 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu' 2026-04-16T04:56:35,398 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h' 2026-04-16T04:56:35,402 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp' 2026-04-16T04:56:35,404 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h' 2026-04-16T04:56:35,405 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h' 2026-04-16T04:56:35,408 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu' 2026-04-16T04:56:35,410 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h' 2026-04-16T04:56:35,416 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh' 2026-04-16T04:56:35,419 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh' 2026-04-16T04:56:35,421 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh' 2026-04-16T04:56:35,424 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh' 2026-04-16T04:56:35,426 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh' 2026-04-16T04:56:35,428 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu' 2026-04-16T04:56:35,430 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu' 2026-04-16T04:56:35,431 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu' 2026-04-16T04:56:35,432 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu' 2026-04-16T04:56:35,434 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu' 2026-04-16T04:56:35,435 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu' 2026-04-16T04:56:35,436 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu' 2026-04-16T04:56:35,438 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu' 2026-04-16T04:56:35,439 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu' 2026-04-16T04:56:35,440 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu' 2026-04-16T04:56:35,441 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu' 2026-04-16T04:56:35,442 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu' 2026-04-16T04:56:35,444 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu' 2026-04-16T04:56:35,445 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu' 2026-04-16T04:56:35,446 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu' 2026-04-16T04:56:35,448 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu' 2026-04-16T04:56:35,449 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu' 2026-04-16T04:56:35,450 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h' 2026-04-16T04:56:35,453 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h' 2026-04-16T04:56:35,455 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h' 2026-04-16T04:56:35,457 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h' 2026-04-16T04:56:35,459 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl' 2026-04-16T04:56:35,461 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h' 2026-04-16T04:56:35,463 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h' 2026-04-16T04:56:35,465 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h' 2026-04-16T04:56:35,470 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h' 2026-04-16T04:56:35,472 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h' 2026-04-16T04:56:35,474 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu' 2026-04-16T04:56:35,475 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu' 2026-04-16T04:56:35,476 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu' 2026-04-16T04:56:35,478 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu' 2026-04-16T04:56:35,479 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu' 2026-04-16T04:56:35,480 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu' 2026-04-16T04:56:35,481 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu' 2026-04-16T04:56:35,483 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu' 2026-04-16T04:56:35,484 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu' 2026-04-16T04:56:35,485 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu' 2026-04-16T04:56:35,486 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu' 2026-04-16T04:56:35,487 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu' 2026-04-16T04:56:35,489 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu' 2026-04-16T04:56:35,490 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu' 2026-04-16T04:56:35,494 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h' 2026-04-16T04:56:35,497 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h' 2026-04-16T04:56:35,499 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h' 2026-04-16T04:56:35,501 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu' 2026-04-16T04:56:35,502 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh' 2026-04-16T04:56:35,504 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h' 2026-04-16T04:56:35,506 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h' 2026-04-16T04:56:35,507 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl' 2026-04-16T04:56:35,509 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h' 2026-04-16T04:56:35,516 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl' 2026-04-16T04:56:35,518 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h' 2026-04-16T04:56:35,520 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl' 2026-04-16T04:56:35,522 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp' 2026-04-16T04:56:35,524 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h' 2026-04-16T04:56:35,527 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp' 2026-04-16T04:56:35,529 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp' 2026-04-16T04:56:35,530 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h' 2026-04-16T04:56:35,532 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp' 2026-04-16T04:56:35,533 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h' 2026-04-16T04:56:35,535 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h' 2026-04-16T04:56:35,536 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h' 2026-04-16T04:56:35,539 adding 'flashinfer/data/csrc/xqa/barriers.cuh' 2026-04-16T04:56:35,540 adding 'flashinfer/data/csrc/xqa/cuda_hint.cuh' 2026-04-16T04:56:35,541 adding 'flashinfer/data/csrc/xqa/defines.h' 2026-04-16T04:56:35,543 adding 'flashinfer/data/csrc/xqa/gmma.cuh' 2026-04-16T04:56:35,553 adding 'flashinfer/data/csrc/xqa/gmma_impl.cuh' 2026-04-16T04:56:35,556 adding 'flashinfer/data/csrc/xqa/hostUtils.h' 2026-04-16T04:56:35,558 adding 'flashinfer/data/csrc/xqa/ldgsts.cuh' 2026-04-16T04:56:35,571 adding 'flashinfer/data/csrc/xqa/mha.cu' 2026-04-16T04:56:35,574 adding 'flashinfer/data/csrc/xqa/mha.h' 2026-04-16T04:56:35,576 adding 'flashinfer/data/csrc/xqa/mhaUtils.cuh' 2026-04-16T04:56:35,578 adding 'flashinfer/data/csrc/xqa/mha_components.cuh' 2026-04-16T04:56:35,591 adding 'flashinfer/data/csrc/xqa/mha_sm90.cu' 2026-04-16T04:56:35,595 adding 'flashinfer/data/csrc/xqa/mha_stdheaders.cuh' 2026-04-16T04:56:35,603 adding 'flashinfer/data/csrc/xqa/mla_sm120.cu' 2026-04-16T04:56:35,604 adding 'flashinfer/data/csrc/xqa/mla_sm120.cuh' 2026-04-16T04:56:35,606 adding 'flashinfer/data/csrc/xqa/mma.cuh' 2026-04-16T04:56:35,607 adding 'flashinfer/data/csrc/xqa/platform.h' 2026-04-16T04:56:35,608 adding 'flashinfer/data/csrc/xqa/specDec.h' 2026-04-16T04:56:35,610 adding 'flashinfer/data/csrc/xqa/tensorMap.cpp' 2026-04-16T04:56:35,611 adding 'flashinfer/data/csrc/xqa/tensorMap.h' 2026-04-16T04:56:35,613 adding 'flashinfer/data/csrc/xqa/tma.h' 2026-04-16T04:56:35,617 adding 'flashinfer/data/csrc/xqa/utils.cuh' 2026-04-16T04:56:35,619 adding 'flashinfer/data/csrc/xqa/utils.h' 2026-04-16T04:56:35,620 adding 'flashinfer/data/csrc/xqa/xqa_wrapper.cu' 2026-04-16T04:56:35,624 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py' 2026-04-16T04:56:35,625 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py' 2026-04-16T04:56:35,627 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py' 2026-04-16T04:56:35,630 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py' 2026-04-16T04:56:35,632 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py' 2026-04-16T04:56:35,634 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py' 2026-04-16T04:56:35,636 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py' 2026-04-16T04:56:35,638 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py' 2026-04-16T04:56:35,640 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py' 2026-04-16T04:56:35,642 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py' 2026-04-16T04:56:35,643 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py' 2026-04-16T04:56:35,646 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py' 2026-04-16T04:56:35,647 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py' 2026-04-16T04:56:35,650 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py' 2026-04-16T04:56:35,652 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py' 2026-04-16T04:56:35,656 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py' 2026-04-16T04:56:35,659 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py' 2026-04-16T04:56:35,660 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py' 2026-04-16T04:56:35,662 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py' 2026-04-16T04:56:35,663 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py' 2026-04-16T04:56:35,667 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py' 2026-04-16T04:56:35,669 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py' 2026-04-16T04:56:35,672 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py' 2026-04-16T04:56:35,673 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py' 2026-04-16T04:56:35,676 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py' 2026-04-16T04:56:35,678 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py' 2026-04-16T04:56:35,681 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py' 2026-04-16T04:56:35,686 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py' 2026-04-16T04:56:35,690 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py' 2026-04-16T04:56:35,692 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py' 2026-04-16T04:56:35,696 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py' 2026-04-16T04:56:35,698 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py' 2026-04-16T04:56:35,702 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py' 2026-04-16T04:56:35,714 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py' 2026-04-16T04:56:35,725 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py' 2026-04-16T04:56:35,735 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-16T04:56:35,743 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py' 2026-04-16T04:56:35,751 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py' 2026-04-16T04:56:35,759 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py' 2026-04-16T04:56:35,767 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py' 2026-04-16T04:56:35,775 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py' 2026-04-16T04:56:35,782 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py' 2026-04-16T04:56:35,794 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py' 2026-04-16T04:56:35,806 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py' 2026-04-16T04:56:35,818 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py' 2026-04-16T04:56:35,828 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py' 2026-04-16T04:56:35,845 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py' 2026-04-16T04:56:35,849 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py' 2026-04-16T04:56:35,852 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py' 2026-04-16T04:56:35,855 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py' 2026-04-16T04:56:35,866 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py' 2026-04-16T04:56:35,877 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py' 2026-04-16T04:56:35,888 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py' 2026-04-16T04:56:35,899 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py' 2026-04-16T04:56:35,903 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py' 2026-04-16T04:56:35,911 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py' 2026-04-16T04:56:35,918 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py' 2026-04-16T04:56:35,921 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py' 2026-04-16T04:56:35,923 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py' 2026-04-16T04:56:35,936 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py' 2026-04-16T04:56:35,939 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py' 2026-04-16T04:56:35,941 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py' 2026-04-16T04:56:35,949 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py' 2026-04-16T04:56:35,957 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py' 2026-04-16T04:56:35,965 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py' 2026-04-16T04:56:35,967 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py' 2026-04-16T04:56:35,977 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py' 2026-04-16T04:56:35,987 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py' 2026-04-16T04:56:35,997 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py' 2026-04-16T04:56:36,000 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py' 2026-04-16T04:56:36,015 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py' 2026-04-16T04:56:36,030 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py' 2026-04-16T04:56:36,033 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py' 2026-04-16T04:56:36,036 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py' 2026-04-16T04:56:36,039 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py' 2026-04-16T04:56:36,042 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py' 2026-04-16T04:56:36,046 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py' 2026-04-16T04:56:36,048 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py' 2026-04-16T04:56:36,054 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py' 2026-04-16T04:56:36,056 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py' 2026-04-16T04:56:36,057 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py' 2026-04-16T04:56:36,059 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py' 2026-04-16T04:56:36,061 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py' 2026-04-16T04:56:36,063 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py' 2026-04-16T04:56:36,066 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py' 2026-04-16T04:56:36,067 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py' 2026-04-16T04:56:36,068 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py' 2026-04-16T04:56:36,070 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py' 2026-04-16T04:56:36,071 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py' 2026-04-16T04:56:36,073 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py' 2026-04-16T04:56:36,074 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py' 2026-04-16T04:56:36,075 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py' 2026-04-16T04:56:36,079 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py' 2026-04-16T04:56:36,081 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py' 2026-04-16T04:56:36,084 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py' 2026-04-16T04:56:36,086 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py' 2026-04-16T04:56:36,095 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py' 2026-04-16T04:56:36,105 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py' 2026-04-16T04:56:36,116 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py' 2026-04-16T04:56:36,119 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py' 2026-04-16T04:56:36,124 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py' 2026-04-16T04:56:36,131 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py' 2026-04-16T04:56:36,135 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py' 2026-04-16T04:56:36,142 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py' 2026-04-16T04:56:36,146 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py' 2026-04-16T04:56:36,148 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py' 2026-04-16T04:56:36,151 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py' 2026-04-16T04:56:36,154 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py' 2026-04-16T04:56:36,160 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py' 2026-04-16T04:56:36,166 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py' 2026-04-16T04:56:36,176 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py' 2026-04-16T04:56:36,178 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py' 2026-04-16T04:56:36,180 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py' 2026-04-16T04:56:36,182 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py' 2026-04-16T04:56:36,184 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py' 2026-04-16T04:56:36,186 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py' 2026-04-16T04:56:36,189 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py' 2026-04-16T04:56:36,191 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py' 2026-04-16T04:56:36,193 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py' 2026-04-16T04:56:36,196 adding 'flashinfer/data/cutlass/include/cute/config.hpp' 2026-04-16T04:56:36,199 adding 'flashinfer/data/cutlass/include/cute/int_tuple.hpp' 2026-04-16T04:56:36,206 adding 'flashinfer/data/cutlass/include/cute/layout.hpp' 2026-04-16T04:56:36,209 adding 'flashinfer/data/cutlass/include/cute/layout_composed.hpp' 2026-04-16T04:56:36,211 adding 'flashinfer/data/cutlass/include/cute/pointer.hpp' 2026-04-16T04:56:36,213 adding 'flashinfer/data/cutlass/include/cute/pointer_base.hpp' 2026-04-16T04:56:36,214 adding 'flashinfer/data/cutlass/include/cute/pointer_flagged.hpp' 2026-04-16T04:56:36,216 adding 'flashinfer/data/cutlass/include/cute/pointer_sparse.hpp' 2026-04-16T04:56:36,218 adding 'flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp' 2026-04-16T04:56:36,220 adding 'flashinfer/data/cutlass/include/cute/stride.hpp' 2026-04-16T04:56:36,223 adding 'flashinfer/data/cutlass/include/cute/swizzle.hpp' 2026-04-16T04:56:36,226 adding 'flashinfer/data/cutlass/include/cute/swizzle_layout.hpp' 2026-04-16T04:56:36,227 adding 'flashinfer/data/cutlass/include/cute/tensor.hpp' 2026-04-16T04:56:36,231 adding 'flashinfer/data/cutlass/include/cute/tensor_impl.hpp' 2026-04-16T04:56:36,233 adding 'flashinfer/data/cutlass/include/cute/tensor_zip.hpp' 2026-04-16T04:56:36,234 adding 'flashinfer/data/cutlass/include/cute/underscore.hpp' 2026-04-16T04:56:36,236 adding 'flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp' 2026-04-16T04:56:36,238 adding 'flashinfer/data/cutlass/include/cute/algorithm/clear.hpp' 2026-04-16T04:56:36,240 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp' 2026-04-16T04:56:36,242 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp' 2026-04-16T04:56:36,245 adding 'flashinfer/data/cutlass/include/cute/algorithm/copy.hpp' 2026-04-16T04:56:36,246 adding 'flashinfer/data/cutlass/include/cute/algorithm/fill.hpp' 2026-04-16T04:56:36,248 adding 'flashinfer/data/cutlass/include/cute/algorithm/functional.hpp' 2026-04-16T04:56:36,250 adding 'flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp' 2026-04-16T04:56:36,251 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp' 2026-04-16T04:56:36,253 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp' 2026-04-16T04:56:36,255 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp' 2026-04-16T04:56:36,256 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp' 2026-04-16T04:56:36,259 adding 'flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp' 2026-04-16T04:56:36,261 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp' 2026-04-16T04:56:36,263 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp' 2026-04-16T04:56:36,264 adding 'flashinfer/data/cutlass/include/cute/arch/config.hpp' 2026-04-16T04:56:36,266 adding 'flashinfer/data/cutlass/include/cute/arch/copy.hpp' 2026-04-16T04:56:36,277 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp' 2026-04-16T04:56:36,282 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp' 2026-04-16T04:56:36,283 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp' 2026-04-16T04:56:36,285 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp' 2026-04-16T04:56:36,286 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp' 2026-04-16T04:56:36,288 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp' 2026-04-16T04:56:36,290 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp' 2026-04-16T04:56:36,293 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp' 2026-04-16T04:56:36,295 adding 'flashinfer/data/cutlass/include/cute/arch/mma.hpp' 2026-04-16T04:56:36,296 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp' 2026-04-16T04:56:36,299 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp' 2026-04-16T04:56:36,303 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp' 2026-04-16T04:56:36,308 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp' 2026-04-16T04:56:36,313 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp' 2026-04-16T04:56:36,315 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp' 2026-04-16T04:56:36,317 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp' 2026-04-16T04:56:36,318 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp' 2026-04-16T04:56:36,321 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp' 2026-04-16T04:56:36,323 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp' 2026-04-16T04:56:36,336 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp' 2026-04-16T04:56:36,340 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp' 2026-04-16T04:56:36,375 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp' 2026-04-16T04:56:36,465 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp' 2026-04-16T04:56:36,518 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp' 2026-04-16T04:56:36,614 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp' 2026-04-16T04:56:36,634 adding 'flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp' 2026-04-16T04:56:36,635 adding 'flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp' 2026-04-16T04:56:36,637 adding 'flashinfer/data/cutlass/include/cute/arch/util.hpp' 2026-04-16T04:56:36,641 adding 'flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp' 2026-04-16T04:56:36,642 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp' 2026-04-16T04:56:36,650 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp' 2026-04-16T04:56:36,653 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp' 2026-04-16T04:56:36,655 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp' 2026-04-16T04:56:36,656 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp' 2026-04-16T04:56:36,658 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp' 2026-04-16T04:56:36,660 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp' 2026-04-16T04:56:36,661 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp' 2026-04-16T04:56:36,665 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp' 2026-04-16T04:56:36,672 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp' 2026-04-16T04:56:36,674 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp' 2026-04-16T04:56:36,677 adding 'flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp' 2026-04-16T04:56:36,678 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp' 2026-04-16T04:56:36,689 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp' 2026-04-16T04:56:36,692 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp' 2026-04-16T04:56:36,694 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp' 2026-04-16T04:56:36,696 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp' 2026-04-16T04:56:36,697 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp' 2026-04-16T04:56:36,698 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp' 2026-04-16T04:56:36,700 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp' 2026-04-16T04:56:36,702 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp' 2026-04-16T04:56:36,703 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp' 2026-04-16T04:56:36,714 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp' 2026-04-16T04:56:36,736 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp' 2026-04-16T04:56:36,749 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp' 2026-04-16T04:56:36,768 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp' 2026-04-16T04:56:36,773 adding 'flashinfer/data/cutlass/include/cute/atom/partitioner.hpp' 2026-04-16T04:56:36,775 adding 'flashinfer/data/cutlass/include/cute/container/alignment.hpp' 2026-04-16T04:56:36,777 adding 'flashinfer/data/cutlass/include/cute/container/array.hpp' 2026-04-16T04:56:36,778 adding 'flashinfer/data/cutlass/include/cute/container/array_aligned.hpp' 2026-04-16T04:56:36,781 adding 'flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp' 2026-04-16T04:56:36,782 adding 'flashinfer/data/cutlass/include/cute/container/bit_field.hpp' 2026-04-16T04:56:36,784 adding 'flashinfer/data/cutlass/include/cute/container/cuda_types.hpp' 2026-04-16T04:56:36,786 adding 'flashinfer/data/cutlass/include/cute/container/tuple.hpp' 2026-04-16T04:56:36,788 adding 'flashinfer/data/cutlass/include/cute/container/type_list.hpp' 2026-04-16T04:56:36,791 adding 'flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp' 2026-04-16T04:56:36,792 adding 'flashinfer/data/cutlass/include/cute/numeric/complex.hpp' 2026-04-16T04:56:36,793 adding 'flashinfer/data/cutlass/include/cute/numeric/int.hpp' 2026-04-16T04:56:36,795 adding 'flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp' 2026-04-16T04:56:36,797 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp' 2026-04-16T04:56:36,798 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp' 2026-04-16T04:56:36,800 adding 'flashinfer/data/cutlass/include/cute/numeric/math.hpp' 2026-04-16T04:56:36,802 adding 'flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp' 2026-04-16T04:56:36,803 adding 'flashinfer/data/cutlass/include/cute/numeric/real.hpp' 2026-04-16T04:56:36,805 adding 'flashinfer/data/cutlass/include/cute/util/debug.hpp' 2026-04-16T04:56:36,807 adding 'flashinfer/data/cutlass/include/cute/util/print.hpp' 2026-04-16T04:56:36,809 adding 'flashinfer/data/cutlass/include/cute/util/print_latex.hpp' 2026-04-16T04:56:36,810 adding 'flashinfer/data/cutlass/include/cute/util/print_svg.hpp' 2026-04-16T04:56:36,812 adding 'flashinfer/data/cutlass/include/cute/util/print_tensor.hpp' 2026-04-16T04:56:36,814 adding 'flashinfer/data/cutlass/include/cute/util/type_traits.hpp' 2026-04-16T04:56:36,817 adding 'flashinfer/data/cutlass/include/cutlass/aligned_buffer.h' 2026-04-16T04:56:36,821 adding 'flashinfer/data/cutlass/include/cutlass/array.h' 2026-04-16T04:56:36,823 adding 'flashinfer/data/cutlass/include/cutlass/array_planar_complex.h' 2026-04-16T04:56:36,825 adding 'flashinfer/data/cutlass/include/cutlass/array_subbyte.h' 2026-04-16T04:56:36,826 adding 'flashinfer/data/cutlass/include/cutlass/barrier.h' 2026-04-16T04:56:36,829 adding 'flashinfer/data/cutlass/include/cutlass/bfloat16.h' 2026-04-16T04:56:36,830 adding 'flashinfer/data/cutlass/include/cutlass/blas3.h' 2026-04-16T04:56:36,831 adding 'flashinfer/data/cutlass/include/cutlass/blas3_types.h' 2026-04-16T04:56:36,833 adding 'flashinfer/data/cutlass/include/cutlass/block_striped.h' 2026-04-16T04:56:36,835 adding 'flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp' 2026-04-16T04:56:36,838 adding 'flashinfer/data/cutlass/include/cutlass/complex.h' 2026-04-16T04:56:36,841 adding 'flashinfer/data/cutlass/include/cutlass/constants.h' 2026-04-16T04:56:36,843 adding 'flashinfer/data/cutlass/include/cutlass/coord.h' 2026-04-16T04:56:36,845 adding 'flashinfer/data/cutlass/include/cutlass/core_io.h' 2026-04-16T04:56:36,848 adding 'flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp' 2026-04-16T04:56:36,850 adding 'flashinfer/data/cutlass/include/cutlass/cutlass.h' 2026-04-16T04:56:36,851 adding 'flashinfer/data/cutlass/include/cutlass/device_kernel.h' 2026-04-16T04:56:36,856 adding 'flashinfer/data/cutlass/include/cutlass/exmy_base.h' 2026-04-16T04:56:36,859 adding 'flashinfer/data/cutlass/include/cutlass/fast_math.h' 2026-04-16T04:56:36,863 adding 'flashinfer/data/cutlass/include/cutlass/float8.h' 2026-04-16T04:56:36,866 adding 'flashinfer/data/cutlass/include/cutlass/float_subbyte.h' 2026-04-16T04:56:36,867 adding 'flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h' 2026-04-16T04:56:36,871 adding 'flashinfer/data/cutlass/include/cutlass/functional.h' 2026-04-16T04:56:36,873 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.h' 2026-04-16T04:56:36,874 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp' 2026-04-16T04:56:36,877 adding 'flashinfer/data/cutlass/include/cutlass/half.h' 2026-04-16T04:56:36,879 adding 'flashinfer/data/cutlass/include/cutlass/integer_subbyte.h' 2026-04-16T04:56:36,880 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h' 2026-04-16T04:56:36,882 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp' 2026-04-16T04:56:36,884 adding 'flashinfer/data/cutlass/include/cutlass/kernel_launch.h' 2026-04-16T04:56:36,905 adding 'flashinfer/data/cutlass/include/cutlass/matrix.h' 2026-04-16T04:56:36,909 adding 'flashinfer/data/cutlass/include/cutlass/matrix_coord.h' 2026-04-16T04:56:36,911 adding 'flashinfer/data/cutlass/include/cutlass/matrix_shape.h' 2026-04-16T04:56:36,926 adding 'flashinfer/data/cutlass/include/cutlass/numeric_conversion.h' 2026-04-16T04:56:36,930 adding 'flashinfer/data/cutlass/include/cutlass/numeric_size.h' 2026-04-16T04:56:36,931 adding 'flashinfer/data/cutlass/include/cutlass/numeric_types.h' 2026-04-16T04:56:36,932 adding 'flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h' 2026-04-16T04:56:36,935 adding 'flashinfer/data/cutlass/include/cutlass/predicate_vector.h' 2026-04-16T04:56:36,937 adding 'flashinfer/data/cutlass/include/cutlass/quaternion.h' 2026-04-16T04:56:36,938 adding 'flashinfer/data/cutlass/include/cutlass/real.h' 2026-04-16T04:56:36,940 adding 'flashinfer/data/cutlass/include/cutlass/relatively_equal.h' 2026-04-16T04:56:36,941 adding 'flashinfer/data/cutlass/include/cutlass/semaphore.h' 2026-04-16T04:56:36,945 adding 'flashinfer/data/cutlass/include/cutlass/subbyte_reference.h' 2026-04-16T04:56:36,947 adding 'flashinfer/data/cutlass/include/cutlass/tensor_coord.h' 2026-04-16T04:56:36,949 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref.h' 2026-04-16T04:56:36,951 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h' 2026-04-16T04:56:36,953 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view.h' 2026-04-16T04:56:36,954 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h' 2026-04-16T04:56:36,956 adding 'flashinfer/data/cutlass/include/cutlass/tfloat32.h' 2026-04-16T04:56:36,958 adding 'flashinfer/data/cutlass/include/cutlass/trace.h' 2026-04-16T04:56:36,959 adding 'flashinfer/data/cutlass/include/cutlass/uint128.h' 2026-04-16T04:56:36,961 adding 'flashinfer/data/cutlass/include/cutlass/uint256.h' 2026-04-16T04:56:36,962 adding 'flashinfer/data/cutlass/include/cutlass/version.h' 2026-04-16T04:56:36,964 adding 'flashinfer/data/cutlass/include/cutlass/wmma_array.h' 2026-04-16T04:56:36,965 adding 'flashinfer/data/cutlass/include/cutlass/workspace.h' 2026-04-16T04:56:36,968 adding 'flashinfer/data/cutlass/include/cutlass/arch/arch.h' 2026-04-16T04:56:36,971 adding 'flashinfer/data/cutlass/include/cutlass/arch/barrier.h' 2026-04-16T04:56:36,972 adding 'flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h' 2026-04-16T04:56:36,974 adding 'flashinfer/data/cutlass/include/cutlass/arch/config.h' 2026-04-16T04:56:36,976 adding 'flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h' 2026-04-16T04:56:36,978 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory.h' 2026-04-16T04:56:36,980 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h' 2026-04-16T04:56:36,981 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h' 2026-04-16T04:56:36,983 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma.h' 2026-04-16T04:56:36,985 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h' 2026-04-16T04:56:36,986 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h' 2026-04-16T04:56:36,988 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h' 2026-04-16T04:56:36,989 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h' 2026-04-16T04:56:36,991 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h' 2026-04-16T04:56:36,993 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h' 2026-04-16T04:56:36,996 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h' 2026-04-16T04:56:36,998 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h' 2026-04-16T04:56:36,999 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h' 2026-04-16T04:56:37,002 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h' 2026-04-16T04:56:37,003 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h' 2026-04-16T04:56:37,005 adding 'flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h' 2026-04-16T04:56:37,006 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd.h' 2026-04-16T04:56:37,007 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h' 2026-04-16T04:56:37,009 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h' 2026-04-16T04:56:37,012 adding 'flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp' 2026-04-16T04:56:37,014 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma.h' 2026-04-16T04:56:37,015 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h' 2026-04-16T04:56:37,017 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h' 2026-04-16T04:56:37,018 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h' 2026-04-16T04:56:37,021 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h' 2026-04-16T04:56:37,024 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h' 2026-04-16T04:56:37,026 adding 'flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp' 2026-04-16T04:56:37,028 adding 'flashinfer/data/cutlass/include/cutlass/conv/convolution.h' 2026-04-16T04:56:37,029 adding 'flashinfer/data/cutlass/include/cutlass/conv/detail.hpp' 2026-04-16T04:56:37,031 adding 'flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp' 2026-04-16T04:56:37,033 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp' 2026-04-16T04:56:37,034 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp' 2026-04-16T04:56:37,036 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp' 2026-04-16T04:56:37,040 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp' 2026-04-16T04:56:37,044 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp' 2026-04-16T04:56:37,046 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl' 2026-04-16T04:56:37,048 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl' 2026-04-16T04:56:37,050 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl' 2026-04-16T04:56:37,051 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl' 2026-04-16T04:56:37,054 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp' 2026-04-16T04:56:37,056 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h' 2026-04-16T04:56:37,058 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h' 2026-04-16T04:56:37,060 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h' 2026-04-16T04:56:37,062 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp' 2026-04-16T04:56:37,064 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h' 2026-04-16T04:56:37,067 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h' 2026-04-16T04:56:37,070 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h' 2026-04-16T04:56:37,072 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h' 2026-04-16T04:56:37,073 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h' 2026-04-16T04:56:37,075 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h' 2026-04-16T04:56:37,076 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h' 2026-04-16T04:56:37,078 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h' 2026-04-16T04:56:37,080 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h' 2026-04-16T04:56:37,082 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h' 2026-04-16T04:56:37,084 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h' 2026-04-16T04:56:37,086 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h' 2026-04-16T04:56:37,088 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h' 2026-04-16T04:56:37,090 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h' 2026-04-16T04:56:37,092 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h' 2026-04-16T04:56:37,094 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h' 2026-04-16T04:56:37,095 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h' 2026-04-16T04:56:37,097 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h' 2026-04-16T04:56:37,099 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h' 2026-04-16T04:56:37,101 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h' 2026-04-16T04:56:37,103 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h' 2026-04-16T04:56:37,106 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h' 2026-04-16T04:56:37,108 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h' 2026-04-16T04:56:37,110 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h' 2026-04-16T04:56:37,113 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h' 2026-04-16T04:56:37,115 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h' 2026-04-16T04:56:37,119 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp' 2026-04-16T04:56:37,121 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp' 2026-04-16T04:56:37,123 adding 'flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h' 2026-04-16T04:56:37,126 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,129 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,131 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,134 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,136 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,138 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h' 2026-04-16T04:56:37,140 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h' 2026-04-16T04:56:37,142 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,144 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,145 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h' 2026-04-16T04:56:37,147 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h' 2026-04-16T04:56:37,149 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,152 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h' 2026-04-16T04:56:37,154 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h' 2026-04-16T04:56:37,155 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,157 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,159 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,161 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,163 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,165 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,166 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,169 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,171 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,173 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,174 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,176 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,178 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h' 2026-04-16T04:56:37,180 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,182 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,184 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-16T04:56:37,186 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-16T04:56:37,187 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h' 2026-04-16T04:56:37,189 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h' 2026-04-16T04:56:37,191 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h' 2026-04-16T04:56:37,193 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h' 2026-04-16T04:56:37,195 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h' 2026-04-16T04:56:37,197 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h' 2026-04-16T04:56:37,199 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h' 2026-04-16T04:56:37,202 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h' 2026-04-16T04:56:37,206 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h' 2026-04-16T04:56:37,208 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h' 2026-04-16T04:56:37,211 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h' 2026-04-16T04:56:37,213 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h' 2026-04-16T04:56:37,215 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-16T04:56:37,217 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-16T04:56:37,219 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h' 2026-04-16T04:56:37,221 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h' 2026-04-16T04:56:37,224 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h' 2026-04-16T04:56:37,226 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h' 2026-04-16T04:56:37,228 adding 'flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp' 2026-04-16T04:56:37,230 adding 'flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp' 2026-04-16T04:56:37,232 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective.hpp' 2026-04-16T04:56:37,233 adding 'flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp' 2026-04-16T04:56:37,234 adding 'flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp' 2026-04-16T04:56:37,237 adding 'flashinfer/data/cutlass/include/cutlass/detail/layout.hpp' 2026-04-16T04:56:37,238 adding 'flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp' 2026-04-16T04:56:37,239 adding 'flashinfer/data/cutlass/include/cutlass/detail/mma.hpp' 2026-04-16T04:56:37,241 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp' 2026-04-16T04:56:37,243 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp' 2026-04-16T04:56:37,244 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp' 2026-04-16T04:56:37,246 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp' 2026-04-16T04:56:37,251 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp' 2026-04-16T04:56:37,252 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp' 2026-04-16T04:56:37,254 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp' 2026-04-16T04:56:37,256 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp' 2026-04-16T04:56:37,258 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp' 2026-04-16T04:56:37,260 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp' 2026-04-16T04:56:37,262 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp' 2026-04-16T04:56:37,263 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp' 2026-04-16T04:56:37,266 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp' 2026-04-16T04:56:37,268 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp' 2026-04-16T04:56:37,272 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp' 2026-04-16T04:56:37,274 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp' 2026-04-16T04:56:37,280 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp' 2026-04-16T04:56:37,288 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp' 2026-04-16T04:56:37,292 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp' 2026-04-16T04:56:37,297 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp' 2026-04-16T04:56:37,303 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp' 2026-04-16T04:56:37,306 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp' 2026-04-16T04:56:37,308 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp' 2026-04-16T04:56:37,314 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp' 2026-04-16T04:56:37,319 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp' 2026-04-16T04:56:37,321 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp' 2026-04-16T04:56:37,328 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl' 2026-04-16T04:56:37,330 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl' 2026-04-16T04:56:37,333 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl' 2026-04-16T04:56:37,334 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl' 2026-04-16T04:56:37,337 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl' 2026-04-16T04:56:37,339 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl' 2026-04-16T04:56:37,341 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp' 2026-04-16T04:56:37,343 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp' 2026-04-16T04:56:37,346 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp' 2026-04-16T04:56:37,349 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp' 2026-04-16T04:56:37,352 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp' 2026-04-16T04:56:37,355 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp' 2026-04-16T04:56:37,359 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp' 2026-04-16T04:56:37,365 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp' 2026-04-16T04:56:37,369 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp' 2026-04-16T04:56:37,374 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp' 2026-04-16T04:56:37,380 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp' 2026-04-16T04:56:37,384 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp' 2026-04-16T04:56:37,388 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp' 2026-04-16T04:56:37,391 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h' 2026-04-16T04:56:37,393 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h' 2026-04-16T04:56:37,395 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp' 2026-04-16T04:56:37,397 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h' 2026-04-16T04:56:37,399 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h' 2026-04-16T04:56:37,402 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h' 2026-04-16T04:56:37,404 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h' 2026-04-16T04:56:37,406 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h' 2026-04-16T04:56:37,408 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h' 2026-04-16T04:56:37,410 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h' 2026-04-16T04:56:37,411 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h' 2026-04-16T04:56:37,413 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h' 2026-04-16T04:56:37,415 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h' 2026-04-16T04:56:37,416 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h' 2026-04-16T04:56:37,418 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h' 2026-04-16T04:56:37,420 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h' 2026-04-16T04:56:37,422 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h' 2026-04-16T04:56:37,424 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h' 2026-04-16T04:56:37,426 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h' 2026-04-16T04:56:37,427 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h' 2026-04-16T04:56:37,429 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h' 2026-04-16T04:56:37,430 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp' 2026-04-16T04:56:37,432 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h' 2026-04-16T04:56:37,433 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h' 2026-04-16T04:56:37,435 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h' 2026-04-16T04:56:37,438 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h' 2026-04-16T04:56:37,440 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h' 2026-04-16T04:56:37,441 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h' 2026-04-16T04:56:37,443 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h' 2026-04-16T04:56:37,444 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h' 2026-04-16T04:56:37,447 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h' 2026-04-16T04:56:37,449 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h' 2026-04-16T04:56:37,451 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h' 2026-04-16T04:56:37,452 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h' 2026-04-16T04:56:37,454 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h' 2026-04-16T04:56:37,455 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h' 2026-04-16T04:56:37,457 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h' 2026-04-16T04:56:37,458 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h' 2026-04-16T04:56:37,460 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h' 2026-04-16T04:56:37,462 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h' 2026-04-16T04:56:37,463 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h' 2026-04-16T04:56:37,465 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h' 2026-04-16T04:56:37,467 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h' 2026-04-16T04:56:37,469 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h' 2026-04-16T04:56:37,470 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h' 2026-04-16T04:56:37,472 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h' 2026-04-16T04:56:37,474 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h' 2026-04-16T04:56:37,476 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h' 2026-04-16T04:56:37,478 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h' 2026-04-16T04:56:37,480 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h' 2026-04-16T04:56:37,482 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h' 2026-04-16T04:56:37,484 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h' 2026-04-16T04:56:37,488 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h' 2026-04-16T04:56:37,492 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h' 2026-04-16T04:56:37,495 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h' 2026-04-16T04:56:37,497 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h' 2026-04-16T04:56:37,499 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h' 2026-04-16T04:56:37,502 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h' 2026-04-16T04:56:37,503 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h' 2026-04-16T04:56:37,505 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h' 2026-04-16T04:56:37,507 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h' 2026-04-16T04:56:37,509 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h' 2026-04-16T04:56:37,513 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h' 2026-04-16T04:56:37,516 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h' 2026-04-16T04:56:37,517 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h' 2026-04-16T04:56:37,520 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h' 2026-04-16T04:56:37,522 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h' 2026-04-16T04:56:37,525 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h' 2026-04-16T04:56:37,527 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h' 2026-04-16T04:56:37,529 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h' 2026-04-16T04:56:37,531 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h' 2026-04-16T04:56:37,533 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h' 2026-04-16T04:56:37,535 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h' 2026-04-16T04:56:37,537 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h' 2026-04-16T04:56:37,540 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp' 2026-04-16T04:56:37,541 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp' 2026-04-16T04:56:37,544 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp' 2026-04-16T04:56:37,546 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp' 2026-04-16T04:56:37,548 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp' 2026-04-16T04:56:37,550 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h' 2026-04-16T04:56:37,552 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h' 2026-04-16T04:56:37,554 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h' 2026-04-16T04:56:37,556 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h' 2026-04-16T04:56:37,557 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h' 2026-04-16T04:56:37,559 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h' 2026-04-16T04:56:37,560 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h' 2026-04-16T04:56:37,562 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h' 2026-04-16T04:56:37,564 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h' 2026-04-16T04:56:37,567 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h' 2026-04-16T04:56:37,569 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h' 2026-04-16T04:56:37,571 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h' 2026-04-16T04:56:37,573 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h' 2026-04-16T04:56:37,574 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h' 2026-04-16T04:56:37,576 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h' 2026-04-16T04:56:37,579 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp' 2026-04-16T04:56:37,582 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp' 2026-04-16T04:56:37,583 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp' 2026-04-16T04:56:37,585 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp' 2026-04-16T04:56:37,587 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp' 2026-04-16T04:56:37,589 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp' 2026-04-16T04:56:37,591 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp' 2026-04-16T04:56:37,594 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp' 2026-04-16T04:56:37,599 adding 'flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp' 2026-04-16T04:56:37,601 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm.h' 2026-04-16T04:56:37,603 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h' 2026-04-16T04:56:37,604 adding 'flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp' 2026-04-16T04:56:37,607 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp' 2026-04-16T04:56:37,609 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp' 2026-04-16T04:56:37,610 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp' 2026-04-16T04:56:37,611 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp' 2026-04-16T04:56:37,613 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp' 2026-04-16T04:56:37,620 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp' 2026-04-16T04:56:37,627 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp' 2026-04-16T04:56:37,632 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-16T04:56:37,638 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp' 2026-04-16T04:56:37,644 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp' 2026-04-16T04:56:37,649 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp' 2026-04-16T04:56:37,656 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp' 2026-04-16T04:56:37,662 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp' 2026-04-16T04:56:37,669 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-16T04:56:37,674 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-16T04:56:37,679 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp' 2026-04-16T04:56:37,684 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp' 2026-04-16T04:56:37,687 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp' 2026-04-16T04:56:37,691 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-16T04:56:37,695 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp' 2026-04-16T04:56:37,700 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp' 2026-04-16T04:56:37,706 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp' 2026-04-16T04:56:37,712 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-16T04:56:37,717 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-16T04:56:37,723 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp' 2026-04-16T04:56:37,727 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp' 2026-04-16T04:56:37,732 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp' 2026-04-16T04:56:37,740 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp' 2026-04-16T04:56:37,747 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp' 2026-04-16T04:56:37,753 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp' 2026-04-16T04:56:37,757 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp' 2026-04-16T04:56:37,764 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp' 2026-04-16T04:56:37,769 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp' 2026-04-16T04:56:37,772 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp' 2026-04-16T04:56:37,776 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp' 2026-04-16T04:56:37,781 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp' 2026-04-16T04:56:37,784 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp' 2026-04-16T04:56:37,786 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp' 2026-04-16T04:56:37,789 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp' 2026-04-16T04:56:37,795 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-16T04:56:37,800 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp' 2026-04-16T04:56:37,804 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-16T04:56:37,809 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-16T04:56:37,813 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp' 2026-04-16T04:56:37,815 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp' 2026-04-16T04:56:37,819 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp' 2026-04-16T04:56:37,824 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-16T04:56:37,828 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp' 2026-04-16T04:56:37,831 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-16T04:56:37,834 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-16T04:56:37,839 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-16T04:56:37,843 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-16T04:56:37,847 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-16T04:56:37,851 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl' 2026-04-16T04:56:37,853 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl' 2026-04-16T04:56:37,855 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl' 2026-04-16T04:56:37,857 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl' 2026-04-16T04:56:37,859 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl' 2026-04-16T04:56:37,862 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl' 2026-04-16T04:56:37,866 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl' 2026-04-16T04:56:37,868 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl' 2026-04-16T04:56:37,869 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl' 2026-04-16T04:56:37,872 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl' 2026-04-16T04:56:37,874 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl' 2026-04-16T04:56:37,875 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl' 2026-04-16T04:56:37,877 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl' 2026-04-16T04:56:37,879 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl' 2026-04-16T04:56:37,881 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl' 2026-04-16T04:56:37,883 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl' 2026-04-16T04:56:37,886 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl' 2026-04-16T04:56:37,888 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl' 2026-04-16T04:56:37,890 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl' 2026-04-16T04:56:37,892 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl' 2026-04-16T04:56:37,894 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl' 2026-04-16T04:56:37,896 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl' 2026-04-16T04:56:37,898 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl' 2026-04-16T04:56:37,901 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl' 2026-04-16T04:56:37,903 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl' 2026-04-16T04:56:37,906 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl' 2026-04-16T04:56:37,910 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl' 2026-04-16T04:56:37,912 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl' 2026-04-16T04:56:37,914 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl' 2026-04-16T04:56:37,917 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h' 2026-04-16T04:56:37,919 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h' 2026-04-16T04:56:37,922 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h' 2026-04-16T04:56:37,925 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h' 2026-04-16T04:56:37,928 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h' 2026-04-16T04:56:37,930 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h' 2026-04-16T04:56:37,933 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h' 2026-04-16T04:56:37,936 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h' 2026-04-16T04:56:37,938 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h' 2026-04-16T04:56:37,940 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h' 2026-04-16T04:56:37,942 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h' 2026-04-16T04:56:37,944 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h' 2026-04-16T04:56:37,945 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h' 2026-04-16T04:56:37,947 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h' 2026-04-16T04:56:37,949 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h' 2026-04-16T04:56:37,952 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h' 2026-04-16T04:56:37,954 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h' 2026-04-16T04:56:37,957 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h' 2026-04-16T04:56:37,959 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h' 2026-04-16T04:56:37,961 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h' 2026-04-16T04:56:37,963 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h' 2026-04-16T04:56:37,965 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h' 2026-04-16T04:56:37,967 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h' 2026-04-16T04:56:37,969 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h' 2026-04-16T04:56:37,970 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h' 2026-04-16T04:56:37,973 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h' 2026-04-16T04:56:37,974 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h' 2026-04-16T04:56:37,976 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h' 2026-04-16T04:56:37,979 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h' 2026-04-16T04:56:37,982 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h' 2026-04-16T04:56:37,987 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h' 2026-04-16T04:56:37,990 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h' 2026-04-16T04:56:37,993 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h' 2026-04-16T04:56:37,994 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h' 2026-04-16T04:56:37,996 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h' 2026-04-16T04:56:37,998 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h' 2026-04-16T04:56:37,999 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h' 2026-04-16T04:56:38,001 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h' 2026-04-16T04:56:38,003 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h' 2026-04-16T04:56:38,004 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h' 2026-04-16T04:56:38,006 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h' 2026-04-16T04:56:38,007 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h' 2026-04-16T04:56:38,009 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h' 2026-04-16T04:56:38,011 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h' 2026-04-16T04:56:38,012 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h' 2026-04-16T04:56:38,014 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h' 2026-04-16T04:56:38,016 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h' 2026-04-16T04:56:38,017 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h' 2026-04-16T04:56:38,019 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h' 2026-04-16T04:56:38,020 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h' 2026-04-16T04:56:38,022 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h' 2026-04-16T04:56:38,024 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h' 2026-04-16T04:56:38,025 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h' 2026-04-16T04:56:38,027 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h' 2026-04-16T04:56:38,029 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h' 2026-04-16T04:56:38,031 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h' 2026-04-16T04:56:38,032 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h' 2026-04-16T04:56:38,034 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h' 2026-04-16T04:56:38,036 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h' 2026-04-16T04:56:38,038 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h' 2026-04-16T04:56:38,040 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h' 2026-04-16T04:56:38,042 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h' 2026-04-16T04:56:38,043 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h' 2026-04-16T04:56:38,045 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h' 2026-04-16T04:56:38,047 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h' 2026-04-16T04:56:38,049 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h' 2026-04-16T04:56:38,051 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h' 2026-04-16T04:56:38,053 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h' 2026-04-16T04:56:38,055 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h' 2026-04-16T04:56:38,057 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h' 2026-04-16T04:56:38,059 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h' 2026-04-16T04:56:38,061 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h' 2026-04-16T04:56:38,062 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h' 2026-04-16T04:56:38,064 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h' 2026-04-16T04:56:38,067 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h' 2026-04-16T04:56:38,069 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h' 2026-04-16T04:56:38,071 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h' 2026-04-16T04:56:38,074 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h' 2026-04-16T04:56:38,076 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h' 2026-04-16T04:56:38,079 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h' 2026-04-16T04:56:38,082 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h' 2026-04-16T04:56:38,084 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h' 2026-04-16T04:56:38,092 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h' 2026-04-16T04:56:38,094 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h' 2026-04-16T04:56:38,097 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h' 2026-04-16T04:56:38,098 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp' 2026-04-16T04:56:38,100 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h' 2026-04-16T04:56:38,102 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h' 2026-04-16T04:56:38,106 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h' 2026-04-16T04:56:38,108 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h' 2026-04-16T04:56:38,112 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h' 2026-04-16T04:56:38,114 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h' 2026-04-16T04:56:38,118 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h' 2026-04-16T04:56:38,121 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h' 2026-04-16T04:56:38,123 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h' 2026-04-16T04:56:38,125 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h' 2026-04-16T04:56:38,129 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h' 2026-04-16T04:56:38,131 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h' 2026-04-16T04:56:38,132 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h' 2026-04-16T04:56:38,134 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h' 2026-04-16T04:56:38,137 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h' 2026-04-16T04:56:38,139 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h' 2026-04-16T04:56:38,141 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h' 2026-04-16T04:56:38,143 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h' 2026-04-16T04:56:38,146 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h' 2026-04-16T04:56:38,153 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp' 2026-04-16T04:56:38,159 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp' 2026-04-16T04:56:38,166 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp' 2026-04-16T04:56:38,169 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp' 2026-04-16T04:56:38,174 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-16T04:56:38,179 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp' 2026-04-16T04:56:38,184 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp' 2026-04-16T04:56:38,189 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp' 2026-04-16T04:56:38,194 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp' 2026-04-16T04:56:38,199 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp' 2026-04-16T04:56:38,201 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp' 2026-04-16T04:56:38,204 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp' 2026-04-16T04:56:38,206 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp' 2026-04-16T04:56:38,211 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp' 2026-04-16T04:56:38,216 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp' 2026-04-16T04:56:38,222 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp' 2026-04-16T04:56:38,226 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp' 2026-04-16T04:56:38,229 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp' 2026-04-16T04:56:38,230 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp' 2026-04-16T04:56:38,235 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp' 2026-04-16T04:56:38,241 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp' 2026-04-16T04:56:38,243 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp' 2026-04-16T04:56:38,246 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp' 2026-04-16T04:56:38,250 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp' 2026-04-16T04:56:38,255 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp' 2026-04-16T04:56:38,257 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp' 2026-04-16T04:56:38,260 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp' 2026-04-16T04:56:38,263 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp' 2026-04-16T04:56:38,265 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp' 2026-04-16T04:56:38,268 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp' 2026-04-16T04:56:38,273 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp' 2026-04-16T04:56:38,276 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h' 2026-04-16T04:56:38,278 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h' 2026-04-16T04:56:38,280 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h' 2026-04-16T04:56:38,282 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp' 2026-04-16T04:56:38,285 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h' 2026-04-16T04:56:38,287 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp' 2026-04-16T04:56:38,288 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp' 2026-04-16T04:56:38,297 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h' 2026-04-16T04:56:38,300 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h' 2026-04-16T04:56:38,302 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h' 2026-04-16T04:56:38,304 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h' 2026-04-16T04:56:38,307 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h' 2026-04-16T04:56:38,309 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h' 2026-04-16T04:56:38,312 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h' 2026-04-16T04:56:38,314 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h' 2026-04-16T04:56:38,316 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h' 2026-04-16T04:56:38,318 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h' 2026-04-16T04:56:38,321 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h' 2026-04-16T04:56:38,323 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h' 2026-04-16T04:56:38,326 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h' 2026-04-16T04:56:38,331 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h' 2026-04-16T04:56:38,334 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h' 2026-04-16T04:56:38,336 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h' 2026-04-16T04:56:38,338 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h' 2026-04-16T04:56:38,340 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h' 2026-04-16T04:56:38,342 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h' 2026-04-16T04:56:38,343 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h' 2026-04-16T04:56:38,345 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h' 2026-04-16T04:56:38,346 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h' 2026-04-16T04:56:38,348 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h' 2026-04-16T04:56:38,350 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h' 2026-04-16T04:56:38,351 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h' 2026-04-16T04:56:38,353 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h' 2026-04-16T04:56:38,356 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h' 2026-04-16T04:56:38,358 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h' 2026-04-16T04:56:38,360 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h' 2026-04-16T04:56:38,362 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h' 2026-04-16T04:56:38,365 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h' 2026-04-16T04:56:38,367 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h' 2026-04-16T04:56:38,369 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h' 2026-04-16T04:56:38,370 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h' 2026-04-16T04:56:38,372 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h' 2026-04-16T04:56:38,375 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h' 2026-04-16T04:56:38,378 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h' 2026-04-16T04:56:38,382 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h' 2026-04-16T04:56:38,384 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h' 2026-04-16T04:56:38,387 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h' 2026-04-16T04:56:38,389 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h' 2026-04-16T04:56:38,391 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h' 2026-04-16T04:56:38,394 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h' 2026-04-16T04:56:38,396 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h' 2026-04-16T04:56:38,399 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h' 2026-04-16T04:56:38,401 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h' 2026-04-16T04:56:38,404 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h' 2026-04-16T04:56:38,406 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h' 2026-04-16T04:56:38,409 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h' 2026-04-16T04:56:38,412 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h' 2026-04-16T04:56:38,415 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h' 2026-04-16T04:56:38,417 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h' 2026-04-16T04:56:38,418 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h' 2026-04-16T04:56:38,420 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h' 2026-04-16T04:56:38,421 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h' 2026-04-16T04:56:38,423 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h' 2026-04-16T04:56:38,424 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h' 2026-04-16T04:56:38,426 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h' 2026-04-16T04:56:38,429 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h' 2026-04-16T04:56:38,432 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h' 2026-04-16T04:56:38,437 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h' 2026-04-16T04:56:38,440 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h' 2026-04-16T04:56:38,442 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h' 2026-04-16T04:56:38,444 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h' 2026-04-16T04:56:38,446 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h' 2026-04-16T04:56:38,448 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h' 2026-04-16T04:56:38,449 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h' 2026-04-16T04:56:38,453 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h' 2026-04-16T04:56:38,455 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h' 2026-04-16T04:56:38,457 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h' 2026-04-16T04:56:38,459 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h' 2026-04-16T04:56:38,462 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h' 2026-04-16T04:56:38,463 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h' 2026-04-16T04:56:38,465 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h' 2026-04-16T04:56:38,467 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h' 2026-04-16T04:56:38,475 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h' 2026-04-16T04:56:38,483 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h' 2026-04-16T04:56:38,487 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h' 2026-04-16T04:56:38,490 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h' 2026-04-16T04:56:38,492 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h' 2026-04-16T04:56:38,494 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h' 2026-04-16T04:56:38,496 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h' 2026-04-16T04:56:38,499 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h' 2026-04-16T04:56:38,500 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h' 2026-04-16T04:56:38,502 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h' 2026-04-16T04:56:38,504 adding 'flashinfer/data/cutlass/include/cutlass/layout/layout.h' 2026-04-16T04:56:38,507 adding 'flashinfer/data/cutlass/include/cutlass/layout/matrix.h' 2026-04-16T04:56:38,510 adding 'flashinfer/data/cutlass/include/cutlass/layout/permute.h' 2026-04-16T04:56:38,511 adding 'flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h' 2026-04-16T04:56:38,513 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor.h' 2026-04-16T04:56:38,515 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h' 2026-04-16T04:56:38,518 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h' 2026-04-16T04:56:38,520 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h' 2026-04-16T04:56:38,522 adding 'flashinfer/data/cutlass/include/cutlass/layout/vector.h' 2026-04-16T04:56:38,524 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp' 2026-04-16T04:56:38,527 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp' 2026-04-16T04:56:38,532 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp' 2026-04-16T04:56:38,536 adding 'flashinfer/data/cutlass/include/cutlass/platform/platform.h' 2026-04-16T04:56:38,538 adding 'flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h' 2026-04-16T04:56:38,540 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h' 2026-04-16T04:56:38,541 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h' 2026-04-16T04:56:38,543 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h' 2026-04-16T04:56:38,545 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h' 2026-04-16T04:56:38,548 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h' 2026-04-16T04:56:38,549 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h' 2026-04-16T04:56:38,552 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h' 2026-04-16T04:56:38,554 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h' 2026-04-16T04:56:38,557 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h' 2026-04-16T04:56:38,558 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h' 2026-04-16T04:56:38,560 adding 'flashinfer/data/cutlass/include/cutlass/thread/matrix.h' 2026-04-16T04:56:38,564 adding 'flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h' 2026-04-16T04:56:38,568 adding 'flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp' 2026-04-16T04:56:38,571 adding 'flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp' 2026-04-16T04:56:38,573 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp' 2026-04-16T04:56:38,576 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp' 2026-04-16T04:56:38,578 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp' 2026-04-16T04:56:38,580 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h' 2026-04-16T04:56:38,582 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h' 2026-04-16T04:56:38,584 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h' 2026-04-16T04:56:38,588 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h' 2026-04-16T04:56:38,591 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h' 2026-04-16T04:56:38,593 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-16T04:56:38,595 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-16T04:56:38,600 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h' 2026-04-16T04:56:38,603 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h' 2026-04-16T04:56:38,605 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h' 2026-04-16T04:56:38,607 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h' 2026-04-16T04:56:38,611 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h' 2026-04-16T04:56:38,614 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h' 2026-04-16T04:56:38,617 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h' 2026-04-16T04:56:38,619 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h' 2026-04-16T04:56:38,620 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h' 2026-04-16T04:56:38,622 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h' 2026-04-16T04:56:38,624 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h' 2026-04-16T04:56:38,626 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h' 2026-04-16T04:56:38,628 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h' 2026-04-16T04:56:38,631 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h' 2026-04-16T04:56:38,633 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h' 2026-04-16T04:56:38,635 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h' 2026-04-16T04:56:38,637 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h' 2026-04-16T04:56:38,639 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h' 2026-04-16T04:56:38,642 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h' 2026-04-16T04:56:38,644 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h' 2026-04-16T04:56:38,646 adding 'flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h' 2026-04-16T04:56:38,648 adding 'flashinfer/data/cutlass/python/setup_cutlass.py' 2026-04-16T04:56:38,649 adding 'flashinfer/data/cutlass/python/setup_library.py' 2026-04-16T04:56:38,651 adding 'flashinfer/data/cutlass/python/setup_pycute.py' 2026-04-16T04:56:38,654 adding 'flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py' 2026-04-16T04:56:38,656 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py' 2026-04-16T04:56:38,657 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py' 2026-04-16T04:56:38,659 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py' 2026-04-16T04:56:38,661 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py' 2026-04-16T04:56:38,663 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py' 2026-04-16T04:56:38,666 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py' 2026-04-16T04:56:38,677 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py' 2026-04-16T04:56:38,680 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py' 2026-04-16T04:56:38,682 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py' 2026-04-16T04:56:38,685 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py' 2026-04-16T04:56:38,694 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py' 2026-04-16T04:56:38,696 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py' 2026-04-16T04:56:38,702 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py' 2026-04-16T04:56:38,709 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py' 2026-04-16T04:56:38,711 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py' 2026-04-16T04:56:38,713 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py' 2026-04-16T04:56:38,715 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py' 2026-04-16T04:56:38,717 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py' 2026-04-16T04:56:38,718 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py' 2026-04-16T04:56:38,720 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py' 2026-04-16T04:56:38,722 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py' 2026-04-16T04:56:38,724 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py' 2026-04-16T04:56:38,725 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py' 2026-04-16T04:56:38,727 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py' 2026-04-16T04:56:38,729 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py' 2026-04-16T04:56:38,732 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py' 2026-04-16T04:56:38,733 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py' 2026-04-16T04:56:38,735 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py' 2026-04-16T04:56:38,736 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py' 2026-04-16T04:56:38,738 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py' 2026-04-16T04:56:38,739 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py' 2026-04-16T04:56:38,741 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py' 2026-04-16T04:56:38,743 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py' 2026-04-16T04:56:38,746 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py' 2026-04-16T04:56:38,749 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py' 2026-04-16T04:56:38,756 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py' 2026-04-16T04:56:38,758 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py' 2026-04-16T04:56:38,760 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py' 2026-04-16T04:56:38,761 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py' 2026-04-16T04:56:38,763 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py' 2026-04-16T04:56:38,764 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py' 2026-04-16T04:56:38,767 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py' 2026-04-16T04:56:38,770 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py' 2026-04-16T04:56:38,772 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py' 2026-04-16T04:56:38,775 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py' 2026-04-16T04:56:38,780 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py' 2026-04-16T04:56:38,799 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py' 2026-04-16T04:56:38,802 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py' 2026-04-16T04:56:38,804 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py' 2026-04-16T04:56:38,808 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py' 2026-04-16T04:56:38,817 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py' 2026-04-16T04:56:38,823 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py' 2026-04-16T04:56:38,825 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py' 2026-04-16T04:56:38,827 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py' 2026-04-16T04:56:38,830 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py' 2026-04-16T04:56:38,831 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py' 2026-04-16T04:56:38,832 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py' 2026-04-16T04:56:38,834 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py' 2026-04-16T04:56:38,836 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py' 2026-04-16T04:56:38,843 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py' 2026-04-16T04:56:38,845 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py' 2026-04-16T04:56:38,846 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py' 2026-04-16T04:56:38,848 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py' 2026-04-16T04:56:38,850 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py' 2026-04-16T04:56:38,851 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py' 2026-04-16T04:56:38,853 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py' 2026-04-16T04:56:38,854 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py' 2026-04-16T04:56:38,857 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py' 2026-04-16T04:56:38,859 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py' 2026-04-16T04:56:38,861 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py' 2026-04-16T04:56:38,862 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py' 2026-04-16T04:56:38,864 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py' 2026-04-16T04:56:38,865 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py' 2026-04-16T04:56:38,867 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py' 2026-04-16T04:56:38,869 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py' 2026-04-16T04:56:38,870 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py' 2026-04-16T04:56:38,872 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py' 2026-04-16T04:56:38,874 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py' 2026-04-16T04:56:38,877 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py' 2026-04-16T04:56:38,879 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py' 2026-04-16T04:56:38,881 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py' 2026-04-16T04:56:38,883 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py' 2026-04-16T04:56:38,886 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py' 2026-04-16T04:56:38,888 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py' 2026-04-16T04:56:38,890 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py' 2026-04-16T04:56:38,892 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py' 2026-04-16T04:56:38,894 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py' 2026-04-16T04:56:38,896 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py' 2026-04-16T04:56:38,897 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py' 2026-04-16T04:56:38,899 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py' 2026-04-16T04:56:38,901 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py' 2026-04-16T04:56:38,903 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py' 2026-04-16T04:56:38,904 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py' 2026-04-16T04:56:38,913 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py' 2026-04-16T04:56:38,917 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py' 2026-04-16T04:56:38,920 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py' 2026-04-16T04:56:38,922 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py' 2026-04-16T04:56:38,923 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py' 2026-04-16T04:56:38,925 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py' 2026-04-16T04:56:38,927 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py' 2026-04-16T04:56:38,928 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py' 2026-04-16T04:56:38,931 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py' 2026-04-16T04:56:38,932 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py' 2026-04-16T04:56:38,935 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py' 2026-04-16T04:56:38,938 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py' 2026-04-16T04:56:38,943 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py' 2026-04-16T04:56:38,945 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py' 2026-04-16T04:56:38,949 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py' 2026-04-16T04:56:38,951 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py' 2026-04-16T04:56:38,953 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py' 2026-04-16T04:56:38,955 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py' 2026-04-16T04:56:38,958 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py' 2026-04-16T04:56:38,961 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py' 2026-04-16T04:56:38,963 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py' 2026-04-16T04:56:38,965 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py' 2026-04-16T04:56:38,966 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py' 2026-04-16T04:56:38,971 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py' 2026-04-16T04:56:38,972 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py' 2026-04-16T04:56:38,975 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py' 2026-04-16T04:56:38,978 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py' 2026-04-16T04:56:38,979 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py' 2026-04-16T04:56:38,981 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py' 2026-04-16T04:56:38,983 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py' 2026-04-16T04:56:38,985 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py' 2026-04-16T04:56:38,988 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py' 2026-04-16T04:56:38,990 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py' 2026-04-16T04:56:38,993 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py' 2026-04-16T04:56:38,995 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/shape.py' 2026-04-16T04:56:38,996 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py' 2026-04-16T04:56:38,998 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py' 2026-04-16T04:56:38,999 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py' 2026-04-16T04:56:39,002 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py' 2026-04-16T04:56:39,004 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py' 2026-04-16T04:56:39,008 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py' 2026-04-16T04:56:39,010 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py' 2026-04-16T04:56:39,011 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py' 2026-04-16T04:56:39,018 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py' 2026-04-16T04:56:39,021 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py' 2026-04-16T04:56:39,023 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py' 2026-04-16T04:56:39,024 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py' 2026-04-16T04:56:39,027 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py' 2026-04-16T04:56:39,028 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py' 2026-04-16T04:56:39,030 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py' 2026-04-16T04:56:39,032 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py' 2026-04-16T04:56:39,034 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py' 2026-04-16T04:56:39,035 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py' 2026-04-16T04:56:39,037 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py' 2026-04-16T04:56:39,038 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py' 2026-04-16T04:56:39,040 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py' 2026-04-16T04:56:39,042 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py' 2026-04-16T04:56:39,043 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py' 2026-04-16T04:56:39,045 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py' 2026-04-16T04:56:39,047 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py' 2026-04-16T04:56:39,049 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py' 2026-04-16T04:56:39,050 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py' 2026-04-16T04:56:39,052 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py' 2026-04-16T04:56:39,054 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py' 2026-04-16T04:56:39,055 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py' 2026-04-16T04:56:39,057 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py' 2026-04-16T04:56:39,060 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py' 2026-04-16T04:56:39,061 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py' 2026-04-16T04:56:39,063 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py' 2026-04-16T04:56:39,065 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py' 2026-04-16T04:56:39,066 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py' 2026-04-16T04:56:39,068 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py' 2026-04-16T04:56:39,070 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py' 2026-04-16T04:56:39,071 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py' 2026-04-16T04:56:39,073 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py' 2026-04-16T04:56:39,074 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py' 2026-04-16T04:56:39,076 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py' 2026-04-16T04:56:39,078 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py' 2026-04-16T04:56:39,079 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py' 2026-04-16T04:56:39,080 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py' 2026-04-16T04:56:39,082 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py' 2026-04-16T04:56:39,083 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py' 2026-04-16T04:56:39,085 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py' 2026-04-16T04:56:39,086 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py' 2026-04-16T04:56:39,088 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py' 2026-04-16T04:56:39,090 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py' 2026-04-16T04:56:39,092 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py' 2026-04-16T04:56:39,094 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py' 2026-04-16T04:56:39,097 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py' 2026-04-16T04:56:39,099 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py' 2026-04-16T04:56:39,101 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py' 2026-04-16T04:56:39,102 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py' 2026-04-16T04:56:39,104 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py' 2026-04-16T04:56:39,109 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py' 2026-04-16T04:56:39,113 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py' 2026-04-16T04:56:39,116 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py' 2026-04-16T04:56:39,118 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py' 2026-04-16T04:56:39,120 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py' 2026-04-16T04:56:39,122 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py' 2026-04-16T04:56:39,124 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py' 2026-04-16T04:56:39,126 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py' 2026-04-16T04:56:39,127 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py' 2026-04-16T04:56:39,130 adding 'flashinfer/data/cutlass/python/cutlass_library/__init__.py' 2026-04-16T04:56:39,133 adding 'flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py' 2026-04-16T04:56:39,135 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py' 2026-04-16T04:56:39,138 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py' 2026-04-16T04:56:39,142 adding 'flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py' 2026-04-16T04:56:39,148 adding 'flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py' 2026-04-16T04:56:39,175 adding 'flashinfer/data/cutlass/python/cutlass_library/generator.py' 2026-04-16T04:56:39,181 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics.py' 2026-04-16T04:56:39,182 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py' 2026-04-16T04:56:39,188 adding 'flashinfer/data/cutlass/python/cutlass_library/library.py' 2026-04-16T04:56:39,192 adding 'flashinfer/data/cutlass/python/cutlass_library/manifest.py' 2026-04-16T04:56:39,195 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py' 2026-04-16T04:56:39,197 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py' 2026-04-16T04:56:39,199 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py' 2026-04-16T04:56:39,201 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py' 2026-04-16T04:56:39,203 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py' 2026-04-16T04:56:39,206 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py' 2026-04-16T04:56:39,208 adding 'flashinfer/data/cutlass/python/cutlass_library/symm_operation.py' 2026-04-16T04:56:39,210 adding 'flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py' 2026-04-16T04:56:39,213 adding 'flashinfer/data/cutlass/python/docs_src/source/conf.py' 2026-04-16T04:56:39,215 adding 'flashinfer/data/cutlass/python/pycute/__init__.py' 2026-04-16T04:56:39,217 adding 'flashinfer/data/cutlass/python/pycute/int_tuple.py' 2026-04-16T04:56:39,219 adding 'flashinfer/data/cutlass/python/pycute/layout.py' 2026-04-16T04:56:39,220 adding 'flashinfer/data/cutlass/python/pycute/swizzle.py' 2026-04-16T04:56:39,222 adding 'flashinfer/data/cutlass/python/pycute/typing.py' 2026-04-16T04:56:39,225 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py' 2026-04-16T04:56:39,227 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py' 2026-04-16T04:56:39,228 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-16T04:56:39,230 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py' 2026-04-16T04:56:39,232 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py' 2026-04-16T04:56:39,233 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py' 2026-04-16T04:56:39,236 adding 'flashinfer/data/cutlass/test/python/cutlass/installation.py' 2026-04-16T04:56:39,238 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py' 2026-04-16T04:56:39,240 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py' 2026-04-16T04:56:39,242 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py' 2026-04-16T04:56:39,244 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py' 2026-04-16T04:56:39,246 adding 'flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py' 2026-04-16T04:56:39,248 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py' 2026-04-16T04:56:39,250 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py' 2026-04-16T04:56:39,251 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py' 2026-04-16T04:56:39,253 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py' 2026-04-16T04:56:39,255 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py' 2026-04-16T04:56:39,256 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py' 2026-04-16T04:56:39,259 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py' 2026-04-16T04:56:39,261 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py' 2026-04-16T04:56:39,262 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py' 2026-04-16T04:56:39,264 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py' 2026-04-16T04:56:39,265 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py' 2026-04-16T04:56:39,267 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py' 2026-04-16T04:56:39,268 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py' 2026-04-16T04:56:39,270 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py' 2026-04-16T04:56:39,271 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py' 2026-04-16T04:56:39,273 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py' 2026-04-16T04:56:39,274 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py' 2026-04-16T04:56:39,276 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py' 2026-04-16T04:56:39,278 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py' 2026-04-16T04:56:39,280 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py' 2026-04-16T04:56:39,282 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py' 2026-04-16T04:56:39,284 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py' 2026-04-16T04:56:39,286 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py' 2026-04-16T04:56:39,288 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/utils.py' 2026-04-16T04:56:39,290 adding 'flashinfer/data/cutlass/test/python/pycute/run_all_tests.py' 2026-04-16T04:56:39,291 adding 'flashinfer/data/cutlass/test/python/pycute/test_coalesce.py' 2026-04-16T04:56:39,293 adding 'flashinfer/data/cutlass/test/python/pycute/test_complement.py' 2026-04-16T04:56:39,294 adding 'flashinfer/data/cutlass/test/python/pycute/test_composition.py' 2026-04-16T04:56:39,295 adding 'flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py' 2026-04-16T04:56:39,297 adding 'flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py' 2026-04-16T04:56:39,298 adding 'flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py' 2026-04-16T04:56:39,300 adding 'flashinfer/data/cutlass/test/python/pycute/test_typing.py' 2026-04-16T04:56:39,303 adding 'flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py' 2026-04-16T04:56:39,307 adding 'flashinfer/data/cutlass/test/utils/test_sharding.py' 2026-04-16T04:56:39,311 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp' 2026-04-16T04:56:39,313 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h' 2026-04-16T04:56:39,315 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp' 2026-04-16T04:56:39,317 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h' 2026-04-16T04:56:39,318 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h' 2026-04-16T04:56:39,321 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h' 2026-04-16T04:56:39,323 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h' 2026-04-16T04:56:39,325 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h' 2026-04-16T04:56:39,327 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h' 2026-04-16T04:56:39,329 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h' 2026-04-16T04:56:39,331 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h' 2026-04-16T04:56:39,333 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h' 2026-04-16T04:56:39,334 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h' 2026-04-16T04:56:39,336 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h' 2026-04-16T04:56:39,337 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h' 2026-04-16T04:56:39,338 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h' 2026-04-16T04:56:39,340 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp' 2026-04-16T04:56:39,342 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp' 2026-04-16T04:56:39,343 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h' 2026-04-16T04:56:39,345 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h' 2026-04-16T04:56:39,348 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h' 2026-04-16T04:56:39,349 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h' 2026-04-16T04:56:39,351 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h' 2026-04-16T04:56:39,354 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp' 2026-04-16T04:56:39,356 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp' 2026-04-16T04:56:39,358 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp' 2026-04-16T04:56:39,360 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h' 2026-04-16T04:56:39,361 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h' 2026-04-16T04:56:39,364 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h' 2026-04-16T04:56:39,365 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h' 2026-04-16T04:56:39,369 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h' 2026-04-16T04:56:39,371 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h' 2026-04-16T04:56:39,373 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h' 2026-04-16T04:56:39,375 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h' 2026-04-16T04:56:39,377 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp' 2026-04-16T04:56:39,379 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h' 2026-04-16T04:56:39,380 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h' 2026-04-16T04:56:39,384 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h' 2026-04-16T04:56:39,386 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h' 2026-04-16T04:56:39,388 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h' 2026-04-16T04:56:39,390 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h' 2026-04-16T04:56:39,392 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h' 2026-04-16T04:56:39,393 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h' 2026-04-16T04:56:39,395 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h' 2026-04-16T04:56:39,397 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h' 2026-04-16T04:56:39,401 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp' 2026-04-16T04:56:39,403 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h' 2026-04-16T04:56:39,405 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h' 2026-04-16T04:56:39,407 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h' 2026-04-16T04:56:39,409 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h' 2026-04-16T04:56:39,410 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h' 2026-04-16T04:56:39,414 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp' 2026-04-16T04:56:39,416 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h' 2026-04-16T04:56:39,418 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h' 2026-04-16T04:56:39,419 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h' 2026-04-16T04:56:39,421 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h' 2026-04-16T04:56:39,423 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h' 2026-04-16T04:56:39,425 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h' 2026-04-16T04:56:39,427 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp' 2026-04-16T04:56:39,428 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h' 2026-04-16T04:56:39,430 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h' 2026-04-16T04:56:39,434 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h' 2026-04-16T04:56:39,436 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp' 2026-04-16T04:56:39,437 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h' 2026-04-16T04:56:39,439 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h' 2026-04-16T04:56:39,440 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h' 2026-04-16T04:56:39,442 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp' 2026-04-16T04:56:39,443 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h' 2026-04-16T04:56:39,445 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h' 2026-04-16T04:56:39,448 adding 'flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py' 2026-04-16T04:56:39,451 adding 'flashinfer/data/include/flashinfer/activation.cuh' 2026-04-16T04:56:39,454 adding 'flashinfer/data/include/flashinfer/air_top_p.cuh' 2026-04-16T04:56:39,455 adding 'flashinfer/data/include/flashinfer/allocator.h' 2026-04-16T04:56:39,457 adding 'flashinfer/data/include/flashinfer/arch_condition.h' 2026-04-16T04:56:39,458 adding 'flashinfer/data/include/flashinfer/attention_impl.cuh' 2026-04-16T04:56:39,460 adding 'flashinfer/data/include/flashinfer/concat_mla.cuh' 2026-04-16T04:56:39,462 adding 'flashinfer/data/include/flashinfer/cp_async.cuh' 2026-04-16T04:56:39,463 adding 'flashinfer/data/include/flashinfer/cubin_loader.h' 2026-04-16T04:56:39,464 adding 'flashinfer/data/include/flashinfer/cutlass_utils.cuh' 2026-04-16T04:56:39,466 adding 'flashinfer/data/include/flashinfer/exception.h' 2026-04-16T04:56:39,467 adding 'flashinfer/data/include/flashinfer/fastdiv.cuh' 2026-04-16T04:56:39,469 adding 'flashinfer/data/include/flashinfer/fp16.h' 2026-04-16T04:56:39,470 adding 'flashinfer/data/include/flashinfer/fp4_layout.cuh' 2026-04-16T04:56:39,472 adding 'flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh' 2026-04-16T04:56:39,473 adding 'flashinfer/data/include/flashinfer/layout.cuh' 2026-04-16T04:56:39,474 adding 'flashinfer/data/include/flashinfer/logging.h' 2026-04-16T04:56:39,476 adding 'flashinfer/data/include/flashinfer/math.cuh' 2026-04-16T04:56:39,478 adding 'flashinfer/data/include/flashinfer/mma.cuh' 2026-04-16T04:56:39,481 adding 'flashinfer/data/include/flashinfer/norm.cuh' 2026-04-16T04:56:39,484 adding 'flashinfer/data/include/flashinfer/page.cuh' 2026-04-16T04:56:39,486 adding 'flashinfer/data/include/flashinfer/permuted_smem.cuh' 2026-04-16T04:56:39,491 adding 'flashinfer/data/include/flashinfer/pos_enc.cuh' 2026-04-16T04:56:39,493 adding 'flashinfer/data/include/flashinfer/profiler.cuh' 2026-04-16T04:56:39,495 adding 'flashinfer/data/include/flashinfer/quantization.cuh' 2026-04-16T04:56:39,501 adding 'flashinfer/data/include/flashinfer/sampling.cuh' 2026-04-16T04:56:39,514 adding 'flashinfer/data/include/flashinfer/topk.cuh' 2026-04-16T04:56:39,517 adding 'flashinfer/data/include/flashinfer/utils.cuh' 2026-04-16T04:56:39,521 adding 'flashinfer/data/include/flashinfer/vec_dtypes.cuh' 2026-04-16T04:56:39,525 adding 'flashinfer/data/include/flashinfer/attention/batch_pod.cuh' 2026-04-16T04:56:39,528 adding 'flashinfer/data/include/flashinfer/attention/cascade.cuh' 2026-04-16T04:56:39,529 adding 'flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh' 2026-04-16T04:56:39,534 adding 'flashinfer/data/include/flashinfer/attention/decode.cuh' 2026-04-16T04:56:39,537 adding 'flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh' 2026-04-16T04:56:39,539 adding 'flashinfer/data/include/flashinfer/attention/default_decode_params.cuh' 2026-04-16T04:56:39,540 adding 'flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh' 2026-04-16T04:56:39,542 adding 'flashinfer/data/include/flashinfer/attention/heap.h' 2026-04-16T04:56:39,543 adding 'flashinfer/data/include/flashinfer/attention/hopper.cuh' 2026-04-16T04:56:39,545 adding 'flashinfer/data/include/flashinfer/attention/mask.cuh' 2026-04-16T04:56:39,549 adding 'flashinfer/data/include/flashinfer/attention/mla.cuh' 2026-04-16T04:56:39,553 adding 'flashinfer/data/include/flashinfer/attention/mla_hopper.cuh' 2026-04-16T04:56:39,555 adding 'flashinfer/data/include/flashinfer/attention/mla_params.cuh' 2026-04-16T04:56:39,558 adding 'flashinfer/data/include/flashinfer/attention/persistent.cuh' 2026-04-16T04:56:39,560 adding 'flashinfer/data/include/flashinfer/attention/persistent_template.cuh' 2026-04-16T04:56:39,562 adding 'flashinfer/data/include/flashinfer/attention/pod.cuh' 2026-04-16T04:56:39,571 adding 'flashinfer/data/include/flashinfer/attention/prefill.cuh' 2026-04-16T04:56:39,579 adding 'flashinfer/data/include/flashinfer/attention/scheduler.cuh' 2026-04-16T04:56:39,581 adding 'flashinfer/data/include/flashinfer/attention/state.cuh' 2026-04-16T04:56:39,583 adding 'flashinfer/data/include/flashinfer/attention/variant_helper.cuh' 2026-04-16T04:56:39,584 adding 'flashinfer/data/include/flashinfer/attention/variants.cuh' 2026-04-16T04:56:39,586 adding 'flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh' 2026-04-16T04:56:39,588 adding 'flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh' 2026-04-16T04:56:39,590 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp' 2026-04-16T04:56:39,592 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp' 2026-04-16T04:56:39,594 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp' 2026-04-16T04:56:39,598 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp' 2026-04-16T04:56:39,600 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp' 2026-04-16T04:56:39,604 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp' 2026-04-16T04:56:39,607 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp' 2026-04-16T04:56:39,609 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp' 2026-04-16T04:56:39,611 adding 'flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp' 2026-04-16T04:56:39,613 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp' 2026-04-16T04:56:39,615 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp' 2026-04-16T04:56:39,617 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp' 2026-04-16T04:56:39,618 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp' 2026-04-16T04:56:39,620 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp' 2026-04-16T04:56:39,622 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp' 2026-04-16T04:56:39,625 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp' 2026-04-16T04:56:39,626 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp' 2026-04-16T04:56:39,634 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp' 2026-04-16T04:56:39,636 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp' 2026-04-16T04:56:39,638 adding 'flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh' 2026-04-16T04:56:39,639 adding 'flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh' 2026-04-16T04:56:39,641 adding 'flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh' 2026-04-16T04:56:39,643 adding 'flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh' 2026-04-16T04:56:39,645 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh' 2026-04-16T04:56:39,647 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh' 2026-04-16T04:56:39,648 adding 'flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh' 2026-04-16T04:56:39,651 adding 'flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh' 2026-04-16T04:56:39,654 adding 'flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh' 2026-04-16T04:56:39,655 adding 'flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh' 2026-04-16T04:56:39,657 adding 'flashinfer/data/include/flashinfer/attention/hopper/utils.cuh' 2026-04-16T04:56:39,658 adding 'flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh' 2026-04-16T04:56:39,660 adding 'flashinfer/data/include/flashinfer/attention/hopper/variants.cuh' 2026-04-16T04:56:39,662 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh' 2026-04-16T04:56:39,664 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh' 2026-04-16T04:56:39,666 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh' 2026-04-16T04:56:39,668 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh' 2026-04-16T04:56:39,671 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh' 2026-04-16T04:56:39,674 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh' 2026-04-16T04:56:39,681 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh' 2026-04-16T04:56:39,687 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh' 2026-04-16T04:56:39,691 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh' 2026-04-16T04:56:39,693 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh' 2026-04-16T04:56:39,697 adding 'flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh' 2026-04-16T04:56:39,703 adding 'flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh' 2026-04-16T04:56:39,706 adding 'flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh' 2026-04-16T04:56:39,708 adding 'flashinfer/data/include/flashinfer/flat/common.hpp' 2026-04-16T04:56:39,709 adding 'flashinfer/data/include/flashinfer/flat/cute_ext.hpp' 2026-04-16T04:56:39,711 adding 'flashinfer/data/include/flashinfer/flat/debug.hpp' 2026-04-16T04:56:39,712 adding 'flashinfer/data/include/flashinfer/flat/math.hpp' 2026-04-16T04:56:39,713 adding 'flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp' 2026-04-16T04:56:39,714 adding 'flashinfer/data/include/flashinfer/flat/type_traits.hpp' 2026-04-16T04:56:39,716 adding 'flashinfer/data/include/flashinfer/flat/unused.hpp' 2026-04-16T04:56:39,719 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp' 2026-04-16T04:56:39,720 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp' 2026-04-16T04:56:39,723 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp' 2026-04-16T04:56:39,725 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp' 2026-04-16T04:56:39,731 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp' 2026-04-16T04:56:39,732 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp' 2026-04-16T04:56:39,734 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp' 2026-04-16T04:56:39,736 adding 'flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp' 2026-04-16T04:56:39,738 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp' 2026-04-16T04:56:39,740 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp' 2026-04-16T04:56:39,742 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp' 2026-04-16T04:56:39,743 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp' 2026-04-16T04:56:39,745 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp' 2026-04-16T04:56:39,747 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh' 2026-04-16T04:56:39,749 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h' 2026-04-16T04:56:39,751 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h' 2026-04-16T04:56:39,753 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h' 2026-04-16T04:56:39,754 adding 'flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh' 2026-04-16T04:56:39,757 adding 'flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h' 2026-04-16T04:56:39,758 adding 'flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh' 2026-04-16T04:56:39,760 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h' 2026-04-16T04:56:39,762 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h' 2026-04-16T04:56:39,764 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h' 2026-04-16T04:56:39,766 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h' 2026-04-16T04:56:39,768 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h' 2026-04-16T04:56:39,771 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h' 2026-04-16T04:56:39,773 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h' 2026-04-16T04:56:39,775 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h' 2026-04-16T04:56:39,777 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h' 2026-04-16T04:56:39,778 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h' 2026-04-16T04:56:39,780 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh' 2026-04-16T04:56:39,782 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh' 2026-04-16T04:56:39,784 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm.cuh' 2026-04-16T04:56:39,785 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh' 2026-04-16T04:56:39,787 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh' 2026-04-16T04:56:39,789 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh' 2026-04-16T04:56:39,791 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh' 2026-04-16T04:56:39,793 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm120.cuh' 2026-04-16T04:56:39,796 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_nvfp4_groupwise_sm120.cuh' 2026-04-16T04:56:39,798 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh' 2026-04-16T04:56:39,799 adding 'flashinfer/data/include/flashinfer/gemm/group_gemv.cuh' 2026-04-16T04:56:39,801 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h' 2026-04-16T04:56:39,803 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h' 2026-04-16T04:56:39,804 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template_sm120.h' 2026-04-16T04:56:39,807 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h' 2026-04-16T04:56:39,809 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm120.h' 2026-04-16T04:56:39,818 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh' 2026-04-16T04:56:39,820 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h' 2026-04-16T04:56:39,822 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h' 2026-04-16T04:56:39,824 adding 'flashinfer/data/include/flashinfer/mamba/common.cuh' 2026-04-16T04:56:39,826 adding 'flashinfer/data/include/flashinfer/mamba/conversion.cuh' 2026-04-16T04:56:39,827 adding 'flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh' 2026-04-16T04:56:39,829 adding 'flashinfer/data/include/flashinfer/mamba/invoke_selective_state_update_mtp.cuh' 2026-04-16T04:56:39,832 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_async_horizontal.cuh' 2026-04-16T04:56:39,835 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_horizontal.cuh' 2026-04-16T04:56:39,838 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_simple.cuh' 2026-04-16T04:56:39,841 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_vertical.cuh' 2026-04-16T04:56:39,846 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh' 2026-04-16T04:56:39,848 adding 'flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh' 2026-04-16T04:56:39,850 adding 'flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh' 2026-04-16T04:56:39,851 adding 'flashinfer/data/include/flashinfer/mamba/ssu_mtp_common.cuh' 2026-04-16T04:56:39,854 adding 'flashinfer/data/include/flashinfer/norm/ln_fwd_silu_kernel.cuh' 2026-04-16T04:56:39,862 adding 'flashinfer/data/include/flashinfer/norm/ln_silu_headers.cuh' 2026-04-16T04:56:39,865 adding 'flashinfer/data/include/flashinfer/trtllm/common.h' 2026-04-16T04:56:39,867 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h' 2026-04-16T04:56:39,869 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh' 2026-04-16T04:56:39,870 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h' 2026-04-16T04:56:39,872 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h' 2026-04-16T04:56:39,874 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh' 2026-04-16T04:56:39,875 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h' 2026-04-16T04:56:39,877 adding 'flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh' 2026-04-16T04:56:39,880 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h' 2026-04-16T04:56:39,881 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h' 2026-04-16T04:56:39,886 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh' 2026-04-16T04:56:39,887 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h' 2026-04-16T04:56:39,889 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh' 2026-04-16T04:56:39,891 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h' 2026-04-16T04:56:39,896 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h' 2026-04-16T04:56:39,897 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h' 2026-04-16T04:56:39,899 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh' 2026-04-16T04:56:39,901 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h' 2026-04-16T04:56:39,902 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h' 2026-04-16T04:56:39,906 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingCustomPolicy.cuh' 2026-04-16T04:56:39,908 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingDevKernel.h' 2026-04-16T04:56:39,912 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh' 2026-04-16T04:56:39,914 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h' 2026-04-16T04:56:39,916 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh' 2026-04-16T04:56:39,917 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h' 2026-04-16T04:56:39,920 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h' 2026-04-16T04:56:39,923 adding 'flashinfer/data/spdlog/include/spdlog/async.h' 2026-04-16T04:56:39,924 adding 'flashinfer/data/spdlog/include/spdlog/async_logger-inl.h' 2026-04-16T04:56:39,925 adding 'flashinfer/data/spdlog/include/spdlog/async_logger.h' 2026-04-16T04:56:39,927 adding 'flashinfer/data/spdlog/include/spdlog/common-inl.h' 2026-04-16T04:56:39,929 adding 'flashinfer/data/spdlog/include/spdlog/common.h' 2026-04-16T04:56:39,930 adding 'flashinfer/data/spdlog/include/spdlog/formatter.h' 2026-04-16T04:56:39,931 adding 'flashinfer/data/spdlog/include/spdlog/fwd.h' 2026-04-16T04:56:39,933 adding 'flashinfer/data/spdlog/include/spdlog/logger-inl.h' 2026-04-16T04:56:39,935 adding 'flashinfer/data/spdlog/include/spdlog/logger.h' 2026-04-16T04:56:39,936 adding 'flashinfer/data/spdlog/include/spdlog/mdc.h' 2026-04-16T04:56:39,940 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h' 2026-04-16T04:56:39,942 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter.h' 2026-04-16T04:56:39,943 adding 'flashinfer/data/spdlog/include/spdlog/spdlog-inl.h' 2026-04-16T04:56:39,945 adding 'flashinfer/data/spdlog/include/spdlog/spdlog.h' 2026-04-16T04:56:39,946 adding 'flashinfer/data/spdlog/include/spdlog/stopwatch.h' 2026-04-16T04:56:39,948 adding 'flashinfer/data/spdlog/include/spdlog/tweakme.h' 2026-04-16T04:56:39,949 adding 'flashinfer/data/spdlog/include/spdlog/version.h' 2026-04-16T04:56:39,951 adding 'flashinfer/data/spdlog/include/spdlog/cfg/argv.h' 2026-04-16T04:56:39,952 adding 'flashinfer/data/spdlog/include/spdlog/cfg/env.h' 2026-04-16T04:56:39,953 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h' 2026-04-16T04:56:39,955 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers.h' 2026-04-16T04:56:39,957 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h' 2026-04-16T04:56:39,959 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer.h' 2026-04-16T04:56:39,960 adding 'flashinfer/data/spdlog/include/spdlog/details/circular_q.h' 2026-04-16T04:56:39,961 adding 'flashinfer/data/spdlog/include/spdlog/details/console_globals.h' 2026-04-16T04:56:39,963 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h' 2026-04-16T04:56:39,964 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper.h' 2026-04-16T04:56:39,966 adding 'flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h' 2026-04-16T04:56:39,967 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h' 2026-04-16T04:56:39,968 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg.h' 2026-04-16T04:56:39,969 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h' 2026-04-16T04:56:39,970 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h' 2026-04-16T04:56:39,972 adding 'flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h' 2026-04-16T04:56:39,973 adding 'flashinfer/data/spdlog/include/spdlog/details/null_mutex.h' 2026-04-16T04:56:39,975 adding 'flashinfer/data/spdlog/include/spdlog/details/os-inl.h' 2026-04-16T04:56:39,977 adding 'flashinfer/data/spdlog/include/spdlog/details/os.h' 2026-04-16T04:56:39,978 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h' 2026-04-16T04:56:39,980 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h' 2026-04-16T04:56:39,981 adding 'flashinfer/data/spdlog/include/spdlog/details/registry-inl.h' 2026-04-16T04:56:39,983 adding 'flashinfer/data/spdlog/include/spdlog/details/registry.h' 2026-04-16T04:56:39,984 adding 'flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h' 2026-04-16T04:56:39,985 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h' 2026-04-16T04:56:39,986 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client.h' 2026-04-16T04:56:39,988 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h' 2026-04-16T04:56:39,989 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool.h' 2026-04-16T04:56:39,991 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h' 2026-04-16T04:56:39,992 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client.h' 2026-04-16T04:56:39,993 adding 'flashinfer/data/spdlog/include/spdlog/details/windows_include.h' 2026-04-16T04:56:39,995 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h' 2026-04-16T04:56:39,997 adding 'flashinfer/data/spdlog/include/spdlog/fmt/chrono.h' 2026-04-16T04:56:39,998 adding 'flashinfer/data/spdlog/include/spdlog/fmt/compile.h' 2026-04-16T04:56:39,999 adding 'flashinfer/data/spdlog/include/spdlog/fmt/fmt.h' 2026-04-16T04:56:40,000 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ostr.h' 2026-04-16T04:56:40,002 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ranges.h' 2026-04-16T04:56:40,003 adding 'flashinfer/data/spdlog/include/spdlog/fmt/std.h' 2026-04-16T04:56:40,004 adding 'flashinfer/data/spdlog/include/spdlog/fmt/xchar.h' 2026-04-16T04:56:40,007 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h' 2026-04-16T04:56:40,014 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h' 2026-04-16T04:56:40,018 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h' 2026-04-16T04:56:40,021 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h' 2026-04-16T04:56:40,033 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h' 2026-04-16T04:56:40,035 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst' 2026-04-16T04:56:40,044 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h' 2026-04-16T04:56:40,065 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h' 2026-04-16T04:56:40,067 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h' 2026-04-16T04:56:40,069 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h' 2026-04-16T04:56:40,071 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h' 2026-04-16T04:56:40,074 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h' 2026-04-16T04:56:40,077 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h' 2026-04-16T04:56:40,079 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h' 2026-04-16T04:56:40,081 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h' 2026-04-16T04:56:40,084 adding 'flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h' 2026-04-16T04:56:40,085 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h' 2026-04-16T04:56:40,086 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h' 2026-04-16T04:56:40,088 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h' 2026-04-16T04:56:40,089 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h' 2026-04-16T04:56:40,090 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h' 2026-04-16T04:56:40,091 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h' 2026-04-16T04:56:40,092 adding 'flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h' 2026-04-16T04:56:40,094 adding 'flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h' 2026-04-16T04:56:40,095 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h' 2026-04-16T04:56:40,097 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h' 2026-04-16T04:56:40,099 adding 'flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h' 2026-04-16T04:56:40,100 adding 'flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h' 2026-04-16T04:56:40,101 adding 'flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h' 2026-04-16T04:56:40,103 adding 'flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h' 2026-04-16T04:56:40,104 adding 'flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h' 2026-04-16T04:56:40,105 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h' 2026-04-16T04:56:40,107 adding 'flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h' 2026-04-16T04:56:40,108 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h' 2026-04-16T04:56:40,110 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h' 2026-04-16T04:56:40,111 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h' 2026-04-16T04:56:40,113 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h' 2026-04-16T04:56:40,114 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink.h' 2026-04-16T04:56:40,115 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h' 2026-04-16T04:56:40,116 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h' 2026-04-16T04:56:40,118 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h' 2026-04-16T04:56:40,119 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h' 2026-04-16T04:56:40,120 adding 'flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h' 2026-04-16T04:56:40,122 adding 'flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h' 2026-04-16T04:56:40,123 adding 'flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h' 2026-04-16T04:56:40,125 adding 'flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h' 2026-04-16T04:56:40,126 adding 'flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h' 2026-04-16T04:56:40,128 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h' 2026-04-16T04:56:40,129 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h' 2026-04-16T04:56:40,131 adding 'flashinfer/data/spdlog/scripts/extract_version.py' 2026-04-16T04:56:40,133 adding 'flashinfer/dsv3_ops/__init__.py' 2026-04-16T04:56:40,135 adding 'flashinfer/fused_moe/__init__.py' 2026-04-16T04:56:40,143 adding 'flashinfer/fused_moe/core.py' 2026-04-16T04:56:40,146 adding 'flashinfer/fused_moe/fused_routing_dsv3.py' 2026-04-16T04:56:40,148 adding 'flashinfer/fused_moe/utils.py' 2026-04-16T04:56:40,150 adding 'flashinfer/fused_moe/cute_dsl/__init__.py' 2026-04-16T04:56:40,153 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-16T04:56:40,156 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-16T04:56:40,159 adding 'flashinfer/fused_moe/cute_dsl/fused_moe.py' 2026-04-16T04:56:40,162 adding 'flashinfer/fused_moe/cute_dsl/moe_utils.py' 2026-04-16T04:56:40,165 adding 'flashinfer/fused_moe/cute_dsl/tuner.py' 2026-04-16T04:56:40,167 adding 'flashinfer/fused_moe/cute_dsl/blackwell/__init__.py' 2026-04-16T04:56:40,180 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-16T04:56:40,190 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-16T04:56:40,193 adding 'flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py' 2026-04-16T04:56:40,195 adding 'flashinfer/fused_moe/cute_dsl/blackwell/utils.py' 2026-04-16T04:56:40,197 adding 'flashinfer/gdn_kernels/__init__.py' 2026-04-16T04:56:40,205 adding 'flashinfer/gdn_kernels/gdn_decode_bf16_state.py' 2026-04-16T04:56:40,213 adding 'flashinfer/gdn_kernels/gdn_decode_mtp.py' 2026-04-16T04:56:40,217 adding 'flashinfer/gdn_kernels/gdn_decode_nontranspose.py' 2026-04-16T04:56:40,220 adding 'flashinfer/gdn_kernels/gdn_decode_pretranspose.py' 2026-04-16T04:56:40,223 adding 'flashinfer/gdn_kernels/blackwell/__init__.py' 2026-04-16T04:56:40,236 adding 'flashinfer/gdn_kernels/blackwell/gated_delta_net_chunked.py' 2026-04-16T04:56:40,239 adding 'flashinfer/gdn_kernels/blackwell/gated_delta_net_tile_scheduler.py' 2026-04-16T04:56:40,241 adding 'flashinfer/gdn_kernels/blackwell/gdn_prefill.py' 2026-04-16T04:56:40,243 adding 'flashinfer/gemm/__init__.py' 2026-04-16T04:56:40,265 adding 'flashinfer/gemm/gemm_base.py' 2026-04-16T04:56:40,268 adding 'flashinfer/gemm/routergemm.py' 2026-04-16T04:56:40,270 adding 'flashinfer/gemm/kernels/__init__.py' 2026-04-16T04:56:40,278 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py' 2026-04-16T04:56:40,288 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py' 2026-04-16T04:56:40,299 adding 'flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py' 2026-04-16T04:56:40,302 adding 'flashinfer/jit/__init__.py' 2026-04-16T04:56:40,303 adding 'flashinfer/jit/activation.py' 2026-04-16T04:56:40,305 adding 'flashinfer/jit/cascade.py' 2026-04-16T04:56:40,306 adding 'flashinfer/jit/comm.py' 2026-04-16T04:56:40,308 adding 'flashinfer/jit/core.py' 2026-04-16T04:56:40,310 adding 'flashinfer/jit/cpp_ext.py' 2026-04-16T04:56:40,313 adding 'flashinfer/jit/cubin_loader.py' 2026-04-16T04:56:40,314 adding 'flashinfer/jit/dsv3_optimizations.py' 2026-04-16T04:56:40,315 adding 'flashinfer/jit/env.py' 2026-04-16T04:56:40,317 adding 'flashinfer/jit/fp4_kv_dequantization.py' 2026-04-16T04:56:40,318 adding 'flashinfer/jit/fp4_kv_quantization.py' 2026-04-16T04:56:40,319 adding 'flashinfer/jit/fp4_quantization.py' 2026-04-16T04:56:40,321 adding 'flashinfer/jit/fp8_quantization.py' 2026-04-16T04:56:40,323 adding 'flashinfer/jit/fused_moe.py' 2026-04-16T04:56:40,324 adding 'flashinfer/jit/gdn.py' 2026-04-16T04:56:40,325 adding 'flashinfer/jit/mla.py' 2026-04-16T04:56:40,327 adding 'flashinfer/jit/moe_utils.py' 2026-04-16T04:56:40,328 adding 'flashinfer/jit/norm.py' 2026-04-16T04:56:40,329 adding 'flashinfer/jit/page.py' 2026-04-16T04:56:40,331 adding 'flashinfer/jit/quantization.py' 2026-04-16T04:56:40,333 adding 'flashinfer/jit/rmsnorm_silu.py' 2026-04-16T04:56:40,334 adding 'flashinfer/jit/rope.py' 2026-04-16T04:56:40,335 adding 'flashinfer/jit/sampling.py' 2026-04-16T04:56:40,337 adding 'flashinfer/jit/spdlog.py' 2026-04-16T04:56:40,338 adding 'flashinfer/jit/tinygemm2.py' 2026-04-16T04:56:40,339 adding 'flashinfer/jit/tllm_utils.py' 2026-04-16T04:56:40,340 adding 'flashinfer/jit/topk.py' 2026-04-16T04:56:40,342 adding 'flashinfer/jit/utils.py' 2026-04-16T04:56:40,343 adding 'flashinfer/jit/xqa.py' 2026-04-16T04:56:40,345 adding 'flashinfer/jit/attention/__init__.py' 2026-04-16T04:56:40,350 adding 'flashinfer/jit/attention/modules.py' 2026-04-16T04:56:40,352 adding 'flashinfer/jit/attention/utils.py' 2026-04-16T04:56:40,353 adding 'flashinfer/jit/attention/variants.py' 2026-04-16T04:56:40,359 adding 'flashinfer/jit/attention/fmha_v2/fmha_library.py' 2026-04-16T04:56:40,360 adding 'flashinfer/jit/attention/fmha_v2/generate_kernels.py' 2026-04-16T04:56:40,376 adding 'flashinfer/jit/attention/fmha_v2/generator_utils.py' 2026-04-16T04:56:40,381 adding 'flashinfer/jit/attention/fmha_v2/utils.py' 2026-04-16T04:56:40,383 adding 'flashinfer/jit/gemm/__init__.py' 2026-04-16T04:56:40,385 adding 'flashinfer/jit/gemm/core.py' 2026-04-16T04:56:40,387 adding 'flashinfer/jit/gemm/deepgemm.py' 2026-04-16T04:56:40,388 adding 'flashinfer/jit/gemm/fp8_blockscale.py' 2026-04-16T04:56:40,390 adding 'flashinfer/jit/gemm/cutlass/__init__.py' 2026-04-16T04:56:40,394 adding 'flashinfer/jit/gemm/cutlass/cutlass_library.py' 2026-04-16T04:56:40,398 adding 'flashinfer/jit/gemm/cutlass/generate_kernels.py' 2026-04-16T04:56:40,400 adding 'flashinfer/jit/mamba/__init__.py' 2026-04-16T04:56:40,402 adding 'flashinfer/jit/mamba/selective_state_update.py' 2026-04-16T04:56:40,403 adding 'flashinfer/jit/mamba/seq_chunk_cumsum.py' 2026-04-16T04:56:40,405 adding 'flashinfer/logits_processor/__init__.py' 2026-04-16T04:56:40,406 adding 'flashinfer/logits_processor/compiler.py' 2026-04-16T04:56:40,407 adding 'flashinfer/logits_processor/fusion_rules.py' 2026-04-16T04:56:40,409 adding 'flashinfer/logits_processor/legalization.py' 2026-04-16T04:56:40,410 adding 'flashinfer/logits_processor/op.py' 2026-04-16T04:56:40,412 adding 'flashinfer/logits_processor/operators.py' 2026-04-16T04:56:40,414 adding 'flashinfer/logits_processor/pipeline.py' 2026-04-16T04:56:40,416 adding 'flashinfer/logits_processor/processors.py' 2026-04-16T04:56:40,417 adding 'flashinfer/logits_processor/types.py' 2026-04-16T04:56:40,418 adding 'flashinfer/logits_processor/validators.py' 2026-04-16T04:56:40,420 adding 'flashinfer/mamba/__init__.py' 2026-04-16T04:56:40,422 adding 'flashinfer/mamba/selective_state_update.py' 2026-04-16T04:56:40,425 adding 'flashinfer/mamba/ssd_combined.py' 2026-04-16T04:56:40,439 adding 'flashinfer/mamba/ssd_kernel.py' 2026-04-16T04:56:40,442 adding 'flashinfer/mamba/ssd_tile_scheduler.py' 2026-04-16T04:56:40,444 adding 'flashinfer/mla/__init__.py' 2026-04-16T04:56:40,448 adding 'flashinfer/mla/_core.py' 2026-04-16T04:56:40,451 adding 'flashinfer/norm/__init__.py' 2026-04-16T04:56:40,454 adding 'flashinfer/norm/utils.py' 2026-04-16T04:56:40,456 adding 'flashinfer/norm/kernels/__init__.py' 2026-04-16T04:56:40,459 adding 'flashinfer/norm/kernels/fused_add_rmsnorm.py' 2026-04-16T04:56:40,462 adding 'flashinfer/norm/kernels/layernorm.py' 2026-04-16T04:56:40,465 adding 'flashinfer/norm/kernels/rmsnorm.py' 2026-04-16T04:56:40,468 adding 'flashinfer/profiler/__init__.py' 2026-04-16T04:56:40,470 adding 'flashinfer/quantization/__init__.py' 2026-04-16T04:56:40,475 adding 'flashinfer/quantization/fp4_quantization.py' 2026-04-16T04:56:40,477 adding 'flashinfer/quantization/fp8_quantization.py' 2026-04-16T04:56:40,478 adding 'flashinfer/quantization/packbits.py' 2026-04-16T04:56:40,482 adding 'flashinfer/quantization/quantization_cute_dsl_utils.py' 2026-04-16T04:56:40,485 adding 'flashinfer/quantization/kernels/__init__.py' 2026-04-16T04:56:40,488 adding 'flashinfer/quantization/kernels/mxfp4_quantize.py' 2026-04-16T04:56:40,492 adding 'flashinfer/quantization/kernels/mxfp8_quantize.py' 2026-04-16T04:56:40,498 adding 'flashinfer/quantization/kernels/nvfp4_quantize.py' 2026-04-16T04:56:40,500 adding 'flashinfer/testing/__init__.py' 2026-04-16T04:56:40,506 adding 'flashinfer/testing/utils.py' 2026-04-16T04:56:40,508 adding 'flashinfer/triton/__init__.py' 2026-04-16T04:56:40,510 adding 'flashinfer/triton/activation.py' 2026-04-16T04:56:40,511 adding 'flashinfer/triton/cascade.py' 2026-04-16T04:56:40,512 adding 'flashinfer/triton/gemm.py' 2026-04-16T04:56:40,514 adding 'flashinfer/triton/norm.py' 2026-04-16T04:56:40,515 adding 'flashinfer/triton/page.py' 2026-04-16T04:56:40,516 adding 'flashinfer/triton/sm_constraint_gemm.py' 2026-04-16T04:56:40,518 adding 'flashinfer/triton/utils.py' 2026-04-16T04:56:40,519 adding 'flashinfer/triton/kernels/__init__.py' 2026-04-16T04:56:40,521 adding 'flashinfer/triton/kernels/activation.py' 2026-04-16T04:56:40,522 adding 'flashinfer/triton/kernels/cascade.py' 2026-04-16T04:56:40,523 adding 'flashinfer/triton/kernels/norm.py' 2026-04-16T04:56:40,524 adding 'flashinfer/triton/kernels/quant.py' 2026-04-16T04:56:40,526 adding 'flashinfer/triton/kernels/sm_constraint_gemm.py' 2026-04-16T04:56:40,528 adding 'flashinfer/triton/kernels/ssd_chunk_state.py' 2026-04-16T04:56:40,530 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py' 2026-04-16T04:56:40,531 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py' 2026-04-16T04:56:40,535 adding 'flashinfer_python-0.6.8.dist-info/licenses/LICENSE' 2026-04-16T04:56:40,537 adding 'flashinfer_python-0.6.8.dist-info/METADATA' 2026-04-16T04:56:40,538 adding 'flashinfer_python-0.6.8.dist-info/WHEEL' 2026-04-16T04:56:40,539 adding 'flashinfer_python-0.6.8.dist-info/entry_points.txt' 2026-04-16T04:56:40,540 adding 'flashinfer_python-0.6.8.dist-info/top_level.txt' 2026-04-16T04:56:40,583 adding 'flashinfer_python-0.6.8.dist-info/RECORD' 2026-04-16T04:56:40,699 removing build/bdist.linux-armv7l/wheel 2026-04-16T04:56:41,447 Building wheel for flashinfer-python (pyproject.toml): finished with status 'done' 2026-04-16T04:56:41,626 Created wheel for flashinfer-python: filename=flashinfer_python-0.6.8-py3-none-any.whl size=9385227 sha256=98f06ee98dd03f9d20637980976634ad9f62e7072236284bf8ef0f6ea644d6f1 2026-04-16T04:56:41,627 Stored in directory: /tmp/pip-ephem-wheel-cache-q06utz8s/wheels/16/4e/11/f23fd2ae76d4a82e3e7fc575d49c430161bec3b4fca5dbc7b6 2026-04-16T04:56:41,710 Successfully built flashinfer-python 2026-04-16T04:56:41,938 Removed build tracker: '/tmp/pip-build-tracker-2hn1r72n'