2025-10-10T01:28:43,039 Created temporary directory: /tmp/pip-ephem-wheel-cache-2csiszy1 2025-10-10T01:28:43,041 Created temporary directory: /tmp/pip-build-tracker-7ss9dt55 2025-10-10T01:28:43,042 Initialized build tracking at /tmp/pip-build-tracker-7ss9dt55 2025-10-10T01:28:43,043 Created build tracker: /tmp/pip-build-tracker-7ss9dt55 2025-10-10T01:28:43,043 Entered build tracker: /tmp/pip-build-tracker-7ss9dt55 2025-10-10T01:28:43,044 Created temporary directory: /tmp/pip-wheel-5jw51xwk 2025-10-10T01:28:43,047 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-10-10T01:28:43,050 Created temporary directory: /tmp/pip-ephem-wheel-cache-4qmj4zzp 2025-10-10T01:28:43,073 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-10-10T01:28:43,077 2 location(s) to search for versions of flashinfer-python: 2025-10-10T01:28:43,077 * https://pypi.org/simple/flashinfer-python/ 2025-10-10T01:28:43,077 * https://www.piwheels.org/simple/flashinfer-python/ 2025-10-10T01:28:43,078 Fetching project page and analyzing links: https://pypi.org/simple/flashinfer-python/ 2025-10-10T01:28:43,079 Getting page https://pypi.org/simple/flashinfer-python/ 2025-10-10T01:28:43,080 Found index url https://pypi.org/simple 2025-10-10T01:28:43,232 Fetched page https://pypi.org/simple/flashinfer-python/ as application/vnd.pypi.simple.v1+json 2025-10-10T01:28:43,241 Found link https://files.pythonhosted.org/packages/6c/e9/5d6adcf888922a17c6fc52a0e5bed78785239af1219f41e1073b063a07ff/flashinfer_python-0.2.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post1 2025-10-10T01:28:43,242 Found link https://files.pythonhosted.org/packages/c8/39/bac839234a3beaab4292e489b4d8941cc97ba4f76474aff0407d7b05a84f/flashinfer_python-0.2.0.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post2 2025-10-10T01:28:43,244 Found link https://files.pythonhosted.org/packages/94/74/4dda2a7a7aa08bcfb8039faf2202bf0fea6b378d0d4968864737400fc329/flashinfer_python-0.2.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1 2025-10-10T01:28:43,245 Found link https://files.pythonhosted.org/packages/7f/3d/aab500609825108d3f6a4b440a7eeb6436d578d3e781e97ea015fd49a530/flashinfer_python-0.2.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post1 2025-10-10T01:28:43,246 Found link https://files.pythonhosted.org/packages/30/ac/afd1d2c472857be8f83389eb506e1413a2ac3a603889bea3cf24d5ab5be5/flashinfer_python-0.2.1.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post2 2025-10-10T01:28:43,248 Found link https://files.pythonhosted.org/packages/90/00/833dd50745bc15bb7a7451b77589d444ce963d48c0cb730b4760bfebffad/flashinfer_python-0.2.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2 2025-10-10T01:28:43,249 Found link https://files.pythonhosted.org/packages/02/cc/db9635c56653d3fa5a28f14ac858e0801de621aa33d3b528e4781aee906f/flashinfer_python-0.2.2.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2.post1 2025-10-10T01:28:43,250 Found link https://files.pythonhosted.org/packages/b6/10/2a63f1d09c5b337705236005dc9ccce513dcc08b7fd037cb40426f1695b1/flashinfer_python-0.2.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.3 2025-10-10T01:28:43,251 Found link https://files.pythonhosted.org/packages/a4/e5/8d193ccf65b92c009c4be50fdffa88fa0edc8fd6e6169bacaca6bab84d89/flashinfer_python-0.2.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.4 2025-10-10T01:28:43,252 Found link https://files.pythonhosted.org/packages/b2/c4/9ec0f79e2480fc5c93307c4a1ac903e5cf33c551c0eaeb648196234b55af/flashinfer_python-0.2.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.5 2025-10-10T01:28:43,254 Found link https://files.pythonhosted.org/packages/95/4a/a3109d57463d25a153b16c0d0f06495e4d18b727c81f8e08e42e97faaf45/flashinfer_python-0.2.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6 2025-10-10T01:28:43,255 Found link https://files.pythonhosted.org/packages/34/26/3c6f12ffaefbfa0c453030d6e15941269b3a4ffcd267daec32d1a10dda96/flashinfer_python-0.2.6.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6.post1 2025-10-10T01:28:43,256 Found link https://files.pythonhosted.org/packages/f9/a0/5e700751f2393a504bc5eb2879e77d783a5b70778a254289711323126abc/flashinfer_python-0.2.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7 2025-10-10T01:28:43,257 Found link https://files.pythonhosted.org/packages/c0/10/43cf1ea7a03ca8e75a185190708e48286e1583d781e93d1de130e5d450ca/flashinfer_python-0.2.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7.post1 2025-10-10T01:28:43,258 Found link https://files.pythonhosted.org/packages/f1/80/8dfae62d04af4597d7615b892f346ace68bcb07dfbef2a9e614219d96a8a/flashinfer_python-0.2.8rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8rc1 2025-10-10T01:28:43,259 Found link https://files.pythonhosted.org/packages/72/0e/827624993516e80f62ba88dd368ad5e180c41324f063c00d27fa638a430e/flashinfer_python-0.2.8.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8 2025-10-10T01:28:43,260 Found link https://files.pythonhosted.org/packages/17/50/42afc9a81031939140fcbfd93e5a3652dc4995e338b4e6d007b0dda04f93/flashinfer_python-0.2.9rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc1 2025-10-10T01:28:43,261 Found link https://files.pythonhosted.org/packages/ed/1a/9f30eda3178ed2f5f7e311ae0011d02c4542d087f84c9247e4b30668b767/flashinfer_python-0.2.9rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc2 2025-10-10T01:28:43,263 Found link https://files.pythonhosted.org/packages/45/fc/4deff13f1420cc6e5871b7505a6c0d9031eb49cd09571ae576aec59bed61/flashinfer_python-0.2.9.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9 2025-10-10T01:28:43,264 Found link https://files.pythonhosted.org/packages/74/e4/2c6d6a19d13ed13d4863f6900febe72b502334e43292d5fe9a1ac2f6c5be/flashinfer_python-0.2.10.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.10 2025-10-10T01:28:43,265 Found link https://files.pythonhosted.org/packages/72/8b/f315dda5993d1c018ca5ecfef0775c6a3c7a8f59ac426fabb7f3f6b93482/flashinfer_python-0.2.11.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11 2025-10-10T01:28:43,266 Found link https://files.pythonhosted.org/packages/37/e3/2e8e31f7f7ee26f39968264e4fcf74f9810d90e940859016d974106ed5c6/flashinfer_python-0.2.11.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post1 2025-10-10T01:28:43,268 Found link https://files.pythonhosted.org/packages/b6/01/fa069f076cfe5bed34ddc3b7f772aa09c70e03e572dd9d3569ff887f33b1/flashinfer_python-0.2.11.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post2 2025-10-10T01:28:43,269 Found link https://files.pythonhosted.org/packages/a3/09/5d89ef0bc2d19d3ebcf3b9fa621c945909f681818c9d55aa3181921db874/flashinfer_python-0.2.11.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post3 2025-10-10T01:28:43,270 Found link https://files.pythonhosted.org/packages/b9/5a/7a839afb07af313549b9d9f1057b02aaf067f020267d5a9d128e50596bf4/flashinfer_python-0.2.12.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.12 2025-10-10T01:28:43,272 Found link https://files.pythonhosted.org/packages/f2/20/e79142a9f26aab61b17e2c906a49e9a3d3c656d97608c8773785c3b13140/flashinfer_python-0.2.13.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.13 2025-10-10T01:28:43,273 Found link https://files.pythonhosted.org/packages/ed/26/d1eac56b37d225cb3f84495bd897829dece21f62463487f3c1d9cafe78a0/flashinfer_python-0.2.14.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14 2025-10-10T01:28:43,274 Found link https://files.pythonhosted.org/packages/94/d4/4a2bf3d49f84b2d975925c1c024790b4e4768bdefbc5e27529d68368355a/flashinfer_python-0.2.14.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14.post1 2025-10-10T01:28:43,275 Found link https://files.pythonhosted.org/packages/56/e3/7c0a4df2640a97ecfed45fe9110ecc6a67d4967278723abf8e6531b6bc1f/flashinfer_python-0.3.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0rc1 2025-10-10T01:28:43,276 Found link https://files.pythonhosted.org/packages/1f/b4/5c4cbb0f3cbc5e8d4c19b3f163c048eed959a0ac0c603cfb3939a3079c52/flashinfer_python-0.3.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0 2025-10-10T01:28:43,277 Found link https://files.pythonhosted.org/packages/59/1b/83a9c58432b4a5d6ff04b97d4873bedfb5e35d38972ca8946b3acdbffeb4/flashinfer_python-0.3.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0.post1 2025-10-10T01:28:43,279 Found link https://files.pythonhosted.org/packages/ba/71/dd3001b8be8174d90561764a5f3be4ca219517bde2841189ea6973a3873f/flashinfer_python-0.3.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1 2025-10-10T01:28:43,280 Found link https://files.pythonhosted.org/packages/49/a7/f5bd3878f94fc47e25ecc0828f910233022366f7e832dfa02f3617fad41f/flashinfer_python-0.3.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1.post1 2025-10-10T01:28:43,281 Found link https://files.pythonhosted.org/packages/df/b4/f113bb950e5244d1c72c3d73c03fac0db939f085670e3a45a41fe92ffde0/flashinfer_python-0.4.0rc0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc0 2025-10-10T01:28:43,282 Found link https://files.pythonhosted.org/packages/2e/a8/adceccda3aae01b7bdb5f99c68a2b401c58600f34a6386d9489ff736cdbc/flashinfer_python-0.4.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc1 2025-10-10T01:28:43,284 Found link https://files.pythonhosted.org/packages/15/c0/5fb88fc273fed23dbf3b0ef0bffa7db26e2df24e016202df1b4e98b95879/flashinfer_python-0.4.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc2 2025-10-10T01:28:43,285 Found link https://files.pythonhosted.org/packages/65/91/cf9e3a0a2626711bfab18ea4a4c739e0eb823e9513addc0e9e1b8f929538/flashinfer_python-0.4.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc3 2025-10-10T01:28:43,286 Found link https://files.pythonhosted.org/packages/94/ec/bdcc0ec502994d544cbe69763d999458ae2deda67e58c1cb2d85867677c4/flashinfer_python-0.4.0rc4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc4 2025-10-10T01:28:43,287 Found link https://files.pythonhosted.org/packages/08/29/f5609be182174e8c97124baeb90bb955fe05e2e1353776f48e226c153214/flashinfer_python-0.4.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0 2025-10-10T01:28:43,288 Fetching project page and analyzing links: https://www.piwheels.org/simple/flashinfer-python/ 2025-10-10T01:28:43,290 Getting page https://www.piwheels.org/simple/flashinfer-python/ 2025-10-10T01:28:43,291 Found index url https://www.piwheels.org/simple 2025-10-10T01:28:43,475 Fetched page https://www.piwheels.org/simple/flashinfer-python/ as text/html 2025-10-10T01:28:43,479 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.5-py3-none-any.whl#sha256=cb2a17c3ea5f47f8129f6410e2892f30051e15665f2ae54db540c8677c187d31 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,480 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.4-py3-none-any.whl#sha256=4a85bd6ac785f106f0ad9fe213abf42f96ab84ccd04aec3ab9acf76d47d2aa3f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,480 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.3-py3-none-any.whl#sha256=b8ead688a4857a2b360c992fb46ae2930fc4c43b50a092b7e42a13b40ee195da (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,481 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2.post1-py3-none-any.whl#sha256=0097a08376ae147084ea6bd0848fc2ea1764f524c510a48755aa8c63259b4466 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,481 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2-py3-none-any.whl#sha256=c109a340b7e60cb57d8c9ccec2c10e303a36b82a56ba8dcaaa0efbee2a48b97f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,482 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post2-py3-none-any.whl#sha256=dc91f387ba09e4df899238705ec37bbe3648395d828240b77db84378d1b91e9e (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,483 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post1-py3-none-any.whl#sha256=a44b9d872cf2ba6812d3c0750d98ad01b73e9ccbede933c7eade01b6c27b6232 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,483 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1-py3-none-any.whl#sha256=e07427d9eff1b8d091b5837c3ffc4fe7885dbf01d271d7225f7a89a2e3925f27 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,484 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post2-py3-none-any.whl#sha256=52c20b84ef1e848dd49c726ffc27801df8acccb4038aea61a2d73fa685bf75f8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,485 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post1-py3-none-any.whl#sha256=783c1039e0a7db0478a579d5cc54894def70ae601b1e5b90a3c3de2209334bf3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2025-10-10T01:28:43,485 Skipping link: not a file: https://www.piwheels.org/simple/flashinfer-python/ 2025-10-10T01:28:43,486 Skipping link: not a file: https://pypi.org/simple/flashinfer-python/ 2025-10-10T01:28:43,512 Given no hashes to check 1 links for project 'flashinfer-python': discarding no candidates 2025-10-10T01:28:43,531 Collecting flashinfer-python==0.4.0 2025-10-10T01:28:43,533 Created temporary directory: /tmp/pip-unpack-tsbt44sz 2025-10-10T01:28:43,684 Downloading flashinfer_python-0.4.0.tar.gz (4.5 MB) 2025-10-10T01:28:49,489 Added flashinfer-python==0.4.0 from https://files.pythonhosted.org/packages/08/29/f5609be182174e8c97124baeb90bb955fe05e2e1353776f48e226c153214/flashinfer_python-0.4.0.tar.gz to build tracker '/tmp/pip-build-tracker-7ss9dt55' 2025-10-10T01:28:49,499 Created temporary directory: /tmp/pip-build-env-z_b0d3ik 2025-10-10T01:28:49,505 Installing build dependencies: started 2025-10-10T01:28:49,506 Running command pip subprocess to install build dependencies 2025-10-10T01:28:50,808 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2025-10-10T01:28:51,432 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2025-10-10T01:28:51,460 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2025-10-10T01:28:53,249 Collecting setuptools>=77 2025-10-10T01:28:53,362 Using cached https://www.piwheels.org/simple/setuptools/setuptools-80.9.0-py3-none-any.whl (1.2 MB) 2025-10-10T01:28:53,582 Collecting packaging>=24 2025-10-10T01:28:53,601 Using cached https://www.piwheels.org/simple/packaging/packaging-25.0-py3-none-any.whl (66 kB) 2025-10-10T01:28:53,898 Collecting apache-tvm-ffi==0.1.0b15 2025-10-10T01:28:53,914 Downloading https://www.piwheels.org/simple/apache-tvm-ffi/apache_tvm_ffi-0.1.0b15-cp311-cp311-linux_armv7l.whl (1.6 MB) 2025-10-10T01:28:54,066 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.6/1.6 MB 11.2 MB/s eta 0:00:00 2025-10-10T01:28:54,295 Collecting typing-extensions>=4.5 2025-10-10T01:28:54,311 Using cached https://www.piwheels.org/simple/typing-extensions/typing_extensions-4.15.0-py3-none-any.whl (44 kB) 2025-10-10T01:28:57,633 Installing collected packages: typing-extensions, setuptools, packaging, apache-tvm-ffi 2025-10-10T01:29:02,417 Creating /tmp/pip-build-env-z_b0d3ik/overlay/local/bin 2025-10-10T01:29:02,419 changing mode of /tmp/pip-build-env-z_b0d3ik/overlay/local/bin/tvm-ffi-config to 755 2025-10-10T01:29:02,451 Successfully installed apache-tvm-ffi-0.1.0b15 packaging-25.0 setuptools-80.9.0 typing-extensions-4.15.0 2025-10-10T01:29:02,762 Installing build dependencies: finished with status 'done' 2025-10-10T01:29:02,770 Getting requirements to build wheel: started 2025-10-10T01:29:02,771 Running command Getting requirements to build wheel 2025-10-10T01:29:07,880 Build metadata file already exists (not in git repo), keeping it 2025-10-10T01:29:07,953 Getting requirements to build wheel: finished with status 'done' 2025-10-10T01:29:07,957 Created temporary directory: /tmp/pip-modern-metadata-6byr1qai 2025-10-10T01:29:07,959 Preparing metadata (pyproject.toml): started 2025-10-10T01:29:07,960 Running command Preparing metadata (pyproject.toml) 2025-10-10T01:29:13,365 Build metadata file already exists (not in git repo), keeping it 2025-10-10T01:29:13,365 running dist_info 2025-10-10T01:29:13,381 creating /tmp/pip-modern-metadata-6byr1qai/flashinfer_python.egg-info 2025-10-10T01:29:13,382 writing /tmp/pip-modern-metadata-6byr1qai/flashinfer_python.egg-info/PKG-INFO 2025-10-10T01:29:13,387 writing dependency_links to /tmp/pip-modern-metadata-6byr1qai/flashinfer_python.egg-info/dependency_links.txt 2025-10-10T01:29:13,389 writing entry points to /tmp/pip-modern-metadata-6byr1qai/flashinfer_python.egg-info/entry_points.txt 2025-10-10T01:29:13,391 writing requirements to /tmp/pip-modern-metadata-6byr1qai/flashinfer_python.egg-info/requires.txt 2025-10-10T01:29:13,392 writing top-level names to /tmp/pip-modern-metadata-6byr1qai/flashinfer_python.egg-info/top_level.txt 2025-10-10T01:29:13,394 writing manifest file '/tmp/pip-modern-metadata-6byr1qai/flashinfer_python.egg-info/SOURCES.txt' 2025-10-10T01:29:14,147 reading manifest file '/tmp/pip-modern-metadata-6byr1qai/flashinfer_python.egg-info/SOURCES.txt' 2025-10-10T01:29:14,150 adding license file 'LICENSE' 2025-10-10T01:29:14,151 adding license file 'licenses/LICENSE.cutlass.txt' 2025-10-10T01:29:14,151 adding license file 'licenses/LICENSE.flashattention3.txt' 2025-10-10T01:29:14,152 adding license file 'licenses/LICENSE.fmt.txt' 2025-10-10T01:29:14,152 adding license file 'licenses/LICENSE.spdlog.txt' 2025-10-10T01:29:14,215 writing manifest file '/tmp/pip-modern-metadata-6byr1qai/flashinfer_python.egg-info/SOURCES.txt' 2025-10-10T01:29:14,218 creating '/tmp/pip-modern-metadata-6byr1qai/flashinfer_python-0.4.0.dist-info' 2025-10-10T01:29:14,371 Preparing metadata (pyproject.toml): finished with status 'done' 2025-10-10T01:29:14,377 Source in /tmp/pip-wheel-5jw51xwk/flashinfer-python_e3efc27bb5da44248808b79c22485ef3 has version 0.4.0, which satisfies requirement flashinfer-python==0.4.0 from https://files.pythonhosted.org/packages/08/29/f5609be182174e8c97124baeb90bb955fe05e2e1353776f48e226c153214/flashinfer_python-0.4.0.tar.gz 2025-10-10T01:29:14,378 Removed flashinfer-python==0.4.0 from https://files.pythonhosted.org/packages/08/29/f5609be182174e8c97124baeb90bb955fe05e2e1353776f48e226c153214/flashinfer_python-0.4.0.tar.gz from build tracker '/tmp/pip-build-tracker-7ss9dt55' 2025-10-10T01:29:14,385 Created temporary directory: /tmp/pip-unpack-4rpzc8kg 2025-10-10T01:29:14,386 Building wheels for collected packages: flashinfer-python 2025-10-10T01:29:14,391 Created temporary directory: /tmp/pip-wheel-9_ljcydi 2025-10-10T01:29:14,391 Destination directory: /tmp/pip-wheel-9_ljcydi 2025-10-10T01:29:14,394 Building wheel for flashinfer-python (pyproject.toml): started 2025-10-10T01:29:14,395 Running command Building wheel for flashinfer-python (pyproject.toml) 2025-10-10T01:29:19,673 Build metadata file already exists (not in git repo), keeping it 2025-10-10T01:29:19,673 running bdist_wheel 2025-10-10T01:29:19,700 running build 2025-10-10T01:29:19,701 running build_py 2025-10-10T01:29:19,708 creating build/lib 2025-10-10T01:29:19,710 copying build_backend.py -> build/lib 2025-10-10T01:29:19,713 copying build_utils.py -> build/lib 2025-10-10T01:29:19,716 creating build/lib/flashinfer 2025-10-10T01:29:19,717 copying flashinfer/green_ctx.py -> build/lib/flashinfer 2025-10-10T01:29:19,720 copying flashinfer/autotuner.py -> build/lib/flashinfer 2025-10-10T01:29:19,723 copying flashinfer/_build_meta.py -> build/lib/flashinfer 2025-10-10T01:29:19,725 copying flashinfer/xqa.py -> build/lib/flashinfer 2025-10-10T01:29:19,727 copying flashinfer/gemm.py -> build/lib/flashinfer 2025-10-10T01:29:19,731 copying flashinfer/cascade.py -> build/lib/flashinfer 2025-10-10T01:29:19,735 copying flashinfer/aot.py -> build/lib/flashinfer 2025-10-10T01:29:19,737 copying flashinfer/quantization.py -> build/lib/flashinfer 2025-10-10T01:29:19,740 copying flashinfer/page.py -> build/lib/flashinfer 2025-10-10T01:29:19,743 copying flashinfer/cuda_utils.py -> build/lib/flashinfer 2025-10-10T01:29:19,745 copying flashinfer/sparse.py -> build/lib/flashinfer 2025-10-10T01:29:19,748 copying flashinfer/tllm_utils.py -> build/lib/flashinfer 2025-10-10T01:29:19,750 copying flashinfer/fp4_quantization.py -> build/lib/flashinfer 2025-10-10T01:29:19,753 copying flashinfer/norm.py -> build/lib/flashinfer 2025-10-10T01:29:19,756 copying flashinfer/version.py -> build/lib/flashinfer 2025-10-10T01:29:19,758 copying flashinfer/rope.py -> build/lib/flashinfer 2025-10-10T01:29:19,761 copying flashinfer/pod.py -> build/lib/flashinfer 2025-10-10T01:29:19,764 copying flashinfer/activation.py -> build/lib/flashinfer 2025-10-10T01:29:19,766 copying flashinfer/__init__.py -> build/lib/flashinfer 2025-10-10T01:29:19,768 copying flashinfer/utils.py -> build/lib/flashinfer 2025-10-10T01:29:19,771 copying flashinfer/deep_gemm.py -> build/lib/flashinfer 2025-10-10T01:29:19,774 copying flashinfer/artifacts.py -> build/lib/flashinfer 2025-10-10T01:29:19,776 copying flashinfer/attention.py -> build/lib/flashinfer 2025-10-10T01:29:19,779 copying flashinfer/prefill.py -> build/lib/flashinfer 2025-10-10T01:29:19,783 copying flashinfer/fp8_quantization.py -> build/lib/flashinfer 2025-10-10T01:29:19,786 copying flashinfer/decode.py -> build/lib/flashinfer 2025-10-10T01:29:19,790 copying flashinfer/__main__.py -> build/lib/flashinfer 2025-10-10T01:29:19,793 copying flashinfer/compilation_context.py -> build/lib/flashinfer 2025-10-10T01:29:19,795 copying flashinfer/sampling.py -> build/lib/flashinfer 2025-10-10T01:29:19,798 copying flashinfer/mla.py -> build/lib/flashinfer 2025-10-10T01:29:19,802 creating build/lib/flashinfer/jit 2025-10-10T01:29:19,803 copying flashinfer/jit/fused_moe.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,806 copying flashinfer/jit/xqa.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,808 copying flashinfer/jit/cascade.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,811 copying flashinfer/jit/spdlog.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,813 copying flashinfer/jit/comm.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,815 copying flashinfer/jit/quantization.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,817 copying flashinfer/jit/page.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,820 copying flashinfer/jit/tllm_utils.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,822 copying flashinfer/jit/fp4_quantization.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,824 copying flashinfer/jit/norm.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,827 copying flashinfer/jit/rope.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,829 copying flashinfer/jit/activation.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,831 copying flashinfer/jit/__init__.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,833 copying flashinfer/jit/utils.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,836 copying flashinfer/jit/cpp_ext.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,838 copying flashinfer/jit/fp8_quantization.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,840 copying flashinfer/jit/env.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,843 copying flashinfer/jit/cubin_loader.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,846 copying flashinfer/jit/core.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,848 copying flashinfer/jit/sampling.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,850 copying flashinfer/jit/mla.py -> build/lib/flashinfer/jit 2025-10-10T01:29:19,853 creating build/lib/flashinfer/data 2025-10-10T01:29:19,854 copying ./build_utils.py -> build/lib/flashinfer/data 2025-10-10T01:29:19,856 copying ./build_backend.py -> build/lib/flashinfer/data 2025-10-10T01:29:19,859 creating build/lib/flashinfer/triton 2025-10-10T01:29:19,860 copying flashinfer/triton/gemm.py -> build/lib/flashinfer/triton 2025-10-10T01:29:19,863 copying flashinfer/triton/cascade.py -> build/lib/flashinfer/triton 2025-10-10T01:29:19,865 copying flashinfer/triton/page.py -> build/lib/flashinfer/triton 2025-10-10T01:29:19,867 copying flashinfer/triton/norm.py -> build/lib/flashinfer/triton 2025-10-10T01:29:19,869 copying flashinfer/triton/activation.py -> build/lib/flashinfer/triton 2025-10-10T01:29:19,871 copying flashinfer/triton/__init__.py -> build/lib/flashinfer/triton 2025-10-10T01:29:19,873 copying flashinfer/triton/utils.py -> build/lib/flashinfer/triton 2025-10-10T01:29:19,875 copying flashinfer/triton/sm_constraint_gemm.py -> build/lib/flashinfer/triton 2025-10-10T01:29:19,879 creating build/lib/flashinfer/profiler 2025-10-10T01:29:19,880 copying flashinfer/profiler/__init__.py -> build/lib/flashinfer/profiler 2025-10-10T01:29:19,883 creating build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,884 copying flashinfer/logits_processor/operators.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,887 copying flashinfer/logits_processor/fusion_rules.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,889 copying flashinfer/logits_processor/compiler.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,891 copying flashinfer/logits_processor/__init__.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,893 copying flashinfer/logits_processor/legalization.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,895 copying flashinfer/logits_processor/types.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,898 copying flashinfer/logits_processor/op.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,900 copying flashinfer/logits_processor/validators.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,902 copying flashinfer/logits_processor/pipeline.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,905 copying flashinfer/logits_processor/processors.py -> build/lib/flashinfer/logits_processor 2025-10-10T01:29:19,908 creating build/lib/flashinfer/cute_dsl 2025-10-10T01:29:19,909 copying flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/lib/flashinfer/cute_dsl 2025-10-10T01:29:19,913 copying flashinfer/cute_dsl/utils.py -> build/lib/flashinfer/cute_dsl 2025-10-10T01:29:19,916 copying flashinfer/cute_dsl/blockscaled_gemm.py -> build/lib/flashinfer/cute_dsl 2025-10-10T01:29:19,921 creating build/lib/flashinfer/cudnn 2025-10-10T01:29:19,922 copying flashinfer/cudnn/__init__.py -> build/lib/flashinfer/cudnn 2025-10-10T01:29:19,924 copying flashinfer/cudnn/utils.py -> build/lib/flashinfer/cudnn 2025-10-10T01:29:19,927 copying flashinfer/cudnn/prefill.py -> build/lib/flashinfer/cudnn 2025-10-10T01:29:19,930 copying flashinfer/cudnn/decode.py -> build/lib/flashinfer/cudnn 2025-10-10T01:29:19,933 creating build/lib/flashinfer/comm 2025-10-10T01:29:19,934 copying flashinfer/comm/vllm_ar.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,937 copying flashinfer/comm/nvshmem.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,940 copying flashinfer/comm/trtllm_ar.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,943 copying flashinfer/comm/mapping.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,945 copying flashinfer/comm/trtllm_alltoall.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,948 copying flashinfer/comm/__init__.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,950 copying flashinfer/comm/trtllm_mnnvl_ar.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,953 copying flashinfer/comm/cuda_ipc.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,956 copying flashinfer/comm/dlpack_utils.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,959 copying flashinfer/comm/mnnvl.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,962 copying flashinfer/comm/nvshmem_allreduce.py -> build/lib/flashinfer/comm 2025-10-10T01:29:19,965 creating build/lib/flashinfer/fused_moe 2025-10-10T01:29:19,966 copying flashinfer/fused_moe/__init__.py -> build/lib/flashinfer/fused_moe 2025-10-10T01:29:19,969 copying flashinfer/fused_moe/utils.py -> build/lib/flashinfer/fused_moe 2025-10-10T01:29:19,971 copying flashinfer/fused_moe/core.py -> build/lib/flashinfer/fused_moe 2025-10-10T01:29:19,976 creating build/lib/flashinfer/testing 2025-10-10T01:29:19,977 copying flashinfer/testing/__init__.py -> build/lib/flashinfer/testing 2025-10-10T01:29:19,979 copying flashinfer/testing/utils.py -> build/lib/flashinfer/testing 2025-10-10T01:29:19,983 creating build/lib/flashinfer/tuning_configs 2025-10-10T01:29:19,984 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/lib/flashinfer/tuning_configs 2025-10-10T01:29:19,987 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/lib/flashinfer/tuning_configs 2025-10-10T01:29:19,990 creating build/lib/flashinfer/jit/gemm 2025-10-10T01:29:19,991 copying flashinfer/jit/gemm/__init__.py -> build/lib/flashinfer/jit/gemm 2025-10-10T01:29:19,994 copying flashinfer/jit/gemm/deepgemm.py -> build/lib/flashinfer/jit/gemm 2025-10-10T01:29:19,996 copying flashinfer/jit/gemm/core.py -> build/lib/flashinfer/jit/gemm 2025-10-10T01:29:20,001 creating build/lib/flashinfer/jit/attention 2025-10-10T01:29:20,002 copying flashinfer/jit/attention/variants.py -> build/lib/flashinfer/jit/attention 2025-10-10T01:29:20,005 copying flashinfer/jit/attention/modules.py -> build/lib/flashinfer/jit/attention 2025-10-10T01:29:20,008 copying flashinfer/jit/attention/__init__.py -> build/lib/flashinfer/jit/attention 2025-10-10T01:29:20,010 copying flashinfer/jit/attention/utils.py -> build/lib/flashinfer/jit/attention 2025-10-10T01:29:20,013 creating build/lib/flashinfer/jit/gemm/cutlass 2025-10-10T01:29:20,014 copying flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/lib/flashinfer/jit/gemm/cutlass 2025-10-10T01:29:20,018 copying flashinfer/jit/gemm/cutlass/__init__.py -> build/lib/flashinfer/jit/gemm/cutlass 2025-10-10T01:29:20,020 copying flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/lib/flashinfer/jit/gemm/cutlass 2025-10-10T01:29:20,025 creating build/lib/flashinfer/data/cutlass/python 2025-10-10T01:29:20,027 copying 3rdparty/cutlass/python/setup_pycute.py -> build/lib/flashinfer/data/cutlass/python 2025-10-10T01:29:20,029 copying 3rdparty/cutlass/python/setup_library.py -> build/lib/flashinfer/data/cutlass/python 2025-10-10T01:29:20,031 copying 3rdparty/cutlass/python/setup_cutlass.py -> build/lib/flashinfer/data/cutlass/python 2025-10-10T01:29:20,035 creating build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,036 copying 3rdparty/cutlass/python/cutlass_library/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,039 copying 3rdparty/cutlass/python/cutlass_library/sm100_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,042 copying 3rdparty/cutlass/python/cutlass_library/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,045 copying 3rdparty/cutlass/python/cutlass_library/manifest.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,048 copying 3rdparty/cutlass/python/cutlass_library/generator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,062 copying 3rdparty/cutlass/python/cutlass_library/trmm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,064 copying 3rdparty/cutlass/python/cutlass_library/sm90_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,067 copying 3rdparty/cutlass/python/cutlass_library/rank_k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,070 copying 3rdparty/cutlass/python/cutlass_library/sm100_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,072 copying 3rdparty/cutlass/python/cutlass_library/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,074 copying 3rdparty/cutlass/python/cutlass_library/heuristics.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,076 copying 3rdparty/cutlass/python/cutlass_library/conv3d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,080 copying 3rdparty/cutlass/python/cutlass_library/rank_2k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,082 copying 3rdparty/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,085 copying 3rdparty/cutlass/python/cutlass_library/conv3x_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,087 copying 3rdparty/cutlass/python/cutlass_library/sm90_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,090 copying 3rdparty/cutlass/python/cutlass_library/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,094 copying 3rdparty/cutlass/python/cutlass_library/symm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,096 copying 3rdparty/cutlass/python/cutlass_library/heuristics_provider.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:20,099 creating build/lib/flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:20,101 copying 3rdparty/cutlass/python/pycute/int_tuple.py -> build/lib/flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:20,104 copying 3rdparty/cutlass/python/pycute/swizzle.py -> build/lib/flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:20,106 copying 3rdparty/cutlass/python/pycute/__init__.py -> build/lib/flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:20,108 copying 3rdparty/cutlass/python/pycute/typing.py -> build/lib/flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:20,110 copying 3rdparty/cutlass/python/pycute/layout.py -> build/lib/flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:20,112 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:20,113 copying 3rdparty/cutlass/python/cutlass_cppgen/shape.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:20,116 copying 3rdparty/cutlass/python/cutlass_cppgen/library_defaults.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:20,119 copying 3rdparty/cutlass/python/cutlass_cppgen/swizzle.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:20,120 copying 3rdparty/cutlass/python/cutlass_cppgen/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:20,124 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2025-10-10T01:29:20,125 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2025-10-10T01:29:20,127 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2025-10-10T01:29:20,129 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/torch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2025-10-10T01:29:20,132 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,133 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/dsl.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,137 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/compiler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,139 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,141 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/ast_preprocessor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,146 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,149 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/env_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,151 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/cache_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,153 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,156 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,159 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/ast_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:20,162 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:20,163 copying 3rdparty/cutlass/python/CuTeDSL/cutlass_dsl/cutlass.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:20,167 copying 3rdparty/cutlass/python/CuTeDSL/cutlass_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:20,168 copying 3rdparty/cutlass/python/CuTeDSL/cutlass_dsl/tree_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:20,171 copying 3rdparty/cutlass/python/CuTeDSL/cutlass_dsl/cutlass_ast_decorators.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:20,174 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:20,175 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:20,178 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:20,180 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:20,183 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:20,185 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:20,192 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:20,195 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:20,197 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:20,200 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:20,203 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:20,205 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:20,208 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,209 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,211 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/distributed_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,214 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,217 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,219 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,222 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,224 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,226 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,229 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,232 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,234 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,236 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/smem_capacity.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,238 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/ampere_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:20,241 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2025-10-10T01:29:20,242 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2025-10-10T01:29:20,244 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2025-10-10T01:29:20,246 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2025-10-10T01:29:20,249 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:20,250 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:20,253 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:20,255 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:20,258 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:20,260 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:20,262 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:20,264 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:20,265 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:20,268 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:20,271 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:20,274 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:20,276 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2025-10-10T01:29:20,277 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2025-10-10T01:29:20,280 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2025-10-10T01:29:20,282 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2025-10-10T01:29:20,284 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2025-10-10T01:29:20,285 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2025-10-10T01:29:20,288 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2025-10-10T01:29:20,290 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2025-10-10T01:29:20,293 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2025-10-10T01:29:20,294 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2025-10-10T01:29:20,296 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2025-10-10T01:29:20,299 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2025-10-10T01:29:20,301 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:20,302 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/runtime/dlpack_types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:20,304 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/runtime/jit_arg_adapters.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:20,307 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/runtime/cuda.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:20,309 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/runtime/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:20,311 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/runtime/device_tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:20,313 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/runtime/tensor_descriptor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:20,316 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:20,317 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:20,319 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:20,321 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/op.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:20,323 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/gpu.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:20,325 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/arith.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:20,329 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:20,330 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/utils/stacktrace.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:20,332 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:20,334 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/utils/logger.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:20,336 copying 3rdparty/cutlass/python/CuTeDSL/base_dsl/utils/timer.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:20,339 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:20,340 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:20,343 copying 3rdparty/cutlass/python/cutlass_cppgen/op/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:20,345 copying 3rdparty/cutlass/python/cutlass_cppgen/op/op.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:20,348 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:20,350 copying 3rdparty/cutlass/python/cutlass_cppgen/op/conv.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:20,354 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,355 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,358 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,361 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,363 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,365 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,368 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,370 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,373 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,376 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,378 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,380 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,383 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,387 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:20,389 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2025-10-10T01:29:20,391 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2025-10-10T01:29:20,393 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/common.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2025-10-10T01:29:20,395 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2025-10-10T01:29:20,399 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:20,400 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:20,402 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:20,405 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:20,407 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:20,408 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/check.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:20,411 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2025-10-10T01:29:20,412 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2025-10-10T01:29:20,414 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2025-10-10T01:29:20,416 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2025-10-10T01:29:20,422 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2025-10-10T01:29:20,423 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2025-10-10T01:29:20,426 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2025-10-10T01:29:20,428 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2025-10-10T01:29:20,429 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2025-10-10T01:29:20,432 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2025-10-10T01:29:20,434 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:20,435 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:20,438 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:20,440 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:20,442 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:20,444 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:20,446 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:20,448 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:20,451 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:20,453 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,455 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,457 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,459 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,461 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,464 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,466 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,469 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,471 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,474 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:20,476 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2025-10-10T01:29:20,477 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2025-10-10T01:29:20,480 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2025-10-10T01:29:20,482 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2025-10-10T01:29:20,485 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,486 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,488 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,491 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,493 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,495 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,497 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,499 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,501 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,504 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,506 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,509 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,511 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,513 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:20,516 creating build/lib/flashinfer/data/cutlass/python/docs_src/source 2025-10-10T01:29:20,518 copying 3rdparty/cutlass/python/docs_src/source/conf.py -> build/lib/flashinfer/data/cutlass/python/docs_src/source 2025-10-10T01:29:20,522 creating build/lib/flashinfer/data/cutlass/tools/util/scripts 2025-10-10T01:29:20,524 copying 3rdparty/cutlass/tools/util/scripts/split_test_cmake.py -> build/lib/flashinfer/data/cutlass/tools/util/scripts 2025-10-10T01:29:20,530 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2025-10-10T01:29:20,532 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2025-10-10T01:29:20,534 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2025-10-10T01:29:20,537 copying 3rdparty/cutlass/examples/40_cutlass_py/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2025-10-10T01:29:20,540 creating build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2025-10-10T01:29:20,541 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2025-10-10T01:29:20,543 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2025-10-10T01:29:20,547 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2025-10-10T01:29:20,548 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2025-10-10T01:29:20,551 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2025-10-10T01:29:20,553 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2025-10-10T01:29:20,557 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:20,559 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:20,563 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:20,568 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:20,572 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:20,577 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:20,580 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:20,584 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2025-10-10T01:29:20,585 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2025-10-10T01:29:20,590 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,591 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,594 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,596 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,599 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,601 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,604 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,607 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,609 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,612 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:20,615 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2025-10-10T01:29:20,616 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2025-10-10T01:29:20,621 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2025-10-10T01:29:20,623 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2025-10-10T01:29:20,626 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2025-10-10T01:29:20,628 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2025-10-10T01:29:20,632 creating build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,633 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,636 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,639 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,641 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,643 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,645 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,648 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,651 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,654 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,656 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,659 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,662 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:20,665 creating build/lib/flashinfer/data/cutlass/test/python/cutlass 2025-10-10T01:29:20,667 copying 3rdparty/cutlass/test/python/cutlass/installation.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass 2025-10-10T01:29:20,670 creating build/lib/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:20,671 copying 3rdparty/cutlass/test/python/pycute/test_complement.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:20,673 copying 3rdparty/cutlass/test/python/pycute/test_left_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:20,675 copying 3rdparty/cutlass/test/python/pycute/test_int_tuple.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:20,677 copying 3rdparty/cutlass/test/python/pycute/test_right_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:20,679 copying 3rdparty/cutlass/test/python/pycute/test_coalesce.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:20,681 copying 3rdparty/cutlass/test/python/pycute/test_typing.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:20,683 copying 3rdparty/cutlass/test/python/pycute/test_composition.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:20,685 copying 3rdparty/cutlass/test/python/pycute/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:20,688 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,689 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,692 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,694 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,697 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,699 copying 3rdparty/cutlass/test/python/cutlass/gemm/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,701 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,704 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,707 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,709 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,711 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,713 copying 3rdparty/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,715 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,718 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:20,721 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2025-10-10T01:29:20,722 copying 3rdparty/cutlass/test/python/cutlass/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2025-10-10T01:29:20,725 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:20,726 copying 3rdparty/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:20,729 copying 3rdparty/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:20,731 copying 3rdparty/cutlass/test/python/cutlass/interface/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:20,733 copying 3rdparty/cutlass/test/python/cutlass/interface/evt_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:20,736 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:20,738 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:20,740 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:20,743 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:20,745 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:20,748 copying 3rdparty/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:20,750 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:20,753 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:20,754 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:20,757 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:20,759 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:20,762 copying 3rdparty/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:20,764 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2025-10-10T01:29:20,765 copying 3rdparty/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2025-10-10T01:29:20,769 creating build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2025-10-10T01:29:20,772 copying 3rdparty/cutlass/test/unit/gemm/device/simt_sm50.py -> build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2025-10-10T01:29:20,817 creating build/lib/flashinfer/data/spdlog/scripts 2025-10-10T01:29:20,819 copying 3rdparty/spdlog/scripts/extract_version.py -> build/lib/flashinfer/data/spdlog/scripts 2025-10-10T01:29:20,834 creating build/lib/flashinfer/triton/kernels 2025-10-10T01:29:20,835 copying flashinfer/triton/kernels/quant.py -> build/lib/flashinfer/triton/kernels 2025-10-10T01:29:20,837 copying flashinfer/triton/kernels/cascade.py -> build/lib/flashinfer/triton/kernels 2025-10-10T01:29:20,840 copying flashinfer/triton/kernels/norm.py -> build/lib/flashinfer/triton/kernels 2025-10-10T01:29:20,842 copying flashinfer/triton/kernels/activation.py -> build/lib/flashinfer/triton/kernels 2025-10-10T01:29:20,844 copying flashinfer/triton/kernels/__init__.py -> build/lib/flashinfer/triton/kernels 2025-10-10T01:29:20,846 copying flashinfer/triton/kernels/sm_constraint_gemm.py -> build/lib/flashinfer/triton/kernels 2025-10-10T01:29:21,338 copying flashinfer/py.typed -> build/lib/flashinfer 2025-10-10T01:29:21,341 creating build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,342 copying ./csrc/fmha_cutlass_sm100.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,344 copying ./csrc/rope.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,347 copying ./csrc/batch_mla_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,349 copying ./csrc/tgv_gemm.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,352 copying ./csrc/pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,354 copying ./csrc/pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,356 copying ./csrc/batch_mla_sm90_run.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,359 copying ./csrc/batch_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,362 copying ./csrc/trtllm_fused_moe_routing_llama4.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,364 copying ./csrc/quantization.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,367 copying ./csrc/gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,369 copying ./csrc/trtllm_fused_moe_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,374 copying ./csrc/norm.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,376 copying ./csrc/single_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,378 copying ./csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,381 copying ./csrc/group_gemm_fp8_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,384 copying ./csrc/sampling.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,386 copying ./csrc/single_decode.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,389 copying ./csrc/trtllm_alltoall.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,392 copying ./csrc/runtime_utils.h -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,394 copying ./csrc/group_gemm_fp8_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,396 copying ./csrc/single_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,399 copying ./csrc/batch_mla_plan.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,401 copying ./csrc/single_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,403 copying ./csrc/trtllm_allreduce.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,406 copying ./csrc/single_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,408 copying ./csrc/batch_attention_jit_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,410 copying ./csrc/single_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,413 copying ./csrc/batch_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,416 copying ./csrc/trtllm_fused_moe_dev_kernel.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,419 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,421 copying ./csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,423 copying ./csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,426 copying ./csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,429 copying ./csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,431 copying ./csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,434 copying ./csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,436 copying ./csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,439 copying ./csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,442 copying ./csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:21,444 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:21,445 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:21,447 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:21,450 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:21,453 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,455 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,457 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,459 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:21,460 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:21,463 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:21,465 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:21,470 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:21,472 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:21,474 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:21,477 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,480 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,482 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,484 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,486 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,489 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,491 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,493 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,496 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,498 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,500 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,503 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,507 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,509 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,512 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,514 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,516 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:21,519 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2025-10-10T01:29:21,522 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2025-10-10T01:29:21,523 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2025-10-10T01:29:21,526 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_stub.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2025-10-10T01:29:21,528 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2025-10-10T01:29:21,530 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2025-10-10T01:29:21,533 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2025-10-10T01:29:21,536 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,538 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,540 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,542 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,544 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,547 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,550 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,552 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,554 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,557 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,559 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,563 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,565 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,567 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,570 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,572 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,574 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,577 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,579 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,581 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:21,584 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2025-10-10T01:29:21,586 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:21,587 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:21,589 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:21,591 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:21,594 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:21,598 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:21,600 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2025-10-10T01:29:21,603 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2025-10-10T01:29:21,604 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2025-10-10T01:29:21,607 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2025-10-10T01:29:21,609 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:21,611 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:21,613 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:21,616 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:21,617 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:21,619 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:21,622 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:21,624 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:21,627 copying ./csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:21,629 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:21,631 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:21,634 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:21,636 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2025-10-10T01:29:21,638 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2025-10-10T01:29:21,641 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:21,643 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:21,645 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2025-10-10T01:29:21,647 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2025-10-10T01:29:21,650 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2025-10-10T01:29:21,652 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2025-10-10T01:29:21,655 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,656 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,660 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,663 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,666 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,669 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,672 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,674 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,677 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,679 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,682 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,685 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,688 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:21,691 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,692 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,695 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,697 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,701 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,705 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2025-10-10T01:29:21,706 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2025-10-10T01:29:21,709 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2025-10-10T01:29:21,712 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2025-10-10T01:29:21,715 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,717 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,720 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,723 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,725 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,729 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:21,731 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,733 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,736 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,739 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,742 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,745 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,747 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,750 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,753 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,756 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,758 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,762 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,765 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:21,768 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2025-10-10T01:29:21,770 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2025-10-10T01:29:21,773 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2025-10-10T01:29:21,774 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2025-10-10T01:29:21,777 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:21,780 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:21,783 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:21,786 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:21,788 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2025-10-10T01:29:21,790 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2025-10-10T01:29:21,793 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2025-10-10T01:29:21,795 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2025-10-10T01:29:21,798 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2025-10-10T01:29:21,800 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2025-10-10T01:29:21,803 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2025-10-10T01:29:21,804 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2025-10-10T01:29:21,807 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:21,808 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:21,811 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:21,813 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:21,816 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:21,819 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:21,821 creating build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,824 copying ./csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,827 copying ./csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,829 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,832 copying ./csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,835 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,838 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,840 copying ./csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,843 copying ./csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,846 copying ./csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,849 copying ./csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:21,851 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:21,853 copying ./csrc/nv_internal/cpp/common/envUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:21,856 copying ./csrc/nv_internal/cpp/common/stringUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:21,859 copying ./csrc/nv_internal/cpp/common/tllmException.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:21,861 copying ./csrc/nv_internal/cpp/common/logger.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:21,864 copying ./csrc/nv_internal/cpp/common/memoryUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:21,868 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2025-10-10T01:29:21,869 copying ./csrc/nv_internal/cpp/kernels/quantization.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2025-10-10T01:29:21,872 copying ./csrc/cascade.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,875 copying ./csrc/batch_prefill.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,878 copying ./csrc/flashinfer_gemm_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,880 copying ./csrc/fmha_cutlass_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,882 copying ./csrc/single_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,885 copying ./csrc/batch_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,887 copying ./csrc/flashinfer_sampling_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,890 copying ./csrc/batch_decode_mla_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,892 copying ./csrc/group_gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,895 copying ./csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,897 copying ./csrc/tvm_ffi_utils.h -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,900 copying ./csrc/batch_mla_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,902 copying ./csrc/group_gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,905 copying ./csrc/page.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,907 copying ./csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,910 copying ./csrc/gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,912 copying ./csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,914 copying ./csrc/batch_mla_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,916 copying ./csrc/bmm_fp8.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,918 copying ./csrc/group_gemm_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,921 copying ./csrc/pod.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,924 copying ./csrc/trtllm_moe_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,926 copying ./csrc/gemm_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,929 copying ./csrc/single_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,931 copying ./csrc/group_gemm_sm90.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,934 copying ./csrc/trtllm_fmha_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,937 copying ./csrc/single_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,939 copying ./csrc/fp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,941 copying ./csrc/fp4_gemm_cutlass_sm120.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,944 copying ./csrc/batch_decode_mla_plan.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,946 copying ./csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,948 copying ./csrc/single_prefill_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,951 copying ./csrc/vllm_custom_all_reduce.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,953 copying ./csrc/flashinfer_mla_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,956 copying ./csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,958 copying ./csrc/fmhaReduction.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,961 copying ./csrc/batch_attention_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,963 copying ./csrc/pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,965 copying ./csrc/flashinfer_norm_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,967 copying ./csrc/group_gemm.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,969 copying ./csrc/trtllm_fused_moe_routing_deepseek.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,972 copying ./csrc/flashinfer_cascade_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,974 copying ./csrc/trtllm_alltoall_prepare.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,977 copying ./csrc/batch_decode_mla_cute_sm80.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,980 copying ./csrc/batch_decode_mla_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,982 copying ./csrc/batch_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,984 copying ./csrc/batch_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,986 copying ./csrc/trtllm_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,989 copying ./csrc/batch_prefill_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,991 copying ./csrc/batch_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,993 copying ./csrc/logging.cc -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,995 copying ./csrc/fp4_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:21,998 copying ./csrc/blackwell_fmha_plan.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,000 copying ./csrc/batch_decode_mla_run.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,003 copying ./csrc/batch_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,005 copying ./csrc/fp4_gemm_cutlass_sm120.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,007 copying ./csrc/fp4_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,010 copying ./csrc/flashinfer_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,012 copying ./csrc/trtllm_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,015 copying ./csrc/trtllm_mnnvl_allreduce.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,018 copying ./csrc/tgv_gemm.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,020 copying ./csrc/flashinfer_page_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,022 creating build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2025-10-10T01:29:22,024 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2025-10-10T01:29:22,026 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2025-10-10T01:29:22,034 copying ./csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_sm100_binding.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2025-10-10T01:29:22,037 copying ./csrc/trtllm_fused_moe_runner.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,040 copying ./csrc/renorm.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,043 copying ./csrc/cutlass_mla.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,045 copying ./csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,048 copying ./csrc/batch_prefill_ragged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,050 copying ./csrc/single_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,052 copying ./csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,054 copying ./csrc/flashinfer_quantization_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,056 copying ./csrc/cudnn_sdpa_utils.h -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,060 copying ./csrc/batch_mla_sm90_plan.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,062 copying ./csrc/batch_decode.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,065 copying ./csrc/flashinfer_xqa_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,067 copying ./csrc/nvshmem_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,070 copying ./csrc/batch_attention_customize_config.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,072 copying ./csrc/flashinfer_rope_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,074 copying ./csrc/cudnn_sdpa_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,078 copying ./csrc/fp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,081 copying ./csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,083 copying ./csrc/batch_attention.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,086 copying ./csrc/batch_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,088 copying ./csrc/batch_mla_run.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,091 copying ./csrc/trtllm_fused_moe_routing_renormalize.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,094 copying ./csrc/trtllm_batched_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,097 copying ./csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,099 copying ./csrc/single_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,102 copying ./csrc/batch_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,104 copying ./csrc/single_prefill.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,106 creating build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,107 copying ./csrc/xqa/platform.h -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,110 copying ./csrc/xqa/utils.cuh -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,113 copying ./csrc/xqa/cuda_hint.cuh -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,115 copying ./csrc/xqa/xqa_wrapper.cu -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,118 copying ./csrc/xqa/specDec.h -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,120 copying ./csrc/xqa/mma.cuh -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,123 copying ./csrc/xqa/mhaUtils.cuh -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,126 copying ./csrc/xqa/defines.h -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,128 copying ./csrc/xqa/mha.h -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,131 copying ./csrc/xqa/utils.h -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,134 copying ./csrc/xqa/hostUtils.h -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,136 copying ./csrc/xqa/barriers.cuh -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,139 copying ./csrc/xqa/ldgsts.cuh -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,141 copying ./csrc/xqa/mha_components.cuh -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,144 copying ./csrc/xqa/mha.cu -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,149 copying ./csrc/xqa/mha_stdheaders.cuh -> build/lib/flashinfer/data/csrc/xqa 2025-10-10T01:29:22,153 copying ./csrc/gemm_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,155 copying ./csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2025-10-10T01:29:22,157 creating build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,159 copying ./include/flashinfer/logging.h -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,162 copying ./include/flashinfer/cp_async.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,164 copying ./include/flashinfer/utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,167 copying ./include/flashinfer/arch_condition.h -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,170 copying ./include/flashinfer/frag_layout_swizzle.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,172 copying ./include/flashinfer/quantization.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,175 copying ./include/flashinfer/vec_dtypes.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,179 copying ./include/flashinfer/exception.h -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,181 copying ./include/flashinfer/allocator.h -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,183 copying ./include/flashinfer/mma.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,186 creating build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,188 copying ./include/flashinfer/gemm/group_gemm_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,191 copying ./include/flashinfer/gemm/tgv_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,193 copying ./include/flashinfer/gemm/bmm_fp8.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,196 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,199 copying ./include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,202 copying ./include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,204 copying ./include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,207 copying ./include/flashinfer/gemm/cutlass_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,210 copying ./include/flashinfer/gemm/group_gemv.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,212 copying ./include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,214 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,217 copying ./include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,220 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,223 copying ./include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,226 copying ./include/flashinfer/gemm/tgv_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,230 copying ./include/flashinfer/gemm/group_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,232 copying ./include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,236 copying ./include/flashinfer/gemm/tgv_gemm_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,238 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,241 copying ./include/flashinfer/gemm/group_gemm_lora.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,243 copying ./include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,246 copying ./include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:22,249 copying ./include/flashinfer/pos_enc.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,252 copying ./include/flashinfer/activation.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,254 copying ./include/flashinfer/attention_impl.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,256 copying ./include/flashinfer/layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,259 creating build/lib/flashinfer/data/include/flashinfer/trtllm 2025-10-10T01:29:22,260 copying ./include/flashinfer/trtllm/common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm 2025-10-10T01:29:22,263 creating build/lib/flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:22,264 copying ./include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:22,267 copying ./include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:22,269 copying ./include/flashinfer/trtllm/common/cudaUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:22,272 copying ./include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:22,274 copying ./include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:22,277 creating build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:22,278 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelTraits.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:22,282 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmOptions.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:22,286 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:22,289 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/Enums.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:22,291 creating build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:22,293 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SfLayoutDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:22,295 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/MmaDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:22,298 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaKernelLauncher.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:22,300 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/DtypeDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:22,303 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CommonUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:22,305 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmInterface.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:22,309 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParamsDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:22,311 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/TmaDescriptor.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:22,314 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,315 copying ./include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,318 copying ./include/flashinfer/trtllm/fmha/kernelUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,320 copying ./include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,323 copying ./include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,326 copying ./include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,329 copying ./include/flashinfer/trtllm/fmha/decoder_params.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,331 copying ./include/flashinfer/trtllm/fmha/lse.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,333 copying ./include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,336 copying ./include/flashinfer/trtllm/fmha/kernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:22,339 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:22,340 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:22,343 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:22,346 copying ./include/flashinfer/trtllm/fused_moe/runner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:22,349 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:22,352 copying ./include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:22,355 copying ./include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:22,358 creating build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,359 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/KernelTraits.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,363 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/GemmGatedActOptions.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,366 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/BatchedGemmEnums.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,368 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/GemmOptions.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,372 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/BatchedGemmInterface.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,375 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/KernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,379 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/Enums.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,381 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/BatchedGemmOptions.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,384 creating build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:22,386 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/SfLayoutDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:22,388 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/MmaDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:22,391 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/CudaKernelLauncher.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:22,393 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/DtypeDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:22,396 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/CommonUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:22,398 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/KernelParamsDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,401 copying ./include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/TmaDescriptor.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:22,404 copying ./include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2025-10-10T01:29:22,407 creating build/lib/flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:22,408 copying ./include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:22,411 copying ./include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:22,414 copying ./include/flashinfer/comm/trtllm_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:22,418 copying ./include/flashinfer/comm/trtllm_alltoall.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:22,421 copying ./include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:22,425 copying ./include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:22,429 copying ./include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:22,432 copying ./include/flashinfer/cutlass_utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,434 copying ./include/flashinfer/fp4_layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,436 copying ./include/flashinfer/fastdiv.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,439 copying ./include/flashinfer/cubin_loader.h -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,441 copying ./include/flashinfer/norm.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,444 copying ./include/flashinfer/sampling.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,449 copying ./include/flashinfer/permuted_smem.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,452 copying ./include/flashinfer/profiler.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,455 copying ./include/flashinfer/math.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,457 creating build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,458 copying ./include/flashinfer/attention/cutlass_mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,461 copying ./include/flashinfer/attention/cascade.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,464 copying ./include/flashinfer/attention/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,467 copying ./include/flashinfer/attention/scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,471 copying ./include/flashinfer/attention/persistent_template.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,473 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2025-10-10T01:29:22,474 copying ./include/flashinfer/attention/blackwell/plan.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2025-10-10T01:29:22,477 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2025-10-10T01:29:22,478 copying ./include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2025-10-10T01:29:22,481 copying ./include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2025-10-10T01:29:22,483 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:22,485 copying ./include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:22,487 copying ./include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:22,490 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:22,494 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:22,497 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:22,500 copying ./include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:22,503 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:22,506 copying ./include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:22,508 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:22,509 copying ./include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:22,512 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:22,516 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:22,519 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:22,522 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:22,524 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:22,527 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:22,531 copying ./include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:22,533 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2025-10-10T01:29:22,534 copying ./include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2025-10-10T01:29:22,537 copying ./include/flashinfer/attention/blackwell/device/fmha.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2025-10-10T01:29:22,540 copying ./include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,543 copying ./include/flashinfer/attention/pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,546 copying ./include/flashinfer/attention/mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,550 copying ./include/flashinfer/attention/hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,553 copying ./include/flashinfer/attention/mla_hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,557 copying ./include/flashinfer/attention/heap.h -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,559 copying ./include/flashinfer/attention/decode.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,563 copying ./include/flashinfer/attention/mask.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,566 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,567 copying ./include/flashinfer/attention/hopper/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,570 copying ./include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,573 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:22,574 copying ./include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:22,577 copying ./include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:22,580 copying ./include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:22,583 copying ./include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:22,586 copying ./include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:22,589 copying ./include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:22,592 copying ./include/flashinfer/attention/hopper/utils.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,594 copying ./include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,597 copying ./include/flashinfer/attention/hopper/named_barrier.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,600 copying ./include/flashinfer/attention/hopper/attention_updater.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,603 copying ./include/flashinfer/attention/hopper/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,606 copying ./include/flashinfer/attention/hopper/mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,609 copying ./include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,612 copying ./include/flashinfer/attention/hopper/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,615 copying ./include/flashinfer/attention/hopper/block_sparse_gather.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,618 copying ./include/flashinfer/attention/hopper/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,621 copying ./include/flashinfer/attention/hopper/default_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,623 copying ./include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:22,626 copying ./include/flashinfer/attention/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,628 copying ./include/flashinfer/attention/persistent.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,632 copying ./include/flashinfer/attention/mla_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,634 copying ./include/flashinfer/attention/state.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,636 copying ./include/flashinfer/attention/default_decode_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,639 copying ./include/flashinfer/attention/default_prefill_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,642 copying ./include/flashinfer/attention/prefill.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:22,648 copying ./include/flashinfer/fp16.h -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,651 copying ./include/flashinfer/page.cuh -> build/lib/flashinfer/data/include/flashinfer 2025-10-10T01:29:22,654 creating build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,656 copying 3rdparty/cutlass/include/cute/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,659 copying 3rdparty/cutlass/include/cute/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,663 creating build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,664 copying 3rdparty/cutlass/include/cute/atom/mma_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,667 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,670 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,673 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,676 copying 3rdparty/cutlass/include/cute/atom/copy_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,679 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,683 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,685 copying 3rdparty/cutlass/include/cute/atom/mma_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,689 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,692 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,694 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,700 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,703 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,719 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,721 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,724 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,734 copying 3rdparty/cutlass/include/cute/atom/copy_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,737 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,739 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,742 copying 3rdparty/cutlass/include/cute/atom/partitioner.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,744 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,748 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,761 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,768 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,771 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,780 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,783 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,786 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:22,789 copying 3rdparty/cutlass/include/cute/tensor_impl.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,792 creating build/lib/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:22,793 copying 3rdparty/cutlass/include/cute/container/array_subbyte.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:22,797 copying 3rdparty/cutlass/include/cute/container/bit_field.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:22,799 copying 3rdparty/cutlass/include/cute/container/array_aligned.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:22,802 copying 3rdparty/cutlass/include/cute/container/tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:22,805 copying 3rdparty/cutlass/include/cute/container/type_list.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:22,807 copying 3rdparty/cutlass/include/cute/container/alignment.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:22,810 copying 3rdparty/cutlass/include/cute/container/cuda_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:22,813 copying 3rdparty/cutlass/include/cute/container/array.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:22,815 copying 3rdparty/cutlass/include/cute/pointer.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,818 copying 3rdparty/cutlass/include/cute/layout_composed.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,821 creating build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,822 copying 3rdparty/cutlass/include/cute/numeric/numeric_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,825 copying 3rdparty/cutlass/include/cute/numeric/int.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,828 copying 3rdparty/cutlass/include/cute/numeric/integral_constant.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,831 copying 3rdparty/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,834 copying 3rdparty/cutlass/include/cute/numeric/math.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,837 copying 3rdparty/cutlass/include/cute/numeric/integral_ratio.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,839 copying 3rdparty/cutlass/include/cute/numeric/real.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,842 copying 3rdparty/cutlass/include/cute/numeric/complex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,844 copying 3rdparty/cutlass/include/cute/numeric/integer_sequence.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:22,846 copying 3rdparty/cutlass/include/cute/swizzle_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,850 copying 3rdparty/cutlass/include/cute/pointer_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,852 creating build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,853 copying 3rdparty/cutlass/include/cute/algorithm/prefer.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,856 copying 3rdparty/cutlass/include/cute/algorithm/fill.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,858 copying 3rdparty/cutlass/include/cute/algorithm/gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,861 copying 3rdparty/cutlass/include/cute/algorithm/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,864 copying 3rdparty/cutlass/include/cute/algorithm/axpby.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,867 copying 3rdparty/cutlass/include/cute/algorithm/functional.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,870 copying 3rdparty/cutlass/include/cute/algorithm/prefetch.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,872 copying 3rdparty/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,875 copying 3rdparty/cutlass/include/cute/algorithm/clear.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,878 copying 3rdparty/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,881 copying 3rdparty/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,884 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,887 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:22,890 creating build/lib/flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:22,891 copying 3rdparty/cutlass/include/cute/util/type_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:22,894 copying 3rdparty/cutlass/include/cute/util/debug.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:22,897 copying 3rdparty/cutlass/include/cute/util/print_tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:22,899 copying 3rdparty/cutlass/include/cute/util/print_svg.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:22,902 copying 3rdparty/cutlass/include/cute/util/print_latex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:22,905 copying 3rdparty/cutlass/include/cute/util/print.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:22,908 copying 3rdparty/cutlass/include/cute/stride.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,911 copying 3rdparty/cutlass/include/cute/pointer_base.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,914 copying 3rdparty/cutlass/include/cute/int_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,917 copying 3rdparty/cutlass/include/cute/swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,920 copying 3rdparty/cutlass/include/cute/tensor_zip.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,923 copying 3rdparty/cutlass/include/cute/pointer_flagged.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,926 copying 3rdparty/cutlass/include/cute/tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,928 copying 3rdparty/cutlass/include/cute/underscore.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,931 copying 3rdparty/cutlass/include/cute/pointer_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:22,934 creating build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,935 copying 3rdparty/cutlass/include/cute/arch/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,938 copying 3rdparty/cutlass/include/cute/arch/util.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,941 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,945 copying 3rdparty/cutlass/include/cute/arch/copy_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,948 copying 3rdparty/cutlass/include/cute/arch/cluster_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,951 copying 3rdparty/cutlass/include/cute/arch/mma_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,953 copying 3rdparty/cutlass/include/cute/arch/mma_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,965 copying 3rdparty/cutlass/include/cute/arch/mma_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,971 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,974 copying 3rdparty/cutlass/include/cute/arch/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:22,977 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,002 copying 3rdparty/cutlass/include/cute/arch/mma_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,005 copying 3rdparty/cutlass/include/cute/arch/copy_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,013 copying 3rdparty/cutlass/include/cute/arch/cluster_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,016 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,021 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,070 copying 3rdparty/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,073 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,077 copying 3rdparty/cutlass/include/cute/arch/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,080 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,108 copying 3rdparty/cutlass/include/cute/arch/mma_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,111 copying 3rdparty/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,114 copying 3rdparty/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,120 copying 3rdparty/cutlass/include/cute/arch/copy_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,122 copying 3rdparty/cutlass/include/cute/arch/mma_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,125 copying 3rdparty/cutlass/include/cute/arch/mma_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,129 copying 3rdparty/cutlass/include/cute/arch/copy_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,132 copying 3rdparty/cutlass/include/cute/arch/copy_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,135 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,189 copying 3rdparty/cutlass/include/cute/arch/simd_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,192 copying 3rdparty/cutlass/include/cute/arch/mma_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,195 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:23,197 creating build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,198 copying 3rdparty/cutlass/include/cutlass/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,201 copying 3rdparty/cutlass/include/cutlass/coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,204 copying 3rdparty/cutlass/include/cutlass/array_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,207 copying 3rdparty/cutlass/include/cutlass/version.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,209 copying 3rdparty/cutlass/include/cutlass/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,219 copying 3rdparty/cutlass/include/cutlass/complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,222 copying 3rdparty/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,224 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:23,226 copying 3rdparty/cutlass/include/cutlass/conv/convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:23,228 copying 3rdparty/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:23,231 copying 3rdparty/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:23,234 copying 3rdparty/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:23,237 copying 3rdparty/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:23,240 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2025-10-10T01:29:23,241 copying 3rdparty/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2025-10-10T01:29:23,243 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2025-10-10T01:29:23,246 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2025-10-10T01:29:23,249 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,250 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,253 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,256 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,259 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,262 copying 3rdparty/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,264 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,267 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,269 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,272 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,275 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,278 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,280 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,283 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,287 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,290 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,293 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,296 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,299 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,302 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,305 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,307 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,310 copying 3rdparty/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,313 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,316 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,319 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,322 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,325 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,328 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,331 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:23,333 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:23,335 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:23,337 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:23,338 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:23,342 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:23,344 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:23,347 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:23,350 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:23,353 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:23,356 copying 3rdparty/cutlass/include/cutlass/conv/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:23,359 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:23,362 copying 3rdparty/cutlass/include/cutlass/conv/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:23,364 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2025-10-10T01:29:23,365 copying 3rdparty/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2025-10-10T01:29:23,368 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:23,369 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:23,372 copying 3rdparty/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:23,375 copying 3rdparty/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:23,378 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:23,381 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,382 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,385 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,388 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,391 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,393 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,396 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,399 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,401 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,404 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,407 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,410 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,413 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,415 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,418 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,421 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,424 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,426 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,429 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,432 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,435 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,438 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,441 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,443 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,446 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,448 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,451 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,454 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,456 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,460 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,463 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,466 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,469 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,471 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,474 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,478 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,481 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,483 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,486 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,489 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,492 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,495 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,498 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,501 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,504 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,507 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,510 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:23,512 copying 3rdparty/cutlass/include/cutlass/real.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,515 copying 3rdparty/cutlass/include/cutlass/subbyte_reference.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,518 copying 3rdparty/cutlass/include/cutlass/pitch_linear_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,520 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2025-10-10T01:29:23,522 copying 3rdparty/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2025-10-10T01:29:23,525 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2025-10-10T01:29:23,526 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2025-10-10T01:29:23,529 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2025-10-10T01:29:23,532 copying 3rdparty/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2025-10-10T01:29:23,535 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2025-10-10T01:29:23,536 copying 3rdparty/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2025-10-10T01:29:23,540 copying 3rdparty/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform 2025-10-10T01:29:23,543 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2025-10-10T01:29:23,544 copying 3rdparty/cutlass/include/cutlass/transform/thread/unary_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2025-10-10T01:29:23,547 copying 3rdparty/cutlass/include/cutlass/transform/thread/transpose.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2025-10-10T01:29:23,549 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2025-10-10T01:29:23,550 copying 3rdparty/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2025-10-10T01:29:23,553 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,554 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,557 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,560 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,562 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,565 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,568 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,571 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,573 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,576 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,578 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,581 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,585 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,587 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,590 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,593 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,596 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,599 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,602 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,606 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,609 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,612 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,616 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,619 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,622 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,625 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:23,629 copying 3rdparty/cutlass/include/cutlass/semaphore.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,632 copying 3rdparty/cutlass/include/cutlass/exmy_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,635 copying 3rdparty/cutlass/include/cutlass/gemm_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,638 copying 3rdparty/cutlass/include/cutlass/functional.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,641 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2025-10-10T01:29:23,643 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2025-10-10T01:29:23,646 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2025-10-10T01:29:23,649 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2025-10-10T01:29:23,651 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2025-10-10T01:29:23,653 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2025-10-10T01:29:23,656 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2025-10-10T01:29:23,658 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2025-10-10T01:29:23,659 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2025-10-10T01:29:23,662 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2025-10-10T01:29:23,665 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2025-10-10T01:29:23,667 creating build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2025-10-10T01:29:23,669 copying 3rdparty/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2025-10-10T01:29:23,671 copying 3rdparty/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2025-10-10T01:29:23,675 copying 3rdparty/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2025-10-10T01:29:23,678 copying 3rdparty/cutlass/include/cutlass/fast_math.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:23,681 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:23,682 copying 3rdparty/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:23,686 copying 3rdparty/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:23,688 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,689 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,693 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,696 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,699 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,702 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,704 copying 3rdparty/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,707 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,709 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,712 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,715 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,720 copying 3rdparty/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,723 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,726 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,728 copying 3rdparty/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,730 copying 3rdparty/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,733 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,736 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,738 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,741 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,744 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,747 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,749 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,752 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,755 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,758 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,762 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,765 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,769 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,772 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,774 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,777 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,780 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,783 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,785 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,790 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:23,793 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,794 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,798 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,800 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,804 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,807 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,809 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,812 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,815 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,817 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,820 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,822 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,825 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,828 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,832 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,835 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,838 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,841 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,844 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,846 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,850 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,853 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,856 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,858 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,861 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,864 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,867 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,871 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,874 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,876 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,880 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,882 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,885 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,888 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,892 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,894 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,897 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,901 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,904 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,907 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,910 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,913 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,916 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,920 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,923 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,926 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,929 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,931 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,935 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,937 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,940 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,943 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,946 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,949 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,952 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,955 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,958 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,962 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,965 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,968 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,971 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,974 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,976 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,979 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,982 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,985 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,988 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,991 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,995 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:23,997 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,000 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,003 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,006 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,009 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,012 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,015 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,018 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,021 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,025 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,028 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,031 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,033 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,035 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,038 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,041 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,044 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,047 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,049 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,054 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,057 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,060 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,062 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,066 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,068 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,071 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,074 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,078 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,081 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,083 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,086 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,089 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,092 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,095 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,098 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,101 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,104 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,108 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,110 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,113 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,116 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,119 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,123 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,126 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,129 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:24,132 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,133 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,136 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,140 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,143 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,146 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,151 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,154 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,157 copying 3rdparty/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,160 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,163 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,167 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,170 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,174 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,178 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,181 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,184 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,187 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,190 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,194 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,197 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,200 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,201 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,205 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,208 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,211 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,215 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,218 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,221 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,224 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,228 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,231 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,234 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,237 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,240 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,243 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,246 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,249 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,252 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,255 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,259 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,262 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,265 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,268 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,271 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,274 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,277 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,280 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:24,284 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,286 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,290 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,295 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,299 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,302 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,306 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,309 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,312 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,315 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,318 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,321 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,324 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,328 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,331 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,335 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,339 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,342 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,345 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,349 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,352 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,354 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,358 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,360 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,364 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:24,367 copying 3rdparty/cutlass/include/cutlass/gemm/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:24,370 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:24,371 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:24,374 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:24,376 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:24,379 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:24,382 copying 3rdparty/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:24,384 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,385 copying 3rdparty/cutlass/include/cutlass/gemm/device/symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,389 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,392 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,395 copying 3rdparty/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,398 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,401 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,403 copying 3rdparty/cutlass/include/cutlass/gemm/device/trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,406 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,409 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,412 copying 3rdparty/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,415 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,418 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,421 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,424 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,427 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,430 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,433 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,436 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,439 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,442 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,445 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,448 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,450 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,453 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,457 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,459 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,462 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,465 copying 3rdparty/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,468 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:24,470 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,471 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,475 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,477 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,480 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,482 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,485 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,488 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,491 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,495 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,498 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,501 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,504 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,507 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,510 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,512 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,515 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,518 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,521 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,524 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,527 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,529 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,532 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,535 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,538 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,541 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,544 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,548 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,551 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,554 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,558 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,560 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,563 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,566 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,569 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,572 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,575 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,579 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,583 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,585 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,588 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,590 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,593 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:24,596 copying 3rdparty/cutlass/include/cutlass/cutlass.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,599 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,600 copying 3rdparty/cutlass/include/cutlass/detail/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,603 copying 3rdparty/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,606 copying 3rdparty/cutlass/include/cutlass/detail/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,608 copying 3rdparty/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,611 copying 3rdparty/cutlass/include/cutlass/detail/cluster.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,614 copying 3rdparty/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,616 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2025-10-10T01:29:24,617 copying 3rdparty/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2025-10-10T01:29:24,620 copying 3rdparty/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2025-10-10T01:29:24,623 copying 3rdparty/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,626 copying 3rdparty/cutlass/include/cutlass/detail/collective.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,628 copying 3rdparty/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,631 copying 3rdparty/cutlass/include/cutlass/detail/helper_macros.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,633 copying 3rdparty/cutlass/include/cutlass/detail/dependent_false.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,636 copying 3rdparty/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:24,638 copying 3rdparty/cutlass/include/cutlass/numeric_conversion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,645 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:24,647 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:24,650 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:24,653 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:24,656 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:24,659 copying 3rdparty/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction 2025-10-10T01:29:24,661 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2025-10-10T01:29:24,662 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2025-10-10T01:29:24,665 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2025-10-10T01:29:24,667 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:24,668 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:24,671 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:24,674 copying 3rdparty/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:24,676 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:24,679 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,682 copying 3rdparty/cutlass/include/cutlass/tensor_ref.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,685 copying 3rdparty/cutlass/include/cutlass/float8.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,688 copying 3rdparty/cutlass/include/cutlass/array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,692 copying 3rdparty/cutlass/include/cutlass/uint128.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,695 copying 3rdparty/cutlass/include/cutlass/half.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,698 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,700 copying 3rdparty/cutlass/include/cutlass/quaternion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,703 copying 3rdparty/cutlass/include/cutlass/aligned_buffer.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,706 copying 3rdparty/cutlass/include/cutlass/tensor_view.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,708 copying 3rdparty/cutlass/include/cutlass/trace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,711 copying 3rdparty/cutlass/include/cutlass/device_kernel.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,713 copying 3rdparty/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,716 copying 3rdparty/cutlass/include/cutlass/float_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,719 copying 3rdparty/cutlass/include/cutlass/numeric_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,722 copying 3rdparty/cutlass/include/cutlass/blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,724 creating build/lib/flashinfer/data/cutlass/include/cutlass/platform 2025-10-10T01:29:24,725 copying 3rdparty/cutlass/include/cutlass/platform/platform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/platform 2025-10-10T01:29:24,728 copying 3rdparty/cutlass/include/cutlass/workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,731 copying 3rdparty/cutlass/include/cutlass/tensor_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,734 copying 3rdparty/cutlass/include/cutlass/integer_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,736 creating build/lib/flashinfer/data/cutlass/include/cutlass/thread 2025-10-10T01:29:24,737 copying 3rdparty/cutlass/include/cutlass/thread/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/thread 2025-10-10T01:29:24,739 copying 3rdparty/cutlass/include/cutlass/cluster_launch.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,742 copying 3rdparty/cutlass/include/cutlass/wmma_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,745 copying 3rdparty/cutlass/include/cutlass/uint256.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,747 copying 3rdparty/cutlass/include/cutlass/array_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,750 copying 3rdparty/cutlass/include/cutlass/constants.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,753 copying 3rdparty/cutlass/include/cutlass/bfloat16.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,756 copying 3rdparty/cutlass/include/cutlass/floating_point_nvrtc.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:24,759 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue 2025-10-10T01:29:24,760 copying 3rdparty/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue 2025-10-10T01:29:24,763 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,764 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,768 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,771 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,774 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,778 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,781 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,784 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,787 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,791 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,794 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,798 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,801 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,805 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:24,809 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,810 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,813 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,816 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,818 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,821 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,824 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,826 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,829 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,832 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,835 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,838 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,841 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,844 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,847 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,850 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:24,853 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,854 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,857 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,861 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,865 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,868 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,871 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:24,872 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:24,876 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:24,879 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:24,882 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:24,884 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:24,887 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:24,890 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,894 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,897 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,900 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,903 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,906 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,910 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,912 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,915 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,918 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:24,920 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,921 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,924 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,927 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,930 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,932 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,935 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,937 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,940 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,943 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,946 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,948 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,951 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,953 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,956 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,958 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,961 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,963 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,966 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,969 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,972 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,975 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,977 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,980 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,983 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/activation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,986 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:24,988 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:24,989 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:24,993 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:24,995 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:24,998 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,001 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,003 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,006 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,009 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,012 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,015 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,018 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,020 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,023 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,026 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:25,027 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:25,030 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:25,032 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:25,035 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:25,039 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:25,042 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,045 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,047 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,051 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,054 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,057 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,060 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,063 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,066 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,069 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,072 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,075 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,078 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,083 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,087 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,090 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,093 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,097 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,101 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,104 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,107 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,110 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,116 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,123 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,125 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,128 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,130 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,133 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,136 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,139 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,142 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,145 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,148 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,151 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,154 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,157 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:25,160 copying 3rdparty/cutlass/include/cutlass/blas3_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,162 copying 3rdparty/cutlass/include/cutlass/numeric_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,164 copying 3rdparty/cutlass/include/cutlass/kernel_launch.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,167 creating build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,168 copying 3rdparty/cutlass/include/cutlass/arch/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,171 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm100.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,174 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,176 copying 3rdparty/cutlass/include/cutlass/arch/arch.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,179 copying 3rdparty/cutlass/include/cutlass/arch/wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,183 copying 3rdparty/cutlass/include/cutlass/arch/memory.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,187 copying 3rdparty/cutlass/include/cutlass/arch/config.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,192 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,196 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,201 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,208 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,213 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,218 copying 3rdparty/cutlass/include/cutlass/arch/cache_operation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,222 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,227 copying 3rdparty/cutlass/include/cutlass/arch/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,231 copying 3rdparty/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,235 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm72.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,238 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,241 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,245 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,249 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,254 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,261 copying 3rdparty/cutlass/include/cutlass/arch/simd.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,266 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,272 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm90.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,276 copying 3rdparty/cutlass/include/cutlass/arch/synclog.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,282 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,289 copying 3rdparty/cutlass/include/cutlass/arch/reg_reconfig.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,295 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:25,303 copying 3rdparty/cutlass/include/cutlass/matrix_shape.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,309 copying 3rdparty/cutlass/include/cutlass/matrix_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,316 copying 3rdparty/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,324 copying 3rdparty/cutlass/include/cutlass/tfloat32.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,332 copying 3rdparty/cutlass/include/cutlass/predicate_vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,340 copying 3rdparty/cutlass/include/cutlass/relatively_equal.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,344 copying 3rdparty/cutlass/include/cutlass/gemm_coord.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,350 copying 3rdparty/cutlass/include/cutlass/block_striped.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,357 copying 3rdparty/cutlass/include/cutlass/core_io.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:25,364 creating build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,366 copying 3rdparty/cutlass/include/cutlass/layout/vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,371 copying 3rdparty/cutlass/include/cutlass/layout/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,374 copying 3rdparty/cutlass/include/cutlass/layout/permute.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,377 copying 3rdparty/cutlass/include/cutlass/layout/layout.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,379 copying 3rdparty/cutlass/include/cutlass/layout/pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,382 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,385 copying 3rdparty/cutlass/include/cutlass/layout/tensor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,388 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,391 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:25,394 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,396 copying 3rdparty/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,399 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,402 copying 3rdparty/cutlass/tools/util/include/cutlass/util/command_line.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,405 copying 3rdparty/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,407 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,410 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,413 copying 3rdparty/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,416 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,419 copying 3rdparty/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,421 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,424 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,427 copying 3rdparty/cutlass/tools/util/include/cutlass/util/distribution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,429 copying 3rdparty/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,432 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,434 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,437 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2025-10-10T01:29:25,438 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2025-10-10T01:29:25,441 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2025-10-10T01:29:25,443 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,444 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,448 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,452 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,455 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,458 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,461 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2025-10-10T01:29:25,462 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2025-10-10T01:29:25,465 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2025-10-10T01:29:25,468 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2025-10-10T01:29:25,471 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,473 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,477 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,479 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,482 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2025-10-10T01:29:25,483 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2025-10-10T01:29:25,486 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,488 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:25,491 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,492 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,496 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,499 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,502 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,505 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,507 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,510 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,512 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,515 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,517 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,520 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,522 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,525 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,528 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,531 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,533 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,536 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,539 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,541 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,544 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,547 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,549 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,552 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,554 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:25,557 copying 3rdparty/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,560 copying 3rdparty/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,563 copying 3rdparty/cutlass/tools/util/include/cutlass/util/debug.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,565 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,568 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,570 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,573 copying 3rdparty/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,576 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,578 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,581 copying 3rdparty/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,583 copying 3rdparty/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,585 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,588 copying 3rdparty/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:25,590 creating build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,592 copying 3rdparty/spdlog/include/spdlog/details/udp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,594 copying 3rdparty/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,597 copying 3rdparty/spdlog/include/spdlog/details/udp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,599 copying 3rdparty/spdlog/include/spdlog/details/file_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,601 copying 3rdparty/spdlog/include/spdlog/details/backtracer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,604 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,606 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,608 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,610 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,612 copying 3rdparty/spdlog/include/spdlog/details/log_msg.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,614 copying 3rdparty/spdlog/include/spdlog/details/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,616 copying 3rdparty/spdlog/include/spdlog/details/os-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,619 copying 3rdparty/spdlog/include/spdlog/details/synchronous_factory.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,621 copying 3rdparty/spdlog/include/spdlog/details/fmt_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,624 copying 3rdparty/spdlog/include/spdlog/details/null_mutex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,626 copying 3rdparty/spdlog/include/spdlog/details/tcp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,628 copying 3rdparty/spdlog/include/spdlog/details/tcp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,631 copying 3rdparty/spdlog/include/spdlog/details/windows_include.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,633 copying 3rdparty/spdlog/include/spdlog/details/thread_pool-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,635 copying 3rdparty/spdlog/include/spdlog/details/thread_pool.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,638 copying 3rdparty/spdlog/include/spdlog/details/console_globals.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,640 copying 3rdparty/spdlog/include/spdlog/details/log_msg-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,642 copying 3rdparty/spdlog/include/spdlog/details/file_helper-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,644 copying 3rdparty/spdlog/include/spdlog/details/circular_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,647 copying 3rdparty/spdlog/include/spdlog/details/backtracer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,649 copying 3rdparty/spdlog/include/spdlog/details/registry.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,652 copying 3rdparty/spdlog/include/spdlog/details/registry-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:25,655 copying 3rdparty/spdlog/include/spdlog/fwd.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,657 copying 3rdparty/spdlog/include/spdlog/pattern_formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,659 copying 3rdparty/spdlog/include/spdlog/common.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,662 copying 3rdparty/spdlog/include/spdlog/pattern_formatter-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,665 copying 3rdparty/spdlog/include/spdlog/version.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,668 copying 3rdparty/spdlog/include/spdlog/stopwatch.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,670 copying 3rdparty/spdlog/include/spdlog/logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,672 copying 3rdparty/spdlog/include/spdlog/spdlog.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,675 copying 3rdparty/spdlog/include/spdlog/common-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,677 copying 3rdparty/spdlog/include/spdlog/spdlog-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,679 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:25,680 copying 3rdparty/spdlog/include/spdlog/fmt/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:25,683 copying 3rdparty/spdlog/include/spdlog/fmt/fmt.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:25,685 copying 3rdparty/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:25,688 copying 3rdparty/spdlog/include/spdlog/fmt/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:25,690 copying 3rdparty/spdlog/include/spdlog/fmt/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:25,692 copying 3rdparty/spdlog/include/spdlog/fmt/ostr.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:25,694 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,695 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/locale.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,697 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,700 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,703 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,707 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/color.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,710 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,713 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,715 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,718 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/printf.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,721 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,724 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/args.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,727 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,731 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/core.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,736 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,739 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:25,781 copying 3rdparty/spdlog/include/spdlog/fmt/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:25,783 copying 3rdparty/spdlog/include/spdlog/fmt/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:25,785 copying 3rdparty/spdlog/include/spdlog/mdc.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,788 copying 3rdparty/spdlog/include/spdlog/formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,790 copying 3rdparty/spdlog/include/spdlog/async_logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,792 creating build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,793 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,795 copying 3rdparty/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,797 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,800 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,802 copying 3rdparty/spdlog/include/spdlog/sinks/qt_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,805 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,807 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,809 copying 3rdparty/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,812 copying 3rdparty/spdlog/include/spdlog/sinks/udp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,814 copying 3rdparty/spdlog/include/spdlog/sinks/dist_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,816 copying 3rdparty/spdlog/include/spdlog/sinks/null_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,818 copying 3rdparty/spdlog/include/spdlog/sinks/tcp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,821 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,823 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,825 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,827 copying 3rdparty/spdlog/include/spdlog/sinks/sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,829 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,832 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,834 copying 3rdparty/spdlog/include/spdlog/sinks/kafka_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,837 copying 3rdparty/spdlog/include/spdlog/sinks/systemd_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,839 copying 3rdparty/spdlog/include/spdlog/sinks/msvc_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,842 copying 3rdparty/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,844 copying 3rdparty/spdlog/include/spdlog/sinks/mongo_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,847 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,849 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,852 copying 3rdparty/spdlog/include/spdlog/sinks/android_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,854 copying 3rdparty/spdlog/include/spdlog/sinks/ostream_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,857 copying 3rdparty/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,859 copying 3rdparty/spdlog/include/spdlog/sinks/callback_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,861 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,864 copying 3rdparty/spdlog/include/spdlog/sinks/syslog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,867 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,869 copying 3rdparty/spdlog/include/spdlog/sinks/sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,871 copying 3rdparty/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:25,873 copying 3rdparty/spdlog/include/spdlog/async_logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,875 copying 3rdparty/spdlog/include/spdlog/async.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,878 copying 3rdparty/spdlog/include/spdlog/tweakme.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,880 copying 3rdparty/spdlog/include/spdlog/logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:25,883 creating build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:25,884 copying 3rdparty/spdlog/include/spdlog/cfg/env.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:25,886 copying 3rdparty/spdlog/include/spdlog/cfg/argv.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:25,888 copying 3rdparty/spdlog/include/spdlog/cfg/helpers-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:25,890 copying 3rdparty/spdlog/include/spdlog/cfg/helpers.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:25,999 installing to build/bdist.linux-armv7l/wheel 2025-10-10T01:29:26,000 running install 2025-10-10T01:29:26,024 running install_lib 2025-10-10T01:29:26,032 creating build/bdist.linux-armv7l/wheel 2025-10-10T01:29:26,035 creating build/bdist.linux-armv7l/wheel/flashinfer 2025-10-10T01:29:26,036 copying build/lib/flashinfer/green_ctx.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:26,039 copying build/lib/flashinfer/autotuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:26,042 creating build/bdist.linux-armv7l/wheel/flashinfer/jit 2025-10-10T01:29:26,043 copying build/lib/flashinfer/jit/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,046 copying build/lib/flashinfer/jit/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,048 copying build/lib/flashinfer/jit/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,050 copying build/lib/flashinfer/jit/spdlog.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,052 copying build/lib/flashinfer/jit/comm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,054 copying build/lib/flashinfer/jit/quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,056 copying build/lib/flashinfer/jit/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,058 copying build/lib/flashinfer/jit/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,060 copying build/lib/flashinfer/jit/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,062 copying build/lib/flashinfer/jit/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,064 copying build/lib/flashinfer/jit/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,066 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm 2025-10-10T01:29:26,068 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm/cutlass 2025-10-10T01:29:26,069 copying build/lib/flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2025-10-10T01:29:26,072 copying build/lib/flashinfer/jit/gemm/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2025-10-10T01:29:26,074 copying build/lib/flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2025-10-10T01:29:26,076 copying build/lib/flashinfer/jit/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2025-10-10T01:29:26,078 copying build/lib/flashinfer/jit/gemm/deepgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2025-10-10T01:29:26,080 copying build/lib/flashinfer/jit/gemm/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2025-10-10T01:29:26,082 copying build/lib/flashinfer/jit/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,084 copying build/lib/flashinfer/jit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,087 copying build/lib/flashinfer/jit/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,088 copying build/lib/flashinfer/jit/cpp_ext.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,091 copying build/lib/flashinfer/jit/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,093 copying build/lib/flashinfer/jit/env.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,095 copying build/lib/flashinfer/jit/cubin_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,097 copying build/lib/flashinfer/jit/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,100 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention 2025-10-10T01:29:26,101 copying build/lib/flashinfer/jit/attention/variants.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2025-10-10T01:29:26,103 copying build/lib/flashinfer/jit/attention/modules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2025-10-10T01:29:26,106 copying build/lib/flashinfer/jit/attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2025-10-10T01:29:26,108 copying build/lib/flashinfer/jit/attention/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2025-10-10T01:29:26,110 copying build/lib/flashinfer/jit/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,112 copying build/lib/flashinfer/jit/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2025-10-10T01:29:26,114 copying build/lib/flashinfer/_build_meta.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:26,116 copying build/lib/flashinfer/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:26,118 copying build/lib/flashinfer/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:26,122 copying build/lib/flashinfer/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:26,125 creating build/bdist.linux-armv7l/wheel/flashinfer/data 2025-10-10T01:29:26,127 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass 2025-10-10T01:29:26,129 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python 2025-10-10T01:29:26,130 copying build/lib/flashinfer/data/cutlass/python/setup_pycute.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2025-10-10T01:29:26,132 copying build/lib/flashinfer/data/cutlass/python/setup_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2025-10-10T01:29:26,134 copying build/lib/flashinfer/data/cutlass/python/setup_cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2025-10-10T01:29:26,137 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,138 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,141 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,143 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,147 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/manifest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,150 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,178 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,181 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,184 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,186 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,189 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,191 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,194 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,196 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,199 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,202 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,205 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,207 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,210 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/symm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,213 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2025-10-10T01:29:26,216 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL 2025-10-10T01:29:26,218 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2025-10-10T01:29:26,220 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:26,221 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2025-10-10T01:29:26,223 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2025-10-10T01:29:26,226 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:26,227 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:26,230 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:26,233 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:26,235 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2025-10-10T01:29:26,237 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2025-10-10T01:29:26,239 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2025-10-10T01:29:26,241 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2025-10-10T01:29:26,243 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2025-10-10T01:29:26,245 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2025-10-10T01:29:26,247 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2025-10-10T01:29:26,249 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2025-10-10T01:29:26,251 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2025-10-10T01:29:26,254 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2025-10-10T01:29:26,256 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2025-10-10T01:29:26,257 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2025-10-10T01:29:26,260 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2025-10-10T01:29:26,263 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2025-10-10T01:29:26,265 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2025-10-10T01:29:26,268 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:26,270 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:26,273 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:26,275 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:26,278 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:26,279 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:26,282 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:26,284 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:26,286 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:26,288 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:26,290 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2025-10-10T01:29:26,293 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:26,300 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2025-10-10T01:29:26,302 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2025-10-10T01:29:26,305 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:26,306 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:26,308 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:26,311 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:26,314 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2025-10-10T01:29:26,317 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2025-10-10T01:29:26,319 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,320 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,324 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,327 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,330 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,332 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,335 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,337 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,340 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,343 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,346 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,350 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,352 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_capacity.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,353 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/ampere_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2025-10-10T01:29:26,355 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2025-10-10T01:29:26,359 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,360 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/dsl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,364 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,367 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,369 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/ast_preprocessor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,374 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,377 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/env_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,380 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/cache_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,383 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:26,385 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/dlpack_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:26,387 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/jit_arg_adapters.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:26,390 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/cuda.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:26,392 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:26,394 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/device_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:26,397 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/tensor_descriptor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime 2025-10-10T01:29:26,400 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,403 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:26,404 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:26,406 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:26,408 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:26,410 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/gpu.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:26,412 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/arith.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers 2025-10-10T01:29:26,416 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:26,417 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils/stacktrace.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:26,419 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:26,421 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils/logger.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:26,424 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils/timer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils 2025-10-10T01:29:26,426 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,431 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/base_dsl/ast_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/base_dsl 2025-10-10T01:29:26,434 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:26,436 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl/cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:26,439 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:26,441 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl/tree_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:26,444 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl/cutlass_ast_decorators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl 2025-10-10T01:29:26,448 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:26,450 copying build/lib/flashinfer/data/cutlass/python/pycute/int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:26,452 copying build/lib/flashinfer/data/cutlass/python/pycute/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:26,455 copying build/lib/flashinfer/data/cutlass/python/pycute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:26,457 copying build/lib/flashinfer/data/cutlass/python/pycute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:26,459 copying build/lib/flashinfer/data/cutlass/python/pycute/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2025-10-10T01:29:26,463 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:26,464 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/shape.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:26,467 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:26,470 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:26,471 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:26,475 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:26,477 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:26,480 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:26,482 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2025-10-10T01:29:26,486 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,488 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,491 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,493 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,496 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,498 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,501 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,504 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,507 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,510 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,512 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,514 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,518 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2025-10-10T01:29:26,519 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2025-10-10T01:29:26,522 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2025-10-10T01:29:26,524 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,532 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2025-10-10T01:29:26,534 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:26,535 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:26,538 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:26,540 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:26,543 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:26,546 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:26,548 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:26,551 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:26,553 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2025-10-10T01:29:26,556 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2025-10-10T01:29:26,559 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2025-10-10T01:29:26,562 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,563 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,566 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,568 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,571 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,574 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,578 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,581 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,583 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,586 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2025-10-10T01:29:26,590 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2025-10-10T01:29:26,591 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2025-10-10T01:29:26,594 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2025-10-10T01:29:26,597 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2025-10-10T01:29:26,600 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,601 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,605 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,607 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,609 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,612 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,614 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,617 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,619 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,622 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,624 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,628 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,631 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,633 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2025-10-10T01:29:26,636 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2025-10-10T01:29:26,638 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:26,641 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2025-10-10T01:29:26,643 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2025-10-10T01:29:26,645 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2025-10-10T01:29:26,648 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2025-10-10T01:29:26,651 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2025-10-10T01:29:26,655 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:26,657 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:26,660 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:26,662 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:26,664 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:26,666 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2025-10-10T01:29:26,669 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2025-10-10T01:29:26,670 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2025-10-10T01:29:26,672 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2025-10-10T01:29:26,674 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2025-10-10T01:29:26,677 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src 2025-10-10T01:29:26,678 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src/source 2025-10-10T01:29:26,680 copying build/lib/flashinfer/data/cutlass/python/docs_src/source/conf.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/docs_src/source 2025-10-10T01:29:26,683 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools 2025-10-10T01:29:26,684 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util 2025-10-10T01:29:26,686 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include 2025-10-10T01:29:26,688 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass 2025-10-10T01:29:26,690 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,692 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,695 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,698 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,700 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,703 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,707 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,709 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,712 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,715 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,717 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,719 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,722 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,725 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,727 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,730 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,733 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference 2025-10-10T01:29:26,734 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2025-10-10T01:29:26,735 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2025-10-10T01:29:26,739 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2025-10-10T01:29:26,742 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,743 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,746 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,749 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,752 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,755 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,758 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2025-10-10T01:29:26,760 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2025-10-10T01:29:26,763 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2025-10-10T01:29:26,765 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2025-10-10T01:29:26,768 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,771 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,774 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,777 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,781 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2025-10-10T01:29:26,782 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2025-10-10T01:29:26,785 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,787 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2025-10-10T01:29:26,791 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,792 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,796 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,799 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,802 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,805 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,807 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,809 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,812 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,814 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,817 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,819 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,822 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,825 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,829 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,832 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,835 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,838 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,841 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,843 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,846 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,848 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,850 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,853 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,855 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2025-10-10T01:29:26,858 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,861 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,863 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,865 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,868 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,870 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,873 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,875 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,877 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,879 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,882 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,884 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,886 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2025-10-10T01:29:26,889 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/scripts 2025-10-10T01:29:26,890 copying build/lib/flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/scripts 2025-10-10T01:29:26,893 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples 2025-10-10T01:29:26,895 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py 2025-10-10T01:29:26,896 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2025-10-10T01:29:26,898 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2025-10-10T01:29:26,900 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2025-10-10T01:29:26,903 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2025-10-10T01:29:26,907 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2025-10-10T01:29:26,909 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2025-10-10T01:29:26,911 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2025-10-10T01:29:26,914 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python 2025-10-10T01:29:26,915 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL 2025-10-10T01:29:26,917 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:26,918 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:26,922 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:26,926 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:26,930 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:26,934 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:26,938 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2025-10-10T01:29:26,942 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2025-10-10T01:29:26,943 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2025-10-10T01:29:26,948 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2025-10-10T01:29:26,950 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2025-10-10T01:29:26,953 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2025-10-10T01:29:26,955 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2025-10-10T01:29:26,956 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2025-10-10T01:29:26,959 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2025-10-10T01:29:26,960 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2025-10-10T01:29:26,964 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,966 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,969 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,971 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,974 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,976 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,979 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,981 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,983 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,986 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2025-10-10T01:29:26,989 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2025-10-10T01:29:26,990 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2025-10-10T01:29:26,992 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2025-10-10T01:29:26,995 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen 2025-10-10T01:29:26,997 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:26,998 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,001 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,003 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,006 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,008 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,010 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,013 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,016 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,019 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,021 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,023 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,026 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2025-10-10T01:29:27,028 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test 2025-10-10T01:29:27,030 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python 2025-10-10T01:29:27,032 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass 2025-10-10T01:29:27,033 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,034 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,037 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,040 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,042 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,044 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,046 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,049 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,051 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,053 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,056 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,058 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,060 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,062 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2025-10-10T01:29:27,065 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/emit 2025-10-10T01:29:27,066 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/emit 2025-10-10T01:29:27,069 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:27,070 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:27,073 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:27,076 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:27,077 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2025-10-10T01:29:27,080 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/installation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass 2025-10-10T01:29:27,082 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:27,083 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:27,086 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:27,088 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:27,092 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2025-10-10T01:29:27,093 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt/utils 2025-10-10T01:29:27,095 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:27,098 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:27,100 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2025-10-10T01:29:27,102 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:27,104 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:27,106 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:27,109 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:27,111 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2025-10-10T01:29:27,114 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:27,115 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_complement.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:27,118 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:27,119 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:27,121 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:27,123 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_coalesce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:27,125 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:27,127 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_composition.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:27,129 copying build/lib/flashinfer/data/cutlass/test/python/pycute/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2025-10-10T01:29:27,132 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit 2025-10-10T01:29:27,133 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm 2025-10-10T01:29:27,135 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm/device 2025-10-10T01:29:27,136 copying build/lib/flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/unit/gemm/device 2025-10-10T01:29:27,139 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include 2025-10-10T01:29:27,141 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,142 copying build/lib/flashinfer/data/cutlass/include/cute/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,145 copying build/lib/flashinfer/data/cutlass/include/cute/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,150 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,152 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,154 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,156 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,158 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,162 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,165 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,167 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,170 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,173 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,175 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,177 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,183 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,185 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,213 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,215 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,217 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,233 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,235 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,238 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,240 copying build/lib/flashinfer/data/cutlass/include/cute/atom/partitioner.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,242 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,246 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,276 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,285 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,288 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,306 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,308 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,311 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2025-10-10T01:29:27,313 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_impl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,318 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:27,319 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:27,322 copying build/lib/flashinfer/data/cutlass/include/cute/container/bit_field.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:27,324 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_aligned.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:27,326 copying build/lib/flashinfer/data/cutlass/include/cute/container/tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:27,328 copying build/lib/flashinfer/data/cutlass/include/cute/container/type_list.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:27,330 copying build/lib/flashinfer/data/cutlass/include/cute/container/alignment.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:27,332 copying build/lib/flashinfer/data/cutlass/include/cute/container/cuda_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:27,335 copying build/lib/flashinfer/data/cutlass/include/cute/container/array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2025-10-10T01:29:27,337 copying build/lib/flashinfer/data/cutlass/include/cute/pointer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,340 copying build/lib/flashinfer/data/cutlass/include/cute/layout_composed.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,343 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,344 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,346 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/int.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,349 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,351 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,354 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,356 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,359 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/real.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,361 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,363 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2025-10-10T01:29:27,366 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,370 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,373 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,374 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,376 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,378 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,381 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,383 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,385 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/functional.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,388 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,390 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,392 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/clear.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,394 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,397 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,399 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,402 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2025-10-10T01:29:27,406 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:27,407 copying build/lib/flashinfer/data/cutlass/include/cute/util/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:27,410 copying build/lib/flashinfer/data/cutlass/include/cute/util/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:27,412 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:27,414 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_svg.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:27,417 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_latex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:27,419 copying build/lib/flashinfer/data/cutlass/include/cute/util/print.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2025-10-10T01:29:27,422 copying build/lib/flashinfer/data/cutlass/include/cute/stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,425 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_base.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,428 copying build/lib/flashinfer/data/cutlass/include/cute/int_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,431 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,433 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_zip.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,435 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_flagged.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,437 copying build/lib/flashinfer/data/cutlass/include/cute/tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,439 copying build/lib/flashinfer/data/cutlass/include/cute/underscore.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,441 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2025-10-10T01:29:27,445 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,446 copying build/lib/flashinfer/data/cutlass/include/cute/arch/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,448 copying build/lib/flashinfer/data/cutlass/include/cute/arch/util.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,450 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,454 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,457 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,459 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,461 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,480 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,487 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,491 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,493 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,545 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,547 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,560 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,562 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,565 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,684 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,687 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,692 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,694 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,748 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,751 copying build/lib/flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,754 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,759 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,762 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,764 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,770 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,772 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,776 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,885 copying build/lib/flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,887 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,890 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2025-10-10T01:29:27,895 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:27,896 copying build/lib/flashinfer/data/cutlass/include/cutlass/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:27,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:27,901 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:27,903 copying build/lib/flashinfer/data/cutlass/include/cutlass/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:27,905 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:27,918 copying build/lib/flashinfer/data/cutlass/include/cutlass/complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:27,920 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:27,923 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:27,925 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:27,927 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:27,929 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:27,932 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:27,934 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:27,938 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/warp 2025-10-10T01:29:27,939 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2025-10-10T01:29:27,941 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2025-10-10T01:29:27,944 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2025-10-10T01:29:27,948 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,949 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,952 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,955 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,958 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,961 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,963 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,965 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,968 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,970 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,973 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,975 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,977 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,980 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,983 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,986 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,989 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,992 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,995 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:27,998 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,000 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,003 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,005 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,008 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,010 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,014 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,016 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,019 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,022 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,024 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2025-10-10T01:29:28,028 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:28,029 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:28,032 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:28,033 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:28,036 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:28,038 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:28,040 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2025-10-10T01:29:28,043 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:28,046 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:28,049 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:28,052 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2025-10-10T01:29:28,054 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2025-10-10T01:29:28,057 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/thread 2025-10-10T01:29:28,058 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/thread 2025-10-10T01:29:28,061 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:28,062 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:28,065 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:28,068 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:28,071 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2025-10-10T01:29:28,074 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,076 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,079 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,081 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,084 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,087 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,089 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,092 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,094 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,097 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,099 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,102 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,104 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,107 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,109 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,112 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,115 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,118 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,121 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,123 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,126 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,128 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,131 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,133 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,136 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,138 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,141 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,143 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,146 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,149 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,152 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,154 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,157 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,159 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,162 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,164 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,167 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,169 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,172 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,174 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,176 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,179 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,181 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,184 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,187 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,190 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,192 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2025-10-10T01:29:28,194 copying build/lib/flashinfer/data/cutlass/include/cutlass/real.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:28,196 copying build/lib/flashinfer/data/cutlass/include/cutlass/subbyte_reference.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:28,200 copying build/lib/flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:28,203 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform 2025-10-10T01:29:28,204 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/warp 2025-10-10T01:29:28,206 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/warp 2025-10-10T01:29:28,209 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/kernel 2025-10-10T01:29:28,210 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2025-10-10T01:29:28,213 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2025-10-10T01:29:28,215 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2025-10-10T01:29:28,218 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/collective 2025-10-10T01:29:28,219 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/collective 2025-10-10T01:29:28,223 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform 2025-10-10T01:29:28,226 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/thread 2025-10-10T01:29:28,227 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2025-10-10T01:29:28,230 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2025-10-10T01:29:28,232 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/device 2025-10-10T01:29:28,234 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/device 2025-10-10T01:29:28,237 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,239 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,242 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,244 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,246 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,249 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,252 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,255 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,257 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,260 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,262 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,265 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,268 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,271 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,274 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,276 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,279 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,281 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,284 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,288 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,291 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,294 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,298 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,301 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,304 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,306 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2025-10-10T01:29:28,311 copying build/lib/flashinfer/data/cutlass/include/cutlass/semaphore.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:28,313 copying build/lib/flashinfer/data/cutlass/include/cutlass/exmy_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:28,316 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:28,318 copying build/lib/flashinfer/data/cutlass/include/cutlass/functional.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:28,322 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental 2025-10-10T01:29:28,323 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed 2025-10-10T01:29:28,325 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2025-10-10T01:29:28,326 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2025-10-10T01:29:28,329 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2025-10-10T01:29:28,333 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2025-10-10T01:29:28,334 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2025-10-10T01:29:28,336 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2025-10-10T01:29:28,338 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2025-10-10T01:29:28,341 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2025-10-10T01:29:28,342 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2025-10-10T01:29:28,344 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2025-10-10T01:29:28,347 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2025-10-10T01:29:28,350 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/pipeline 2025-10-10T01:29:28,352 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2025-10-10T01:29:28,354 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2025-10-10T01:29:28,357 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2025-10-10T01:29:28,361 copying build/lib/flashinfer/data/cutlass/include/cutlass/fast_math.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:28,364 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:28,366 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:28,370 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:28,373 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,374 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,378 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,381 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,383 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,386 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,388 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,390 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,392 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,395 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,398 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,404 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,406 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,409 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,411 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,413 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,416 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,419 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,421 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,424 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,426 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,429 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,431 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,434 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,437 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,440 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,444 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,447 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,451 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,454 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,457 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,460 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,462 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,465 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,468 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,473 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2025-10-10T01:29:28,478 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,479 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,483 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,485 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,489 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,492 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,494 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,496 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,499 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,501 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,504 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,506 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,509 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,512 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,515 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,518 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,521 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,524 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,528 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,530 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,533 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,536 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,539 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,541 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,543 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,547 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,549 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,554 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,557 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,559 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,563 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,565 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,568 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,571 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,574 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,577 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,580 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,584 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,587 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,590 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,592 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,595 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,598 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,602 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,604 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,607 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,610 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,612 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,615 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,618 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,621 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,624 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,629 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,632 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,635 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,638 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,641 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,644 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,646 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,649 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,652 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,654 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,657 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,659 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,662 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,664 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,668 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,670 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,673 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,676 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,678 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,681 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,684 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,686 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,689 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,691 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,694 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,697 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,700 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,703 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,705 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,708 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,710 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,713 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,715 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,718 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,721 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,724 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,727 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,732 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,735 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,738 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,740 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,743 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,746 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,749 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,751 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,754 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,756 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,759 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,762 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,765 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,768 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,771 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,774 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,776 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,779 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,781 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,784 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,789 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,792 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,794 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2025-10-10T01:29:28,798 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,800 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,803 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,806 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,809 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,812 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,816 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,819 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,822 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,825 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,828 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,831 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,835 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,838 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,842 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,845 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,847 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,850 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,853 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,857 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,860 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,864 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,865 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,868 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,871 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,874 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,877 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,879 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,882 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,884 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,887 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,890 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,893 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,896 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,900 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,902 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,905 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,907 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,910 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,913 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,915 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,918 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,921 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,923 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,926 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,928 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,931 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,934 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2025-10-10T01:29:28,937 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,940 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,943 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,949 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,952 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,955 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,959 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,963 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,967 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,970 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,972 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,975 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,979 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,983 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,986 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,992 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:28,997 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:29,000 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:29,003 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:29,007 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:29,010 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:29,012 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:29,015 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:29,018 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:29,021 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2025-10-10T01:29:29,025 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:29,028 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:29,029 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:29,032 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:29,034 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:29,036 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2025-10-10T01:29:29,039 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2025-10-10T01:29:29,042 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,043 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,046 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,049 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,052 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,055 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,058 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,060 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,063 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,065 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,068 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,071 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,074 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,077 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,079 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,082 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,084 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,087 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,090 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,092 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,095 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,098 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,100 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,103 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,106 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,109 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,112 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,114 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,117 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,120 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2025-10-10T01:29:29,123 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,124 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,128 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,130 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,132 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,134 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,137 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,140 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,142 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,145 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,148 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,151 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,154 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,157 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,160 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,163 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,165 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,168 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,171 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,174 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,177 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,180 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,183 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,185 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,188 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,191 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,194 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,198 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,200 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,203 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,205 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,208 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,211 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,214 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,216 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,220 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,222 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,227 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,230 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,232 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,235 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,237 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,240 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2025-10-10T01:29:29,243 copying build/lib/flashinfer/data/cutlass/include/cutlass/cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,246 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,247 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,250 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,253 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,255 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,257 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,259 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,262 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail/collective 2025-10-10T01:29:29,264 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2025-10-10T01:29:29,266 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2025-10-10T01:29:29,269 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,271 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,274 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,276 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,278 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,280 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2025-10-10T01:29:29,283 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,292 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction 2025-10-10T01:29:29,293 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:29,295 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:29,298 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:29,301 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:29,303 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2025-10-10T01:29:29,306 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction 2025-10-10T01:29:29,309 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/thread 2025-10-10T01:29:29,310 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2025-10-10T01:29:29,313 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2025-10-10T01:29:29,316 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:29,317 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:29,320 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:29,322 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:29,325 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2025-10-10T01:29:29,329 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,331 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,334 copying build/lib/flashinfer/data/cutlass/include/cutlass/float8.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,338 copying build/lib/flashinfer/data/cutlass/include/cutlass/array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,342 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint128.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,344 copying build/lib/flashinfer/data/cutlass/include/cutlass/half.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,347 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,349 copying build/lib/flashinfer/data/cutlass/include/cutlass/quaternion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,352 copying build/lib/flashinfer/data/cutlass/include/cutlass/aligned_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,354 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,357 copying build/lib/flashinfer/data/cutlass/include/cutlass/trace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,359 copying build/lib/flashinfer/data/cutlass/include/cutlass/device_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,362 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,364 copying build/lib/flashinfer/data/cutlass/include/cutlass/float_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,368 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,370 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,373 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/platform 2025-10-10T01:29:29,374 copying build/lib/flashinfer/data/cutlass/include/cutlass/platform/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/platform 2025-10-10T01:29:29,377 copying build/lib/flashinfer/data/cutlass/include/cutlass/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,380 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,382 copying build/lib/flashinfer/data/cutlass/include/cutlass/integer_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,385 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/thread 2025-10-10T01:29:29,387 copying build/lib/flashinfer/data/cutlass/include/cutlass/thread/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/thread 2025-10-10T01:29:29,389 copying build/lib/flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,392 copying build/lib/flashinfer/data/cutlass/include/cutlass/wmma_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,394 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint256.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,397 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,399 copying build/lib/flashinfer/data/cutlass/include/cutlass/constants.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,403 copying build/lib/flashinfer/data/cutlass/include/cutlass/bfloat16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,406 copying build/lib/flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,408 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue 2025-10-10T01:29:29,409 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue 2025-10-10T01:29:29,413 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,414 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,418 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,421 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,423 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,427 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,430 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,434 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,437 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,441 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,444 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,447 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,450 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,455 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2025-10-10T01:29:29,460 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,461 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,463 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,466 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,469 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,471 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,474 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,476 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,480 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,483 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,485 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,488 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,490 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,493 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,496 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,499 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2025-10-10T01:29:29,502 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,503 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,506 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,510 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,514 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,517 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,520 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:29,521 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:29,526 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:29,529 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:29,531 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:29,533 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:29,537 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2025-10-10T01:29:29,540 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,543 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,546 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,548 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,551 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,555 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,558 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,560 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,563 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,565 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2025-10-10T01:29:29,569 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,570 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,572 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,575 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,577 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,580 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,582 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,584 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,587 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,590 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,593 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,595 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,598 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,600 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,603 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,605 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,607 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,610 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,613 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,616 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,618 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,621 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,623 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,629 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,632 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2025-10-10T01:29:29,636 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,637 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,640 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,643 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,645 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,648 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,650 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,653 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,655 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,658 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,661 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,664 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,667 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,669 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,673 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:29,674 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:29,677 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:29,679 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:29,682 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:29,685 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2025-10-10T01:29:29,687 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,690 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,692 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,696 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,698 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,701 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,704 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,706 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,709 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,712 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,715 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,717 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,720 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,723 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,725 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,728 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,733 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,735 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,737 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,740 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,743 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,746 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,749 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,751 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,753 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,756 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,758 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,761 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,763 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,767 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,769 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,772 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,774 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,777 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,779 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2025-10-10T01:29:29,782 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,784 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,786 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_launch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,789 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,790 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,794 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,796 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,798 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/arch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,801 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,804 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,806 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,809 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,811 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,814 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,818 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,820 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,823 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,825 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,828 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,830 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,833 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,835 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,837 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,840 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,843 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,845 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,849 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,851 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,854 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,856 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,860 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,862 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,865 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2025-10-10T01:29:29,868 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_shape.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,870 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,872 copying build/lib/flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,875 copying build/lib/flashinfer/data/cutlass/include/cutlass/tfloat32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,877 copying build/lib/flashinfer/data/cutlass/include/cutlass/predicate_vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,880 copying build/lib/flashinfer/data/cutlass/include/cutlass/relatively_equal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,883 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,885 copying build/lib/flashinfer/data/cutlass/include/cutlass/block_striped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,888 copying build/lib/flashinfer/data/cutlass/include/cutlass/core_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2025-10-10T01:29:29,891 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,892 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,895 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/permute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,901 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,903 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,906 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,909 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,912 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,915 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2025-10-10T01:29:29,919 copying build/lib/flashinfer/data/build_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2025-10-10T01:29:29,923 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc 2025-10-10T01:29:29,925 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,927 copying build/lib/flashinfer/data/csrc/rope.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,930 copying build/lib/flashinfer/data/csrc/batch_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,932 copying build/lib/flashinfer/data/csrc/tgv_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,934 copying build/lib/flashinfer/data/csrc/pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,936 copying build/lib/flashinfer/data/csrc/pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,939 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,941 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,944 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_routing_llama4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,947 copying build/lib/flashinfer/data/csrc/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,949 copying build/lib/flashinfer/data/csrc/gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,951 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,955 copying build/lib/flashinfer/data/csrc/norm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,957 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,959 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,962 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,964 copying build/lib/flashinfer/data/csrc/sampling.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,966 copying build/lib/flashinfer/data/csrc/single_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,968 copying build/lib/flashinfer/data/csrc/trtllm_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,971 copying build/lib/flashinfer/data/csrc/runtime_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,973 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,975 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,977 copying build/lib/flashinfer/data/csrc/batch_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,980 copying build/lib/flashinfer/data/csrc/single_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,982 copying build/lib/flashinfer/data/csrc/trtllm_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,984 copying build/lib/flashinfer/data/csrc/single_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,986 copying build/lib/flashinfer/data/csrc/batch_attention_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,988 copying build/lib/flashinfer/data/csrc/single_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,991 copying build/lib/flashinfer/data/csrc/batch_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,993 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_dev_kernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:29,996 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal 2025-10-10T01:29:29,998 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm 2025-10-10T01:29:30,000 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,001 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,003 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,006 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,008 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,010 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,013 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,016 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,018 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,021 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2025-10-10T01:29:30,023 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:30,024 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:30,027 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:30,029 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:30,033 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2025-10-10T01:29:30,035 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,037 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,039 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,041 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:30,043 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:30,045 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:30,047 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:30,053 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:30,055 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:30,057 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2025-10-10T01:29:30,060 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,062 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,064 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,066 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,068 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,070 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,072 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,074 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,077 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,079 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,081 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,084 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,089 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,091 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,093 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,095 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,097 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2025-10-10T01:29:30,100 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2025-10-10T01:29:30,103 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2025-10-10T01:29:30,104 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2025-10-10T01:29:30,107 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_stub.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2025-10-10T01:29:30,110 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,111 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2025-10-10T01:29:30,113 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2025-10-10T01:29:30,115 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2025-10-10T01:29:30,118 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,120 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,122 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,125 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,127 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,129 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,131 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,133 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,135 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,138 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,140 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,143 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,145 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,147 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,149 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,151 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,153 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,155 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,158 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,160 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2025-10-10T01:29:30,162 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2025-10-10T01:29:30,164 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:30,166 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:30,168 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:30,170 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:30,172 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:30,176 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2025-10-10T01:29:30,178 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2025-10-10T01:29:30,182 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2025-10-10T01:29:30,183 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2025-10-10T01:29:30,185 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2025-10-10T01:29:30,187 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:30,189 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:30,191 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2025-10-10T01:29:30,194 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:30,195 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:30,198 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:30,200 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:30,202 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:30,205 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:30,207 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2025-10-10T01:29:30,210 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions 2025-10-10T01:29:30,212 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include 2025-10-10T01:29:30,214 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:30,215 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:30,218 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform 2025-10-10T01:29:30,220 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2025-10-10T01:29:30,221 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2025-10-10T01:29:30,224 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:30,226 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:30,229 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm 2025-10-10T01:29:30,230 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2025-10-10T01:29:30,232 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2025-10-10T01:29:30,234 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2025-10-10T01:29:30,237 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2025-10-10T01:29:30,240 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,242 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,244 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,248 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,250 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,253 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,256 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,258 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,260 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,262 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,265 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,268 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,271 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2025-10-10T01:29:30,275 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,276 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,278 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,281 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,285 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,288 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2025-10-10T01:29:30,289 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2025-10-10T01:29:30,292 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2025-10-10T01:29:30,295 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2025-10-10T01:29:30,297 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,300 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,302 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,305 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,307 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,311 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2025-10-10T01:29:30,314 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,315 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,318 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,321 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,324 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,327 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,329 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,332 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,334 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,337 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,339 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,342 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,344 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2025-10-10T01:29:30,348 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail 2025-10-10T01:29:30,350 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2025-10-10T01:29:30,351 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2025-10-10T01:29:30,355 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2025-10-10T01:29:30,356 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2025-10-10T01:29:30,359 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:30,361 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:30,364 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:30,366 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2025-10-10T01:29:30,369 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication 2025-10-10T01:29:30,371 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2025-10-10T01:29:30,372 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2025-10-10T01:29:30,376 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue 2025-10-10T01:29:30,378 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2025-10-10T01:29:30,379 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2025-10-10T01:29:30,382 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2025-10-10T01:29:30,384 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2025-10-10T01:29:30,388 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2025-10-10T01:29:30,389 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2025-10-10T01:29:30,392 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:30,393 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:30,396 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:30,398 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:30,400 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:30,402 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2025-10-10T01:29:30,405 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include 2025-10-10T01:29:30,407 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm 2025-10-10T01:29:30,409 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,410 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,413 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,415 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,419 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,421 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,423 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,425 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,428 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,431 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,433 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2025-10-10T01:29:30,436 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp 2025-10-10T01:29:30,438 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:30,439 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:30,441 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:30,444 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:30,446 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:30,448 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2025-10-10T01:29:30,452 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/kernels 2025-10-10T01:29:30,453 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/kernels 2025-10-10T01:29:30,456 copying build/lib/flashinfer/data/csrc/cascade.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,459 copying build/lib/flashinfer/data/csrc/batch_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,462 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,464 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,465 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,468 copying build/lib/flashinfer/data/csrc/batch_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,470 copying build/lib/flashinfer/data/csrc/flashinfer_sampling_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,472 copying build/lib/flashinfer/data/csrc/batch_decode_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,474 copying build/lib/flashinfer/data/csrc/group_gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,476 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,478 copying build/lib/flashinfer/data/csrc/tvm_ffi_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,481 copying build/lib/flashinfer/data/csrc/batch_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,483 copying build/lib/flashinfer/data/csrc/group_gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,485 copying build/lib/flashinfer/data/csrc/page.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,487 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,489 copying build/lib/flashinfer/data/csrc/gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,492 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,494 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,496 copying build/lib/flashinfer/data/csrc/bmm_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,498 copying build/lib/flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,500 copying build/lib/flashinfer/data/csrc/pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,503 copying build/lib/flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,505 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,509 copying build/lib/flashinfer/data/csrc/single_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,511 copying build/lib/flashinfer/data/csrc/group_gemm_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,513 copying build/lib/flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,516 copying build/lib/flashinfer/data/csrc/single_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,518 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,520 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,523 copying build/lib/flashinfer/data/csrc/batch_decode_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,525 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,527 copying build/lib/flashinfer/data/csrc/single_prefill_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,529 copying build/lib/flashinfer/data/csrc/vllm_custom_all_reduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,532 copying build/lib/flashinfer/data/csrc/flashinfer_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,534 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,536 copying build/lib/flashinfer/data/csrc/fmhaReduction.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,539 copying build/lib/flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,541 copying build/lib/flashinfer/data/csrc/pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,543 copying build/lib/flashinfer/data/csrc/flashinfer_norm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,545 copying build/lib/flashinfer/data/csrc/group_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,548 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_routing_deepseek.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,551 copying build/lib/flashinfer/data/csrc/flashinfer_cascade_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,553 copying build/lib/flashinfer/data/csrc/trtllm_alltoall_prepare.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,556 copying build/lib/flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,559 copying build/lib/flashinfer/data/csrc/batch_decode_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,561 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,563 copying build/lib/flashinfer/data/csrc/batch_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,565 copying build/lib/flashinfer/data/csrc/trtllm_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,568 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,570 copying build/lib/flashinfer/data/csrc/batch_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,572 copying build/lib/flashinfer/data/csrc/logging.cc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,574 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,576 copying build/lib/flashinfer/data/csrc/blackwell_fmha_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,578 copying build/lib/flashinfer/data/csrc/batch_decode_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,580 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,583 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,585 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,588 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,590 copying build/lib/flashinfer/data/csrc/trtllm_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,592 copying build/lib/flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,595 copying build/lib/flashinfer/data/csrc/tgv_gemm.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,597 copying build/lib/flashinfer/data/csrc/flashinfer_page_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,599 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe 2025-10-10T01:29:30,601 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/cutlass_backend 2025-10-10T01:29:30,602 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2025-10-10T01:29:30,605 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2025-10-10T01:29:30,613 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2025-10-10T01:29:30,617 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,620 copying build/lib/flashinfer/data/csrc/renorm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,622 copying build/lib/flashinfer/data/csrc/cutlass_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,624 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,626 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,628 copying build/lib/flashinfer/data/csrc/single_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,630 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,632 copying build/lib/flashinfer/data/csrc/flashinfer_quantization_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,634 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,637 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,639 copying build/lib/flashinfer/data/csrc/batch_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,641 copying build/lib/flashinfer/data/csrc/flashinfer_xqa_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,643 copying build/lib/flashinfer/data/csrc/nvshmem_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,646 copying build/lib/flashinfer/data/csrc/batch_attention_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,648 copying build/lib/flashinfer/data/csrc/flashinfer_rope_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,650 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,654 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,656 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,659 copying build/lib/flashinfer/data/csrc/batch_attention.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,661 copying build/lib/flashinfer/data/csrc/batch_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,663 copying build/lib/flashinfer/data/csrc/batch_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,666 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_routing_renormalize.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,669 copying build/lib/flashinfer/data/csrc/trtllm_batched_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,671 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,673 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,676 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,678 copying build/lib/flashinfer/data/csrc/single_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,681 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/xqa 2025-10-10T01:29:30,682 copying build/lib/flashinfer/data/csrc/xqa/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,684 copying build/lib/flashinfer/data/csrc/xqa/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,687 copying build/lib/flashinfer/data/csrc/xqa/cuda_hint.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,689 copying build/lib/flashinfer/data/csrc/xqa/xqa_wrapper.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,691 copying build/lib/flashinfer/data/csrc/xqa/specDec.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,693 copying build/lib/flashinfer/data/csrc/xqa/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,695 copying build/lib/flashinfer/data/csrc/xqa/mhaUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,698 copying build/lib/flashinfer/data/csrc/xqa/defines.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,700 copying build/lib/flashinfer/data/csrc/xqa/mha.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,703 copying build/lib/flashinfer/data/csrc/xqa/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,705 copying build/lib/flashinfer/data/csrc/xqa/hostUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,707 copying build/lib/flashinfer/data/csrc/xqa/barriers.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,710 copying build/lib/flashinfer/data/csrc/xqa/ldgsts.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,712 copying build/lib/flashinfer/data/csrc/xqa/mha_components.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,714 copying build/lib/flashinfer/data/csrc/xqa/mha.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,719 copying build/lib/flashinfer/data/csrc/xqa/mha_stdheaders.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2025-10-10T01:29:30,722 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,724 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2025-10-10T01:29:30,728 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog 2025-10-10T01:29:30,730 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include 2025-10-10T01:29:30,731 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,733 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,735 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,737 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,739 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,742 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,744 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,746 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,748 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,750 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,752 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,754 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,756 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,758 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,761 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,763 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,765 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/null_mutex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,767 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,769 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,771 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/windows_include.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,773 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,775 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,777 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/console_globals.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,779 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,781 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,784 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/circular_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,786 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,788 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,790 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2025-10-10T01:29:30,792 copying build/lib/flashinfer/data/spdlog/include/spdlog/fwd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,795 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,797 copying build/lib/flashinfer/data/spdlog/include/spdlog/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,799 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,803 copying build/lib/flashinfer/data/spdlog/include/spdlog/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,804 copying build/lib/flashinfer/data/spdlog/include/spdlog/stopwatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,807 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,809 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,812 copying build/lib/flashinfer/data/spdlog/include/spdlog/common-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,814 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,817 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:30,818 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:30,820 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/fmt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:30,822 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:30,825 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:30,827 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:30,829 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ostr.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:30,831 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,833 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,835 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,838 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,840 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,846 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,849 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,852 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,854 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,857 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,860 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,863 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,866 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,870 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,875 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,877 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2025-10-10T01:29:30,883 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:30,885 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2025-10-10T01:29:30,887 copying build/lib/flashinfer/data/spdlog/include/spdlog/mdc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,889 copying build/lib/flashinfer/data/spdlog/include/spdlog/formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,891 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,894 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,895 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,898 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,900 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,902 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,904 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,907 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,909 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,911 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,913 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,916 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,918 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,920 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,922 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,924 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,926 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,928 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,930 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,932 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,935 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,937 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,939 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,941 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,944 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,946 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,948 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,951 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,953 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,955 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,958 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,960 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,962 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,965 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,967 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,969 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2025-10-10T01:29:30,971 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,973 copying build/lib/flashinfer/data/spdlog/include/spdlog/async.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,975 copying build/lib/flashinfer/data/spdlog/include/spdlog/tweakme.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,978 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2025-10-10T01:29:30,981 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:30,983 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/env.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:30,985 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/argv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:30,987 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:30,989 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2025-10-10T01:29:30,991 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/scripts 2025-10-10T01:29:30,993 copying build/lib/flashinfer/data/spdlog/scripts/extract_version.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/scripts 2025-10-10T01:29:30,996 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include 2025-10-10T01:29:30,998 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer 2025-10-10T01:29:30,999 copying build/lib/flashinfer/data/include/flashinfer/logging.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,001 copying build/lib/flashinfer/data/include/flashinfer/cp_async.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,003 copying build/lib/flashinfer/data/include/flashinfer/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,006 copying build/lib/flashinfer/data/include/flashinfer/arch_condition.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,008 copying build/lib/flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,011 copying build/lib/flashinfer/data/include/flashinfer/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,013 copying build/lib/flashinfer/data/include/flashinfer/vec_dtypes.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,017 copying build/lib/flashinfer/data/include/flashinfer/exception.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,020 copying build/lib/flashinfer/data/include/flashinfer/allocator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,022 copying build/lib/flashinfer/data/include/flashinfer/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,026 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,027 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,030 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,033 copying build/lib/flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,035 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,038 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,041 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,043 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,045 copying build/lib/flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,048 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,050 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,052 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,054 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,057 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,060 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,063 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,067 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,070 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,073 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,075 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,078 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,080 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,083 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2025-10-10T01:29:31,086 copying build/lib/flashinfer/data/include/flashinfer/pos_enc.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,090 copying build/lib/flashinfer/data/include/flashinfer/activation.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,092 copying build/lib/flashinfer/data/include/flashinfer/attention_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,094 copying build/lib/flashinfer/data/include/flashinfer/layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,097 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm 2025-10-10T01:29:31,098 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm 2025-10-10T01:29:31,102 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:31,103 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:31,106 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:31,109 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:31,111 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:31,114 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2025-10-10T01:29:31,116 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm 2025-10-10T01:29:31,118 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:31,120 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelTraits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:31,123 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmOptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:31,126 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:31,129 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/Enums.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:31,133 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm 2025-10-10T01:29:31,134 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:31,136 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SfLayoutDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:31,138 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/MmaDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:31,141 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaKernelLauncher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:31,143 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/DtypeDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:31,146 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CommonUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2025-10-10T01:29:31,148 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmInterface.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:31,151 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParamsDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:31,154 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/TmaDescriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2025-10-10T01:29:31,157 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,159 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,161 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,164 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,166 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,170 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,172 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,174 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,177 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,179 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2025-10-10T01:29:31,182 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:31,184 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:31,186 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:31,189 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:31,192 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:31,195 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:31,198 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2025-10-10T01:29:31,201 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2025-10-10T01:29:31,203 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,205 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/KernelTraits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,208 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/GemmGatedActOptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,210 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/BatchedGemmEnums.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,212 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/GemmOptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,217 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/BatchedGemmInterface.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,220 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/KernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,223 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/Enums.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,226 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/BatchedGemmOptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,229 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm 2025-10-10T01:29:31,231 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:31,232 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/SfLayoutDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:31,235 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/MmaDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:31,237 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/CudaKernelLauncher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:31,239 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/DtypeDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:31,242 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/CommonUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen 2025-10-10T01:29:31,244 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/KernelParamsDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,247 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/TmaDescriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export 2025-10-10T01:29:31,250 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm 2025-10-10T01:29:31,253 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:31,254 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:31,257 copying build/lib/flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:31,259 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:31,264 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:31,267 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:31,271 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:31,274 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2025-10-10T01:29:31,277 copying build/lib/flashinfer/data/include/flashinfer/cutlass_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,280 copying build/lib/flashinfer/data/include/flashinfer/fp4_layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,282 copying build/lib/flashinfer/data/include/flashinfer/fastdiv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,284 copying build/lib/flashinfer/data/include/flashinfer/cubin_loader.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,286 copying build/lib/flashinfer/data/include/flashinfer/norm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,289 copying build/lib/flashinfer/data/include/flashinfer/sampling.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,294 copying build/lib/flashinfer/data/include/flashinfer/permuted_smem.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,296 copying build/lib/flashinfer/data/include/flashinfer/profiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,298 copying build/lib/flashinfer/data/include/flashinfer/math.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,302 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,303 copying build/lib/flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,306 copying build/lib/flashinfer/data/include/flashinfer/attention/cascade.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,309 copying build/lib/flashinfer/data/include/flashinfer/attention/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,311 copying build/lib/flashinfer/data/include/flashinfer/attention/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,315 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent_template.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,318 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell 2025-10-10T01:29:31,319 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2025-10-10T01:29:31,322 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/common 2025-10-10T01:29:31,323 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/common 2025-10-10T01:29:31,325 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2025-10-10T01:29:31,329 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:31,330 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:31,332 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:31,334 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:31,339 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:31,342 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:31,345 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:31,347 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:31,350 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2025-10-10T01:29:31,353 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:31,354 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:31,356 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:31,360 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:31,363 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:31,365 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:31,367 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:31,370 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:31,373 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2025-10-10T01:29:31,376 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/device 2025-10-10T01:29:31,377 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2025-10-10T01:29:31,380 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2025-10-10T01:29:31,382 copying build/lib/flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,385 copying build/lib/flashinfer/data/include/flashinfer/attention/pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,388 copying build/lib/flashinfer/data/include/flashinfer/attention/mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,392 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,394 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,398 copying build/lib/flashinfer/data/include/flashinfer/attention/heap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,400 copying build/lib/flashinfer/data/include/flashinfer/attention/decode.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,403 copying build/lib/flashinfer/data/include/flashinfer/attention/mask.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,406 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,407 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,410 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,414 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:31,415 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:31,419 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:31,421 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:31,424 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:31,426 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:31,429 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2025-10-10T01:29:31,432 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,434 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,437 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,439 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,442 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,444 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,446 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,449 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,452 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/block_sparse_gather.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,454 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,457 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,459 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2025-10-10T01:29:31,462 copying build/lib/flashinfer/data/include/flashinfer/attention/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,464 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,468 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,470 copying build/lib/flashinfer/data/include/flashinfer/attention/state.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,472 copying build/lib/flashinfer/data/include/flashinfer/attention/default_decode_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,474 copying build/lib/flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,477 copying build/lib/flashinfer/data/include/flashinfer/attention/prefill.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2025-10-10T01:29:31,482 copying build/lib/flashinfer/data/include/flashinfer/fp16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,485 copying build/lib/flashinfer/data/include/flashinfer/page.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2025-10-10T01:29:31,488 copying build/lib/flashinfer/data/build_backend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2025-10-10T01:29:31,491 creating build/bdist.linux-armv7l/wheel/flashinfer/triton 2025-10-10T01:29:31,492 copying build/lib/flashinfer/triton/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2025-10-10T01:29:31,494 copying build/lib/flashinfer/triton/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2025-10-10T01:29:31,497 copying build/lib/flashinfer/triton/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2025-10-10T01:29:31,499 copying build/lib/flashinfer/triton/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2025-10-10T01:29:31,501 creating build/bdist.linux-armv7l/wheel/flashinfer/triton/kernels 2025-10-10T01:29:31,503 copying build/lib/flashinfer/triton/kernels/quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2025-10-10T01:29:31,505 copying build/lib/flashinfer/triton/kernels/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2025-10-10T01:29:31,507 copying build/lib/flashinfer/triton/kernels/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2025-10-10T01:29:31,510 copying build/lib/flashinfer/triton/kernels/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2025-10-10T01:29:31,512 copying build/lib/flashinfer/triton/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2025-10-10T01:29:31,513 copying build/lib/flashinfer/triton/kernels/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2025-10-10T01:29:31,516 copying build/lib/flashinfer/triton/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2025-10-10T01:29:31,518 copying build/lib/flashinfer/triton/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2025-10-10T01:29:31,520 copying build/lib/flashinfer/triton/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2025-10-10T01:29:31,522 copying build/lib/flashinfer/triton/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2025-10-10T01:29:31,525 copying build/lib/flashinfer/aot.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,528 copying build/lib/flashinfer/quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,530 copying build/lib/flashinfer/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,533 copying build/lib/flashinfer/cuda_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,535 copying build/lib/flashinfer/sparse.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,539 copying build/lib/flashinfer/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,541 copying build/lib/flashinfer/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,545 creating build/bdist.linux-armv7l/wheel/flashinfer/profiler 2025-10-10T01:29:31,546 copying build/lib/flashinfer/profiler/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/profiler 2025-10-10T01:29:31,548 copying build/lib/flashinfer/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,551 copying build/lib/flashinfer/version.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,553 copying build/lib/flashinfer/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,556 copying build/lib/flashinfer/pod.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,559 copying build/lib/flashinfer/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,562 creating build/bdist.linux-armv7l/wheel/flashinfer/logits_processor 2025-10-10T01:29:31,563 copying build/lib/flashinfer/logits_processor/operators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,566 copying build/lib/flashinfer/logits_processor/fusion_rules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,568 copying build/lib/flashinfer/logits_processor/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,570 copying build/lib/flashinfer/logits_processor/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,572 copying build/lib/flashinfer/logits_processor/legalization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,574 copying build/lib/flashinfer/logits_processor/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,576 copying build/lib/flashinfer/logits_processor/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,578 copying build/lib/flashinfer/logits_processor/validators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,580 copying build/lib/flashinfer/logits_processor/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,583 copying build/lib/flashinfer/logits_processor/processors.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2025-10-10T01:29:31,586 copying build/lib/flashinfer/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,588 copying build/lib/flashinfer/py.typed -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,590 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl 2025-10-10T01:29:31,591 copying build/lib/flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2025-10-10T01:29:31,596 copying build/lib/flashinfer/cute_dsl/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2025-10-10T01:29:31,598 copying build/lib/flashinfer/cute_dsl/blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2025-10-10T01:29:31,603 copying build/lib/flashinfer/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,607 creating build/bdist.linux-armv7l/wheel/flashinfer/cudnn 2025-10-10T01:29:31,608 copying build/lib/flashinfer/cudnn/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2025-10-10T01:29:31,610 copying build/lib/flashinfer/cudnn/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2025-10-10T01:29:31,612 copying build/lib/flashinfer/cudnn/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2025-10-10T01:29:31,615 copying build/lib/flashinfer/cudnn/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2025-10-10T01:29:31,618 copying build/lib/flashinfer/deep_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,621 copying build/lib/flashinfer/artifacts.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,625 creating build/bdist.linux-armv7l/wheel/flashinfer/comm 2025-10-10T01:29:31,626 copying build/lib/flashinfer/comm/vllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,628 copying build/lib/flashinfer/comm/nvshmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,630 copying build/lib/flashinfer/comm/trtllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,633 copying build/lib/flashinfer/comm/mapping.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,636 copying build/lib/flashinfer/comm/trtllm_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,639 copying build/lib/flashinfer/comm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,641 copying build/lib/flashinfer/comm/trtllm_mnnvl_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,643 copying build/lib/flashinfer/comm/cuda_ipc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,646 copying build/lib/flashinfer/comm/dlpack_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,648 copying build/lib/flashinfer/comm/mnnvl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,651 copying build/lib/flashinfer/comm/nvshmem_allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2025-10-10T01:29:31,654 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe 2025-10-10T01:29:31,656 copying build/lib/flashinfer/fused_moe/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2025-10-10T01:29:31,658 copying build/lib/flashinfer/fused_moe/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2025-10-10T01:29:31,660 copying build/lib/flashinfer/fused_moe/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2025-10-10T01:29:31,664 copying build/lib/flashinfer/attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,667 creating build/bdist.linux-armv7l/wheel/flashinfer/testing 2025-10-10T01:29:31,669 copying build/lib/flashinfer/testing/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2025-10-10T01:29:31,671 copying build/lib/flashinfer/testing/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2025-10-10T01:29:31,674 creating build/bdist.linux-armv7l/wheel/flashinfer/tuning_configs 2025-10-10T01:29:31,676 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2025-10-10T01:29:31,678 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2025-10-10T01:29:31,680 copying build/lib/flashinfer/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,686 copying build/lib/flashinfer/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,689 copying build/lib/flashinfer/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,693 copying build/lib/flashinfer/__main__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,696 copying build/lib/flashinfer/compilation_context.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,698 copying build/lib/flashinfer/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,702 copying build/lib/flashinfer/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2025-10-10T01:29:31,705 copying build/lib/build_utils.py -> build/bdist.linux-armv7l/wheel/. 2025-10-10T01:29:31,707 copying build/lib/build_backend.py -> build/bdist.linux-armv7l/wheel/. 2025-10-10T01:29:31,710 running install_egg_info 2025-10-10T01:29:31,724 running egg_info 2025-10-10T01:29:31,733 writing flashinfer_python.egg-info/PKG-INFO 2025-10-10T01:29:31,736 writing dependency_links to flashinfer_python.egg-info/dependency_links.txt 2025-10-10T01:29:31,738 writing entry points to flashinfer_python.egg-info/entry_points.txt 2025-10-10T01:29:31,740 writing requirements to flashinfer_python.egg-info/requires.txt 2025-10-10T01:29:31,741 writing top-level names to flashinfer_python.egg-info/top_level.txt 2025-10-10T01:29:32,400 reading manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2025-10-10T01:29:32,535 adding license file 'LICENSE' 2025-10-10T01:29:32,536 adding license file 'licenses/LICENSE.cutlass.txt' 2025-10-10T01:29:32,536 adding license file 'licenses/LICENSE.flashattention3.txt' 2025-10-10T01:29:32,537 adding license file 'licenses/LICENSE.fmt.txt' 2025-10-10T01:29:32,538 adding license file 'licenses/LICENSE.spdlog.txt' 2025-10-10T01:29:32,642 writing manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2025-10-10T01:29:32,648 Copying flashinfer_python.egg-info to build/bdist.linux-armv7l/wheel/./flashinfer_python-0.4.0-py3.11.egg-info 2025-10-10T01:29:32,665 running install_scripts 2025-10-10T01:29:32,686 creating build/bdist.linux-armv7l/wheel/flashinfer_python-0.4.0.dist-info/WHEEL 2025-10-10T01:29:32,689 creating '/tmp/pip-wheel-9_ljcydi/.tmp-s1o09azj/flashinfer_python-0.4.0-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2025-10-10T01:29:32,692 adding 'build_backend.py' 2025-10-10T01:29:32,693 adding 'build_utils.py' 2025-10-10T01:29:32,696 adding 'flashinfer/__init__.py' 2025-10-10T01:29:32,698 adding 'flashinfer/__main__.py' 2025-10-10T01:29:32,699 adding 'flashinfer/_build_meta.py' 2025-10-10T01:29:32,701 adding 'flashinfer/activation.py' 2025-10-10T01:29:32,704 adding 'flashinfer/aot.py' 2025-10-10T01:29:32,706 adding 'flashinfer/artifacts.py' 2025-10-10T01:29:32,708 adding 'flashinfer/attention.py' 2025-10-10T01:29:32,712 adding 'flashinfer/autotuner.py' 2025-10-10T01:29:32,716 adding 'flashinfer/cascade.py' 2025-10-10T01:29:32,717 adding 'flashinfer/compilation_context.py' 2025-10-10T01:29:32,719 adding 'flashinfer/cuda_utils.py' 2025-10-10T01:29:32,728 adding 'flashinfer/decode.py' 2025-10-10T01:29:32,733 adding 'flashinfer/deep_gemm.py' 2025-10-10T01:29:32,737 adding 'flashinfer/fp4_quantization.py' 2025-10-10T01:29:32,738 adding 'flashinfer/fp8_quantization.py' 2025-10-10T01:29:32,747 adding 'flashinfer/gemm.py' 2025-10-10T01:29:32,750 adding 'flashinfer/green_ctx.py' 2025-10-10T01:29:32,752 adding 'flashinfer/mla.py' 2025-10-10T01:29:32,754 adding 'flashinfer/norm.py' 2025-10-10T01:29:32,756 adding 'flashinfer/page.py' 2025-10-10T01:29:32,759 adding 'flashinfer/pod.py' 2025-10-10T01:29:32,772 adding 'flashinfer/prefill.py' 2025-10-10T01:29:32,774 adding 'flashinfer/py.typed' 2025-10-10T01:29:32,776 adding 'flashinfer/quantization.py' 2025-10-10T01:29:32,779 adding 'flashinfer/rope.py' 2025-10-10T01:29:32,784 adding 'flashinfer/sampling.py' 2025-10-10T01:29:32,788 adding 'flashinfer/sparse.py' 2025-10-10T01:29:32,790 adding 'flashinfer/tllm_utils.py' 2025-10-10T01:29:32,793 adding 'flashinfer/utils.py' 2025-10-10T01:29:32,794 adding 'flashinfer/version.py' 2025-10-10T01:29:32,796 adding 'flashinfer/xqa.py' 2025-10-10T01:29:32,798 adding 'flashinfer/comm/__init__.py' 2025-10-10T01:29:32,800 adding 'flashinfer/comm/cuda_ipc.py' 2025-10-10T01:29:32,802 adding 'flashinfer/comm/dlpack_utils.py' 2025-10-10T01:29:32,804 adding 'flashinfer/comm/mapping.py' 2025-10-10T01:29:32,808 adding 'flashinfer/comm/mnnvl.py' 2025-10-10T01:29:32,810 adding 'flashinfer/comm/nvshmem.py' 2025-10-10T01:29:32,812 adding 'flashinfer/comm/nvshmem_allreduce.py' 2025-10-10T01:29:32,814 adding 'flashinfer/comm/trtllm_alltoall.py' 2025-10-10T01:29:32,818 adding 'flashinfer/comm/trtllm_ar.py' 2025-10-10T01:29:32,820 adding 'flashinfer/comm/trtllm_mnnvl_ar.py' 2025-10-10T01:29:32,821 adding 'flashinfer/comm/vllm_ar.py' 2025-10-10T01:29:32,823 adding 'flashinfer/cudnn/__init__.py' 2025-10-10T01:29:32,825 adding 'flashinfer/cudnn/decode.py' 2025-10-10T01:29:32,828 adding 'flashinfer/cudnn/prefill.py' 2025-10-10T01:29:32,829 adding 'flashinfer/cudnn/utils.py' 2025-10-10T01:29:32,841 adding 'flashinfer/cute_dsl/blockscaled_gemm.py' 2025-10-10T01:29:32,849 adding 'flashinfer/cute_dsl/gemm_allreduce_two_shot.py' 2025-10-10T01:29:32,851 adding 'flashinfer/cute_dsl/utils.py' 2025-10-10T01:29:32,853 adding 'flashinfer/data/build_backend.py' 2025-10-10T01:29:32,855 adding 'flashinfer/data/build_utils.py' 2025-10-10T01:29:32,859 adding 'flashinfer/data/csrc/batch_attention.cu' 2025-10-10T01:29:32,861 adding 'flashinfer/data/csrc/batch_attention_customize_config.jinja' 2025-10-10T01:29:32,862 adding 'flashinfer/data/csrc/batch_attention_jit_binding.cu' 2025-10-10T01:29:32,863 adding 'flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja' 2025-10-10T01:29:32,865 adding 'flashinfer/data/csrc/batch_decode.cu' 2025-10-10T01:29:32,866 adding 'flashinfer/data/csrc/batch_decode_customize_config.jinja' 2025-10-10T01:29:32,868 adding 'flashinfer/data/csrc/batch_decode_jit_binding.cu' 2025-10-10T01:29:32,869 adding 'flashinfer/data/csrc/batch_decode_kernel_inst.jinja' 2025-10-10T01:29:32,870 adding 'flashinfer/data/csrc/batch_decode_mla_binding.cu' 2025-10-10T01:29:32,872 adding 'flashinfer/data/csrc/batch_decode_mla_config.jinja' 2025-10-10T01:29:32,873 adding 'flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu' 2025-10-10T01:29:32,875 adding 'flashinfer/data/csrc/batch_decode_mla_plan.cu' 2025-10-10T01:29:32,876 adding 'flashinfer/data/csrc/batch_decode_mla_run.cu' 2025-10-10T01:29:32,877 adding 'flashinfer/data/csrc/batch_mla_binding.cu' 2025-10-10T01:29:32,879 adding 'flashinfer/data/csrc/batch_mla_config.jinja' 2025-10-10T01:29:32,880 adding 'flashinfer/data/csrc/batch_mla_plan.cu' 2025-10-10T01:29:32,882 adding 'flashinfer/data/csrc/batch_mla_run.cu' 2025-10-10T01:29:32,883 adding 'flashinfer/data/csrc/batch_mla_sm90_binding.cu' 2025-10-10T01:29:32,884 adding 'flashinfer/data/csrc/batch_mla_sm90_plan.cu' 2025-10-10T01:29:32,886 adding 'flashinfer/data/csrc/batch_mla_sm90_run.cu' 2025-10-10T01:29:32,888 adding 'flashinfer/data/csrc/batch_prefill.cu' 2025-10-10T01:29:32,889 adding 'flashinfer/data/csrc/batch_prefill_customize_config.jinja' 2025-10-10T01:29:32,891 adding 'flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja' 2025-10-10T01:29:32,892 adding 'flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja' 2025-10-10T01:29:32,894 adding 'flashinfer/data/csrc/batch_prefill_fp8_sm90.cu' 2025-10-10T01:29:32,895 adding 'flashinfer/data/csrc/batch_prefill_jit_binding.cu' 2025-10-10T01:29:32,896 adding 'flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja' 2025-10-10T01:29:32,898 adding 'flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja' 2025-10-10T01:29:32,899 adding 'flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja' 2025-10-10T01:29:32,900 adding 'flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja' 2025-10-10T01:29:32,902 adding 'flashinfer/data/csrc/batch_prefill_sm90.cu' 2025-10-10T01:29:32,904 adding 'flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja' 2025-10-10T01:29:32,905 adding 'flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu' 2025-10-10T01:29:32,906 adding 'flashinfer/data/csrc/blackwell_fmha_plan.cu' 2025-10-10T01:29:32,908 adding 'flashinfer/data/csrc/bmm_fp8.cu' 2025-10-10T01:29:32,910 adding 'flashinfer/data/csrc/cascade.cu' 2025-10-10T01:29:32,914 adding 'flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu' 2025-10-10T01:29:32,917 adding 'flashinfer/data/csrc/cudnn_sdpa_utils.h' 2025-10-10T01:29:32,918 adding 'flashinfer/data/csrc/cutlass_mla.cu' 2025-10-10T01:29:32,920 adding 'flashinfer/data/csrc/flashinfer_cascade_binding.cu' 2025-10-10T01:29:32,921 adding 'flashinfer/data/csrc/flashinfer_gemm_binding.cu' 2025-10-10T01:29:32,923 adding 'flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu' 2025-10-10T01:29:32,924 adding 'flashinfer/data/csrc/flashinfer_mla_binding.cu' 2025-10-10T01:29:32,925 adding 'flashinfer/data/csrc/flashinfer_norm_binding.cu' 2025-10-10T01:29:32,927 adding 'flashinfer/data/csrc/flashinfer_page_binding.cu' 2025-10-10T01:29:32,928 adding 'flashinfer/data/csrc/flashinfer_quantization_binding.cu' 2025-10-10T01:29:32,929 adding 'flashinfer/data/csrc/flashinfer_rope_binding.cu' 2025-10-10T01:29:32,930 adding 'flashinfer/data/csrc/flashinfer_sampling_binding.cu' 2025-10-10T01:29:32,932 adding 'flashinfer/data/csrc/flashinfer_xqa_binding.cu' 2025-10-10T01:29:32,934 adding 'flashinfer/data/csrc/fmhaReduction.cu' 2025-10-10T01:29:32,936 adding 'flashinfer/data/csrc/fmha_cutlass_sm100.cu' 2025-10-10T01:29:32,937 adding 'flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu' 2025-10-10T01:29:32,939 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.cu' 2025-10-10T01:29:32,940 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.jinja' 2025-10-10T01:29:32,942 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu' 2025-10-10T01:29:32,943 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja' 2025-10-10T01:29:32,945 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.cu' 2025-10-10T01:29:32,946 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.jinja' 2025-10-10T01:29:32,948 adding 'flashinfer/data/csrc/gemm_groupwise_sm100.cu' 2025-10-10T01:29:32,949 adding 'flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja' 2025-10-10T01:29:32,951 adding 'flashinfer/data/csrc/gemm_groupwise_sm120.cu' 2025-10-10T01:29:32,952 adding 'flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja' 2025-10-10T01:29:32,954 adding 'flashinfer/data/csrc/gemm_sm100_binding.cu' 2025-10-10T01:29:32,955 adding 'flashinfer/data/csrc/gemm_sm120_binding.cu' 2025-10-10T01:29:32,956 adding 'flashinfer/data/csrc/group_gemm.cu' 2025-10-10T01:29:32,958 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu' 2025-10-10T01:29:32,959 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja' 2025-10-10T01:29:32,961 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu' 2025-10-10T01:29:32,962 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja' 2025-10-10T01:29:32,964 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu' 2025-10-10T01:29:32,966 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja' 2025-10-10T01:29:32,967 adding 'flashinfer/data/csrc/group_gemm_sm100_binding.cu' 2025-10-10T01:29:32,968 adding 'flashinfer/data/csrc/group_gemm_sm120_binding.cu' 2025-10-10T01:29:32,970 adding 'flashinfer/data/csrc/group_gemm_sm90.cu' 2025-10-10T01:29:32,971 adding 'flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja' 2025-10-10T01:29:32,973 adding 'flashinfer/data/csrc/logging.cc' 2025-10-10T01:29:32,974 adding 'flashinfer/data/csrc/norm.cu' 2025-10-10T01:29:32,976 adding 'flashinfer/data/csrc/nvshmem_binding.cu' 2025-10-10T01:29:32,978 adding 'flashinfer/data/csrc/page.cu' 2025-10-10T01:29:32,980 adding 'flashinfer/data/csrc/pod.cu' 2025-10-10T01:29:32,981 adding 'flashinfer/data/csrc/pod_customize_config.jinja' 2025-10-10T01:29:32,983 adding 'flashinfer/data/csrc/pod_jit_binding.cu' 2025-10-10T01:29:32,984 adding 'flashinfer/data/csrc/pod_kernel_inst.jinja' 2025-10-10T01:29:32,985 adding 'flashinfer/data/csrc/quantization.cu' 2025-10-10T01:29:32,987 adding 'flashinfer/data/csrc/renorm.cu' 2025-10-10T01:29:32,989 adding 'flashinfer/data/csrc/rope.cu' 2025-10-10T01:29:32,990 adding 'flashinfer/data/csrc/runtime_utils.h' 2025-10-10T01:29:32,992 adding 'flashinfer/data/csrc/sampling.cu' 2025-10-10T01:29:32,993 adding 'flashinfer/data/csrc/single_decode.cu' 2025-10-10T01:29:32,995 adding 'flashinfer/data/csrc/single_decode_customize_config.jinja' 2025-10-10T01:29:32,996 adding 'flashinfer/data/csrc/single_decode_jit_binding.cu' 2025-10-10T01:29:32,998 adding 'flashinfer/data/csrc/single_decode_kernel_inst.jinja' 2025-10-10T01:29:32,999 adding 'flashinfer/data/csrc/single_prefill.cu' 2025-10-10T01:29:33,001 adding 'flashinfer/data/csrc/single_prefill_customize_config.jinja' 2025-10-10T01:29:33,002 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90.cu' 2025-10-10T01:29:33,003 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja' 2025-10-10T01:29:33,005 adding 'flashinfer/data/csrc/single_prefill_jit_binding.cu' 2025-10-10T01:29:33,006 adding 'flashinfer/data/csrc/single_prefill_kernel_inst.jinja' 2025-10-10T01:29:33,007 adding 'flashinfer/data/csrc/single_prefill_sm90.cu' 2025-10-10T01:29:33,008 adding 'flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja' 2025-10-10T01:29:33,010 adding 'flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu' 2025-10-10T01:29:33,011 adding 'flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja' 2025-10-10T01:29:33,012 adding 'flashinfer/data/csrc/tgv_gemm.cu' 2025-10-10T01:29:33,014 adding 'flashinfer/data/csrc/tgv_gemm.jinja' 2025-10-10T01:29:33,015 adding 'flashinfer/data/csrc/trtllm_allreduce.cu' 2025-10-10T01:29:33,017 adding 'flashinfer/data/csrc/trtllm_allreduce_fusion.cu' 2025-10-10T01:29:33,019 adding 'flashinfer/data/csrc/trtllm_alltoall.cu' 2025-10-10T01:29:33,022 adding 'flashinfer/data/csrc/trtllm_alltoall_prepare.cu' 2025-10-10T01:29:33,024 adding 'flashinfer/data/csrc/trtllm_batched_gemm_runner.cu' 2025-10-10T01:29:33,027 adding 'flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu' 2025-10-10T01:29:33,030 adding 'flashinfer/data/csrc/trtllm_fused_moe_dev_kernel.cu' 2025-10-10T01:29:33,035 adding 'flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu' 2025-10-10T01:29:33,039 adding 'flashinfer/data/csrc/trtllm_fused_moe_routing_deepseek.cu' 2025-10-10T01:29:33,041 adding 'flashinfer/data/csrc/trtllm_fused_moe_routing_llama4.cu' 2025-10-10T01:29:33,044 adding 'flashinfer/data/csrc/trtllm_fused_moe_routing_renormalize.cu' 2025-10-10T01:29:33,047 adding 'flashinfer/data/csrc/trtllm_fused_moe_runner.cu' 2025-10-10T01:29:33,049 adding 'flashinfer/data/csrc/trtllm_gemm_runner.cu' 2025-10-10T01:29:33,051 adding 'flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu' 2025-10-10T01:29:33,052 adding 'flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu' 2025-10-10T01:29:33,054 adding 'flashinfer/data/csrc/tvm_ffi_utils.h' 2025-10-10T01:29:33,056 adding 'flashinfer/data/csrc/vllm_custom_all_reduce.cu' 2025-10-10T01:29:33,058 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu' 2025-10-10T01:29:33,082 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh' 2025-10-10T01:29:33,088 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_sm100_binding.cu' 2025-10-10T01:29:33,092 adding 'flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp' 2025-10-10T01:29:33,093 adding 'flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp' 2025-10-10T01:29:33,097 adding 'flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu' 2025-10-10T01:29:33,099 adding 'flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp' 2025-10-10T01:29:33,100 adding 'flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp' 2025-10-10T01:29:33,103 adding 'flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu' 2025-10-10T01:29:33,106 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h' 2025-10-10T01:29:33,108 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h' 2025-10-10T01:29:33,109 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h' 2025-10-10T01:29:33,111 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h' 2025-10-10T01:29:33,115 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h' 2025-10-10T01:29:33,117 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h' 2025-10-10T01:29:33,119 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h' 2025-10-10T01:29:33,120 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h' 2025-10-10T01:29:33,122 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h' 2025-10-10T01:29:33,123 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h' 2025-10-10T01:29:33,126 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h' 2025-10-10T01:29:33,128 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh' 2025-10-10T01:29:33,129 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h' 2025-10-10T01:29:33,131 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh' 2025-10-10T01:29:33,133 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h' 2025-10-10T01:29:33,135 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h' 2025-10-10T01:29:33,136 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh' 2025-10-10T01:29:33,138 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh' 2025-10-10T01:29:33,140 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h' 2025-10-10T01:29:33,143 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h' 2025-10-10T01:29:33,144 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h' 2025-10-10T01:29:33,146 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h' 2025-10-10T01:29:33,148 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h' 2025-10-10T01:29:33,150 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h' 2025-10-10T01:29:33,151 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h' 2025-10-10T01:29:33,153 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h' 2025-10-10T01:29:33,155 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp' 2025-10-10T01:29:33,157 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp' 2025-10-10T01:29:33,158 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp' 2025-10-10T01:29:33,160 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h' 2025-10-10T01:29:33,161 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h' 2025-10-10T01:29:33,165 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp' 2025-10-10T01:29:33,169 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp' 2025-10-10T01:29:33,173 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp' 2025-10-10T01:29:33,176 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp' 2025-10-10T01:29:33,178 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h' 2025-10-10T01:29:33,181 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp' 2025-10-10T01:29:33,182 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp' 2025-10-10T01:29:33,184 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp' 2025-10-10T01:29:33,185 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp' 2025-10-10T01:29:33,186 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp' 2025-10-10T01:29:33,187 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp' 2025-10-10T01:29:33,195 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp' 2025-10-10T01:29:33,198 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp' 2025-10-10T01:29:33,202 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp' 2025-10-10T01:29:33,209 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2025-10-10T01:29:33,212 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl' 2025-10-10T01:29:33,214 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl' 2025-10-10T01:29:33,216 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl' 2025-10-10T01:29:33,218 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h' 2025-10-10T01:29:33,220 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh' 2025-10-10T01:29:33,223 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh' 2025-10-10T01:29:33,225 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh' 2025-10-10T01:29:33,227 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h' 2025-10-10T01:29:33,228 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp' 2025-10-10T01:29:33,230 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h' 2025-10-10T01:29:33,232 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh' 2025-10-10T01:29:33,235 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h' 2025-10-10T01:29:33,237 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h' 2025-10-10T01:29:33,240 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp' 2025-10-10T01:29:33,244 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp' 2025-10-10T01:29:33,246 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h' 2025-10-10T01:29:33,248 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h' 2025-10-10T01:29:33,250 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h' 2025-10-10T01:29:33,252 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h' 2025-10-10T01:29:33,254 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h' 2025-10-10T01:29:33,255 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h' 2025-10-10T01:29:33,257 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h' 2025-10-10T01:29:33,260 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h' 2025-10-10T01:29:33,263 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h' 2025-10-10T01:29:33,265 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h' 2025-10-10T01:29:33,267 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h' 2025-10-10T01:29:33,270 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h' 2025-10-10T01:29:33,272 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h' 2025-10-10T01:29:33,274 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h' 2025-10-10T01:29:33,276 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h' 2025-10-10T01:29:33,279 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h' 2025-10-10T01:29:33,281 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp' 2025-10-10T01:29:33,283 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu' 2025-10-10T01:29:33,285 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h' 2025-10-10T01:29:33,286 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu' 2025-10-10T01:29:33,288 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h' 2025-10-10T01:29:33,292 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh' 2025-10-10T01:29:33,293 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h' 2025-10-10T01:29:33,297 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp' 2025-10-10T01:29:33,299 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h' 2025-10-10T01:29:33,300 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h' 2025-10-10T01:29:33,302 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h' 2025-10-10T01:29:33,304 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_stub.cu' 2025-10-10T01:29:33,306 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu' 2025-10-10T01:29:33,307 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu' 2025-10-10T01:29:33,309 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu' 2025-10-10T01:29:33,310 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu' 2025-10-10T01:29:33,311 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu' 2025-10-10T01:29:33,313 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu' 2025-10-10T01:29:33,314 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu' 2025-10-10T01:29:33,315 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu' 2025-10-10T01:29:33,317 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu' 2025-10-10T01:29:33,318 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu' 2025-10-10T01:29:33,319 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu' 2025-10-10T01:29:33,321 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu' 2025-10-10T01:29:33,322 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu' 2025-10-10T01:29:33,323 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu' 2025-10-10T01:29:33,325 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu' 2025-10-10T01:29:33,326 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu' 2025-10-10T01:29:33,327 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu' 2025-10-10T01:29:33,329 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h' 2025-10-10T01:29:33,332 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h' 2025-10-10T01:29:33,334 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h' 2025-10-10T01:29:33,336 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h' 2025-10-10T01:29:33,339 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl' 2025-10-10T01:29:33,341 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h' 2025-10-10T01:29:33,342 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h' 2025-10-10T01:29:33,344 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h' 2025-10-10T01:29:33,349 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h' 2025-10-10T01:29:33,351 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h' 2025-10-10T01:29:33,353 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu' 2025-10-10T01:29:33,355 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu' 2025-10-10T01:29:33,356 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu' 2025-10-10T01:29:33,357 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu' 2025-10-10T01:29:33,359 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu' 2025-10-10T01:29:33,360 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu' 2025-10-10T01:29:33,361 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu' 2025-10-10T01:29:33,362 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu' 2025-10-10T01:29:33,364 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu' 2025-10-10T01:29:33,365 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu' 2025-10-10T01:29:33,366 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu' 2025-10-10T01:29:33,368 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu' 2025-10-10T01:29:33,369 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu' 2025-10-10T01:29:33,370 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu' 2025-10-10T01:29:33,375 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h' 2025-10-10T01:29:33,377 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h' 2025-10-10T01:29:33,379 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h' 2025-10-10T01:29:33,381 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu' 2025-10-10T01:29:33,383 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h' 2025-10-10T01:29:33,385 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h' 2025-10-10T01:29:33,386 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl' 2025-10-10T01:29:33,388 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h' 2025-10-10T01:29:33,394 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl' 2025-10-10T01:29:33,396 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h' 2025-10-10T01:29:33,398 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl' 2025-10-10T01:29:33,400 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp' 2025-10-10T01:29:33,401 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h' 2025-10-10T01:29:33,404 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp' 2025-10-10T01:29:33,406 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp' 2025-10-10T01:29:33,408 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h' 2025-10-10T01:29:33,409 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp' 2025-10-10T01:29:33,413 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h' 2025-10-10T01:29:33,414 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h' 2025-10-10T01:29:33,417 adding 'flashinfer/data/csrc/xqa/barriers.cuh' 2025-10-10T01:29:33,418 adding 'flashinfer/data/csrc/xqa/cuda_hint.cuh' 2025-10-10T01:29:33,419 adding 'flashinfer/data/csrc/xqa/defines.h' 2025-10-10T01:29:33,421 adding 'flashinfer/data/csrc/xqa/hostUtils.h' 2025-10-10T01:29:33,422 adding 'flashinfer/data/csrc/xqa/ldgsts.cuh' 2025-10-10T01:29:33,435 adding 'flashinfer/data/csrc/xqa/mha.cu' 2025-10-10T01:29:33,437 adding 'flashinfer/data/csrc/xqa/mha.h' 2025-10-10T01:29:33,439 adding 'flashinfer/data/csrc/xqa/mhaUtils.cuh' 2025-10-10T01:29:33,441 adding 'flashinfer/data/csrc/xqa/mha_components.cuh' 2025-10-10T01:29:33,444 adding 'flashinfer/data/csrc/xqa/mha_stdheaders.cuh' 2025-10-10T01:29:33,446 adding 'flashinfer/data/csrc/xqa/mma.cuh' 2025-10-10T01:29:33,447 adding 'flashinfer/data/csrc/xqa/platform.h' 2025-10-10T01:29:33,448 adding 'flashinfer/data/csrc/xqa/specDec.h' 2025-10-10T01:29:33,451 adding 'flashinfer/data/csrc/xqa/utils.cuh' 2025-10-10T01:29:33,453 adding 'flashinfer/data/csrc/xqa/utils.h' 2025-10-10T01:29:33,455 adding 'flashinfer/data/csrc/xqa/xqa_wrapper.cu' 2025-10-10T01:29:33,458 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py' 2025-10-10T01:29:33,460 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py' 2025-10-10T01:29:33,461 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py' 2025-10-10T01:29:33,464 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py' 2025-10-10T01:29:33,466 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py' 2025-10-10T01:29:33,468 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py' 2025-10-10T01:29:33,471 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py' 2025-10-10T01:29:33,473 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py' 2025-10-10T01:29:33,475 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py' 2025-10-10T01:29:33,477 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py' 2025-10-10T01:29:33,479 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py' 2025-10-10T01:29:33,481 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py' 2025-10-10T01:29:33,483 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py' 2025-10-10T01:29:33,486 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py' 2025-10-10T01:29:33,488 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py' 2025-10-10T01:29:33,492 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py' 2025-10-10T01:29:33,495 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py' 2025-10-10T01:29:33,497 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py' 2025-10-10T01:29:33,498 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py' 2025-10-10T01:29:33,500 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py' 2025-10-10T01:29:33,503 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py' 2025-10-10T01:29:33,505 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py' 2025-10-10T01:29:33,507 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py' 2025-10-10T01:29:33,509 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py' 2025-10-10T01:29:33,511 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py' 2025-10-10T01:29:33,517 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py' 2025-10-10T01:29:33,521 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py' 2025-10-10T01:29:33,523 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py' 2025-10-10T01:29:33,528 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py' 2025-10-10T01:29:33,539 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py' 2025-10-10T01:29:33,547 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py' 2025-10-10T01:29:33,556 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py' 2025-10-10T01:29:33,563 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py' 2025-10-10T01:29:33,576 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py' 2025-10-10T01:29:33,587 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py' 2025-10-10T01:29:33,600 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py' 2025-10-10T01:29:33,603 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py' 2025-10-10T01:29:33,605 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py' 2025-10-10T01:29:33,608 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py' 2025-10-10T01:29:33,615 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py' 2025-10-10T01:29:33,618 adding 'flashinfer/data/cutlass/include/cute/config.hpp' 2025-10-10T01:29:33,621 adding 'flashinfer/data/cutlass/include/cute/int_tuple.hpp' 2025-10-10T01:29:33,628 adding 'flashinfer/data/cutlass/include/cute/layout.hpp' 2025-10-10T01:29:33,630 adding 'flashinfer/data/cutlass/include/cute/layout_composed.hpp' 2025-10-10T01:29:33,632 adding 'flashinfer/data/cutlass/include/cute/pointer.hpp' 2025-10-10T01:29:33,634 adding 'flashinfer/data/cutlass/include/cute/pointer_base.hpp' 2025-10-10T01:29:33,636 adding 'flashinfer/data/cutlass/include/cute/pointer_flagged.hpp' 2025-10-10T01:29:33,638 adding 'flashinfer/data/cutlass/include/cute/pointer_sparse.hpp' 2025-10-10T01:29:33,640 adding 'flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp' 2025-10-10T01:29:33,642 adding 'flashinfer/data/cutlass/include/cute/stride.hpp' 2025-10-10T01:29:33,645 adding 'flashinfer/data/cutlass/include/cute/swizzle.hpp' 2025-10-10T01:29:33,648 adding 'flashinfer/data/cutlass/include/cute/swizzle_layout.hpp' 2025-10-10T01:29:33,649 adding 'flashinfer/data/cutlass/include/cute/tensor.hpp' 2025-10-10T01:29:33,653 adding 'flashinfer/data/cutlass/include/cute/tensor_impl.hpp' 2025-10-10T01:29:33,655 adding 'flashinfer/data/cutlass/include/cute/tensor_zip.hpp' 2025-10-10T01:29:33,657 adding 'flashinfer/data/cutlass/include/cute/underscore.hpp' 2025-10-10T01:29:33,659 adding 'flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp' 2025-10-10T01:29:33,661 adding 'flashinfer/data/cutlass/include/cute/algorithm/clear.hpp' 2025-10-10T01:29:33,663 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp' 2025-10-10T01:29:33,665 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp' 2025-10-10T01:29:33,668 adding 'flashinfer/data/cutlass/include/cute/algorithm/copy.hpp' 2025-10-10T01:29:33,670 adding 'flashinfer/data/cutlass/include/cute/algorithm/fill.hpp' 2025-10-10T01:29:33,672 adding 'flashinfer/data/cutlass/include/cute/algorithm/functional.hpp' 2025-10-10T01:29:33,674 adding 'flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp' 2025-10-10T01:29:33,675 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp' 2025-10-10T01:29:33,677 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp' 2025-10-10T01:29:33,678 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp' 2025-10-10T01:29:33,680 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp' 2025-10-10T01:29:33,682 adding 'flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp' 2025-10-10T01:29:33,685 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp' 2025-10-10T01:29:33,687 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp' 2025-10-10T01:29:33,688 adding 'flashinfer/data/cutlass/include/cute/arch/config.hpp' 2025-10-10T01:29:33,690 adding 'flashinfer/data/cutlass/include/cute/arch/copy.hpp' 2025-10-10T01:29:33,706 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp' 2025-10-10T01:29:33,710 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp' 2025-10-10T01:29:33,712 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp' 2025-10-10T01:29:33,714 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp' 2025-10-10T01:29:33,715 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp' 2025-10-10T01:29:33,717 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp' 2025-10-10T01:29:33,719 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp' 2025-10-10T01:29:33,722 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp' 2025-10-10T01:29:33,724 adding 'flashinfer/data/cutlass/include/cute/arch/mma.hpp' 2025-10-10T01:29:33,725 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp' 2025-10-10T01:29:33,728 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp' 2025-10-10T01:29:33,732 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp' 2025-10-10T01:29:33,737 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp' 2025-10-10T01:29:33,742 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp' 2025-10-10T01:29:33,744 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp' 2025-10-10T01:29:33,746 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp' 2025-10-10T01:29:33,748 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp' 2025-10-10T01:29:33,751 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp' 2025-10-10T01:29:33,753 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp' 2025-10-10T01:29:33,768 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp' 2025-10-10T01:29:33,772 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp' 2025-10-10T01:29:33,819 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp' 2025-10-10T01:29:33,995 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp' 2025-10-10T01:29:34,073 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp' 2025-10-10T01:29:34,225 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp' 2025-10-10T01:29:34,244 adding 'flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp' 2025-10-10T01:29:34,246 adding 'flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp' 2025-10-10T01:29:34,248 adding 'flashinfer/data/cutlass/include/cute/arch/util.hpp' 2025-10-10T01:29:34,252 adding 'flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp' 2025-10-10T01:29:34,255 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp' 2025-10-10T01:29:34,263 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp' 2025-10-10T01:29:34,266 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp' 2025-10-10T01:29:34,268 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp' 2025-10-10T01:29:34,270 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp' 2025-10-10T01:29:34,272 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp' 2025-10-10T01:29:34,273 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp' 2025-10-10T01:29:34,274 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp' 2025-10-10T01:29:34,278 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp' 2025-10-10T01:29:34,286 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp' 2025-10-10T01:29:34,288 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp' 2025-10-10T01:29:34,290 adding 'flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp' 2025-10-10T01:29:34,292 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp' 2025-10-10T01:29:34,302 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp' 2025-10-10T01:29:34,305 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp' 2025-10-10T01:29:34,307 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp' 2025-10-10T01:29:34,308 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp' 2025-10-10T01:29:34,310 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp' 2025-10-10T01:29:34,311 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp' 2025-10-10T01:29:34,313 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp' 2025-10-10T01:29:34,315 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp' 2025-10-10T01:29:34,316 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp' 2025-10-10T01:29:34,330 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp' 2025-10-10T01:29:34,356 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp' 2025-10-10T01:29:34,371 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp' 2025-10-10T01:29:34,407 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp' 2025-10-10T01:29:34,413 adding 'flashinfer/data/cutlass/include/cute/atom/partitioner.hpp' 2025-10-10T01:29:34,415 adding 'flashinfer/data/cutlass/include/cute/container/alignment.hpp' 2025-10-10T01:29:34,417 adding 'flashinfer/data/cutlass/include/cute/container/array.hpp' 2025-10-10T01:29:34,418 adding 'flashinfer/data/cutlass/include/cute/container/array_aligned.hpp' 2025-10-10T01:29:34,421 adding 'flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp' 2025-10-10T01:29:34,423 adding 'flashinfer/data/cutlass/include/cute/container/bit_field.hpp' 2025-10-10T01:29:34,424 adding 'flashinfer/data/cutlass/include/cute/container/cuda_types.hpp' 2025-10-10T01:29:34,427 adding 'flashinfer/data/cutlass/include/cute/container/tuple.hpp' 2025-10-10T01:29:34,428 adding 'flashinfer/data/cutlass/include/cute/container/type_list.hpp' 2025-10-10T01:29:34,431 adding 'flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp' 2025-10-10T01:29:34,433 adding 'flashinfer/data/cutlass/include/cute/numeric/complex.hpp' 2025-10-10T01:29:34,434 adding 'flashinfer/data/cutlass/include/cute/numeric/int.hpp' 2025-10-10T01:29:34,436 adding 'flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp' 2025-10-10T01:29:34,439 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp' 2025-10-10T01:29:34,441 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp' 2025-10-10T01:29:34,443 adding 'flashinfer/data/cutlass/include/cute/numeric/math.hpp' 2025-10-10T01:29:34,445 adding 'flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp' 2025-10-10T01:29:34,446 adding 'flashinfer/data/cutlass/include/cute/numeric/real.hpp' 2025-10-10T01:29:34,449 adding 'flashinfer/data/cutlass/include/cute/util/debug.hpp' 2025-10-10T01:29:34,451 adding 'flashinfer/data/cutlass/include/cute/util/print.hpp' 2025-10-10T01:29:34,453 adding 'flashinfer/data/cutlass/include/cute/util/print_latex.hpp' 2025-10-10T01:29:34,455 adding 'flashinfer/data/cutlass/include/cute/util/print_svg.hpp' 2025-10-10T01:29:34,457 adding 'flashinfer/data/cutlass/include/cute/util/print_tensor.hpp' 2025-10-10T01:29:34,458 adding 'flashinfer/data/cutlass/include/cute/util/type_traits.hpp' 2025-10-10T01:29:34,462 adding 'flashinfer/data/cutlass/include/cutlass/aligned_buffer.h' 2025-10-10T01:29:34,466 adding 'flashinfer/data/cutlass/include/cutlass/array.h' 2025-10-10T01:29:34,468 adding 'flashinfer/data/cutlass/include/cutlass/array_planar_complex.h' 2025-10-10T01:29:34,470 adding 'flashinfer/data/cutlass/include/cutlass/array_subbyte.h' 2025-10-10T01:29:34,472 adding 'flashinfer/data/cutlass/include/cutlass/barrier.h' 2025-10-10T01:29:34,474 adding 'flashinfer/data/cutlass/include/cutlass/bfloat16.h' 2025-10-10T01:29:34,476 adding 'flashinfer/data/cutlass/include/cutlass/blas3.h' 2025-10-10T01:29:34,477 adding 'flashinfer/data/cutlass/include/cutlass/blas3_types.h' 2025-10-10T01:29:34,479 adding 'flashinfer/data/cutlass/include/cutlass/block_striped.h' 2025-10-10T01:29:34,482 adding 'flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp' 2025-10-10T01:29:34,485 adding 'flashinfer/data/cutlass/include/cutlass/complex.h' 2025-10-10T01:29:34,488 adding 'flashinfer/data/cutlass/include/cutlass/constants.h' 2025-10-10T01:29:34,491 adding 'flashinfer/data/cutlass/include/cutlass/coord.h' 2025-10-10T01:29:34,492 adding 'flashinfer/data/cutlass/include/cutlass/core_io.h' 2025-10-10T01:29:34,495 adding 'flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp' 2025-10-10T01:29:34,496 adding 'flashinfer/data/cutlass/include/cutlass/cutlass.h' 2025-10-10T01:29:34,498 adding 'flashinfer/data/cutlass/include/cutlass/device_kernel.h' 2025-10-10T01:29:34,503 adding 'flashinfer/data/cutlass/include/cutlass/exmy_base.h' 2025-10-10T01:29:34,506 adding 'flashinfer/data/cutlass/include/cutlass/fast_math.h' 2025-10-10T01:29:34,510 adding 'flashinfer/data/cutlass/include/cutlass/float8.h' 2025-10-10T01:29:34,513 adding 'flashinfer/data/cutlass/include/cutlass/float_subbyte.h' 2025-10-10T01:29:34,514 adding 'flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h' 2025-10-10T01:29:34,518 adding 'flashinfer/data/cutlass/include/cutlass/functional.h' 2025-10-10T01:29:34,520 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.h' 2025-10-10T01:29:34,521 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp' 2025-10-10T01:29:34,524 adding 'flashinfer/data/cutlass/include/cutlass/half.h' 2025-10-10T01:29:34,525 adding 'flashinfer/data/cutlass/include/cutlass/integer_subbyte.h' 2025-10-10T01:29:34,527 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h' 2025-10-10T01:29:34,528 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp' 2025-10-10T01:29:34,530 adding 'flashinfer/data/cutlass/include/cutlass/kernel_launch.h' 2025-10-10T01:29:34,552 adding 'flashinfer/data/cutlass/include/cutlass/matrix.h' 2025-10-10T01:29:34,556 adding 'flashinfer/data/cutlass/include/cutlass/matrix_coord.h' 2025-10-10T01:29:34,557 adding 'flashinfer/data/cutlass/include/cutlass/matrix_shape.h' 2025-10-10T01:29:34,571 adding 'flashinfer/data/cutlass/include/cutlass/numeric_conversion.h' 2025-10-10T01:29:34,574 adding 'flashinfer/data/cutlass/include/cutlass/numeric_size.h' 2025-10-10T01:29:34,576 adding 'flashinfer/data/cutlass/include/cutlass/numeric_types.h' 2025-10-10T01:29:34,578 adding 'flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h' 2025-10-10T01:29:34,580 adding 'flashinfer/data/cutlass/include/cutlass/predicate_vector.h' 2025-10-10T01:29:34,583 adding 'flashinfer/data/cutlass/include/cutlass/quaternion.h' 2025-10-10T01:29:34,584 adding 'flashinfer/data/cutlass/include/cutlass/real.h' 2025-10-10T01:29:34,586 adding 'flashinfer/data/cutlass/include/cutlass/relatively_equal.h' 2025-10-10T01:29:34,588 adding 'flashinfer/data/cutlass/include/cutlass/semaphore.h' 2025-10-10T01:29:34,591 adding 'flashinfer/data/cutlass/include/cutlass/subbyte_reference.h' 2025-10-10T01:29:34,593 adding 'flashinfer/data/cutlass/include/cutlass/tensor_coord.h' 2025-10-10T01:29:34,595 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref.h' 2025-10-10T01:29:34,597 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h' 2025-10-10T01:29:34,599 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view.h' 2025-10-10T01:29:34,601 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h' 2025-10-10T01:29:34,603 adding 'flashinfer/data/cutlass/include/cutlass/tfloat32.h' 2025-10-10T01:29:34,604 adding 'flashinfer/data/cutlass/include/cutlass/trace.h' 2025-10-10T01:29:34,606 adding 'flashinfer/data/cutlass/include/cutlass/uint128.h' 2025-10-10T01:29:34,608 adding 'flashinfer/data/cutlass/include/cutlass/uint256.h' 2025-10-10T01:29:34,609 adding 'flashinfer/data/cutlass/include/cutlass/version.h' 2025-10-10T01:29:34,611 adding 'flashinfer/data/cutlass/include/cutlass/wmma_array.h' 2025-10-10T01:29:34,612 adding 'flashinfer/data/cutlass/include/cutlass/workspace.h' 2025-10-10T01:29:34,615 adding 'flashinfer/data/cutlass/include/cutlass/arch/arch.h' 2025-10-10T01:29:34,618 adding 'flashinfer/data/cutlass/include/cutlass/arch/barrier.h' 2025-10-10T01:29:34,620 adding 'flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h' 2025-10-10T01:29:34,621 adding 'flashinfer/data/cutlass/include/cutlass/arch/config.h' 2025-10-10T01:29:34,623 adding 'flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h' 2025-10-10T01:29:34,625 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory.h' 2025-10-10T01:29:34,627 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h' 2025-10-10T01:29:34,629 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h' 2025-10-10T01:29:34,631 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma.h' 2025-10-10T01:29:34,632 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h' 2025-10-10T01:29:34,634 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h' 2025-10-10T01:29:34,636 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h' 2025-10-10T01:29:34,637 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h' 2025-10-10T01:29:34,639 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h' 2025-10-10T01:29:34,641 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h' 2025-10-10T01:29:34,644 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h' 2025-10-10T01:29:34,646 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h' 2025-10-10T01:29:34,647 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h' 2025-10-10T01:29:34,650 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h' 2025-10-10T01:29:34,652 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h' 2025-10-10T01:29:34,653 adding 'flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h' 2025-10-10T01:29:34,655 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd.h' 2025-10-10T01:29:34,656 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h' 2025-10-10T01:29:34,658 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h' 2025-10-10T01:29:34,661 adding 'flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp' 2025-10-10T01:29:34,663 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma.h' 2025-10-10T01:29:34,665 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h' 2025-10-10T01:29:34,667 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h' 2025-10-10T01:29:34,668 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h' 2025-10-10T01:29:34,672 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h' 2025-10-10T01:29:34,674 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h' 2025-10-10T01:29:34,677 adding 'flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp' 2025-10-10T01:29:34,679 adding 'flashinfer/data/cutlass/include/cutlass/conv/convolution.h' 2025-10-10T01:29:34,681 adding 'flashinfer/data/cutlass/include/cutlass/conv/detail.hpp' 2025-10-10T01:29:34,682 adding 'flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp' 2025-10-10T01:29:34,684 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp' 2025-10-10T01:29:34,686 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp' 2025-10-10T01:29:34,687 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp' 2025-10-10T01:29:34,692 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp' 2025-10-10T01:29:34,696 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp' 2025-10-10T01:29:34,699 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl' 2025-10-10T01:29:34,701 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl' 2025-10-10T01:29:34,702 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl' 2025-10-10T01:29:34,704 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl' 2025-10-10T01:29:34,707 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp' 2025-10-10T01:29:34,709 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h' 2025-10-10T01:29:34,711 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h' 2025-10-10T01:29:34,713 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h' 2025-10-10T01:29:34,716 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp' 2025-10-10T01:29:34,718 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h' 2025-10-10T01:29:34,720 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h' 2025-10-10T01:29:34,724 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h' 2025-10-10T01:29:34,726 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h' 2025-10-10T01:29:34,727 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h' 2025-10-10T01:29:34,729 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h' 2025-10-10T01:29:34,730 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h' 2025-10-10T01:29:34,733 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h' 2025-10-10T01:29:34,735 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h' 2025-10-10T01:29:34,737 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h' 2025-10-10T01:29:34,739 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h' 2025-10-10T01:29:34,741 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h' 2025-10-10T01:29:34,743 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h' 2025-10-10T01:29:34,745 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h' 2025-10-10T01:29:34,747 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h' 2025-10-10T01:29:34,750 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h' 2025-10-10T01:29:34,752 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h' 2025-10-10T01:29:34,754 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h' 2025-10-10T01:29:34,755 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h' 2025-10-10T01:29:34,757 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h' 2025-10-10T01:29:34,760 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h' 2025-10-10T01:29:34,762 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h' 2025-10-10T01:29:34,765 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h' 2025-10-10T01:29:34,767 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h' 2025-10-10T01:29:34,770 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h' 2025-10-10T01:29:34,772 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h' 2025-10-10T01:29:34,777 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp' 2025-10-10T01:29:34,778 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp' 2025-10-10T01:29:34,781 adding 'flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h' 2025-10-10T01:29:34,784 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,787 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,789 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,792 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,794 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,796 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h' 2025-10-10T01:29:34,798 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h' 2025-10-10T01:29:34,800 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,802 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,804 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h' 2025-10-10T01:29:34,806 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h' 2025-10-10T01:29:34,808 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,811 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h' 2025-10-10T01:29:34,813 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h' 2025-10-10T01:29:34,815 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,817 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,819 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,821 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,823 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,825 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,827 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,829 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,831 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,834 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,835 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,837 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,839 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h' 2025-10-10T01:29:34,841 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,844 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,846 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2025-10-10T01:29:34,848 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2025-10-10T01:29:34,849 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h' 2025-10-10T01:29:34,851 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h' 2025-10-10T01:29:34,853 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h' 2025-10-10T01:29:34,856 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h' 2025-10-10T01:29:34,858 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h' 2025-10-10T01:29:34,860 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h' 2025-10-10T01:29:34,862 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h' 2025-10-10T01:29:34,865 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h' 2025-10-10T01:29:34,869 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h' 2025-10-10T01:29:34,872 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h' 2025-10-10T01:29:34,874 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h' 2025-10-10T01:29:34,877 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h' 2025-10-10T01:29:34,879 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h' 2025-10-10T01:29:34,881 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h' 2025-10-10T01:29:34,883 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h' 2025-10-10T01:29:34,886 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h' 2025-10-10T01:29:34,889 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h' 2025-10-10T01:29:34,890 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h' 2025-10-10T01:29:34,893 adding 'flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp' 2025-10-10T01:29:34,895 adding 'flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp' 2025-10-10T01:29:34,896 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective.hpp' 2025-10-10T01:29:34,898 adding 'flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp' 2025-10-10T01:29:34,900 adding 'flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp' 2025-10-10T01:29:34,902 adding 'flashinfer/data/cutlass/include/cutlass/detail/layout.hpp' 2025-10-10T01:29:34,903 adding 'flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp' 2025-10-10T01:29:34,905 adding 'flashinfer/data/cutlass/include/cutlass/detail/mma.hpp' 2025-10-10T01:29:34,906 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp' 2025-10-10T01:29:34,908 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp' 2025-10-10T01:29:34,910 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp' 2025-10-10T01:29:34,911 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp' 2025-10-10T01:29:34,916 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp' 2025-10-10T01:29:34,918 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp' 2025-10-10T01:29:34,920 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp' 2025-10-10T01:29:34,923 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp' 2025-10-10T01:29:34,924 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp' 2025-10-10T01:29:34,926 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp' 2025-10-10T01:29:34,928 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp' 2025-10-10T01:29:34,931 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp' 2025-10-10T01:29:34,933 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp' 2025-10-10T01:29:34,937 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp' 2025-10-10T01:29:34,944 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp' 2025-10-10T01:29:34,949 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp' 2025-10-10T01:29:34,954 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp' 2025-10-10T01:29:34,957 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp' 2025-10-10T01:29:34,960 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp' 2025-10-10T01:29:34,966 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp' 2025-10-10T01:29:34,971 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp' 2025-10-10T01:29:34,973 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp' 2025-10-10T01:29:34,980 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl' 2025-10-10T01:29:34,982 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl' 2025-10-10T01:29:34,985 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl' 2025-10-10T01:29:34,987 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl' 2025-10-10T01:29:34,990 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl' 2025-10-10T01:29:34,992 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl' 2025-10-10T01:29:34,994 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp' 2025-10-10T01:29:34,996 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp' 2025-10-10T01:29:34,999 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp' 2025-10-10T01:29:35,002 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp' 2025-10-10T01:29:35,005 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp' 2025-10-10T01:29:35,009 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp' 2025-10-10T01:29:35,013 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp' 2025-10-10T01:29:35,019 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp' 2025-10-10T01:29:35,023 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp' 2025-10-10T01:29:35,028 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp' 2025-10-10T01:29:35,034 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp' 2025-10-10T01:29:35,038 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp' 2025-10-10T01:29:35,042 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp' 2025-10-10T01:29:35,046 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h' 2025-10-10T01:29:35,047 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h' 2025-10-10T01:29:35,049 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp' 2025-10-10T01:29:35,051 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h' 2025-10-10T01:29:35,054 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h' 2025-10-10T01:29:35,056 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h' 2025-10-10T01:29:35,059 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h' 2025-10-10T01:29:35,061 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h' 2025-10-10T01:29:35,063 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h' 2025-10-10T01:29:35,064 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h' 2025-10-10T01:29:35,066 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h' 2025-10-10T01:29:35,068 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h' 2025-10-10T01:29:35,070 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h' 2025-10-10T01:29:35,072 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h' 2025-10-10T01:29:35,073 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h' 2025-10-10T01:29:35,075 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h' 2025-10-10T01:29:35,077 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h' 2025-10-10T01:29:35,079 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h' 2025-10-10T01:29:35,082 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h' 2025-10-10T01:29:35,083 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h' 2025-10-10T01:29:35,085 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h' 2025-10-10T01:29:35,086 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp' 2025-10-10T01:29:35,088 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h' 2025-10-10T01:29:35,090 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h' 2025-10-10T01:29:35,091 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h' 2025-10-10T01:29:35,095 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h' 2025-10-10T01:29:35,096 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h' 2025-10-10T01:29:35,098 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h' 2025-10-10T01:29:35,099 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h' 2025-10-10T01:29:35,101 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h' 2025-10-10T01:29:35,104 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h' 2025-10-10T01:29:35,106 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h' 2025-10-10T01:29:35,108 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h' 2025-10-10T01:29:35,109 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h' 2025-10-10T01:29:35,111 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h' 2025-10-10T01:29:35,113 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h' 2025-10-10T01:29:35,114 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h' 2025-10-10T01:29:35,116 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h' 2025-10-10T01:29:35,118 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h' 2025-10-10T01:29:35,119 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h' 2025-10-10T01:29:35,121 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h' 2025-10-10T01:29:35,122 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h' 2025-10-10T01:29:35,125 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h' 2025-10-10T01:29:35,127 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h' 2025-10-10T01:29:35,129 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h' 2025-10-10T01:29:35,131 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h' 2025-10-10T01:29:35,133 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h' 2025-10-10T01:29:35,135 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h' 2025-10-10T01:29:35,137 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h' 2025-10-10T01:29:35,139 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h' 2025-10-10T01:29:35,141 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h' 2025-10-10T01:29:35,143 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h' 2025-10-10T01:29:35,147 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h' 2025-10-10T01:29:35,152 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h' 2025-10-10T01:29:35,155 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h' 2025-10-10T01:29:35,157 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h' 2025-10-10T01:29:35,159 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h' 2025-10-10T01:29:35,162 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h' 2025-10-10T01:29:35,163 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h' 2025-10-10T01:29:35,166 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h' 2025-10-10T01:29:35,167 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h' 2025-10-10T01:29:35,170 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h' 2025-10-10T01:29:35,174 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h' 2025-10-10T01:29:35,176 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h' 2025-10-10T01:29:35,178 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h' 2025-10-10T01:29:35,180 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h' 2025-10-10T01:29:35,183 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h' 2025-10-10T01:29:35,185 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h' 2025-10-10T01:29:35,187 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h' 2025-10-10T01:29:35,189 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h' 2025-10-10T01:29:35,191 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h' 2025-10-10T01:29:35,193 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h' 2025-10-10T01:29:35,195 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h' 2025-10-10T01:29:35,197 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h' 2025-10-10T01:29:35,200 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp' 2025-10-10T01:29:35,202 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp' 2025-10-10T01:29:35,204 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp' 2025-10-10T01:29:35,207 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp' 2025-10-10T01:29:35,209 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp' 2025-10-10T01:29:35,212 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h' 2025-10-10T01:29:35,213 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h' 2025-10-10T01:29:35,215 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h' 2025-10-10T01:29:35,217 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h' 2025-10-10T01:29:35,219 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h' 2025-10-10T01:29:35,221 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h' 2025-10-10T01:29:35,222 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h' 2025-10-10T01:29:35,224 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h' 2025-10-10T01:29:35,226 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h' 2025-10-10T01:29:35,229 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h' 2025-10-10T01:29:35,231 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h' 2025-10-10T01:29:35,233 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h' 2025-10-10T01:29:35,235 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h' 2025-10-10T01:29:35,237 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h' 2025-10-10T01:29:35,238 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h' 2025-10-10T01:29:35,242 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp' 2025-10-10T01:29:35,245 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp' 2025-10-10T01:29:35,246 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp' 2025-10-10T01:29:35,249 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp' 2025-10-10T01:29:35,250 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp' 2025-10-10T01:29:35,252 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp' 2025-10-10T01:29:35,255 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp' 2025-10-10T01:29:35,257 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp' 2025-10-10T01:29:35,262 adding 'flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp' 2025-10-10T01:29:35,264 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm.h' 2025-10-10T01:29:35,266 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h' 2025-10-10T01:29:35,268 adding 'flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp' 2025-10-10T01:29:35,271 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp' 2025-10-10T01:29:35,272 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp' 2025-10-10T01:29:35,274 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp' 2025-10-10T01:29:35,275 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp' 2025-10-10T01:29:35,277 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp' 2025-10-10T01:29:35,284 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp' 2025-10-10T01:29:35,290 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp' 2025-10-10T01:29:35,295 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp' 2025-10-10T01:29:35,302 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp' 2025-10-10T01:29:35,307 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp' 2025-10-10T01:29:35,314 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp' 2025-10-10T01:29:35,320 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp' 2025-10-10T01:29:35,324 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp' 2025-10-10T01:29:35,328 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp' 2025-10-10T01:29:35,332 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp' 2025-10-10T01:29:35,338 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp' 2025-10-10T01:29:35,343 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp' 2025-10-10T01:29:35,350 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp' 2025-10-10T01:29:35,355 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp' 2025-10-10T01:29:35,363 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp' 2025-10-10T01:29:35,370 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp' 2025-10-10T01:29:35,376 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp' 2025-10-10T01:29:35,381 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp' 2025-10-10T01:29:35,388 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp' 2025-10-10T01:29:35,393 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp' 2025-10-10T01:29:35,396 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp' 2025-10-10T01:29:35,400 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp' 2025-10-10T01:29:35,405 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp' 2025-10-10T01:29:35,408 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp' 2025-10-10T01:29:35,410 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp' 2025-10-10T01:29:35,413 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp' 2025-10-10T01:29:35,420 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2025-10-10T01:29:35,425 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp' 2025-10-10T01:29:35,429 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp' 2025-10-10T01:29:35,434 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2025-10-10T01:29:35,438 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp' 2025-10-10T01:29:35,441 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp' 2025-10-10T01:29:35,445 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp' 2025-10-10T01:29:35,450 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2025-10-10T01:29:35,454 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp' 2025-10-10T01:29:35,457 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp' 2025-10-10T01:29:35,460 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2025-10-10T01:29:35,465 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2025-10-10T01:29:35,470 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp' 2025-10-10T01:29:35,474 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2025-10-10T01:29:35,478 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl' 2025-10-10T01:29:35,480 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl' 2025-10-10T01:29:35,482 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl' 2025-10-10T01:29:35,484 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl' 2025-10-10T01:29:35,487 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl' 2025-10-10T01:29:35,490 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl' 2025-10-10T01:29:35,492 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl' 2025-10-10T01:29:35,495 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl' 2025-10-10T01:29:35,497 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl' 2025-10-10T01:29:35,498 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl' 2025-10-10T01:29:35,500 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl' 2025-10-10T01:29:35,502 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl' 2025-10-10T01:29:35,505 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl' 2025-10-10T01:29:35,508 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl' 2025-10-10T01:29:35,510 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl' 2025-10-10T01:29:35,513 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl' 2025-10-10T01:29:35,515 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl' 2025-10-10T01:29:35,517 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl' 2025-10-10T01:29:35,518 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl' 2025-10-10T01:29:35,521 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl' 2025-10-10T01:29:35,524 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl' 2025-10-10T01:29:35,527 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl' 2025-10-10T01:29:35,529 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl' 2025-10-10T01:29:35,533 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl' 2025-10-10T01:29:35,536 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl' 2025-10-10T01:29:35,538 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl' 2025-10-10T01:29:35,541 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h' 2025-10-10T01:29:35,544 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h' 2025-10-10T01:29:35,547 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h' 2025-10-10T01:29:35,550 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h' 2025-10-10T01:29:35,552 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h' 2025-10-10T01:29:35,555 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h' 2025-10-10T01:29:35,558 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h' 2025-10-10T01:29:35,560 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h' 2025-10-10T01:29:35,562 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h' 2025-10-10T01:29:35,564 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h' 2025-10-10T01:29:35,566 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h' 2025-10-10T01:29:35,568 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h' 2025-10-10T01:29:35,570 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h' 2025-10-10T01:29:35,572 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h' 2025-10-10T01:29:35,574 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h' 2025-10-10T01:29:35,576 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h' 2025-10-10T01:29:35,580 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h' 2025-10-10T01:29:35,582 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h' 2025-10-10T01:29:35,584 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h' 2025-10-10T01:29:35,586 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h' 2025-10-10T01:29:35,588 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h' 2025-10-10T01:29:35,590 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h' 2025-10-10T01:29:35,592 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h' 2025-10-10T01:29:35,594 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h' 2025-10-10T01:29:35,596 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h' 2025-10-10T01:29:35,598 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h' 2025-10-10T01:29:35,600 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h' 2025-10-10T01:29:35,603 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h' 2025-10-10T01:29:35,606 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h' 2025-10-10T01:29:35,611 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h' 2025-10-10T01:29:35,614 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h' 2025-10-10T01:29:35,617 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h' 2025-10-10T01:29:35,619 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h' 2025-10-10T01:29:35,621 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h' 2025-10-10T01:29:35,622 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h' 2025-10-10T01:29:35,624 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h' 2025-10-10T01:29:35,626 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h' 2025-10-10T01:29:35,628 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h' 2025-10-10T01:29:35,629 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h' 2025-10-10T01:29:35,631 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h' 2025-10-10T01:29:35,633 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h' 2025-10-10T01:29:35,635 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h' 2025-10-10T01:29:35,636 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h' 2025-10-10T01:29:35,638 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h' 2025-10-10T01:29:35,640 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h' 2025-10-10T01:29:35,641 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h' 2025-10-10T01:29:35,643 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h' 2025-10-10T01:29:35,644 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h' 2025-10-10T01:29:35,646 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h' 2025-10-10T01:29:35,648 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h' 2025-10-10T01:29:35,649 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h' 2025-10-10T01:29:35,651 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h' 2025-10-10T01:29:35,653 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h' 2025-10-10T01:29:35,655 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h' 2025-10-10T01:29:35,657 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h' 2025-10-10T01:29:35,659 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h' 2025-10-10T01:29:35,661 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h' 2025-10-10T01:29:35,662 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h' 2025-10-10T01:29:35,664 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h' 2025-10-10T01:29:35,667 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h' 2025-10-10T01:29:35,668 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h' 2025-10-10T01:29:35,670 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h' 2025-10-10T01:29:35,672 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h' 2025-10-10T01:29:35,674 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h' 2025-10-10T01:29:35,676 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h' 2025-10-10T01:29:35,679 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h' 2025-10-10T01:29:35,681 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h' 2025-10-10T01:29:35,683 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h' 2025-10-10T01:29:35,685 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h' 2025-10-10T01:29:35,687 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h' 2025-10-10T01:29:35,688 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h' 2025-10-10T01:29:35,691 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h' 2025-10-10T01:29:35,694 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h' 2025-10-10T01:29:35,696 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h' 2025-10-10T01:29:35,697 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h' 2025-10-10T01:29:35,700 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h' 2025-10-10T01:29:35,703 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h' 2025-10-10T01:29:35,707 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h' 2025-10-10T01:29:35,709 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h' 2025-10-10T01:29:35,711 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h' 2025-10-10T01:29:35,720 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h' 2025-10-10T01:29:35,722 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h' 2025-10-10T01:29:35,725 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h' 2025-10-10T01:29:35,726 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp' 2025-10-10T01:29:35,728 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h' 2025-10-10T01:29:35,732 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h' 2025-10-10T01:29:35,734 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h' 2025-10-10T01:29:35,738 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h' 2025-10-10T01:29:35,741 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h' 2025-10-10T01:29:35,745 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h' 2025-10-10T01:29:35,748 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h' 2025-10-10T01:29:35,750 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h' 2025-10-10T01:29:35,752 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h' 2025-10-10T01:29:35,756 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h' 2025-10-10T01:29:35,758 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h' 2025-10-10T01:29:35,760 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h' 2025-10-10T01:29:35,762 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h' 2025-10-10T01:29:35,765 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h' 2025-10-10T01:29:35,767 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h' 2025-10-10T01:29:35,769 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h' 2025-10-10T01:29:35,772 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h' 2025-10-10T01:29:35,774 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h' 2025-10-10T01:29:35,780 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp' 2025-10-10T01:29:35,786 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp' 2025-10-10T01:29:35,792 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp' 2025-10-10T01:29:35,796 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp' 2025-10-10T01:29:35,801 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp' 2025-10-10T01:29:35,806 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp' 2025-10-10T01:29:35,811 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp' 2025-10-10T01:29:35,817 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp' 2025-10-10T01:29:35,822 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp' 2025-10-10T01:29:35,827 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp' 2025-10-10T01:29:35,829 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp' 2025-10-10T01:29:35,832 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp' 2025-10-10T01:29:35,835 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp' 2025-10-10T01:29:35,839 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp' 2025-10-10T01:29:35,845 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp' 2025-10-10T01:29:35,850 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp' 2025-10-10T01:29:35,855 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp' 2025-10-10T01:29:35,857 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp' 2025-10-10T01:29:35,859 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp' 2025-10-10T01:29:35,864 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp' 2025-10-10T01:29:35,870 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp' 2025-10-10T01:29:35,872 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp' 2025-10-10T01:29:35,875 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp' 2025-10-10T01:29:35,880 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp' 2025-10-10T01:29:35,885 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp' 2025-10-10T01:29:35,888 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp' 2025-10-10T01:29:35,890 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp' 2025-10-10T01:29:35,894 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp' 2025-10-10T01:29:35,895 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp' 2025-10-10T01:29:35,898 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp' 2025-10-10T01:29:35,904 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp' 2025-10-10T01:29:35,906 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h' 2025-10-10T01:29:35,909 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h' 2025-10-10T01:29:35,911 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h' 2025-10-10T01:29:35,913 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp' 2025-10-10T01:29:35,916 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h' 2025-10-10T01:29:35,918 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp' 2025-10-10T01:29:35,920 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp' 2025-10-10T01:29:35,929 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h' 2025-10-10T01:29:35,932 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h' 2025-10-10T01:29:35,935 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h' 2025-10-10T01:29:35,937 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h' 2025-10-10T01:29:35,940 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h' 2025-10-10T01:29:35,942 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h' 2025-10-10T01:29:35,945 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h' 2025-10-10T01:29:35,947 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h' 2025-10-10T01:29:35,950 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h' 2025-10-10T01:29:35,952 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h' 2025-10-10T01:29:35,955 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h' 2025-10-10T01:29:35,957 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h' 2025-10-10T01:29:35,960 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h' 2025-10-10T01:29:35,965 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h' 2025-10-10T01:29:35,968 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h' 2025-10-10T01:29:35,971 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h' 2025-10-10T01:29:35,972 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h' 2025-10-10T01:29:35,975 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h' 2025-10-10T01:29:35,976 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h' 2025-10-10T01:29:35,978 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h' 2025-10-10T01:29:35,980 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h' 2025-10-10T01:29:35,981 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h' 2025-10-10T01:29:35,983 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h' 2025-10-10T01:29:35,985 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h' 2025-10-10T01:29:35,986 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h' 2025-10-10T01:29:35,990 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h' 2025-10-10T01:29:35,992 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h' 2025-10-10T01:29:35,994 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h' 2025-10-10T01:29:35,996 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h' 2025-10-10T01:29:35,999 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h' 2025-10-10T01:29:36,001 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h' 2025-10-10T01:29:36,002 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h' 2025-10-10T01:29:36,004 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h' 2025-10-10T01:29:36,006 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h' 2025-10-10T01:29:36,009 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h' 2025-10-10T01:29:36,012 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h' 2025-10-10T01:29:36,016 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h' 2025-10-10T01:29:36,019 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h' 2025-10-10T01:29:36,020 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h' 2025-10-10T01:29:36,023 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h' 2025-10-10T01:29:36,026 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h' 2025-10-10T01:29:36,028 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h' 2025-10-10T01:29:36,031 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h' 2025-10-10T01:29:36,033 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h' 2025-10-10T01:29:36,036 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h' 2025-10-10T01:29:36,039 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h' 2025-10-10T01:29:36,041 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h' 2025-10-10T01:29:36,044 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h' 2025-10-10T01:29:36,047 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h' 2025-10-10T01:29:36,049 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h' 2025-10-10T01:29:36,051 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h' 2025-10-10T01:29:36,053 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h' 2025-10-10T01:29:36,054 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h' 2025-10-10T01:29:36,056 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h' 2025-10-10T01:29:36,057 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h' 2025-10-10T01:29:36,059 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h' 2025-10-10T01:29:36,062 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h' 2025-10-10T01:29:36,065 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h' 2025-10-10T01:29:36,070 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h' 2025-10-10T01:29:36,073 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h' 2025-10-10T01:29:36,075 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h' 2025-10-10T01:29:36,077 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h' 2025-10-10T01:29:36,079 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h' 2025-10-10T01:29:36,081 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h' 2025-10-10T01:29:36,082 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h' 2025-10-10T01:29:36,086 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h' 2025-10-10T01:29:36,088 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h' 2025-10-10T01:29:36,091 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h' 2025-10-10T01:29:36,093 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h' 2025-10-10T01:29:36,095 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h' 2025-10-10T01:29:36,097 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h' 2025-10-10T01:29:36,099 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h' 2025-10-10T01:29:36,101 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h' 2025-10-10T01:29:36,110 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h' 2025-10-10T01:29:36,117 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h' 2025-10-10T01:29:36,122 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h' 2025-10-10T01:29:36,125 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h' 2025-10-10T01:29:36,127 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h' 2025-10-10T01:29:36,129 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h' 2025-10-10T01:29:36,132 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h' 2025-10-10T01:29:36,134 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h' 2025-10-10T01:29:36,136 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h' 2025-10-10T01:29:36,137 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h' 2025-10-10T01:29:36,140 adding 'flashinfer/data/cutlass/include/cutlass/layout/layout.h' 2025-10-10T01:29:36,143 adding 'flashinfer/data/cutlass/include/cutlass/layout/matrix.h' 2025-10-10T01:29:36,145 adding 'flashinfer/data/cutlass/include/cutlass/layout/permute.h' 2025-10-10T01:29:36,147 adding 'flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h' 2025-10-10T01:29:36,149 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor.h' 2025-10-10T01:29:36,151 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h' 2025-10-10T01:29:36,154 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h' 2025-10-10T01:29:36,156 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h' 2025-10-10T01:29:36,158 adding 'flashinfer/data/cutlass/include/cutlass/layout/vector.h' 2025-10-10T01:29:36,160 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp' 2025-10-10T01:29:36,164 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp' 2025-10-10T01:29:36,169 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp' 2025-10-10T01:29:36,172 adding 'flashinfer/data/cutlass/include/cutlass/platform/platform.h' 2025-10-10T01:29:36,175 adding 'flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h' 2025-10-10T01:29:36,177 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h' 2025-10-10T01:29:36,179 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h' 2025-10-10T01:29:36,181 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h' 2025-10-10T01:29:36,183 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h' 2025-10-10T01:29:36,185 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h' 2025-10-10T01:29:36,187 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h' 2025-10-10T01:29:36,190 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h' 2025-10-10T01:29:36,192 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h' 2025-10-10T01:29:36,195 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h' 2025-10-10T01:29:36,196 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h' 2025-10-10T01:29:36,199 adding 'flashinfer/data/cutlass/include/cutlass/thread/matrix.h' 2025-10-10T01:29:36,203 adding 'flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h' 2025-10-10T01:29:36,206 adding 'flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp' 2025-10-10T01:29:36,209 adding 'flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp' 2025-10-10T01:29:36,212 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp' 2025-10-10T01:29:36,215 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp' 2025-10-10T01:29:36,217 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp' 2025-10-10T01:29:36,220 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h' 2025-10-10T01:29:36,221 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h' 2025-10-10T01:29:36,224 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h' 2025-10-10T01:29:36,227 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h' 2025-10-10T01:29:36,231 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h' 2025-10-10T01:29:36,233 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h' 2025-10-10T01:29:36,235 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h' 2025-10-10T01:29:36,240 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h' 2025-10-10T01:29:36,243 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h' 2025-10-10T01:29:36,244 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h' 2025-10-10T01:29:36,247 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h' 2025-10-10T01:29:36,251 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h' 2025-10-10T01:29:36,254 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h' 2025-10-10T01:29:36,257 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h' 2025-10-10T01:29:36,259 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h' 2025-10-10T01:29:36,261 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h' 2025-10-10T01:29:36,262 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h' 2025-10-10T01:29:36,264 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h' 2025-10-10T01:29:36,266 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h' 2025-10-10T01:29:36,269 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h' 2025-10-10T01:29:36,272 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h' 2025-10-10T01:29:36,274 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h' 2025-10-10T01:29:36,276 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h' 2025-10-10T01:29:36,278 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h' 2025-10-10T01:29:36,280 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h' 2025-10-10T01:29:36,284 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h' 2025-10-10T01:29:36,285 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h' 2025-10-10T01:29:36,288 adding 'flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h' 2025-10-10T01:29:36,290 adding 'flashinfer/data/cutlass/python/setup_cutlass.py' 2025-10-10T01:29:36,292 adding 'flashinfer/data/cutlass/python/setup_library.py' 2025-10-10T01:29:36,293 adding 'flashinfer/data/cutlass/python/setup_pycute.py' 2025-10-10T01:29:36,296 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/__init__.py' 2025-10-10T01:29:36,298 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/ast_helpers.py' 2025-10-10T01:29:36,307 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/ast_preprocessor.py' 2025-10-10T01:29:36,310 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/cache_helpers.py' 2025-10-10T01:29:36,311 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/common.py' 2025-10-10T01:29:36,314 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/compiler.py' 2025-10-10T01:29:36,321 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/dsl.py' 2025-10-10T01:29:36,324 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/env_manager.py' 2025-10-10T01:29:36,326 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/jit_executor.py' 2025-10-10T01:29:36,333 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/typing.py' 2025-10-10T01:29:36,336 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/__init__.py' 2025-10-10T01:29:36,338 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/arith.py' 2025-10-10T01:29:36,340 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/gpu.py' 2025-10-10T01:29:36,341 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/lru_cache_ir.py' 2025-10-10T01:29:36,343 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/_mlir_helpers/op.py' 2025-10-10T01:29:36,345 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/__init__.py' 2025-10-10T01:29:36,347 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/cuda.py' 2025-10-10T01:29:36,348 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/device_tensor.py' 2025-10-10T01:29:36,350 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/dlpack_types.py' 2025-10-10T01:29:36,351 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/jit_arg_adapters.py' 2025-10-10T01:29:36,353 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/runtime/tensor_descriptor.py' 2025-10-10T01:29:36,355 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils/__init__.py' 2025-10-10T01:29:36,357 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils/logger.py' 2025-10-10T01:29:36,358 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils/stacktrace.py' 2025-10-10T01:29:36,360 adding 'flashinfer/data/cutlass/python/CuTeDSL/base_dsl/utils/timer.py' 2025-10-10T01:29:36,362 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py' 2025-10-10T01:29:36,363 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py' 2025-10-10T01:29:36,365 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py' 2025-10-10T01:29:36,368 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py' 2025-10-10T01:29:36,396 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py' 2025-10-10T01:29:36,400 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py' 2025-10-10T01:29:36,402 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py' 2025-10-10T01:29:36,405 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py' 2025-10-10T01:29:36,407 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py' 2025-10-10T01:29:36,409 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py' 2025-10-10T01:29:36,411 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py' 2025-10-10T01:29:36,413 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py' 2025-10-10T01:29:36,415 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py' 2025-10-10T01:29:36,417 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py' 2025-10-10T01:29:36,418 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py' 2025-10-10T01:29:36,421 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py' 2025-10-10T01:29:36,422 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py' 2025-10-10T01:29:36,424 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py' 2025-10-10T01:29:36,426 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py' 2025-10-10T01:29:36,428 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py' 2025-10-10T01:29:36,430 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py' 2025-10-10T01:29:36,433 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py' 2025-10-10T01:29:36,434 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py' 2025-10-10T01:29:36,436 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py' 2025-10-10T01:29:36,439 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py' 2025-10-10T01:29:36,441 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py' 2025-10-10T01:29:36,443 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py' 2025-10-10T01:29:36,444 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py' 2025-10-10T01:29:36,446 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py' 2025-10-10T01:29:36,448 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py' 2025-10-10T01:29:36,450 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py' 2025-10-10T01:29:36,452 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py' 2025-10-10T01:29:36,455 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py' 2025-10-10T01:29:36,457 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py' 2025-10-10T01:29:36,460 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py' 2025-10-10T01:29:36,463 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py' 2025-10-10T01:29:36,464 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/ampere_helpers.py' 2025-10-10T01:29:36,468 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py' 2025-10-10T01:29:36,470 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py' 2025-10-10T01:29:36,471 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed_helpers.py' 2025-10-10T01:29:36,474 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py' 2025-10-10T01:29:36,476 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py' 2025-10-10T01:29:36,477 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py' 2025-10-10T01:29:36,479 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py' 2025-10-10T01:29:36,481 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py' 2025-10-10T01:29:36,482 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_capacity.py' 2025-10-10T01:29:36,484 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py' 2025-10-10T01:29:36,486 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py' 2025-10-10T01:29:36,488 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl/__init__.py' 2025-10-10T01:29:36,495 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl/cutlass.py' 2025-10-10T01:29:36,498 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl/cutlass_ast_decorators.py' 2025-10-10T01:29:36,501 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass_dsl/tree_utils.py' 2025-10-10T01:29:36,504 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py' 2025-10-10T01:29:36,507 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py' 2025-10-10T01:29:36,509 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/shape.py' 2025-10-10T01:29:36,511 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py' 2025-10-10T01:29:36,513 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py' 2025-10-10T01:29:36,515 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py' 2025-10-10T01:29:36,517 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py' 2025-10-10T01:29:36,520 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py' 2025-10-10T01:29:36,523 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py' 2025-10-10T01:29:36,526 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py' 2025-10-10T01:29:36,527 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py' 2025-10-10T01:29:36,535 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py' 2025-10-10T01:29:36,538 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py' 2025-10-10T01:29:36,539 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py' 2025-10-10T01:29:36,541 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py' 2025-10-10T01:29:36,543 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py' 2025-10-10T01:29:36,545 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py' 2025-10-10T01:29:36,547 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py' 2025-10-10T01:29:36,549 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py' 2025-10-10T01:29:36,551 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py' 2025-10-10T01:29:36,553 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py' 2025-10-10T01:29:36,554 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py' 2025-10-10T01:29:36,556 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py' 2025-10-10T01:29:36,557 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py' 2025-10-10T01:29:36,559 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py' 2025-10-10T01:29:36,560 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py' 2025-10-10T01:29:36,562 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py' 2025-10-10T01:29:36,564 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py' 2025-10-10T01:29:36,567 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py' 2025-10-10T01:29:36,569 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py' 2025-10-10T01:29:36,571 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py' 2025-10-10T01:29:36,572 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py' 2025-10-10T01:29:36,574 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py' 2025-10-10T01:29:36,577 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py' 2025-10-10T01:29:36,579 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py' 2025-10-10T01:29:36,581 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py' 2025-10-10T01:29:36,583 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py' 2025-10-10T01:29:36,585 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py' 2025-10-10T01:29:36,587 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py' 2025-10-10T01:29:36,589 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py' 2025-10-10T01:29:36,591 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py' 2025-10-10T01:29:36,592 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py' 2025-10-10T01:29:36,594 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py' 2025-10-10T01:29:36,596 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py' 2025-10-10T01:29:36,597 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py' 2025-10-10T01:29:36,599 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py' 2025-10-10T01:29:36,601 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py' 2025-10-10T01:29:36,602 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py' 2025-10-10T01:29:36,604 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py' 2025-10-10T01:29:36,605 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py' 2025-10-10T01:29:36,607 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py' 2025-10-10T01:29:36,609 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py' 2025-10-10T01:29:36,611 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py' 2025-10-10T01:29:36,612 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py' 2025-10-10T01:29:36,614 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py' 2025-10-10T01:29:36,616 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py' 2025-10-10T01:29:36,619 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py' 2025-10-10T01:29:36,622 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py' 2025-10-10T01:29:36,623 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py' 2025-10-10T01:29:36,625 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py' 2025-10-10T01:29:36,627 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py' 2025-10-10T01:29:36,632 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py' 2025-10-10T01:29:36,636 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py' 2025-10-10T01:29:36,638 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py' 2025-10-10T01:29:36,641 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py' 2025-10-10T01:29:36,643 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py' 2025-10-10T01:29:36,645 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py' 2025-10-10T01:29:36,647 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py' 2025-10-10T01:29:36,648 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py' 2025-10-10T01:29:36,650 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py' 2025-10-10T01:29:36,652 adding 'flashinfer/data/cutlass/python/cutlass_library/__init__.py' 2025-10-10T01:29:36,655 adding 'flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py' 2025-10-10T01:29:36,658 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py' 2025-10-10T01:29:36,660 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py' 2025-10-10T01:29:36,664 adding 'flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py' 2025-10-10T01:29:36,670 adding 'flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py' 2025-10-10T01:29:36,703 adding 'flashinfer/data/cutlass/python/cutlass_library/generator.py' 2025-10-10T01:29:36,709 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics.py' 2025-10-10T01:29:36,711 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py' 2025-10-10T01:29:36,716 adding 'flashinfer/data/cutlass/python/cutlass_library/library.py' 2025-10-10T01:29:36,720 adding 'flashinfer/data/cutlass/python/cutlass_library/manifest.py' 2025-10-10T01:29:36,723 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py' 2025-10-10T01:29:36,726 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py' 2025-10-10T01:29:36,728 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py' 2025-10-10T01:29:36,730 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py' 2025-10-10T01:29:36,732 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py' 2025-10-10T01:29:36,735 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py' 2025-10-10T01:29:36,737 adding 'flashinfer/data/cutlass/python/cutlass_library/symm_operation.py' 2025-10-10T01:29:36,740 adding 'flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py' 2025-10-10T01:29:36,742 adding 'flashinfer/data/cutlass/python/docs_src/source/conf.py' 2025-10-10T01:29:36,744 adding 'flashinfer/data/cutlass/python/pycute/__init__.py' 2025-10-10T01:29:36,746 adding 'flashinfer/data/cutlass/python/pycute/int_tuple.py' 2025-10-10T01:29:36,748 adding 'flashinfer/data/cutlass/python/pycute/layout.py' 2025-10-10T01:29:36,750 adding 'flashinfer/data/cutlass/python/pycute/swizzle.py' 2025-10-10T01:29:36,752 adding 'flashinfer/data/cutlass/python/pycute/typing.py' 2025-10-10T01:29:36,755 adding 'flashinfer/data/cutlass/test/python/cutlass/installation.py' 2025-10-10T01:29:36,758 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py' 2025-10-10T01:29:36,760 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py' 2025-10-10T01:29:36,762 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py' 2025-10-10T01:29:36,764 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py' 2025-10-10T01:29:36,766 adding 'flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py' 2025-10-10T01:29:36,769 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py' 2025-10-10T01:29:36,770 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py' 2025-10-10T01:29:36,772 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py' 2025-10-10T01:29:36,774 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py' 2025-10-10T01:29:36,775 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py' 2025-10-10T01:29:36,777 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py' 2025-10-10T01:29:36,779 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py' 2025-10-10T01:29:36,782 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py' 2025-10-10T01:29:36,783 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py' 2025-10-10T01:29:36,785 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py' 2025-10-10T01:29:36,787 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py' 2025-10-10T01:29:36,788 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py' 2025-10-10T01:29:36,790 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py' 2025-10-10T01:29:36,791 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py' 2025-10-10T01:29:36,793 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py' 2025-10-10T01:29:36,794 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py' 2025-10-10T01:29:36,796 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py' 2025-10-10T01:29:36,798 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py' 2025-10-10T01:29:36,800 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py' 2025-10-10T01:29:36,802 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py' 2025-10-10T01:29:36,805 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py' 2025-10-10T01:29:36,807 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py' 2025-10-10T01:29:36,809 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py' 2025-10-10T01:29:36,811 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/utils.py' 2025-10-10T01:29:36,813 adding 'flashinfer/data/cutlass/test/python/pycute/run_all_tests.py' 2025-10-10T01:29:36,815 adding 'flashinfer/data/cutlass/test/python/pycute/test_coalesce.py' 2025-10-10T01:29:36,816 adding 'flashinfer/data/cutlass/test/python/pycute/test_complement.py' 2025-10-10T01:29:36,818 adding 'flashinfer/data/cutlass/test/python/pycute/test_composition.py' 2025-10-10T01:29:36,819 adding 'flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py' 2025-10-10T01:29:36,820 adding 'flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py' 2025-10-10T01:29:36,822 adding 'flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py' 2025-10-10T01:29:36,823 adding 'flashinfer/data/cutlass/test/python/pycute/test_typing.py' 2025-10-10T01:29:36,827 adding 'flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py' 2025-10-10T01:29:36,832 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp' 2025-10-10T01:29:36,834 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h' 2025-10-10T01:29:36,836 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp' 2025-10-10T01:29:36,837 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h' 2025-10-10T01:29:36,839 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h' 2025-10-10T01:29:36,841 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h' 2025-10-10T01:29:36,843 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h' 2025-10-10T01:29:36,845 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h' 2025-10-10T01:29:36,847 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h' 2025-10-10T01:29:36,850 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h' 2025-10-10T01:29:36,852 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h' 2025-10-10T01:29:36,854 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h' 2025-10-10T01:29:36,855 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h' 2025-10-10T01:29:36,857 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h' 2025-10-10T01:29:36,859 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h' 2025-10-10T01:29:36,860 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h' 2025-10-10T01:29:36,862 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp' 2025-10-10T01:29:36,864 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp' 2025-10-10T01:29:36,866 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h' 2025-10-10T01:29:36,868 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h' 2025-10-10T01:29:36,871 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h' 2025-10-10T01:29:36,872 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h' 2025-10-10T01:29:36,874 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h' 2025-10-10T01:29:36,877 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp' 2025-10-10T01:29:36,879 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp' 2025-10-10T01:29:36,882 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp' 2025-10-10T01:29:36,883 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h' 2025-10-10T01:29:36,885 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h' 2025-10-10T01:29:36,888 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h' 2025-10-10T01:29:36,890 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h' 2025-10-10T01:29:36,894 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h' 2025-10-10T01:29:36,896 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h' 2025-10-10T01:29:36,898 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h' 2025-10-10T01:29:36,900 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h' 2025-10-10T01:29:36,901 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp' 2025-10-10T01:29:36,903 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h' 2025-10-10T01:29:36,905 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h' 2025-10-10T01:29:36,909 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h' 2025-10-10T01:29:36,911 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h' 2025-10-10T01:29:36,913 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h' 2025-10-10T01:29:36,915 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h' 2025-10-10T01:29:36,917 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h' 2025-10-10T01:29:36,919 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h' 2025-10-10T01:29:36,920 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h' 2025-10-10T01:29:36,923 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h' 2025-10-10T01:29:36,927 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp' 2025-10-10T01:29:36,929 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h' 2025-10-10T01:29:36,931 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h' 2025-10-10T01:29:36,933 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h' 2025-10-10T01:29:36,935 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h' 2025-10-10T01:29:36,937 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h' 2025-10-10T01:29:36,941 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp' 2025-10-10T01:29:36,943 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h' 2025-10-10T01:29:36,945 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h' 2025-10-10T01:29:36,947 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h' 2025-10-10T01:29:36,949 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h' 2025-10-10T01:29:36,951 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h' 2025-10-10T01:29:36,953 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h' 2025-10-10T01:29:36,954 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp' 2025-10-10T01:29:36,956 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h' 2025-10-10T01:29:36,958 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h' 2025-10-10T01:29:36,962 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h' 2025-10-10T01:29:36,964 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp' 2025-10-10T01:29:36,966 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h' 2025-10-10T01:29:36,967 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h' 2025-10-10T01:29:36,969 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h' 2025-10-10T01:29:36,970 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp' 2025-10-10T01:29:36,972 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h' 2025-10-10T01:29:36,974 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h' 2025-10-10T01:29:36,977 adding 'flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py' 2025-10-10T01:29:36,980 adding 'flashinfer/data/include/flashinfer/activation.cuh' 2025-10-10T01:29:36,981 adding 'flashinfer/data/include/flashinfer/allocator.h' 2025-10-10T01:29:36,983 adding 'flashinfer/data/include/flashinfer/arch_condition.h' 2025-10-10T01:29:36,984 adding 'flashinfer/data/include/flashinfer/attention_impl.cuh' 2025-10-10T01:29:36,986 adding 'flashinfer/data/include/flashinfer/cp_async.cuh' 2025-10-10T01:29:36,987 adding 'flashinfer/data/include/flashinfer/cubin_loader.h' 2025-10-10T01:29:36,989 adding 'flashinfer/data/include/flashinfer/cutlass_utils.cuh' 2025-10-10T01:29:36,990 adding 'flashinfer/data/include/flashinfer/exception.h' 2025-10-10T01:29:36,992 adding 'flashinfer/data/include/flashinfer/fastdiv.cuh' 2025-10-10T01:29:36,994 adding 'flashinfer/data/include/flashinfer/fp16.h' 2025-10-10T01:29:36,995 adding 'flashinfer/data/include/flashinfer/fp4_layout.cuh' 2025-10-10T01:29:36,996 adding 'flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh' 2025-10-10T01:29:36,998 adding 'flashinfer/data/include/flashinfer/layout.cuh' 2025-10-10T01:29:36,999 adding 'flashinfer/data/include/flashinfer/logging.h' 2025-10-10T01:29:37,001 adding 'flashinfer/data/include/flashinfer/math.cuh' 2025-10-10T01:29:37,003 adding 'flashinfer/data/include/flashinfer/mma.cuh' 2025-10-10T01:29:37,005 adding 'flashinfer/data/include/flashinfer/norm.cuh' 2025-10-10T01:29:37,008 adding 'flashinfer/data/include/flashinfer/page.cuh' 2025-10-10T01:29:37,010 adding 'flashinfer/data/include/flashinfer/permuted_smem.cuh' 2025-10-10T01:29:37,014 adding 'flashinfer/data/include/flashinfer/pos_enc.cuh' 2025-10-10T01:29:37,016 adding 'flashinfer/data/include/flashinfer/profiler.cuh' 2025-10-10T01:29:37,017 adding 'flashinfer/data/include/flashinfer/quantization.cuh' 2025-10-10T01:29:37,024 adding 'flashinfer/data/include/flashinfer/sampling.cuh' 2025-10-10T01:29:37,027 adding 'flashinfer/data/include/flashinfer/utils.cuh' 2025-10-10T01:29:37,032 adding 'flashinfer/data/include/flashinfer/vec_dtypes.cuh' 2025-10-10T01:29:37,036 adding 'flashinfer/data/include/flashinfer/attention/cascade.cuh' 2025-10-10T01:29:37,038 adding 'flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh' 2025-10-10T01:29:37,043 adding 'flashinfer/data/include/flashinfer/attention/decode.cuh' 2025-10-10T01:29:37,046 adding 'flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh' 2025-10-10T01:29:37,048 adding 'flashinfer/data/include/flashinfer/attention/default_decode_params.cuh' 2025-10-10T01:29:37,050 adding 'flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh' 2025-10-10T01:29:37,052 adding 'flashinfer/data/include/flashinfer/attention/heap.h' 2025-10-10T01:29:37,054 adding 'flashinfer/data/include/flashinfer/attention/hopper.cuh' 2025-10-10T01:29:37,055 adding 'flashinfer/data/include/flashinfer/attention/mask.cuh' 2025-10-10T01:29:37,060 adding 'flashinfer/data/include/flashinfer/attention/mla.cuh' 2025-10-10T01:29:37,064 adding 'flashinfer/data/include/flashinfer/attention/mla_hopper.cuh' 2025-10-10T01:29:37,066 adding 'flashinfer/data/include/flashinfer/attention/mla_params.cuh' 2025-10-10T01:29:37,069 adding 'flashinfer/data/include/flashinfer/attention/persistent.cuh' 2025-10-10T01:29:37,071 adding 'flashinfer/data/include/flashinfer/attention/persistent_template.cuh' 2025-10-10T01:29:37,074 adding 'flashinfer/data/include/flashinfer/attention/pod.cuh' 2025-10-10T01:29:37,084 adding 'flashinfer/data/include/flashinfer/attention/prefill.cuh' 2025-10-10T01:29:37,092 adding 'flashinfer/data/include/flashinfer/attention/scheduler.cuh' 2025-10-10T01:29:37,094 adding 'flashinfer/data/include/flashinfer/attention/state.cuh' 2025-10-10T01:29:37,096 adding 'flashinfer/data/include/flashinfer/attention/variant_helper.cuh' 2025-10-10T01:29:37,097 adding 'flashinfer/data/include/flashinfer/attention/variants.cuh' 2025-10-10T01:29:37,100 adding 'flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh' 2025-10-10T01:29:37,101 adding 'flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh' 2025-10-10T01:29:37,104 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp' 2025-10-10T01:29:37,106 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp' 2025-10-10T01:29:37,108 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp' 2025-10-10T01:29:37,112 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp' 2025-10-10T01:29:37,114 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp' 2025-10-10T01:29:37,118 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp' 2025-10-10T01:29:37,121 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp' 2025-10-10T01:29:37,123 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp' 2025-10-10T01:29:37,125 adding 'flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp' 2025-10-10T01:29:37,127 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp' 2025-10-10T01:29:37,129 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp' 2025-10-10T01:29:37,132 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp' 2025-10-10T01:29:37,133 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp' 2025-10-10T01:29:37,135 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp' 2025-10-10T01:29:37,138 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp' 2025-10-10T01:29:37,140 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp' 2025-10-10T01:29:37,142 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp' 2025-10-10T01:29:37,150 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp' 2025-10-10T01:29:37,152 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp' 2025-10-10T01:29:37,155 adding 'flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh' 2025-10-10T01:29:37,157 adding 'flashinfer/data/include/flashinfer/attention/hopper/block_sparse_gather.cuh' 2025-10-10T01:29:37,158 adding 'flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh' 2025-10-10T01:29:37,160 adding 'flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh' 2025-10-10T01:29:37,162 adding 'flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh' 2025-10-10T01:29:37,164 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh' 2025-10-10T01:29:37,166 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh' 2025-10-10T01:29:37,168 adding 'flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh' 2025-10-10T01:29:37,170 adding 'flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh' 2025-10-10T01:29:37,173 adding 'flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh' 2025-10-10T01:29:37,174 adding 'flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh' 2025-10-10T01:29:37,176 adding 'flashinfer/data/include/flashinfer/attention/hopper/utils.cuh' 2025-10-10T01:29:37,178 adding 'flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh' 2025-10-10T01:29:37,179 adding 'flashinfer/data/include/flashinfer/attention/hopper/variants.cuh' 2025-10-10T01:29:37,182 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh' 2025-10-10T01:29:37,184 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh' 2025-10-10T01:29:37,186 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh' 2025-10-10T01:29:37,188 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh' 2025-10-10T01:29:37,190 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh' 2025-10-10T01:29:37,193 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh' 2025-10-10T01:29:37,200 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh' 2025-10-10T01:29:37,205 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh' 2025-10-10T01:29:37,210 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh' 2025-10-10T01:29:37,212 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh' 2025-10-10T01:29:37,214 adding 'flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh' 2025-10-10T01:29:37,220 adding 'flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh' 2025-10-10T01:29:37,223 adding 'flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh' 2025-10-10T01:29:37,226 adding 'flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh' 2025-10-10T01:29:37,228 adding 'flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h' 2025-10-10T01:29:37,230 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h' 2025-10-10T01:29:37,232 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h' 2025-10-10T01:29:37,234 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h' 2025-10-10T01:29:37,236 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h' 2025-10-10T01:29:37,239 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h' 2025-10-10T01:29:37,240 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h' 2025-10-10T01:29:37,242 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h' 2025-10-10T01:29:37,244 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h' 2025-10-10T01:29:37,246 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh' 2025-10-10T01:29:37,248 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh' 2025-10-10T01:29:37,250 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm.cuh' 2025-10-10T01:29:37,252 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh' 2025-10-10T01:29:37,254 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh' 2025-10-10T01:29:37,255 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh' 2025-10-10T01:29:37,258 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh' 2025-10-10T01:29:37,260 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh' 2025-10-10T01:29:37,261 adding 'flashinfer/data/include/flashinfer/gemm/group_gemv.cuh' 2025-10-10T01:29:37,270 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh' 2025-10-10T01:29:37,272 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h' 2025-10-10T01:29:37,274 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h' 2025-10-10T01:29:37,276 adding 'flashinfer/data/include/flashinfer/trtllm/common.h' 2025-10-10T01:29:37,279 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h' 2025-10-10T01:29:37,281 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/BatchedGemmEnums.h' 2025-10-10T01:29:37,285 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/BatchedGemmInterface.h' 2025-10-10T01:29:37,288 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/BatchedGemmOptions.h' 2025-10-10T01:29:37,289 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/Enums.h' 2025-10-10T01:29:37,291 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/GemmGatedActOptions.h' 2025-10-10T01:29:37,297 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/GemmOptions.h' 2025-10-10T01:29:37,300 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/KernelParams.h' 2025-10-10T01:29:37,304 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/KernelParamsDecl.h' 2025-10-10T01:29:37,307 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/KernelTraits.h' 2025-10-10T01:29:37,309 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/TmaDescriptor.h' 2025-10-10T01:29:37,311 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/CommonUtils.h' 2025-10-10T01:29:37,313 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/CudaKernelLauncher.h' 2025-10-10T01:29:37,315 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/DtypeDecl.h' 2025-10-10T01:29:37,316 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/MmaDecl.h' 2025-10-10T01:29:37,318 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/trtllmGen_bmm_export/trtllm/gen/SfLayoutDecl.h' 2025-10-10T01:29:37,320 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh' 2025-10-10T01:29:37,322 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h' 2025-10-10T01:29:37,323 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h' 2025-10-10T01:29:37,326 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh' 2025-10-10T01:29:37,327 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h' 2025-10-10T01:29:37,330 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h' 2025-10-10T01:29:37,331 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h' 2025-10-10T01:29:37,335 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh' 2025-10-10T01:29:37,337 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h' 2025-10-10T01:29:37,339 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh' 2025-10-10T01:29:37,341 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h' 2025-10-10T01:29:37,345 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h' 2025-10-10T01:29:37,347 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h' 2025-10-10T01:29:37,348 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh' 2025-10-10T01:29:37,351 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h' 2025-10-10T01:29:37,353 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h' 2025-10-10T01:29:37,356 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh' 2025-10-10T01:29:37,358 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h' 2025-10-10T01:29:37,360 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh' 2025-10-10T01:29:37,362 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h' 2025-10-10T01:29:37,365 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/Enums.h' 2025-10-10T01:29:37,368 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmInterface.h' 2025-10-10T01:29:37,374 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmOptions.h' 2025-10-10T01:29:37,377 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParams.h' 2025-10-10T01:29:37,379 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParamsDecl.h' 2025-10-10T01:29:37,382 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelTraits.h' 2025-10-10T01:29:37,384 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/TmaDescriptor.h' 2025-10-10T01:29:37,387 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CommonUtils.h' 2025-10-10T01:29:37,388 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaKernelLauncher.h' 2025-10-10T01:29:37,390 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/DtypeDecl.h' 2025-10-10T01:29:37,392 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/MmaDecl.h' 2025-10-10T01:29:37,393 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SfLayoutDecl.h' 2025-10-10T01:29:37,397 adding 'flashinfer/data/spdlog/include/spdlog/async.h' 2025-10-10T01:29:37,398 adding 'flashinfer/data/spdlog/include/spdlog/async_logger-inl.h' 2025-10-10T01:29:37,400 adding 'flashinfer/data/spdlog/include/spdlog/async_logger.h' 2025-10-10T01:29:37,401 adding 'flashinfer/data/spdlog/include/spdlog/common-inl.h' 2025-10-10T01:29:37,403 adding 'flashinfer/data/spdlog/include/spdlog/common.h' 2025-10-10T01:29:37,405 adding 'flashinfer/data/spdlog/include/spdlog/formatter.h' 2025-10-10T01:29:37,406 adding 'flashinfer/data/spdlog/include/spdlog/fwd.h' 2025-10-10T01:29:37,408 adding 'flashinfer/data/spdlog/include/spdlog/logger-inl.h' 2025-10-10T01:29:37,410 adding 'flashinfer/data/spdlog/include/spdlog/logger.h' 2025-10-10T01:29:37,412 adding 'flashinfer/data/spdlog/include/spdlog/mdc.h' 2025-10-10T01:29:37,416 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h' 2025-10-10T01:29:37,418 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter.h' 2025-10-10T01:29:37,419 adding 'flashinfer/data/spdlog/include/spdlog/spdlog-inl.h' 2025-10-10T01:29:37,421 adding 'flashinfer/data/spdlog/include/spdlog/spdlog.h' 2025-10-10T01:29:37,423 adding 'flashinfer/data/spdlog/include/spdlog/stopwatch.h' 2025-10-10T01:29:37,424 adding 'flashinfer/data/spdlog/include/spdlog/tweakme.h' 2025-10-10T01:29:37,426 adding 'flashinfer/data/spdlog/include/spdlog/version.h' 2025-10-10T01:29:37,428 adding 'flashinfer/data/spdlog/include/spdlog/cfg/argv.h' 2025-10-10T01:29:37,429 adding 'flashinfer/data/spdlog/include/spdlog/cfg/env.h' 2025-10-10T01:29:37,431 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h' 2025-10-10T01:29:37,432 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers.h' 2025-10-10T01:29:37,435 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h' 2025-10-10T01:29:37,436 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer.h' 2025-10-10T01:29:37,438 adding 'flashinfer/data/spdlog/include/spdlog/details/circular_q.h' 2025-10-10T01:29:37,439 adding 'flashinfer/data/spdlog/include/spdlog/details/console_globals.h' 2025-10-10T01:29:37,441 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h' 2025-10-10T01:29:37,442 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper.h' 2025-10-10T01:29:37,444 adding 'flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h' 2025-10-10T01:29:37,445 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h' 2025-10-10T01:29:37,446 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg.h' 2025-10-10T01:29:37,448 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h' 2025-10-10T01:29:37,449 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h' 2025-10-10T01:29:37,451 adding 'flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h' 2025-10-10T01:29:37,452 adding 'flashinfer/data/spdlog/include/spdlog/details/null_mutex.h' 2025-10-10T01:29:37,455 adding 'flashinfer/data/spdlog/include/spdlog/details/os-inl.h' 2025-10-10T01:29:37,456 adding 'flashinfer/data/spdlog/include/spdlog/details/os.h' 2025-10-10T01:29:37,458 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h' 2025-10-10T01:29:37,459 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h' 2025-10-10T01:29:37,461 adding 'flashinfer/data/spdlog/include/spdlog/details/registry-inl.h' 2025-10-10T01:29:37,463 adding 'flashinfer/data/spdlog/include/spdlog/details/registry.h' 2025-10-10T01:29:37,464 adding 'flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h' 2025-10-10T01:29:37,466 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h' 2025-10-10T01:29:37,467 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client.h' 2025-10-10T01:29:37,469 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h' 2025-10-10T01:29:37,470 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool.h' 2025-10-10T01:29:37,472 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h' 2025-10-10T01:29:37,473 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client.h' 2025-10-10T01:29:37,474 adding 'flashinfer/data/spdlog/include/spdlog/details/windows_include.h' 2025-10-10T01:29:37,477 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h' 2025-10-10T01:29:37,478 adding 'flashinfer/data/spdlog/include/spdlog/fmt/chrono.h' 2025-10-10T01:29:37,480 adding 'flashinfer/data/spdlog/include/spdlog/fmt/compile.h' 2025-10-10T01:29:37,481 adding 'flashinfer/data/spdlog/include/spdlog/fmt/fmt.h' 2025-10-10T01:29:37,482 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ostr.h' 2025-10-10T01:29:37,483 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ranges.h' 2025-10-10T01:29:37,485 adding 'flashinfer/data/spdlog/include/spdlog/fmt/std.h' 2025-10-10T01:29:37,486 adding 'flashinfer/data/spdlog/include/spdlog/fmt/xchar.h' 2025-10-10T01:29:37,489 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h' 2025-10-10T01:29:37,497 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h' 2025-10-10T01:29:37,500 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h' 2025-10-10T01:29:37,503 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h' 2025-10-10T01:29:37,516 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h' 2025-10-10T01:29:37,518 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst' 2025-10-10T01:29:37,528 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h' 2025-10-10T01:29:37,550 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h' 2025-10-10T01:29:37,553 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h' 2025-10-10T01:29:37,556 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h' 2025-10-10T01:29:37,557 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h' 2025-10-10T01:29:37,560 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h' 2025-10-10T01:29:37,564 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h' 2025-10-10T01:29:37,566 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h' 2025-10-10T01:29:37,568 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h' 2025-10-10T01:29:37,571 adding 'flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h' 2025-10-10T01:29:37,573 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h' 2025-10-10T01:29:37,574 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h' 2025-10-10T01:29:37,575 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h' 2025-10-10T01:29:37,577 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h' 2025-10-10T01:29:37,578 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h' 2025-10-10T01:29:37,580 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h' 2025-10-10T01:29:37,581 adding 'flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h' 2025-10-10T01:29:37,583 adding 'flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h' 2025-10-10T01:29:37,584 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h' 2025-10-10T01:29:37,586 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h' 2025-10-10T01:29:37,587 adding 'flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h' 2025-10-10T01:29:37,589 adding 'flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h' 2025-10-10T01:29:37,590 adding 'flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h' 2025-10-10T01:29:37,592 adding 'flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h' 2025-10-10T01:29:37,593 adding 'flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h' 2025-10-10T01:29:37,594 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h' 2025-10-10T01:29:37,596 adding 'flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h' 2025-10-10T01:29:37,598 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h' 2025-10-10T01:29:37,599 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h' 2025-10-10T01:29:37,601 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h' 2025-10-10T01:29:37,602 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h' 2025-10-10T01:29:37,604 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink.h' 2025-10-10T01:29:37,605 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h' 2025-10-10T01:29:37,606 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h' 2025-10-10T01:29:37,608 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h' 2025-10-10T01:29:37,609 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h' 2025-10-10T01:29:37,611 adding 'flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h' 2025-10-10T01:29:37,612 adding 'flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h' 2025-10-10T01:29:37,614 adding 'flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h' 2025-10-10T01:29:37,615 adding 'flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h' 2025-10-10T01:29:37,617 adding 'flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h' 2025-10-10T01:29:37,619 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h' 2025-10-10T01:29:37,621 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h' 2025-10-10T01:29:37,622 adding 'flashinfer/data/spdlog/scripts/extract_version.py' 2025-10-10T01:29:37,624 adding 'flashinfer/fused_moe/__init__.py' 2025-10-10T01:29:37,631 adding 'flashinfer/fused_moe/core.py' 2025-10-10T01:29:37,633 adding 'flashinfer/fused_moe/utils.py' 2025-10-10T01:29:37,635 adding 'flashinfer/jit/__init__.py' 2025-10-10T01:29:37,637 adding 'flashinfer/jit/activation.py' 2025-10-10T01:29:37,638 adding 'flashinfer/jit/cascade.py' 2025-10-10T01:29:37,639 adding 'flashinfer/jit/comm.py' 2025-10-10T01:29:37,642 adding 'flashinfer/jit/core.py' 2025-10-10T01:29:37,644 adding 'flashinfer/jit/cpp_ext.py' 2025-10-10T01:29:37,646 adding 'flashinfer/jit/cubin_loader.py' 2025-10-10T01:29:37,647 adding 'flashinfer/jit/env.py' 2025-10-10T01:29:37,649 adding 'flashinfer/jit/fp4_quantization.py' 2025-10-10T01:29:37,650 adding 'flashinfer/jit/fp8_quantization.py' 2025-10-10T01:29:37,652 adding 'flashinfer/jit/fused_moe.py' 2025-10-10T01:29:37,653 adding 'flashinfer/jit/mla.py' 2025-10-10T01:29:37,654 adding 'flashinfer/jit/norm.py' 2025-10-10T01:29:37,655 adding 'flashinfer/jit/page.py' 2025-10-10T01:29:37,657 adding 'flashinfer/jit/quantization.py' 2025-10-10T01:29:37,658 adding 'flashinfer/jit/rope.py' 2025-10-10T01:29:37,659 adding 'flashinfer/jit/sampling.py' 2025-10-10T01:29:37,660 adding 'flashinfer/jit/spdlog.py' 2025-10-10T01:29:37,662 adding 'flashinfer/jit/tllm_utils.py' 2025-10-10T01:29:37,663 adding 'flashinfer/jit/utils.py' 2025-10-10T01:29:37,665 adding 'flashinfer/jit/xqa.py' 2025-10-10T01:29:37,667 adding 'flashinfer/jit/attention/__init__.py' 2025-10-10T01:29:37,670 adding 'flashinfer/jit/attention/modules.py' 2025-10-10T01:29:37,672 adding 'flashinfer/jit/attention/utils.py' 2025-10-10T01:29:37,674 adding 'flashinfer/jit/attention/variants.py' 2025-10-10T01:29:37,676 adding 'flashinfer/jit/gemm/__init__.py' 2025-10-10T01:29:37,678 adding 'flashinfer/jit/gemm/core.py' 2025-10-10T01:29:37,679 adding 'flashinfer/jit/gemm/deepgemm.py' 2025-10-10T01:29:37,681 adding 'flashinfer/jit/gemm/cutlass/__init__.py' 2025-10-10T01:29:37,686 adding 'flashinfer/jit/gemm/cutlass/cutlass_library.py' 2025-10-10T01:29:37,690 adding 'flashinfer/jit/gemm/cutlass/generate_kernels.py' 2025-10-10T01:29:37,692 adding 'flashinfer/logits_processor/__init__.py' 2025-10-10T01:29:37,693 adding 'flashinfer/logits_processor/compiler.py' 2025-10-10T01:29:37,695 adding 'flashinfer/logits_processor/fusion_rules.py' 2025-10-10T01:29:37,696 adding 'flashinfer/logits_processor/legalization.py' 2025-10-10T01:29:37,698 adding 'flashinfer/logits_processor/op.py' 2025-10-10T01:29:37,700 adding 'flashinfer/logits_processor/operators.py' 2025-10-10T01:29:37,702 adding 'flashinfer/logits_processor/pipeline.py' 2025-10-10T01:29:37,704 adding 'flashinfer/logits_processor/processors.py' 2025-10-10T01:29:37,705 adding 'flashinfer/logits_processor/types.py' 2025-10-10T01:29:37,707 adding 'flashinfer/logits_processor/validators.py' 2025-10-10T01:29:37,709 adding 'flashinfer/profiler/__init__.py' 2025-10-10T01:29:37,711 adding 'flashinfer/testing/__init__.py' 2025-10-10T01:29:37,715 adding 'flashinfer/testing/utils.py' 2025-10-10T01:29:37,717 adding 'flashinfer/triton/__init__.py' 2025-10-10T01:29:37,719 adding 'flashinfer/triton/activation.py' 2025-10-10T01:29:37,720 adding 'flashinfer/triton/cascade.py' 2025-10-10T01:29:37,722 adding 'flashinfer/triton/gemm.py' 2025-10-10T01:29:37,723 adding 'flashinfer/triton/norm.py' 2025-10-10T01:29:37,724 adding 'flashinfer/triton/page.py' 2025-10-10T01:29:37,726 adding 'flashinfer/triton/sm_constraint_gemm.py' 2025-10-10T01:29:37,728 adding 'flashinfer/triton/utils.py' 2025-10-10T01:29:37,730 adding 'flashinfer/triton/kernels/__init__.py' 2025-10-10T01:29:37,731 adding 'flashinfer/triton/kernels/activation.py' 2025-10-10T01:29:37,733 adding 'flashinfer/triton/kernels/cascade.py' 2025-10-10T01:29:37,734 adding 'flashinfer/triton/kernels/norm.py' 2025-10-10T01:29:37,735 adding 'flashinfer/triton/kernels/quant.py' 2025-10-10T01:29:37,737 adding 'flashinfer/triton/kernels/sm_constraint_gemm.py' 2025-10-10T01:29:37,739 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py' 2025-10-10T01:29:37,741 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py' 2025-10-10T01:29:37,745 adding 'flashinfer_python-0.4.0.dist-info/licenses/LICENSE' 2025-10-10T01:29:37,747 adding 'flashinfer_python-0.4.0.dist-info/licenses/licenses/LICENSE.cutlass.txt' 2025-10-10T01:29:37,749 adding 'flashinfer_python-0.4.0.dist-info/licenses/licenses/LICENSE.flashattention3.txt' 2025-10-10T01:29:37,750 adding 'flashinfer_python-0.4.0.dist-info/licenses/licenses/LICENSE.fmt.txt' 2025-10-10T01:29:37,752 adding 'flashinfer_python-0.4.0.dist-info/licenses/licenses/LICENSE.spdlog.txt' 2025-10-10T01:29:37,754 adding 'flashinfer_python-0.4.0.dist-info/METADATA' 2025-10-10T01:29:37,755 adding 'flashinfer_python-0.4.0.dist-info/WHEEL' 2025-10-10T01:29:37,756 adding 'flashinfer_python-0.4.0.dist-info/entry_points.txt' 2025-10-10T01:29:37,757 adding 'flashinfer_python-0.4.0.dist-info/top_level.txt' 2025-10-10T01:29:37,793 adding 'flashinfer_python-0.4.0.dist-info/RECORD' 2025-10-10T01:29:37,899 removing build/bdist.linux-armv7l/wheel 2025-10-10T01:29:38,592 Building wheel for flashinfer-python (pyproject.toml): finished with status 'done' 2025-10-10T01:29:38,753 Created wheel for flashinfer-python: filename=flashinfer_python-0.4.0-py3-none-any.whl size=6769103 sha256=da0141b2163f9703e49972728eeb502d45eda60c25529a460d0d0d61963eedb2 2025-10-10T01:29:38,755 Stored in directory: /tmp/pip-ephem-wheel-cache-4qmj4zzp/wheels/0b/82/2b/ed8f803cab85790bcb920b1b045c4e216b051d95f01fa6d976 2025-10-10T01:29:38,824 Successfully built flashinfer-python 2025-10-10T01:29:38,996 Removed build tracker: '/tmp/pip-build-tracker-7ss9dt55'