2026-04-24T15:41:51,180 Created temporary directory: /tmp/pip-ephem-wheel-cache-sqrrcvqu 2026-04-24T15:41:51,182 Created temporary directory: /tmp/pip-build-tracker-0l9cu84t 2026-04-24T15:41:51,183 Initialized build tracking at /tmp/pip-build-tracker-0l9cu84t 2026-04-24T15:41:51,183 Created build tracker: /tmp/pip-build-tracker-0l9cu84t 2026-04-24T15:41:51,183 Entered build tracker: /tmp/pip-build-tracker-0l9cu84t 2026-04-24T15:41:51,184 Created temporary directory: /tmp/pip-wheel-va5s0at6 2026-04-24T15:41:51,188 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-24T15:41:51,190 Created temporary directory: /tmp/pip-ephem-wheel-cache-63k8cme3 2026-04-24T15:41:51,211 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-24T15:41:51,215 2 location(s) to search for versions of flashinfer-python: 2026-04-24T15:41:51,215 * https://pypi.org/simple/flashinfer-python/ 2026-04-24T15:41:51,215 * https://www.piwheels.org/simple/flashinfer-python/ 2026-04-24T15:41:51,215 Fetching project page and analyzing links: https://pypi.org/simple/flashinfer-python/ 2026-04-24T15:41:51,216 Getting page https://pypi.org/simple/flashinfer-python/ 2026-04-24T15:41:51,218 Found index url https://pypi.org/simple 2026-04-24T15:41:51,369 Fetched page https://pypi.org/simple/flashinfer-python/ as application/vnd.pypi.simple.v1+json 2026-04-24T15:41:51,386 Found link https://files.pythonhosted.org/packages/6c/e9/5d6adcf888922a17c6fc52a0e5bed78785239af1219f41e1073b063a07ff/flashinfer_python-0.2.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post1 2026-04-24T15:41:51,387 Found link https://files.pythonhosted.org/packages/c8/39/bac839234a3beaab4292e489b4d8941cc97ba4f76474aff0407d7b05a84f/flashinfer_python-0.2.0.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post2 2026-04-24T15:41:51,388 Found link https://files.pythonhosted.org/packages/94/74/4dda2a7a7aa08bcfb8039faf2202bf0fea6b378d0d4968864737400fc329/flashinfer_python-0.2.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1 2026-04-24T15:41:51,389 Found link https://files.pythonhosted.org/packages/7f/3d/aab500609825108d3f6a4b440a7eeb6436d578d3e781e97ea015fd49a530/flashinfer_python-0.2.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post1 2026-04-24T15:41:51,391 Found link https://files.pythonhosted.org/packages/30/ac/afd1d2c472857be8f83389eb506e1413a2ac3a603889bea3cf24d5ab5be5/flashinfer_python-0.2.1.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post2 2026-04-24T15:41:51,392 Found link https://files.pythonhosted.org/packages/90/00/833dd50745bc15bb7a7451b77589d444ce963d48c0cb730b4760bfebffad/flashinfer_python-0.2.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2 2026-04-24T15:41:51,393 Found link https://files.pythonhosted.org/packages/02/cc/db9635c56653d3fa5a28f14ac858e0801de621aa33d3b528e4781aee906f/flashinfer_python-0.2.2.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2.post1 2026-04-24T15:41:51,394 Found link https://files.pythonhosted.org/packages/b6/10/2a63f1d09c5b337705236005dc9ccce513dcc08b7fd037cb40426f1695b1/flashinfer_python-0.2.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.3 2026-04-24T15:41:51,395 Found link https://files.pythonhosted.org/packages/a4/e5/8d193ccf65b92c009c4be50fdffa88fa0edc8fd6e6169bacaca6bab84d89/flashinfer_python-0.2.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.4 2026-04-24T15:41:51,396 Found link https://files.pythonhosted.org/packages/b2/c4/9ec0f79e2480fc5c93307c4a1ac903e5cf33c551c0eaeb648196234b55af/flashinfer_python-0.2.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.5 2026-04-24T15:41:51,398 Found link https://files.pythonhosted.org/packages/95/4a/a3109d57463d25a153b16c0d0f06495e4d18b727c81f8e08e42e97faaf45/flashinfer_python-0.2.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6 2026-04-24T15:41:51,398 Found link https://files.pythonhosted.org/packages/34/26/3c6f12ffaefbfa0c453030d6e15941269b3a4ffcd267daec32d1a10dda96/flashinfer_python-0.2.6.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6.post1 2026-04-24T15:41:51,399 Found link https://files.pythonhosted.org/packages/f9/a0/5e700751f2393a504bc5eb2879e77d783a5b70778a254289711323126abc/flashinfer_python-0.2.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7 2026-04-24T15:41:51,400 Found link https://files.pythonhosted.org/packages/c0/10/43cf1ea7a03ca8e75a185190708e48286e1583d781e93d1de130e5d450ca/flashinfer_python-0.2.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7.post1 2026-04-24T15:41:51,401 Found link https://files.pythonhosted.org/packages/f1/80/8dfae62d04af4597d7615b892f346ace68bcb07dfbef2a9e614219d96a8a/flashinfer_python-0.2.8rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8rc1 2026-04-24T15:41:51,402 Found link https://files.pythonhosted.org/packages/72/0e/827624993516e80f62ba88dd368ad5e180c41324f063c00d27fa638a430e/flashinfer_python-0.2.8.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8 2026-04-24T15:41:51,403 Found link https://files.pythonhosted.org/packages/17/50/42afc9a81031939140fcbfd93e5a3652dc4995e338b4e6d007b0dda04f93/flashinfer_python-0.2.9rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc1 2026-04-24T15:41:51,405 Found link https://files.pythonhosted.org/packages/ed/1a/9f30eda3178ed2f5f7e311ae0011d02c4542d087f84c9247e4b30668b767/flashinfer_python-0.2.9rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc2 2026-04-24T15:41:51,406 Found link https://files.pythonhosted.org/packages/45/fc/4deff13f1420cc6e5871b7505a6c0d9031eb49cd09571ae576aec59bed61/flashinfer_python-0.2.9.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9 2026-04-24T15:41:51,407 Found link https://files.pythonhosted.org/packages/74/e4/2c6d6a19d13ed13d4863f6900febe72b502334e43292d5fe9a1ac2f6c5be/flashinfer_python-0.2.10.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.10 2026-04-24T15:41:51,408 Found link https://files.pythonhosted.org/packages/72/8b/f315dda5993d1c018ca5ecfef0775c6a3c7a8f59ac426fabb7f3f6b93482/flashinfer_python-0.2.11.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11 2026-04-24T15:41:51,409 Found link https://files.pythonhosted.org/packages/37/e3/2e8e31f7f7ee26f39968264e4fcf74f9810d90e940859016d974106ed5c6/flashinfer_python-0.2.11.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post1 2026-04-24T15:41:51,410 Found link https://files.pythonhosted.org/packages/b6/01/fa069f076cfe5bed34ddc3b7f772aa09c70e03e572dd9d3569ff887f33b1/flashinfer_python-0.2.11.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post2 2026-04-24T15:41:51,411 Found link https://files.pythonhosted.org/packages/a3/09/5d89ef0bc2d19d3ebcf3b9fa621c945909f681818c9d55aa3181921db874/flashinfer_python-0.2.11.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post3 2026-04-24T15:41:51,412 Found link https://files.pythonhosted.org/packages/b9/5a/7a839afb07af313549b9d9f1057b02aaf067f020267d5a9d128e50596bf4/flashinfer_python-0.2.12.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.12 2026-04-24T15:41:51,413 Found link https://files.pythonhosted.org/packages/f2/20/e79142a9f26aab61b17e2c906a49e9a3d3c656d97608c8773785c3b13140/flashinfer_python-0.2.13.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.13 2026-04-24T15:41:51,414 Found link https://files.pythonhosted.org/packages/ed/26/d1eac56b37d225cb3f84495bd897829dece21f62463487f3c1d9cafe78a0/flashinfer_python-0.2.14.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14 2026-04-24T15:41:51,415 Found link https://files.pythonhosted.org/packages/94/d4/4a2bf3d49f84b2d975925c1c024790b4e4768bdefbc5e27529d68368355a/flashinfer_python-0.2.14.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14.post1 2026-04-24T15:41:51,416 Found link https://files.pythonhosted.org/packages/56/e3/7c0a4df2640a97ecfed45fe9110ecc6a67d4967278723abf8e6531b6bc1f/flashinfer_python-0.3.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0rc1 2026-04-24T15:41:51,417 Found link https://files.pythonhosted.org/packages/1f/b4/5c4cbb0f3cbc5e8d4c19b3f163c048eed959a0ac0c603cfb3939a3079c52/flashinfer_python-0.3.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0 2026-04-24T15:41:51,418 Found link https://files.pythonhosted.org/packages/59/1b/83a9c58432b4a5d6ff04b97d4873bedfb5e35d38972ca8946b3acdbffeb4/flashinfer_python-0.3.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0.post1 2026-04-24T15:41:51,419 Found link https://files.pythonhosted.org/packages/ba/71/dd3001b8be8174d90561764a5f3be4ca219517bde2841189ea6973a3873f/flashinfer_python-0.3.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1 2026-04-24T15:41:51,421 Found link https://files.pythonhosted.org/packages/49/a7/f5bd3878f94fc47e25ecc0828f910233022366f7e832dfa02f3617fad41f/flashinfer_python-0.3.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1.post1 2026-04-24T15:41:51,422 Found link https://files.pythonhosted.org/packages/df/b4/f113bb950e5244d1c72c3d73c03fac0db939f085670e3a45a41fe92ffde0/flashinfer_python-0.4.0rc0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc0 2026-04-24T15:41:51,423 Found link https://files.pythonhosted.org/packages/2e/a8/adceccda3aae01b7bdb5f99c68a2b401c58600f34a6386d9489ff736cdbc/flashinfer_python-0.4.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc1 2026-04-24T15:41:51,424 Found link https://files.pythonhosted.org/packages/15/c0/5fb88fc273fed23dbf3b0ef0bffa7db26e2df24e016202df1b4e98b95879/flashinfer_python-0.4.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc2 2026-04-24T15:41:51,425 Found link https://files.pythonhosted.org/packages/65/91/cf9e3a0a2626711bfab18ea4a4c739e0eb823e9513addc0e9e1b8f929538/flashinfer_python-0.4.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc3 2026-04-24T15:41:51,426 Found link https://files.pythonhosted.org/packages/94/ec/bdcc0ec502994d544cbe69763d999458ae2deda67e58c1cb2d85867677c4/flashinfer_python-0.4.0rc4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc4 2026-04-24T15:41:51,427 Found link https://files.pythonhosted.org/packages/08/29/f5609be182174e8c97124baeb90bb955fe05e2e1353776f48e226c153214/flashinfer_python-0.4.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0 2026-04-24T15:41:51,428 Found link https://files.pythonhosted.org/packages/64/cf/f82142abd7c819fb84a53f18fe1ac9e7cf1af8790b93c06dbf430001473b/flashinfer_python-0.4.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.1 2026-04-24T15:41:51,429 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/c7/92/126dacc3476fab07478bdfc9944abd22aafa1000088d93bf86fb9ec78a29/flashinfer_python-0.5.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,430 Found link https://files.pythonhosted.org/packages/53/47/a759f1ae9ef4ceb4e12895665b65dfacea2085494626e764627dd3548fa8/flashinfer_python-0.5.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc1 2026-04-24T15:41:51,431 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/fb/aa/7b5d28c2aec11acfce18f2655d0b4614c7e34547fab218b4f2fd0d57bdce/flashinfer_python-0.5.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,432 Found link https://files.pythonhosted.org/packages/3d/5a/58a7b60f79a1ac9c652b4055b06e88b5f57e8ef4c7dd4830ef48fa4cc265/flashinfer_python-0.5.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc2 2026-04-24T15:41:51,432 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/5f/8f/7077cf0a44056a65045a793d6d55845d95818fb6455bfebb44ddea7e1f12/flashinfer_python-0.5.0rc3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,433 Found link https://files.pythonhosted.org/packages/60/d1/8c90d6dfc95ab609028e9d541a6cdb3483f5c1475b07d97465ff3f0db14c/flashinfer_python-0.5.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc3 2026-04-24T15:41:51,434 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/eb/8a/425b75b44ce5eeefe01dd61d4ee260b8e5f9dcf1a500d5f08d6cd4095d3a/flashinfer_python-0.5.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,435 Found link https://files.pythonhosted.org/packages/e3/1d/b82cd2606f4f0033e2fb28194dc3b04fd8101643e4ceb1d13fb1466cfd28/flashinfer_python-0.5.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0 2026-04-24T15:41:51,436 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f4/f1/33dedad087a2bc3d66244126bd5d1c79721ea22d1f2124299f9e5bdaf3b1/flashinfer_python-0.5.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,437 Found link https://files.pythonhosted.org/packages/6c/bb/897c3b9d683dcf6490f70e468efb585eebcd673970b13a04ed947b491982/flashinfer_python-0.5.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.1 2026-04-24T15:41:51,438 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/8d/0c/4a8ffbbc0d85e314f534cf5c32711f2af5d5e6e49225a5a414400a67b684/flashinfer_python-0.5.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,439 Found link https://files.pythonhosted.org/packages/d8/04/e357eaa50238e12c49e66fcf47f83e066e741ef19a117c136782b32eafbb/flashinfer_python-0.5.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.2 2026-04-24T15:41:51,439 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/78/6dc7e7da8cb87c9965644ea0d2439457a1bc9256c45ceda0044595be4143/flashinfer_python-0.5.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,440 Found link https://files.pythonhosted.org/packages/b4/91/cca69baeff24bb3efd12c7479a026432c8717ee47193694010494c528b22/flashinfer_python-0.5.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.5.3 2026-04-24T15:41:51,441 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/b2/0c/cb2d60eb86f0171451d676f17b90484ab66baf73c54cefe15c9a7c800739/flashinfer_python-0.6.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,442 Found link https://files.pythonhosted.org/packages/53/2a/e855be4851ad6bfcebed929807fb541715f9a3a7d7b239b696e635b49d0e/flashinfer_python-0.6.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc1 2026-04-24T15:41:51,443 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/05/22/9193f1da2468acec8ba99c4bee8aeacbda489777acf00b5871a73209acf7/flashinfer_python-0.6.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,444 Found link https://files.pythonhosted.org/packages/1b/71/dd1bb86ea531e5c1a34f8ad851901bf2e2ce500618b5a4da19bd69f7de11/flashinfer_python-0.6.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc2 2026-04-24T15:41:51,445 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/90/5834597488f5ea62b1cc874338125c79ce21c11d777ac6f7b47f12cf2bb3/flashinfer_python-0.6.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,445 Found link https://files.pythonhosted.org/packages/ad/8d/c7330f27f09b9110af2f6c44c6f68d7b536f525f8ac539210073bfcdb965/flashinfer_python-0.6.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0 2026-04-24T15:41:51,446 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/d5/bca632bb5781689415186421bbee2ad39ae8a39b0996d579c76901e5c66f/flashinfer_python-0.6.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,447 Found link https://files.pythonhosted.org/packages/68/81/5a84e14df7358d2c2903b18c6f2779bd4b4a6739076d01a847d4c18fb102/flashinfer_python-0.6.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.1 2026-04-24T15:41:51,448 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/aa/c0/ee819d16f6b40e287727bb3db471f4eaa9e0372e233bf2f7343faaa3009f/flashinfer_python-0.6.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,449 Found link https://files.pythonhosted.org/packages/89/86/b25115177606ae3b6cec373d290798c28e185d033b66f6b80a89589e7786/flashinfer_python-0.6.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.2 2026-04-24T15:41:51,450 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/13/2d95248101d8cb978db9000a4dceafb5b122484a694b53e84df1ac2a7b3d/flashinfer_python-0.6.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,451 Found link https://files.pythonhosted.org/packages/d6/aa/c564313b42dee7573da4ed0e441844f0c2bd827aecc9f29ea02c3838ffae/flashinfer_python-0.6.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.3 2026-04-24T15:41:51,452 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/17/9a/d2bab76d2bb15062c6a2329614653e4f8bec9c78eec9069856ef0c7c0a79/flashinfer_python-0.6.4-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,453 Found link https://files.pythonhosted.org/packages/77/45/15645d2a4ee81d08206f3e132a77323e48312f510462415d7cd1122eba43/flashinfer_python-0.6.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.4 2026-04-24T15:41:51,454 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/4f/83/eea2a74700b5fcae36ee2b748db9c3554a83a3f9e2dc4f3816369c5cb653/flashinfer_python-0.6.5-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,455 Found link https://files.pythonhosted.org/packages/e2/2f/5c52276af3cc40ac1f6eaf823ccd8e257f77e2fe5d465fa641ad3dba4d1b/flashinfer_python-0.6.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.5 2026-04-24T15:41:51,455 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/e0/61/385d06755f3ab66333018285657adf0daf8a90a129448231fd09e315bd2e/flashinfer_python-0.6.6-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,456 Found link https://files.pythonhosted.org/packages/03/70/c5a235297351021f5d3d3233523a85f5a6468495587489ad2f257e8eafe2/flashinfer_python-0.6.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.6 2026-04-24T15:41:51,457 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f1/e8/91361a5f07667f36181cfd08e2d7d28be4cae2aa5a24016339174b308c38/flashinfer_python-0.6.7-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,457 Found link https://files.pythonhosted.org/packages/d9/2d/aa36fa1fee744c46fef99436baea5cda4a34244846c1df0fea97eaa9a856/flashinfer_python-0.6.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7 2026-04-24T15:41:51,458 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/16/92/516c79e5d8d1f0b41793e499c37a9299115ac8bc05171661b30d4a94beb8/flashinfer_python-0.6.7.post1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,459 Found link https://files.pythonhosted.org/packages/60/6c/4b1a3d380c04306bde63412043e679d5a52d3da7feed91f1e9ba8ce8bc3f/flashinfer_python-0.6.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post1 2026-04-24T15:41:51,460 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/62/9e/bf26a95bb219eb3d43cc6f3cd1dde6f560081fbcb50f846535c9f571a807/flashinfer_python-0.6.7.post2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,461 Found link https://files.pythonhosted.org/packages/cc/95/81eafb78574312db79ef7144a4e77f2fee015343f413ef3000f279c8a118/flashinfer_python-0.6.7.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post2 2026-04-24T15:41:51,462 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/01/6b/4117cd7cbeff07818ae7c6b8bf5a6d1ee3eed29356672b731b55af3d4453/flashinfer_python-0.6.7.post3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,463 Found link https://files.pythonhosted.org/packages/12/b5/466778818d195b96a062467ee389d0fcfa51fdfecad4a831922916d4c48a/flashinfer_python-0.6.7.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post3 2026-04-24T15:41:51,463 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/6a/0a/e8ae05fd59f800e74ec24fa6a58a04c6c0d9308917880c42f2b53cfe36bb/flashinfer_python-0.6.8rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,464 Found link https://files.pythonhosted.org/packages/68/e1/67b0b5eb9f3ea23e05e7d454571ad7a186ede6a9c30fec55e51291bfa461/flashinfer_python-0.6.8rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.8rc1 2026-04-24T15:41:51,465 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/9e/f8/54f8764748f1ba7d45a1915a1a51ad08f63b68a2f2141e399bdb0379d146/flashinfer_python-0.6.8-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,466 Found link https://files.pythonhosted.org/packages/7e/14/869ae016b4249db0b312203e4ba19b86406ce98417abf80fd2003af0a1a7/flashinfer_python-0.6.8.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.8 2026-04-24T15:41:51,467 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/73/6d/1e8a8533913e33a50a486332ce0673f4fdb860f6eb9ed450327c5c1762cb/flashinfer_python-0.6.8.post1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,468 Found link https://files.pythonhosted.org/packages/53/1e/2760fef9e74abc4480961048e5790b4c9e955872fb4d7d97900cfddced5a/flashinfer_python-0.6.8.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.8.post1 2026-04-24T15:41:51,468 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/ab/96/f64c9c8845cfb04acb6766c8f0b12488fc5d439c3c67f5710f24e44cfcf8/flashinfer_python-0.6.9rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,469 Found link https://files.pythonhosted.org/packages/48/aa/4ed362e1ee900a78b9255cd556adff395cd77a605c2dfe5741685f72bf4d/flashinfer_python-0.6.9rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.9rc1 2026-04-24T15:41:51,470 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/fa/fd/1d1b03d696cee94b387b2e3ee58c23a00fc74bb34b77a7bbdaf63012c530/flashinfer_python-0.6.9-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,471 Found link https://files.pythonhosted.org/packages/dc/38/64c39bc71ec538061f707bc30d36d1a2c4d1e95d662fd6282fb74224f2e3/flashinfer_python-0.6.9.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.9 2026-04-24T15:41:51,472 Fetching project page and analyzing links: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-24T15:41:51,472 Getting page https://www.piwheels.org/simple/flashinfer-python/ 2026-04-24T15:41:51,474 Found index url https://www.piwheels.org/simple 2026-04-24T15:41:51,660 Fetched page https://www.piwheels.org/simple/flashinfer-python/ as text/html 2026-04-24T15:41:51,670 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.9rc1-py3-none-any.whl#sha256=4bb2c38a3b3996432afd32e8509c4970660b964666d24ac9a11f6e605fe3812d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,671 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.8.post1-py3-none-any.whl#sha256=8bdb31c966879fb7814fd3025875c6da218ecf7d5021878a92c391e903379693 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,672 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.8-py3-none-any.whl#sha256=98f06ee98dd03f9d20637980976634ad9f62e7072236284bf8ef0f6ea644d6f1 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,672 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.8rc1-py3-none-any.whl#sha256=03391613bde22d44aa6cc1c6e53ba651f46f264a4dee2b4b6b1088a35f872db7 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,673 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7.post3-py3-none-any.whl#sha256=7a81720af5bdc04efcb67207f3867adb1b068f961d2e048e55baf32fb8e2cfc5 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,674 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7.post1-py3-none-any.whl#sha256=c9bf5183228f6636ddb26d7354f250af4b2385876527538a0ff7f94fd48207d2 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,674 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7-py3-none-any.whl#sha256=9b349825a2d26c3e4653c594d7a1d7b2126a43b29a4a70a6d48f3aaac23b96f3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,675 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.6-py3-none-any.whl#sha256=94791e01c31510c057b4decabff24cbc62466682667867e84214c62c45d9b343 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,675 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.5-py3-none-any.whl#sha256=4b0a6c246959ca2dbc232fa1fe2f17ff857fd258de5dfacfa45033f21b6b7b93 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,676 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.4-py3-none-any.whl#sha256=22ee7972266bb31ce1583330769efc0ecd001fb70371531ce4c77f2d6eda0d59 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,676 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.3-py3-none-any.whl#sha256=ed3282188580afd663819924a772b2b531ac5bb88438bbe89d0baf67fe8c9fa5 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,677 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.1-py3-none-any.whl#sha256=9e0e308062a81d4e4c462313bfe33edce7712309e8c89aed722065249e644833 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,678 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0-py3-none-any.whl#sha256=7ebc0582df714a933fc4c58ed4d12f4e61b4ad30b22b9155f290e96ee3eee3a0 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,678 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc2-py3-none-any.whl#sha256=63057b7ee43a4f6764c6ed8fe4c4c6de5a94da058fe0975bf279db0567c26204 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,679 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc1-py3-none-any.whl#sha256=e30a125bf89f8155f83aca80e5fb88a3d81224225485ce70f0f4c4c3a27da92c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,679 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.3-py3-none-any.whl#sha256=1de562233dfbd8de835c2eb757275a7759eda034460093c1eb9ff3c7d5c0845d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-24T15:41:51,680 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.2-py3-none-any.whl#sha256=bd3d206d1243bee523cf6cda27e0219e8fdf9026ade2e32045c8d9d4b7f7bf7a (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,681 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.1-py3-none-any.whl#sha256=8d73e4b66b7eb7fc4500f7f7e61aa194efebc769e7da1635a86506c97bf6fa0d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,681 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0-py3-none-any.whl#sha256=ac991d1911cff4a7453f02d88922803e7ca794a0af1dceaa920e33b81c78f5c8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,682 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc3-py3-none-any.whl#sha256=8799f4a93afc14042ac6f521f6fb682e4d62d738dc18a1e8798b7a2ba5b2e4ec (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,682 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc2-py3-none-any.whl#sha256=4ee4d438c8c7fdc242a917c3f97076562f3c44411dcaceb4f7d29082c41c0f8c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,683 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc1-py3-none-any.whl#sha256=a9d675075f3cb79ac1b5cba9e8430496d3983127609dc780a117b2b44bdb025d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,684 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.1-py3-none-any.whl#sha256=8fc8fc3233781e384689c5f202124ae7d266cb8dee14055cbb3c90fca530bf7f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,684 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.0-py3-none-any.whl#sha256=da0141b2163f9703e49972728eeb502d45eda60c25529a460d0d0d61963eedb2 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-24T15:41:51,685 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.5-py3-none-any.whl#sha256=cb2a17c3ea5f47f8129f6410e2892f30051e15665f2ae54db540c8677c187d31 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,685 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.4-py3-none-any.whl#sha256=4a85bd6ac785f106f0ad9fe213abf42f96ab84ccd04aec3ab9acf76d47d2aa3f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,686 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.3-py3-none-any.whl#sha256=b8ead688a4857a2b360c992fb46ae2930fc4c43b50a092b7e42a13b40ee195da (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,686 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2.post1-py3-none-any.whl#sha256=0097a08376ae147084ea6bd0848fc2ea1764f524c510a48755aa8c63259b4466 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,688 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2-py3-none-any.whl#sha256=c109a340b7e60cb57d8c9ccec2c10e303a36b82a56ba8dcaaa0efbee2a48b97f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,688 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post2-py3-none-any.whl#sha256=dc91f387ba09e4df899238705ec37bbe3648395d828240b77db84378d1b91e9e (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,689 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post1-py3-none-any.whl#sha256=a44b9d872cf2ba6812d3c0750d98ad01b73e9ccbede933c7eade01b6c27b6232 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,689 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1-py3-none-any.whl#sha256=e07427d9eff1b8d091b5837c3ffc4fe7885dbf01d271d7225f7a89a2e3925f27 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,690 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post2-py3-none-any.whl#sha256=52c20b84ef1e848dd49c726ffc27801df8acccb4038aea61a2d73fa685bf75f8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,690 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post1-py3-none-any.whl#sha256=783c1039e0a7db0478a579d5cc54894def70ae601b1e5b90a3c3de2209334bf3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-24T15:41:51,691 Skipping link: not a file: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-24T15:41:51,691 Skipping link: not a file: https://pypi.org/simple/flashinfer-python/ 2026-04-24T15:41:51,719 Given no hashes to check 1 links for project 'flashinfer-python': discarding no candidates 2026-04-24T15:41:51,739 Collecting flashinfer-python==0.6.9 2026-04-24T15:41:51,742 Created temporary directory: /tmp/pip-unpack-4w0mtgsx 2026-04-24T15:41:51,972 Downloading flashinfer_python-0.6.9.tar.gz (6.8 MB) 2026-04-24T15:41:59,066 Added flashinfer-python==0.6.9 from https://files.pythonhosted.org/packages/dc/38/64c39bc71ec538061f707bc30d36d1a2c4d1e95d662fd6282fb74224f2e3/flashinfer_python-0.6.9.tar.gz to build tracker '/tmp/pip-build-tracker-0l9cu84t' 2026-04-24T15:41:59,075 Created temporary directory: /tmp/pip-build-env-tslinz9w 2026-04-24T15:41:59,080 Installing build dependencies: started 2026-04-24T15:41:59,081 Running command pip subprocess to install build dependencies 2026-04-24T15:42:00,209 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-04-24T15:42:00,618 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-24T15:42:00,641 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-24T15:42:02,367 Collecting setuptools>=77 2026-04-24T15:42:02,461 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-04-24T15:42:02,678 Collecting packaging>=24 2026-04-24T15:42:02,697 Using cached https://www.piwheels.org/simple/packaging/packaging-26.1-py3-none-any.whl (95 kB) 2026-04-24T15:42:03,333 Collecting apache-tvm-ffi!=0.1.8,!=0.1.8.post0,<0.2,>=0.1.6 2026-04-24T15:42:03,555 Downloading https://archive1.piwheels.org/simple/apache-tvm-ffi/apache_tvm_ffi-0.1.10-cp311-cp311-linux_armv7l.whl (2.6 MB) 2026-04-24T15:42:03,764 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.6/2.6 MB 12.8 MB/s eta 0:00:00 2026-04-24T15:42:03,995 Collecting typing-extensions>=4.5 2026-04-24T15:42:04,012 Using cached https://www.piwheels.org/simple/typing-extensions/typing_extensions-4.15.0-py3-none-any.whl (44 kB) 2026-04-24T15:42:06,985 Installing collected packages: typing-extensions, setuptools, packaging, apache-tvm-ffi 2026-04-24T15:42:11,526 Creating /tmp/pip-build-env-tslinz9w/overlay/local/bin 2026-04-24T15:42:11,528 changing mode of /tmp/pip-build-env-tslinz9w/overlay/local/bin/tvm-ffi-config to 755 2026-04-24T15:42:11,531 changing mode of /tmp/pip-build-env-tslinz9w/overlay/local/bin/tvm-ffi-stubgen to 755 2026-04-24T15:42:11,564 Successfully installed apache-tvm-ffi-0.1.10 packaging-26.1 setuptools-82.0.1 typing-extensions-4.15.0 2026-04-24T15:42:11,868 Installing build dependencies: finished with status 'done' 2026-04-24T15:42:11,875 Getting requirements to build wheel: started 2026-04-24T15:42:11,877 Running command Getting requirements to build wheel 2026-04-24T15:42:17,440 Build metadata file already exists (not in git repo), keeping it 2026-04-24T15:42:17,527 Getting requirements to build wheel: finished with status 'done' 2026-04-24T15:42:17,530 Created temporary directory: /tmp/pip-modern-metadata-myte07lg 2026-04-24T15:42:17,533 Preparing metadata (pyproject.toml): started 2026-04-24T15:42:17,534 Running command Preparing metadata (pyproject.toml) 2026-04-24T15:42:23,516 /tmp/pip-build-env-tslinz9w/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-24T15:42:23,516 !! 2026-04-24T15:42:23,518 ******************************************************************************** 2026-04-24T15:42:23,518 Pattern 'LICENSE*.txt' did not match any files. 2026-04-24T15:42:23,519 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-24T15:42:23,520 or your builds will no longer be supported. 2026-04-24T15:42:23,521 ******************************************************************************** 2026-04-24T15:42:23,522 !! 2026-04-24T15:42:23,522 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-24T15:42:23,527 Build metadata file already exists (not in git repo), keeping it 2026-04-24T15:42:23,528 running dist_info 2026-04-24T15:42:23,541 creating /tmp/pip-modern-metadata-myte07lg/flashinfer_python.egg-info 2026-04-24T15:42:23,542 writing /tmp/pip-modern-metadata-myte07lg/flashinfer_python.egg-info/PKG-INFO 2026-04-24T15:42:23,547 writing dependency_links to /tmp/pip-modern-metadata-myte07lg/flashinfer_python.egg-info/dependency_links.txt 2026-04-24T15:42:23,549 writing entry points to /tmp/pip-modern-metadata-myte07lg/flashinfer_python.egg-info/entry_points.txt 2026-04-24T15:42:23,552 writing requirements to /tmp/pip-modern-metadata-myte07lg/flashinfer_python.egg-info/requires.txt 2026-04-24T15:42:23,553 writing top-level names to /tmp/pip-modern-metadata-myte07lg/flashinfer_python.egg-info/top_level.txt 2026-04-24T15:42:23,555 writing manifest file '/tmp/pip-modern-metadata-myte07lg/flashinfer_python.egg-info/SOURCES.txt' 2026-04-24T15:42:24,360 reading manifest file '/tmp/pip-modern-metadata-myte07lg/flashinfer_python.egg-info/SOURCES.txt' 2026-04-24T15:42:24,362 adding license file 'LICENSE' 2026-04-24T15:42:24,439 writing manifest file '/tmp/pip-modern-metadata-myte07lg/flashinfer_python.egg-info/SOURCES.txt' 2026-04-24T15:42:24,444 creating '/tmp/pip-modern-metadata-myte07lg/flashinfer_python-0.6.9.dist-info' 2026-04-24T15:42:24,574 Preparing metadata (pyproject.toml): finished with status 'done' 2026-04-24T15:42:24,579 Source in /tmp/pip-wheel-va5s0at6/flashinfer-python_2620d60cde274446966421f16f1993cb has version 0.6.9, which satisfies requirement flashinfer-python==0.6.9 from https://files.pythonhosted.org/packages/dc/38/64c39bc71ec538061f707bc30d36d1a2c4d1e95d662fd6282fb74224f2e3/flashinfer_python-0.6.9.tar.gz 2026-04-24T15:42:24,580 Removed flashinfer-python==0.6.9 from https://files.pythonhosted.org/packages/dc/38/64c39bc71ec538061f707bc30d36d1a2c4d1e95d662fd6282fb74224f2e3/flashinfer_python-0.6.9.tar.gz from build tracker '/tmp/pip-build-tracker-0l9cu84t' 2026-04-24T15:42:24,587 Created temporary directory: /tmp/pip-unpack-2yurvek6 2026-04-24T15:42:24,588 Building wheels for collected packages: flashinfer-python 2026-04-24T15:42:24,592 Created temporary directory: /tmp/pip-wheel-eodox3n_ 2026-04-24T15:42:24,593 Destination directory: /tmp/pip-wheel-eodox3n_ 2026-04-24T15:42:24,595 Building wheel for flashinfer-python (pyproject.toml): started 2026-04-24T15:42:24,596 Running command Building wheel for flashinfer-python (pyproject.toml) 2026-04-24T15:42:30,244 /tmp/pip-build-env-tslinz9w/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-24T15:42:30,245 !! 2026-04-24T15:42:30,246 ******************************************************************************** 2026-04-24T15:42:30,247 Pattern 'LICENSE*.txt' did not match any files. 2026-04-24T15:42:30,248 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-24T15:42:30,249 or your builds will no longer be supported. 2026-04-24T15:42:30,250 ******************************************************************************** 2026-04-24T15:42:30,251 !! 2026-04-24T15:42:30,252 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-24T15:42:30,252 Build metadata file already exists (not in git repo), keeping it 2026-04-24T15:42:30,253 running bdist_wheel 2026-04-24T15:42:30,275 running build 2026-04-24T15:42:30,275 running build_py 2026-04-24T15:42:30,282 creating build/lib 2026-04-24T15:42:30,284 copying build_backend.py -> build/lib 2026-04-24T15:42:30,287 copying build_utils.py -> build/lib 2026-04-24T15:42:30,291 creating build/lib/flashinfer 2026-04-24T15:42:30,292 copying flashinfer/sampling.py -> build/lib/flashinfer 2026-04-24T15:42:30,297 copying flashinfer/compilation_context.py -> build/lib/flashinfer 2026-04-24T15:42:30,299 copying flashinfer/pod.py -> build/lib/flashinfer 2026-04-24T15:42:30,302 copying flashinfer/gdn_prefill.py -> build/lib/flashinfer 2026-04-24T15:42:30,305 copying flashinfer/__init__.py -> build/lib/flashinfer 2026-04-24T15:42:30,308 copying flashinfer/gdn_decode.py -> build/lib/flashinfer 2026-04-24T15:42:30,311 copying flashinfer/decode.py -> build/lib/flashinfer 2026-04-24T15:42:30,316 copying flashinfer/fp8_quantization.py -> build/lib/flashinfer 2026-04-24T15:42:30,318 copying flashinfer/version.py -> build/lib/flashinfer 2026-04-24T15:42:30,320 copying flashinfer/_build_meta.py -> build/lib/flashinfer 2026-04-24T15:42:30,322 copying flashinfer/deep_gemm.py -> build/lib/flashinfer 2026-04-24T15:42:30,326 copying flashinfer/sparse.py -> build/lib/flashinfer 2026-04-24T15:42:30,330 copying flashinfer/xqa.py -> build/lib/flashinfer 2026-04-24T15:42:30,332 copying flashinfer/tllm_enums.py -> build/lib/flashinfer 2026-04-24T15:42:30,335 copying flashinfer/trtllm_low_latency_gemm.py -> build/lib/flashinfer 2026-04-24T15:42:30,338 copying flashinfer/autotuner.py -> build/lib/flashinfer 2026-04-24T15:42:30,341 copying flashinfer/cuda_utils.py -> build/lib/flashinfer 2026-04-24T15:42:30,344 copying flashinfer/fp4_quantization.py -> build/lib/flashinfer 2026-04-24T15:42:30,346 copying flashinfer/prefill.py -> build/lib/flashinfer 2026-04-24T15:42:30,352 copying flashinfer/concat_ops.py -> build/lib/flashinfer 2026-04-24T15:42:30,354 copying flashinfer/page.py -> build/lib/flashinfer 2026-04-24T15:42:30,357 copying flashinfer/rope.py -> build/lib/flashinfer 2026-04-24T15:42:30,361 copying flashinfer/activation.py -> build/lib/flashinfer 2026-04-24T15:42:30,363 copying flashinfer/artifacts.py -> build/lib/flashinfer 2026-04-24T15:42:30,366 copying flashinfer/topk.py -> build/lib/flashinfer 2026-04-24T15:42:30,369 copying flashinfer/green_ctx.py -> build/lib/flashinfer 2026-04-24T15:42:30,371 copying flashinfer/cascade.py -> build/lib/flashinfer 2026-04-24T15:42:30,375 copying flashinfer/aot.py -> build/lib/flashinfer 2026-04-24T15:42:30,378 copying flashinfer/__main__.py -> build/lib/flashinfer 2026-04-24T15:42:30,380 copying flashinfer/api_logging.py -> build/lib/flashinfer 2026-04-24T15:42:30,384 copying flashinfer/attention.py -> build/lib/flashinfer 2026-04-24T15:42:30,387 copying flashinfer/tllm_utils.py -> build/lib/flashinfer 2026-04-24T15:42:30,389 copying flashinfer/utils.py -> build/lib/flashinfer 2026-04-24T15:42:30,393 creating build/lib/flashinfer/cudnn 2026-04-24T15:42:30,394 copying flashinfer/cudnn/__init__.py -> build/lib/flashinfer/cudnn 2026-04-24T15:42:30,396 copying flashinfer/cudnn/decode.py -> build/lib/flashinfer/cudnn 2026-04-24T15:42:30,399 copying flashinfer/cudnn/prefill.py -> build/lib/flashinfer/cudnn 2026-04-24T15:42:30,402 copying flashinfer/cudnn/utils.py -> build/lib/flashinfer/cudnn 2026-04-24T15:42:30,405 creating build/lib/flashinfer/norm 2026-04-24T15:42:30,406 copying flashinfer/norm/__init__.py -> build/lib/flashinfer/norm 2026-04-24T15:42:30,409 copying flashinfer/norm/utils.py -> build/lib/flashinfer/norm 2026-04-24T15:42:30,413 creating build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,415 copying flashinfer/logits_processor/pipeline.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,417 copying flashinfer/logits_processor/legalization.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,419 copying flashinfer/logits_processor/fusion_rules.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,422 copying flashinfer/logits_processor/__init__.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,424 copying flashinfer/logits_processor/operators.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,427 copying flashinfer/logits_processor/validators.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,429 copying flashinfer/logits_processor/op.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,431 copying flashinfer/logits_processor/compiler.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,434 copying flashinfer/logits_processor/processors.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,437 copying flashinfer/logits_processor/types.py -> build/lib/flashinfer/logits_processor 2026-04-24T15:42:30,440 creating build/lib/flashinfer/fused_moe 2026-04-24T15:42:30,441 copying flashinfer/fused_moe/__init__.py -> build/lib/flashinfer/fused_moe 2026-04-24T15:42:30,444 copying flashinfer/fused_moe/fused_routing_dsv3.py -> build/lib/flashinfer/fused_moe 2026-04-24T15:42:30,446 copying flashinfer/fused_moe/core.py -> build/lib/flashinfer/fused_moe 2026-04-24T15:42:30,452 copying flashinfer/fused_moe/utils.py -> build/lib/flashinfer/fused_moe 2026-04-24T15:42:30,455 creating build/lib/flashinfer/cute_dsl 2026-04-24T15:42:30,457 copying flashinfer/cute_dsl/__init__.py -> build/lib/flashinfer/cute_dsl 2026-04-24T15:42:30,459 copying flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-24T15:42:30,462 copying flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/lib/flashinfer/cute_dsl 2026-04-24T15:42:30,467 copying flashinfer/cute_dsl/blockscaled_gemm.py -> build/lib/flashinfer/cute_dsl 2026-04-24T15:42:30,469 copying flashinfer/cute_dsl/fp4_common.py -> build/lib/flashinfer/cute_dsl 2026-04-24T15:42:30,473 copying flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-24T15:42:30,476 copying flashinfer/cute_dsl/utils.py -> build/lib/flashinfer/cute_dsl 2026-04-24T15:42:30,480 creating build/lib/flashinfer/gdn_kernels 2026-04-24T15:42:30,481 copying flashinfer/gdn_kernels/__init__.py -> build/lib/flashinfer/gdn_kernels 2026-04-24T15:42:30,484 copying flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-24T15:42:30,487 copying flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-24T15:42:30,490 copying flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/lib/flashinfer/gdn_kernels 2026-04-24T15:42:30,495 copying flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/lib/flashinfer/gdn_kernels 2026-04-24T15:42:30,500 creating build/lib/flashinfer/testing 2026-04-24T15:42:30,502 copying flashinfer/testing/__init__.py -> build/lib/flashinfer/testing 2026-04-24T15:42:30,504 copying flashinfer/testing/utils.py -> build/lib/flashinfer/testing 2026-04-24T15:42:30,508 creating build/lib/flashinfer/tuning_configs 2026-04-24T15:42:30,510 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/lib/flashinfer/tuning_configs 2026-04-24T15:42:30,512 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/lib/flashinfer/tuning_configs 2026-04-24T15:42:30,516 creating build/lib/flashinfer/data 2026-04-24T15:42:30,517 copying ./build_backend.py -> build/lib/flashinfer/data 2026-04-24T15:42:30,519 copying ./build_utils.py -> build/lib/flashinfer/data 2026-04-24T15:42:30,522 creating build/lib/flashinfer/gemm 2026-04-24T15:42:30,523 copying flashinfer/gemm/__init__.py -> build/lib/flashinfer/gemm 2026-04-24T15:42:30,526 copying flashinfer/gemm/gemm_base.py -> build/lib/flashinfer/gemm 2026-04-24T15:42:30,534 copying flashinfer/gemm/routergemm.py -> build/lib/flashinfer/gemm 2026-04-24T15:42:30,538 creating build/lib/flashinfer/profiler 2026-04-24T15:42:30,539 copying flashinfer/profiler/__init__.py -> build/lib/flashinfer/profiler 2026-04-24T15:42:30,542 creating build/lib/flashinfer/comm 2026-04-24T15:42:30,543 copying flashinfer/comm/allreduce.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,546 copying flashinfer/comm/dlpack_utils.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,549 copying flashinfer/comm/mapping.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,551 copying flashinfer/comm/trtllm_moe_alltoall.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,554 copying flashinfer/comm/__init__.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,556 copying flashinfer/comm/vllm_ar.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,558 copying flashinfer/comm/workspace_base.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,560 copying flashinfer/comm/trtllm_ar.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,563 copying flashinfer/comm/nvshmem.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,565 copying flashinfer/comm/mnnvl.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,568 copying flashinfer/comm/nvshmem_allreduce.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,570 copying flashinfer/comm/trtllm_mnnvl_ar.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,573 copying flashinfer/comm/cuda_ipc.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,576 copying flashinfer/comm/trtllm_alltoall.py -> build/lib/flashinfer/comm 2026-04-24T15:42:30,579 creating build/lib/flashinfer/triton 2026-04-24T15:42:30,580 copying flashinfer/triton/norm.py -> build/lib/flashinfer/triton 2026-04-24T15:42:30,582 copying flashinfer/triton/__init__.py -> build/lib/flashinfer/triton 2026-04-24T15:42:30,584 copying flashinfer/triton/gemm.py -> build/lib/flashinfer/triton 2026-04-24T15:42:30,586 copying flashinfer/triton/sm_constraint_gemm.py -> build/lib/flashinfer/triton 2026-04-24T15:42:30,588 copying flashinfer/triton/page.py -> build/lib/flashinfer/triton 2026-04-24T15:42:30,590 copying flashinfer/triton/activation.py -> build/lib/flashinfer/triton 2026-04-24T15:42:30,591 copying flashinfer/triton/cascade.py -> build/lib/flashinfer/triton 2026-04-24T15:42:30,593 copying flashinfer/triton/utils.py -> build/lib/flashinfer/triton 2026-04-24T15:42:30,595 creating build/lib/flashinfer/mla 2026-04-24T15:42:30,596 copying flashinfer/mla/__init__.py -> build/lib/flashinfer/mla 2026-04-24T15:42:30,598 copying flashinfer/mla/_core.py -> build/lib/flashinfer/mla 2026-04-24T15:42:30,601 creating build/lib/flashinfer/dsv3_ops 2026-04-24T15:42:30,602 copying flashinfer/dsv3_ops/__init__.py -> build/lib/flashinfer/dsv3_ops 2026-04-24T15:42:30,605 creating build/lib/flashinfer/mamba 2026-04-24T15:42:30,606 copying flashinfer/mamba/__init__.py -> build/lib/flashinfer/mamba 2026-04-24T15:42:30,608 copying flashinfer/mamba/ssd_tile_scheduler.py -> build/lib/flashinfer/mamba 2026-04-24T15:42:30,610 copying flashinfer/mamba/ssd_kernel.py -> build/lib/flashinfer/mamba 2026-04-24T15:42:30,618 copying flashinfer/mamba/selective_state_update.py -> build/lib/flashinfer/mamba 2026-04-24T15:42:30,621 copying flashinfer/mamba/ssd_combined.py -> build/lib/flashinfer/mamba 2026-04-24T15:42:30,625 creating build/lib/flashinfer/jit 2026-04-24T15:42:30,626 copying flashinfer/jit/sampling.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,628 copying flashinfer/jit/norm.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,630 copying flashinfer/jit/comm.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,632 copying flashinfer/jit/fp4_kv_quantization.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,634 copying flashinfer/jit/__init__.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,637 copying flashinfer/jit/fused_moe.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,639 copying flashinfer/jit/fp8_quantization.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,641 copying flashinfer/jit/mla.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,644 copying flashinfer/jit/quantization.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,646 copying flashinfer/jit/xqa.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,649 copying flashinfer/jit/gdn.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,651 copying flashinfer/jit/spdlog.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,653 copying flashinfer/jit/moe_utils.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,655 copying flashinfer/jit/fp4_quantization.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,657 copying flashinfer/jit/cpp_ext.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,660 copying flashinfer/jit/page.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,661 copying flashinfer/jit/fp4_kv_dequantization.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,663 copying flashinfer/jit/rope.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,666 copying flashinfer/jit/activation.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,668 copying flashinfer/jit/env.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,670 copying flashinfer/jit/topk.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,672 copying flashinfer/jit/cascade.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,674 copying flashinfer/jit/tinygemm2.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,676 copying flashinfer/jit/core.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,679 copying flashinfer/jit/cubin_loader.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,681 copying flashinfer/jit/tllm_utils.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,683 copying flashinfer/jit/dsv3_optimizations.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,685 copying flashinfer/jit/rmsnorm_silu.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,688 copying flashinfer/jit/utils.py -> build/lib/flashinfer/jit 2026-04-24T15:42:30,690 creating build/lib/flashinfer/parallel_attention 2026-04-24T15:42:30,691 copying flashinfer/parallel_attention/parallel_attention.py -> build/lib/flashinfer/parallel_attention 2026-04-24T15:42:30,693 copying flashinfer/parallel_attention/__init__.py -> build/lib/flashinfer/parallel_attention 2026-04-24T15:42:30,695 copying flashinfer/parallel_attention/parallel_wrapper.py -> build/lib/flashinfer/parallel_attention 2026-04-24T15:42:30,698 copying flashinfer/parallel_attention/parallel_config.py -> build/lib/flashinfer/parallel_attention 2026-04-24T15:42:30,700 copying flashinfer/parallel_attention/attention_ops.py -> build/lib/flashinfer/parallel_attention 2026-04-24T15:42:30,702 copying flashinfer/parallel_attention/utils.py -> build/lib/flashinfer/parallel_attention 2026-04-24T15:42:30,705 creating build/lib/flashinfer/quantization 2026-04-24T15:42:30,706 copying flashinfer/quantization/__init__.py -> build/lib/flashinfer/quantization 2026-04-24T15:42:30,708 copying flashinfer/quantization/fp8_quantization.py -> build/lib/flashinfer/quantization 2026-04-24T15:42:30,710 copying flashinfer/quantization/fp4_quantization.py -> build/lib/flashinfer/quantization 2026-04-24T15:42:30,713 copying flashinfer/quantization/packbits.py -> build/lib/flashinfer/quantization 2026-04-24T15:42:30,716 copying flashinfer/quantization/quantization_cute_dsl_utils.py -> build/lib/flashinfer/quantization 2026-04-24T15:42:30,719 creating build/lib/flashinfer/norm/kernels 2026-04-24T15:42:30,720 copying flashinfer/norm/kernels/__init__.py -> build/lib/flashinfer/norm/kernels 2026-04-24T15:42:30,722 copying flashinfer/norm/kernels/layernorm.py -> build/lib/flashinfer/norm/kernels 2026-04-24T15:42:30,724 copying flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-24T15:42:30,728 copying flashinfer/norm/kernels/rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-24T15:42:30,731 creating build/lib/flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:30,732 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:30,735 copying flashinfer/fused_moe/cute_dsl/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:30,737 copying flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:30,740 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:30,742 copying flashinfer/fused_moe/cute_dsl/tuner.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:30,745 copying flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:30,748 copying flashinfer/fused_moe/cute_dsl/b12x_moe.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:30,751 creating build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:30,752 copying flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_dispatch.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:30,755 copying flashinfer/fused_moe/cute_dsl/blackwell_sm12x/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:30,757 copying flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_micro_kernel.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:30,760 copying flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_static_kernel.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:30,765 copying flashinfer/fused_moe/cute_dsl/blackwell_sm12x/triton_compact.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:30,767 copying flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_dynamic_kernel.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:30,771 creating build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:30,772 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:30,776 copying flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:30,778 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:30,783 copying flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:30,785 copying flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:30,788 creating build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,790 copying flashinfer/cute_dsl/attention/pipeline_topology.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,793 copying flashinfer/cute_dsl/attention/__init__.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,795 copying flashinfer/cute_dsl/attention/tmem_layout.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,797 copying flashinfer/cute_dsl/attention/mla_decode_fp8.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,800 copying flashinfer/cute_dsl/attention/config.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,803 copying flashinfer/cute_dsl/attention/mla_warp_schedule.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,806 copying flashinfer/cute_dsl/attention/warp_schedule.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,808 copying flashinfer/cute_dsl/attention/collective_builder.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,812 copying flashinfer/cute_dsl/attention/mla_config.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,814 copying flashinfer/cute_dsl/attention/prefill.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,817 copying flashinfer/cute_dsl/attention/compat.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,819 copying flashinfer/cute_dsl/attention/mla_decode.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,823 copying flashinfer/cute_dsl/attention/mainloop_spec.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-24T15:42:30,826 creating build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,827 copying flashinfer/cute_dsl/attention/roles/__init__.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,829 copying flashinfer/cute_dsl/attention/roles/softmax.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,832 copying flashinfer/cute_dsl/attention/roles/correction.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,835 copying flashinfer/cute_dsl/attention/roles/mla_mma.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,838 copying flashinfer/cute_dsl/attention/roles/softmax_math.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,840 copying flashinfer/cute_dsl/attention/roles/loader_tma.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,843 copying flashinfer/cute_dsl/attention/roles/epilogue.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,846 copying flashinfer/cute_dsl/attention/roles/mla_loader.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,849 copying flashinfer/cute_dsl/attention/roles/mla_pt_loader.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,851 copying flashinfer/cute_dsl/attention/roles/mla_correction.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,854 copying flashinfer/cute_dsl/attention/roles/mla_mma_fp8.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,857 copying flashinfer/cute_dsl/attention/roles/mla_compute.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,860 copying flashinfer/cute_dsl/attention/roles/mma.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,863 copying flashinfer/cute_dsl/attention/roles/mla_loader_fp8.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:30,866 creating build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-24T15:42:30,867 copying flashinfer/cute_dsl/attention/fusion/__init__.py -> build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-24T15:42:30,870 copying flashinfer/cute_dsl/attention/fusion/mask.py -> build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-24T15:42:30,872 copying flashinfer/cute_dsl/attention/fusion/variant.py -> build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-24T15:42:30,876 creating build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-24T15:42:30,877 copying flashinfer/cute_dsl/attention/wrappers/batch_prefill.py -> build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-24T15:42:30,880 copying flashinfer/cute_dsl/attention/wrappers/__init__.py -> build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-24T15:42:30,882 copying flashinfer/cute_dsl/attention/wrappers/batch_mla.py -> build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-24T15:42:30,886 creating build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-24T15:42:30,887 copying flashinfer/cute_dsl/attention/scheduler/mla_persistent.py -> build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-24T15:42:30,890 copying flashinfer/cute_dsl/attention/scheduler/__init__.py -> build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-24T15:42:30,892 copying flashinfer/cute_dsl/attention/scheduler/persistent.py -> build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-24T15:42:30,895 creating build/lib/flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:30,896 copying flashinfer/gdn_kernels/blackwell/gated_delta_net_chunked.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:30,902 copying flashinfer/gdn_kernels/blackwell/gdn_prefill.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:30,904 copying flashinfer/gdn_kernels/blackwell/__init__.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:30,906 copying flashinfer/gdn_kernels/blackwell/gated_delta_net_tile_scheduler.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:30,911 creating build/lib/flashinfer/data/spdlog/scripts 2026-04-24T15:42:30,913 copying 3rdparty/spdlog/scripts/extract_version.py -> build/lib/flashinfer/data/spdlog/scripts 2026-04-24T15:42:30,918 creating build/lib/flashinfer/data/cutlass/python 2026-04-24T15:42:30,920 copying 3rdparty/cutlass/python/setup_pycute.py -> build/lib/flashinfer/data/cutlass/python 2026-04-24T15:42:30,922 copying 3rdparty/cutlass/python/setup_library.py -> build/lib/flashinfer/data/cutlass/python 2026-04-24T15:42:30,924 copying 3rdparty/cutlass/python/setup_cutlass.py -> build/lib/flashinfer/data/cutlass/python 2026-04-24T15:42:30,928 creating build/lib/flashinfer/data/cutlass/test/utils 2026-04-24T15:42:30,929 copying 3rdparty/cutlass/test/utils/test_sharding.py -> build/lib/flashinfer/data/cutlass/test/utils 2026-04-24T15:42:30,934 creating build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:30,935 copying 3rdparty/cutlass/test/python/pycute/test_complement.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:30,938 copying 3rdparty/cutlass/test/python/pycute/test_int_tuple.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:30,940 copying 3rdparty/cutlass/test/python/pycute/test_composition.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:30,942 copying 3rdparty/cutlass/test/python/pycute/test_typing.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:30,944 copying 3rdparty/cutlass/test/python/pycute/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:30,946 copying 3rdparty/cutlass/test/python/pycute/test_left_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:30,948 copying 3rdparty/cutlass/test/python/pycute/test_right_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:30,950 copying 3rdparty/cutlass/test/python/pycute/test_coalesce.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:30,953 creating build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-24T15:42:30,954 copying 3rdparty/cutlass/test/python/cutlass/installation.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-24T15:42:30,957 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,958 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,960 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,962 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,965 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,967 copying 3rdparty/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,969 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,971 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,974 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,976 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,979 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,981 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,984 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,986 copying 3rdparty/cutlass/test/python/cutlass/gemm/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:30,989 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:30,991 copying 3rdparty/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:30,994 copying 3rdparty/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:30,996 copying 3rdparty/cutlass/test/python/cutlass/interface/evt_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:30,999 copying 3rdparty/cutlass/test/python/cutlass/interface/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:31,001 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:31,003 copying 3rdparty/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:31,005 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:31,007 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:31,010 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:31,012 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:31,015 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:31,017 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-24T15:42:31,019 copying 3rdparty/cutlass/test/python/cutlass/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-24T15:42:31,022 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:31,023 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:31,026 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:31,029 copying 3rdparty/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:31,031 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:31,034 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-24T15:42:31,036 copying 3rdparty/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-24T15:42:31,039 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-24T15:42:31,041 copying 3rdparty/cutlass/test/examples/CuTeDSL/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-24T15:42:31,044 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:31,046 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:31,048 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:31,051 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:31,053 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:31,056 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:31,059 creating build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-24T15:42:31,061 copying 3rdparty/cutlass/test/unit/gemm/device/simt_sm50.py -> build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-24T15:42:31,065 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:31,066 copying 3rdparty/cutlass/python/cutlass_cppgen/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:31,069 copying 3rdparty/cutlass/python/cutlass_cppgen/swizzle.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:31,071 copying 3rdparty/cutlass/python/cutlass_cppgen/shape.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:31,074 copying 3rdparty/cutlass/python/cutlass_cppgen/library_defaults.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:31,078 creating build/lib/flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:31,079 copying 3rdparty/cutlass/python/pycute/__init__.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:31,081 copying 3rdparty/cutlass/python/pycute/swizzle.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:31,084 copying 3rdparty/cutlass/python/pycute/layout.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:31,087 copying 3rdparty/cutlass/python/pycute/typing.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:31,089 copying 3rdparty/cutlass/python/pycute/int_tuple.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:31,092 creating build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,094 copying 3rdparty/cutlass/python/cutlass_library/generator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,104 copying 3rdparty/cutlass/python/cutlass_library/rank_k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,106 copying 3rdparty/cutlass/python/cutlass_library/trmm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,109 copying 3rdparty/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,112 copying 3rdparty/cutlass/python/cutlass_library/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,114 copying 3rdparty/cutlass/python/cutlass_library/conv3x_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,116 copying 3rdparty/cutlass/python/cutlass_library/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,118 copying 3rdparty/cutlass/python/cutlass_library/manifest.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,121 copying 3rdparty/cutlass/python/cutlass_library/sm100_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,124 copying 3rdparty/cutlass/python/cutlass_library/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,127 copying 3rdparty/cutlass/python/cutlass_library/sm90_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,129 copying 3rdparty/cutlass/python/cutlass_library/heuristics.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,132 copying 3rdparty/cutlass/python/cutlass_library/sm100_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,134 copying 3rdparty/cutlass/python/cutlass_library/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,137 copying 3rdparty/cutlass/python/cutlass_library/sm90_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,140 copying 3rdparty/cutlass/python/cutlass_library/heuristics_provider.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,142 copying 3rdparty/cutlass/python/cutlass_library/symm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,145 copying 3rdparty/cutlass/python/cutlass_library/rank_2k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,148 copying 3rdparty/cutlass/python/cutlass_library/conv3d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:31,151 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-24T15:42:31,152 copying 3rdparty/cutlass/python/CuTeDSL/prep_editable_install.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-24T15:42:31,156 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-24T15:42:31,157 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-24T15:42:31,159 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-24T15:42:31,161 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-24T15:42:31,164 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:31,164 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:31,166 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:31,168 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/check.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:31,170 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:31,172 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:31,175 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,176 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,178 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,180 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,182 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,184 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,186 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,189 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,191 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,195 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,196 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,199 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,200 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,203 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:31,205 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:31,206 copying 3rdparty/cutlass/python/cutlass_cppgen/op/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:31,208 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:31,211 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:31,213 copying 3rdparty/cutlass/python/cutlass_cppgen/op/op.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:31,216 copying 3rdparty/cutlass/python/cutlass_cppgen/op/conv.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:31,219 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-24T15:42:31,220 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-24T15:42:31,222 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-24T15:42:31,224 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/common.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-24T15:42:31,227 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-24T15:42:31,228 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-24T15:42:31,230 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-24T15:42:31,232 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-24T15:42:31,233 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-24T15:42:31,235 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-24T15:42:31,237 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:31,238 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:31,240 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:31,242 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:31,244 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:31,246 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:31,248 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:31,250 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:31,252 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:31,255 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,256 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,258 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,260 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,262 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,263 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,265 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,268 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,270 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,272 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,274 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,277 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,279 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,280 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:31,283 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,284 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,286 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,288 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,290 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,292 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,295 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,297 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,299 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,301 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:31,304 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-24T15:42:31,305 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-24T15:42:31,307 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-24T15:42:31,309 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-24T15:42:31,312 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-24T15:42:31,313 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-24T15:42:31,315 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/torch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-24T15:42:31,317 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-24T15:42:31,320 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,320 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,323 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,325 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,327 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,329 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,331 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,333 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,336 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,338 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,340 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,342 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,344 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,346 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,348 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,351 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,353 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,357 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:31,360 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,361 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,363 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,365 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,367 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,371 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,374 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,376 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,380 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,382 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,385 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,386 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,389 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:31,393 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,394 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,397 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,399 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,401 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,405 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,407 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,410 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,413 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,416 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,422 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,426 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,430 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:31,434 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:31,435 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:31,438 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:31,440 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:31,442 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:31,445 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:31,448 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:31,452 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:31,453 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:31,456 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:31,460 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:31,463 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:31,467 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:31,468 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:31,472 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:31,476 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:31,478 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:31,481 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:31,483 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:31,486 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-24T15:42:31,487 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-24T15:42:31,490 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-24T15:42:31,493 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:31,495 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:31,497 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:31,500 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:31,502 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:31,505 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:31,507 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:31,510 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:31,511 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:31,514 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:31,515 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:31,517 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:31,519 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:31,521 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:31,523 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:31,525 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:31,526 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:31,528 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:31,531 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:31,532 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:31,535 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:31,538 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:31,539 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:31,541 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:31,543 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:31,545 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:31,548 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:31,549 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:31,551 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:31,553 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:31,555 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:31,557 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:31,560 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:31,561 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:31,564 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:31,566 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:31,568 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:31,570 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:31,572 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:31,574 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:31,576 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:31,577 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:31,580 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:31,581 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:31,584 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:31,585 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:31,589 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:31,591 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:31,592 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:31,595 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-24T15:42:31,596 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-24T15:42:31,598 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-24T15:42:31,600 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-24T15:42:31,602 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:31,603 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:31,605 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:31,607 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:31,609 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:31,611 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:31,614 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-24T15:42:31,615 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-24T15:42:31,617 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-24T15:42:31,619 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-24T15:42:31,621 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:31,622 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:31,625 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:31,626 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:31,629 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:31,632 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-24T15:42:31,633 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-24T15:42:31,636 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-24T15:42:31,637 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-24T15:42:31,640 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-24T15:42:31,641 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-24T15:42:31,643 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-24T15:42:31,645 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-24T15:42:31,648 creating build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-24T15:42:31,649 copying 3rdparty/cutlass/python/docs_src/source/conf.py -> build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-24T15:42:31,652 creating build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-24T15:42:31,655 copying 3rdparty/cutlass/tools/util/scripts/split_test_cmake.py -> build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-24T15:42:31,661 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-24T15:42:31,663 copying 3rdparty/cutlass/examples/40_cutlass_py/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-24T15:42:31,665 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-24T15:42:31,667 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-24T15:42:31,670 creating build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-24T15:42:31,671 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-24T15:42:31,673 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-24T15:42:31,676 creating build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,677 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,679 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,682 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,683 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,685 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,688 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,689 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,691 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,693 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,696 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,699 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,701 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:31,703 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-24T15:42:31,704 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-24T15:42:31,707 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-24T15:42:31,709 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-24T15:42:31,713 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,715 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,718 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,720 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,723 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,725 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,728 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,730 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,733 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,736 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,738 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,740 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,742 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,744 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:31,746 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:31,747 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:31,749 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:31,752 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:31,754 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:31,756 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-24T15:42:31,757 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-24T15:42:31,761 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-24T15:42:31,762 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-24T15:42:31,764 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-24T15:42:31,766 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:31,767 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:31,770 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:31,774 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:31,777 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:31,781 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:31,782 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:31,784 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:31,788 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:31,792 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:31,795 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:31,797 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:31,799 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:31,803 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:31,804 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:31,806 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:31,808 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:31,810 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:31,813 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,814 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,819 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,823 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,827 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,830 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,837 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,840 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,845 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,849 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,853 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,857 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,860 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,863 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,865 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,869 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,872 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,875 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,878 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:31,882 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-24T15:42:31,883 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-24T15:42:31,884 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-24T15:42:31,887 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-24T15:42:31,889 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-24T15:42:31,891 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:31,892 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:31,896 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:31,898 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:31,900 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:31,903 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:31,906 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:31,908 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:31,910 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:31,912 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:31,913 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:31,916 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:31,918 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:31,919 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:31,921 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:31,923 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-24T15:42:31,925 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-24T15:42:31,928 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-24T15:42:31,929 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-24T15:42:31,930 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-24T15:42:31,933 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:31,934 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:31,936 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:31,940 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:31,943 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:31,947 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:31,948 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:31,951 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:31,955 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:31,957 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:31,960 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:31,964 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:31,965 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:31,968 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:31,971 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:31,974 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:31,976 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:31,979 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-24T15:42:31,980 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-24T15:42:31,984 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-24T15:42:31,989 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-24T15:42:31,994 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-24T15:42:31,995 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-24T15:42:32,000 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-24T15:42:32,005 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-24T15:42:32,009 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-24T15:42:32,010 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-24T15:42:32,015 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-24T15:42:32,017 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-24T15:42:32,020 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:32,021 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:32,024 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:32,028 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:32,032 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:32,090 creating build/lib/flashinfer/gemm/kernels 2026-04-24T15:42:32,092 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm120.py -> build/lib/flashinfer/gemm/kernels 2026-04-24T15:42:32,095 copying flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/lib/flashinfer/gemm/kernels 2026-04-24T15:42:32,100 copying flashinfer/gemm/kernels/__init__.py -> build/lib/flashinfer/gemm/kernels 2026-04-24T15:42:32,102 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/lib/flashinfer/gemm/kernels 2026-04-24T15:42:32,107 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/lib/flashinfer/gemm/kernels 2026-04-24T15:42:32,111 copying flashinfer/gemm/kernels/utils.py -> build/lib/flashinfer/gemm/kernels 2026-04-24T15:42:32,113 creating build/lib/flashinfer/triton/kernels 2026-04-24T15:42:32,115 copying flashinfer/triton/kernels/norm.py -> build/lib/flashinfer/triton/kernels 2026-04-24T15:42:32,117 copying flashinfer/triton/kernels/ssd_chunk_state.py -> build/lib/flashinfer/triton/kernels 2026-04-24T15:42:32,119 copying flashinfer/triton/kernels/__init__.py -> build/lib/flashinfer/triton/kernels 2026-04-24T15:42:32,120 copying flashinfer/triton/kernels/sm_constraint_gemm.py -> build/lib/flashinfer/triton/kernels 2026-04-24T15:42:32,123 copying flashinfer/triton/kernels/activation.py -> build/lib/flashinfer/triton/kernels 2026-04-24T15:42:32,125 copying flashinfer/triton/kernels/quant.py -> build/lib/flashinfer/triton/kernels 2026-04-24T15:42:32,127 copying flashinfer/triton/kernels/cascade.py -> build/lib/flashinfer/triton/kernels 2026-04-24T15:42:32,130 creating build/lib/flashinfer/jit/gemm 2026-04-24T15:42:32,131 copying flashinfer/jit/gemm/fp8_blockscale.py -> build/lib/flashinfer/jit/gemm 2026-04-24T15:42:32,133 copying flashinfer/jit/gemm/__init__.py -> build/lib/flashinfer/jit/gemm 2026-04-24T15:42:32,135 copying flashinfer/jit/gemm/deepgemm.py -> build/lib/flashinfer/jit/gemm 2026-04-24T15:42:32,137 copying flashinfer/jit/gemm/core.py -> build/lib/flashinfer/jit/gemm 2026-04-24T15:42:32,140 creating build/lib/flashinfer/jit/mamba 2026-04-24T15:42:32,141 copying flashinfer/jit/mamba/__init__.py -> build/lib/flashinfer/jit/mamba 2026-04-24T15:42:32,143 copying flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/lib/flashinfer/jit/mamba 2026-04-24T15:42:32,145 copying flashinfer/jit/mamba/selective_state_update.py -> build/lib/flashinfer/jit/mamba 2026-04-24T15:42:32,148 creating build/lib/flashinfer/jit/attention 2026-04-24T15:42:32,149 copying flashinfer/jit/attention/__init__.py -> build/lib/flashinfer/jit/attention 2026-04-24T15:42:32,151 copying flashinfer/jit/attention/modules.py -> build/lib/flashinfer/jit/attention 2026-04-24T15:42:32,155 copying flashinfer/jit/attention/variants.py -> build/lib/flashinfer/jit/attention 2026-04-24T15:42:32,158 copying flashinfer/jit/attention/utils.py -> build/lib/flashinfer/jit/attention 2026-04-24T15:42:32,160 creating build/lib/flashinfer/jit/gemm/cutlass 2026-04-24T15:42:32,161 copying flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-24T15:42:32,165 copying flashinfer/jit/gemm/cutlass/__init__.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-24T15:42:32,167 copying flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-24T15:42:32,170 creating build/lib/flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:32,171 copying flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:32,174 copying flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:32,177 copying flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:32,183 copying flashinfer/jit/attention/fmha_v2/utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:32,187 creating build/lib/flashinfer/quantization/kernels 2026-04-24T15:42:32,188 copying flashinfer/quantization/kernels/mxfp4_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-24T15:42:32,191 copying flashinfer/quantization/kernels/__init__.py -> build/lib/flashinfer/quantization/kernels 2026-04-24T15:42:32,193 copying flashinfer/quantization/kernels/nvfp4_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-24T15:42:32,196 copying flashinfer/quantization/kernels/mxfp8_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-24T15:42:32,744 copying flashinfer/py.typed -> build/lib/flashinfer 2026-04-24T15:42:32,747 creating build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,747 copying ./csrc/trtllm_batched_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,750 copying ./csrc/norm.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,753 copying ./csrc/single_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,755 copying ./csrc/fp4_gemm_cutlass_sm120.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,757 copying ./csrc/single_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,758 copying ./csrc/fmha_v2_run.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,761 copying ./csrc/batch_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,763 copying ./csrc/pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,765 copying ./csrc/tgv_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,768 copying ./csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,769 copying ./csrc/flashinfer_cascade_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,771 copying ./csrc/trtllm_alltoall_prepare.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,774 copying ./csrc/single_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,776 copying ./csrc/seq_chunk_cumsum.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,778 copying ./csrc/vllm_custom_all_reduce.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,780 copying ./csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,782 copying ./csrc/batch_mla_sm90_run.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,785 copying ./csrc/flashinfer_fast_topk_clusters_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,787 copying ./csrc/flashinfer_gemm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,789 copying ./csrc/rope.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,792 copying ./csrc/batch_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,794 copying ./csrc/batch_attention_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,796 copying ./csrc/gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,797 copying ./csrc/sampling_utils.h -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,799 copying ./csrc/single_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,801 copying ./csrc/group_gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,803 copying ./csrc/tvm_ffi_utils.h -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,806 copying ./csrc/trtllm_low_latency_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,809 copying ./csrc/selective_state_update_kernel_inst.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,810 copying ./csrc/batch_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,813 copying ./csrc/batch_mla_sm90_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,815 copying ./csrc/batch_pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,817 copying ./csrc/flashinfer_rope_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,819 copying ./csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,820 copying ./csrc/gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,822 copying ./csrc/group_gemm_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,825 copying ./csrc/blackwell_fmha_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,827 copying ./csrc/batch_decode_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,829 copying ./csrc/selective_state_update.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,831 copying ./csrc/fmhaReduction.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,834 copying ./csrc/batch_decode_mla_cute_sm80.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,836 copying ./csrc/rmsnorm_silu.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,838 creating build/lib/flashinfer/data/csrc/fused_moe 2026-04-24T15:42:32,839 copying ./csrc/fused_moe/noAuxTcKernels.cu -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-24T15:42:32,842 creating build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:32,842 copying ./csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:32,846 copying ./csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:32,848 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:32,854 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:32,856 creating build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:32,857 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:32,860 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:32,863 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:32,866 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_common.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:32,868 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_custom.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:32,871 copying ./csrc/fused_moe/moeTopKFuncs.cuh -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-24T15:42:32,873 copying ./csrc/renorm.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,876 copying ./csrc/mxfp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,878 copying ./csrc/cutlass_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,880 copying ./csrc/page.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,883 copying ./csrc/single_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,885 copying ./csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,887 copying ./csrc/batch_attention_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,889 copying ./csrc/runtime_utils.h -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,891 copying ./csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,893 copying ./csrc/group_gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,895 copying ./csrc/single_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,897 copying ./csrc/flashinfer_xqa_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,899 copying ./csrc/flashinfer_quantization_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,901 copying ./csrc/batch_pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,903 copying ./csrc/selective_state_update_dtype_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,905 copying ./csrc/trtllm_moe_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,907 copying ./csrc/gdn_prefill_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,909 copying ./csrc/bf16_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,912 copying ./csrc/batch_pod.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,914 copying ./csrc/group_gemm_mxfp4_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,916 copying ./csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,918 copying ./csrc/moe_utils_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,921 copying ./csrc/concat_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,923 copying ./csrc/flashinfer_rmsnorm_silu_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,925 copying ./csrc/batch_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,926 copying ./csrc/group_gemm_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,928 copying ./csrc/group_gemm_nvfp4_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,930 copying ./csrc/flashinfer_page_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,932 copying ./csrc/batch_pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,933 copying ./csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,936 copying ./csrc/trtllm_fused_moe_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,940 copying ./csrc/flashinfer_topk_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,942 copying ./csrc/mxfp8_gemm_cutlass_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,945 copying ./csrc/prefill_kernel_delta_rule_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:32,947 creating build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,948 copying ./csrc/xqa/barriers.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,950 copying ./csrc/xqa/utils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,953 copying ./csrc/xqa/tensorMap.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,955 copying ./csrc/xqa/mla_sm120.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,959 copying ./csrc/xqa/defines.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,962 copying ./csrc/xqa/platform.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,963 copying ./csrc/xqa/utils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,966 copying ./csrc/xqa/mhaUtils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,969 copying ./csrc/xqa/gmma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,971 copying ./csrc/xqa/mha.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,974 copying ./csrc/xqa/mha_components.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,976 copying ./csrc/xqa/ldgsts.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,978 copying ./csrc/xqa/mha.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,982 copying ./csrc/xqa/tensorMap.cpp -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,985 copying ./csrc/xqa/mha_stdheaders.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,987 copying ./csrc/xqa/mha_sm90.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,992 copying ./csrc/xqa/mla_sm120.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,994 copying ./csrc/xqa/mma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,996 copying ./csrc/xqa/xqa_wrapper.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:32,998 copying ./csrc/xqa/tma.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:33,001 copying ./csrc/xqa/hostUtils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:33,002 copying ./csrc/xqa/cuda_hint.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:33,004 copying ./csrc/xqa/specDec.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:33,006 copying ./csrc/xqa/gmma_impl.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-24T15:42:33,013 copying ./csrc/bmm_fp8.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,016 copying ./csrc/batch_decode_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,018 copying ./csrc/batch_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,020 copying ./csrc/pod.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,022 copying ./csrc/trtllm_fused_moe_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,025 copying ./csrc/batch_mla_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,027 copying ./csrc/batch_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,029 copying ./csrc/batch_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,031 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:33,033 copying ./csrc/nv_internal/cpp/common/memoryUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:33,037 copying ./csrc/nv_internal/cpp/common/envUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:33,040 copying ./csrc/nv_internal/cpp/common/tllmException.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:33,041 copying ./csrc/nv_internal/cpp/common/logger.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:33,043 copying ./csrc/nv_internal/cpp/common/stringUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:33,045 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-24T15:42:33,046 copying ./csrc/nv_internal/cpp/kernels/quantization.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-24T15:42:33,049 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:33,052 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:33,054 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-24T15:42:33,056 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-24T15:42:33,058 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-24T15:42:33,059 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-24T15:42:33,062 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-24T15:42:33,063 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-24T15:42:33,066 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-24T15:42:33,069 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:33,071 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:33,072 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:33,074 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:33,076 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:33,079 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:33,081 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:33,083 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:33,086 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:33,088 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-24T15:42:33,090 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-24T15:42:33,093 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:33,094 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,096 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,099 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,101 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,103 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,106 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,109 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,111 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,114 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,117 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,119 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,121 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,124 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:33,127 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-24T15:42:33,128 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-24T15:42:33,130 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-24T15:42:33,132 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-24T15:42:33,135 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,136 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,139 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,142 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,146 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,148 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,150 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-24T15:42:33,151 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-24T15:42:33,154 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-24T15:42:33,156 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-24T15:42:33,159 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,162 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,164 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,166 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,168 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:33,170 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,171 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,174 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,177 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,179 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,182 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,184 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,186 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,189 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,191 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,194 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,196 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,198 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:33,200 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-24T15:42:33,202 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-24T15:42:33,205 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-24T15:42:33,206 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-24T15:42:33,209 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:33,211 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:33,213 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-24T15:42:33,214 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-24T15:42:33,217 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,218 copying ./csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,220 copying ./csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,222 copying ./csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,225 copying ./csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,227 copying ./csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,229 copying ./csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,231 copying ./csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,233 copying ./csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,235 copying ./csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:33,238 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-24T15:42:33,239 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-24T15:42:33,242 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-24T15:42:33,244 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-24T15:42:33,245 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-24T15:42:33,247 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-24T15:42:33,250 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:33,252 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:33,254 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:33,256 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:33,258 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:33,260 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:33,263 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-24T15:42:33,264 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-24T15:42:33,267 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-24T15:42:33,270 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:33,272 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:33,276 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:33,278 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:33,281 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-24T15:42:33,283 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-24T15:42:33,285 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,286 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,288 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,290 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,293 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,295 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,297 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,299 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,301 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,303 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,305 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,307 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,309 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,311 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,313 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,315 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,317 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,320 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,322 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,324 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,326 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:33,328 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-24T15:42:33,329 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-24T15:42:33,331 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-24T15:42:33,333 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-24T15:42:33,336 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:33,337 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:33,341 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:33,343 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:33,345 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:33,347 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:33,349 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,350 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,353 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,356 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,358 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,360 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,362 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,364 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,366 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,368 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,370 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,372 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,374 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,376 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,378 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,380 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,382 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,384 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,386 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,388 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,390 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:33,392 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:33,393 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:33,395 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:33,397 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:33,399 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:33,402 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:33,407 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:33,409 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-24T15:42:33,410 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-24T15:42:33,412 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-24T15:42:33,415 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:33,417 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:33,419 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:33,422 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,423 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,427 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,429 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,432 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,435 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,437 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,439 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,442 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,445 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,448 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,451 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:33,453 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:33,454 copying ./csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:33,456 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:33,458 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:33,461 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:33,463 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:33,465 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:33,468 copying ./csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:33,470 creating build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,472 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,475 copying ./csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,477 copying ./csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,479 copying ./csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,482 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,484 copying ./csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,485 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,488 copying ./csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,490 copying ./csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,493 copying ./csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,495 copying ./csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:33,497 copying ./csrc/cascade.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,499 copying ./csrc/single_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,501 copying ./csrc/topk.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,503 copying ./csrc/single_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,505 copying ./csrc/single_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,507 copying ./csrc/group_gemm_fp8_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,510 copying ./csrc/fp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,512 copying ./csrc/batch_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,514 copying ./csrc/bf16_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,516 copying ./csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,518 copying ./csrc/trtllm_moe_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,520 copying ./csrc/batch_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,523 copying ./csrc/seq_chunk_cumsum_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,525 copying ./csrc/dsv3_router_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,527 copying ./csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,529 copying ./csrc/fp4_gemm_cutlass_sm103.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,532 copying ./csrc/mxfp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,534 copying ./csrc/single_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,536 copying ./csrc/flashinfer_sampling_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,538 copying ./csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,540 copying ./csrc/sampling.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,543 copying ./csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,545 copying ./csrc/batch_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,548 copying ./csrc/fp4_gemm_cutlass_sm103.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,550 copying ./csrc/group_gemm_nvfp4_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,552 copying ./csrc/flashinfer_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,554 copying ./csrc/batch_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,556 copying ./csrc/batch_prefill_ragged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,558 copying ./csrc/trtllm_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,561 copying ./csrc/batch_attention_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,563 copying ./csrc/gemm_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,565 copying ./csrc/flashinfer_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,567 copying ./csrc/fp4_gemm_cutlass_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,569 copying ./csrc/trtllm_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,572 copying ./csrc/single_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,574 copying ./csrc/flashinfer_norm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,576 copying ./csrc/fp4_kv_dequantization.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,578 copying ./csrc/tinygemm2.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,581 copying ./csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,584 copying ./csrc/fp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,586 copying ./csrc/group_gemm_mxfp4_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,588 copying ./csrc/logging.cc -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,590 copying ./csrc/group_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,592 copying ./csrc/trtllm_fmha_v2_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,595 copying ./csrc/batch_attention.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,598 copying ./csrc/trtllm_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,600 copying ./csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,602 copying ./csrc/fmha_v2_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,604 creating build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,605 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,608 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,611 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,614 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,616 copying ./csrc/fmha_v2/fused_multihead_attention_utils.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,619 copying ./csrc/fmha_v2/fused_multihead_cross_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,621 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,624 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,627 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,630 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,633 copying ./csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,636 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,639 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,642 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,645 creating build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:33,646 copying ./csrc/fmha_v2/templates/kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:33,649 copying ./csrc/fmha_v2/templates/kernel_hopper.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:33,652 copying ./csrc/fmha_v2/templates/fa_kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:33,655 copying ./csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:33,658 copying ./csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,661 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,664 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,665 copying ./csrc/fmha_v2/fmha/paged_kv_cache.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,668 copying ./csrc/fmha_v2/fmha/softmax.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,674 copying ./csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,678 copying ./csrc/fmha_v2/fmha/gemm.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,680 copying ./csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,684 copying ./csrc/fmha_v2/fmha/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,689 copying ./csrc/fmha_v2/fmha/gmem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,692 copying ./csrc/fmha_v2/fmha/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,696 copying ./csrc/fmha_v2/fmha/utils.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,700 copying ./csrc/fmha_v2/fmha/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,703 copying ./csrc/fmha_v2/fmha/mask.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,706 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,708 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,712 copying ./csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,716 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,720 copying ./csrc/fmha_v2/fmha/hopper/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,723 copying ./csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,727 copying ./csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,731 copying ./csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,735 copying ./csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,738 copying ./csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,742 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,746 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,750 copying ./csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,754 copying ./csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,758 copying ./csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,762 copying ./csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,770 copying ./csrc/fmha_v2/fmha/hopper/tma_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,774 copying ./csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,778 copying ./csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:33,781 copying ./csrc/fmha_v2/fmha/traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,786 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,791 copying ./csrc/fmha_v2/fmha/numeric_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,794 copying ./csrc/fmha_v2/fmha/alibi_params.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,797 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:33,799 copying ./csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:33,803 copying ./csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:33,808 copying ./csrc/fmha_v2/fmha/warpspec/compute.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:33,812 copying ./csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:33,817 copying ./csrc/fmha_v2/fmha/warpspec/dma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:33,821 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,825 copying ./csrc/fmha_v2/fmha/smem_tile_v.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,829 copying ./csrc/fmha_v2/fmha/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,832 copying ./csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:33,835 copying ./csrc/fmha_v2/fused_multihead_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,838 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:33,840 copying ./csrc/gemm_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,843 copying ./csrc/pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,845 copying ./csrc/single_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,848 copying ./csrc/mxfp8_gemm_cutlass_sm120.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,850 copying ./csrc/selective_state_update_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,852 copying ./csrc/batch_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,854 copying ./csrc/tgv_gemm.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,856 copying ./csrc/batch_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,858 copying ./csrc/trtllm_fmha_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,861 copying ./csrc/trtllm_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,863 copying ./csrc/batch_prefill_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,865 copying ./csrc/flashinfer_mamba_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,867 copying ./csrc/fmha_cutlass_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,869 copying ./csrc/group_gemm_fp8_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,871 copying ./csrc/fmha_cutlass_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,873 copying ./csrc/batch_decode_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,875 copying ./csrc/single_prefill_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,877 copying ./csrc/batch_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,880 copying ./csrc/fp4_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,882 copying ./csrc/quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,884 copying ./csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,885 copying ./csrc/fp4_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,888 copying ./csrc/batch_decode_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,890 copying ./csrc/pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,892 copying ./csrc/cudnn_sdpa_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,895 copying ./csrc/trtllm_mnnvl_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,897 copying ./csrc/cudnn_sdpa_utils.h -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,901 copying ./csrc/batch_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,903 copying ./csrc/fp4_kv_quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-24T15:42:33,906 creating build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:33,907 copying ./include/flashinfer/topk.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:33,912 copying ./include/flashinfer/fastdiv.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:33,914 creating build/lib/flashinfer/data/include/flashinfer/norm 2026-04-24T15:42:33,915 copying ./include/flashinfer/norm/ln_fwd_silu_kernel.cuh -> build/lib/flashinfer/data/include/flashinfer/norm 2026-04-24T15:42:33,919 copying ./include/flashinfer/norm/ln_silu_headers.cuh -> build/lib/flashinfer/data/include/flashinfer/norm 2026-04-24T15:42:33,922 copying ./include/flashinfer/math.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:33,925 copying ./include/flashinfer/cubin_loader.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:33,927 creating build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-24T15:42:33,930 copying ./include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-24T15:42:33,933 copying ./include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-24T15:42:33,935 copying ./include/flashinfer/flat/unused.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:33,937 creating build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-24T15:42:33,938 copying ./include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-24T15:42:33,941 copying ./include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-24T15:42:33,943 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-24T15:42:33,945 copying ./include/flashinfer/flat/hopper/device/device_universal.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-24T15:42:33,947 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:33,948 copying ./include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:33,951 copying ./include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:33,953 copying ./include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:33,956 copying ./include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:33,958 copying ./include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:33,962 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:33,963 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:33,966 copying ./include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:33,968 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:33,970 copying ./include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:33,973 copying ./include/flashinfer/flat/type_traits.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:33,975 copying ./include/flashinfer/flat/cute_ext.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:33,978 copying ./include/flashinfer/flat/math.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:33,980 copying ./include/flashinfer/flat/math_order_barrier.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:33,983 copying ./include/flashinfer/flat/common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:33,985 copying ./include/flashinfer/flat/debug.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:33,987 copying ./include/flashinfer/fp16.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:33,990 copying ./include/flashinfer/vec_dtypes.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:33,994 copying ./include/flashinfer/cutlass_utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:33,997 copying ./include/flashinfer/allocator.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:33,999 copying ./include/flashinfer/activation.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,002 copying ./include/flashinfer/topk_common.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,004 copying ./include/flashinfer/exception.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,007 copying ./include/flashinfer/utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,010 copying ./include/flashinfer/quantization.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,013 copying ./include/flashinfer/norm.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,016 creating build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,017 copying ./include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,021 copying ./include/flashinfer/gemm/tgv_gemm_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,023 copying ./include/flashinfer/gemm/tgv_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,025 copying ./include/flashinfer/gemm/bmm_fp8.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,028 copying ./include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,031 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,034 copying ./include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,036 copying ./include/flashinfer/gemm/mxfp8_gemm_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,039 copying ./include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,042 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,045 copying ./include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,047 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,049 copying ./include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,051 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,054 copying ./include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,056 copying ./include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,059 copying ./include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,061 copying ./include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,064 copying ./include/flashinfer/gemm/group_gemv.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,066 copying ./include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,069 copying ./include/flashinfer/gemm/group_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,072 copying ./include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,074 copying ./include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,076 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,079 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,082 copying ./include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,084 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,087 copying ./include/flashinfer/gemm/group_gemm_nvfp4_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,090 copying ./include/flashinfer/gemm/cutlass_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,092 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,095 copying ./include/flashinfer/gemm/group_gemm_lora.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,097 copying ./include/flashinfer/gemm/tgv_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,100 copying ./include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,103 copying ./include/flashinfer/gemm/group_gemm_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,105 copying ./include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:34,108 copying ./include/flashinfer/attention_impl.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,110 copying ./include/flashinfer/fast_topk_clusters_exact.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,113 copying ./include/flashinfer/arch_condition.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,116 copying ./include/flashinfer/sampling.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,119 copying ./include/flashinfer/fp4_layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,121 creating build/lib/flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:34,122 copying ./include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:34,126 copying ./include/flashinfer/comm/trtllm_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:34,130 copying ./include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:34,133 copying ./include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:34,136 copying ./include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:34,139 copying ./include/flashinfer/comm/trtllm_alltoall.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:34,143 copying ./include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:34,146 copying ./include/flashinfer/pos_enc.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,149 copying ./include/flashinfer/layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,152 copying ./include/flashinfer/air_top_p.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,155 copying ./include/flashinfer/permuted_smem.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,158 copying ./include/flashinfer/concat_mla.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,161 creating build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,162 copying ./include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,165 copying ./include/flashinfer/mamba/create_tensor_map.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,167 copying ./include/flashinfer/mamba/invoke_selective_state_update_mtp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,170 copying ./include/flashinfer/mamba/common.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,172 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_horizontal.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,176 copying ./include/flashinfer/mamba/selective_state_update.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,178 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_vertical.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,180 copying ./include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,184 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_async_horizontal.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,187 copying ./include/flashinfer/mamba/ssu_mtp_common.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,189 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_simple.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,192 copying ./include/flashinfer/mamba/conversion.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:34,194 copying ./include/flashinfer/profiler.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,197 copying ./include/flashinfer/mma.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,200 copying ./include/flashinfer/frag_layout_swizzle.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,201 creating build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,202 copying ./include/flashinfer/attention/state.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,205 copying ./include/flashinfer/attention/mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,208 copying ./include/flashinfer/attention/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,210 copying ./include/flashinfer/attention/persistent.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,213 copying ./include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,216 copying ./include/flashinfer/attention/heap.h -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,218 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,219 copying ./include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,222 copying ./include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,225 copying ./include/flashinfer/attention/hopper/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,227 copying ./include/flashinfer/attention/hopper/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,229 copying ./include/flashinfer/attention/hopper/mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,232 copying ./include/flashinfer/attention/hopper/utils.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,234 copying ./include/flashinfer/attention/hopper/attention_updater.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,236 copying ./include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,239 copying ./include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,242 copying ./include/flashinfer/attention/hopper/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,244 copying ./include/flashinfer/attention/hopper/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,246 copying ./include/flashinfer/attention/hopper/named_barrier.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,248 copying ./include/flashinfer/attention/hopper/default_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:34,250 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:34,251 copying ./include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:34,254 copying ./include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:34,257 copying ./include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:34,259 copying ./include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:34,261 copying ./include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:34,264 copying ./include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:34,267 copying ./include/flashinfer/attention/mla_hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,270 copying ./include/flashinfer/attention/default_prefill_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,272 copying ./include/flashinfer/attention/mask.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,274 copying ./include/flashinfer/attention/default_decode_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,277 copying ./include/flashinfer/attention/hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,279 copying ./include/flashinfer/attention/cutlass_mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,281 copying ./include/flashinfer/attention/decode.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,285 copying ./include/flashinfer/attention/cascade.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,287 copying ./include/flashinfer/attention/pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,290 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-24T15:42:34,291 copying ./include/flashinfer/attention/blackwell/plan.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-24T15:42:34,294 copying ./include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-24T15:42:34,296 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-24T15:42:34,297 copying ./include/flashinfer/attention/blackwell/device/fmha.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-24T15:42:34,300 copying ./include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-24T15:42:34,302 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-24T15:42:34,303 copying ./include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-24T15:42:34,305 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:34,306 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:34,309 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:34,312 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:34,314 copying ./include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:34,317 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:34,319 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:34,322 copying ./include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:34,324 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:34,328 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:34,329 copying ./include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:34,331 copying ./include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:34,334 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:34,337 copying ./include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:34,339 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:34,341 copying ./include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:34,343 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:34,347 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:34,351 copying ./include/flashinfer/attention/batch_pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,354 copying ./include/flashinfer/attention/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,356 copying ./include/flashinfer/attention/prefill.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,362 copying ./include/flashinfer/attention/mla_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,364 copying ./include/flashinfer/attention/scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,368 copying ./include/flashinfer/attention/persistent_template.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:34,370 copying ./include/flashinfer/cp_async.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,372 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,374 copying ./include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,377 copying ./include/flashinfer/trtllm/fused_moe/runner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,379 copying ./include/flashinfer/trtllm/fused_moe/RoutingCustomPolicy.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,382 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,386 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,388 copying ./include/flashinfer/trtllm/fused_moe/RoutingDevKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,391 copying ./include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,392 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,395 copying ./include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:34,397 creating build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:34,398 copying ./include/flashinfer/trtllm/common/cudaUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:34,401 copying ./include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:34,403 copying ./include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:34,406 copying ./include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:34,408 copying ./include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:34,410 copying ./include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:34,413 creating build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-24T15:42:34,414 copying ./include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-24T15:42:34,417 copying ./include/flashinfer/trtllm/common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm 2026-04-24T15:42:34,419 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,420 copying ./include/flashinfer/trtllm/fmha/kernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,424 copying ./include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,427 copying ./include/flashinfer/trtllm/fmha/decoder_params.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,429 copying ./include/flashinfer/trtllm/fmha/lse.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,431 copying ./include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,434 copying ./include/flashinfer/trtllm/fmha/kernelUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,436 copying ./include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,438 copying ./include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,441 copying ./include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:34,443 copying ./include/flashinfer/logging.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,445 copying ./include/flashinfer/page.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-24T15:42:34,448 creating build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,449 copying 3rdparty/spdlog/include/spdlog/async_logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,451 creating build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,452 copying 3rdparty/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,455 copying 3rdparty/spdlog/include/spdlog/details/file_helper-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,457 copying 3rdparty/spdlog/include/spdlog/details/file_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,459 copying 3rdparty/spdlog/include/spdlog/details/backtracer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,461 copying 3rdparty/spdlog/include/spdlog/details/synchronous_factory.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,463 copying 3rdparty/spdlog/include/spdlog/details/fmt_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,466 copying 3rdparty/spdlog/include/spdlog/details/tcp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,468 copying 3rdparty/spdlog/include/spdlog/details/tcp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,470 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,472 copying 3rdparty/spdlog/include/spdlog/details/circular_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,475 copying 3rdparty/spdlog/include/spdlog/details/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,477 copying 3rdparty/spdlog/include/spdlog/details/null_mutex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,479 copying 3rdparty/spdlog/include/spdlog/details/os-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,481 copying 3rdparty/spdlog/include/spdlog/details/thread_pool-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,484 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,486 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,488 copying 3rdparty/spdlog/include/spdlog/details/console_globals.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,491 copying 3rdparty/spdlog/include/spdlog/details/windows_include.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,492 copying 3rdparty/spdlog/include/spdlog/details/udp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,494 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,496 copying 3rdparty/spdlog/include/spdlog/details/backtracer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,498 copying 3rdparty/spdlog/include/spdlog/details/thread_pool.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,500 copying 3rdparty/spdlog/include/spdlog/details/log_msg-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,502 copying 3rdparty/spdlog/include/spdlog/details/log_msg.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,504 copying 3rdparty/spdlog/include/spdlog/details/registry-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,506 copying 3rdparty/spdlog/include/spdlog/details/udp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,508 copying 3rdparty/spdlog/include/spdlog/details/registry.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:34,511 copying 3rdparty/spdlog/include/spdlog/mdc.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,513 copying 3rdparty/spdlog/include/spdlog/pattern_formatter-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,517 copying 3rdparty/spdlog/include/spdlog/common-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,519 copying 3rdparty/spdlog/include/spdlog/tweakme.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,522 copying 3rdparty/spdlog/include/spdlog/async.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,524 copying 3rdparty/spdlog/include/spdlog/formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,526 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:34,528 copying 3rdparty/spdlog/include/spdlog/fmt/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:34,530 copying 3rdparty/spdlog/include/spdlog/fmt/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:34,532 copying 3rdparty/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:34,535 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,536 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,541 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/args.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,544 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/core.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,549 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,552 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,556 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/color.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,559 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,561 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,564 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,568 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,573 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,576 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,582 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/locale.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,585 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,588 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/printf.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:34,591 copying 3rdparty/spdlog/include/spdlog/fmt/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:34,594 copying 3rdparty/spdlog/include/spdlog/fmt/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:34,596 copying 3rdparty/spdlog/include/spdlog/fmt/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:34,599 copying 3rdparty/spdlog/include/spdlog/fmt/fmt.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:34,601 copying 3rdparty/spdlog/include/spdlog/fmt/ostr.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:34,604 creating build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,605 copying 3rdparty/spdlog/include/spdlog/sinks/null_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,608 copying 3rdparty/spdlog/include/spdlog/sinks/ostream_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,610 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,613 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,616 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,618 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,621 copying 3rdparty/spdlog/include/spdlog/sinks/sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,623 copying 3rdparty/spdlog/include/spdlog/sinks/mongo_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,626 copying 3rdparty/spdlog/include/spdlog/sinks/android_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,628 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,630 copying 3rdparty/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,632 copying 3rdparty/spdlog/include/spdlog/sinks/qt_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,635 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,637 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,639 copying 3rdparty/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,641 copying 3rdparty/spdlog/include/spdlog/sinks/sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,643 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,646 copying 3rdparty/spdlog/include/spdlog/sinks/tcp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,648 copying 3rdparty/spdlog/include/spdlog/sinks/systemd_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,650 copying 3rdparty/spdlog/include/spdlog/sinks/syslog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,653 copying 3rdparty/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,655 copying 3rdparty/spdlog/include/spdlog/sinks/dist_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,657 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,659 copying 3rdparty/spdlog/include/spdlog/sinks/udp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,661 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,663 copying 3rdparty/spdlog/include/spdlog/sinks/callback_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,665 copying 3rdparty/spdlog/include/spdlog/sinks/kafka_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,667 copying 3rdparty/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,669 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,671 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,674 copying 3rdparty/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,676 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,678 copying 3rdparty/spdlog/include/spdlog/sinks/msvc_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,679 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:34,681 copying 3rdparty/spdlog/include/spdlog/spdlog-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,683 copying 3rdparty/spdlog/include/spdlog/logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,686 copying 3rdparty/spdlog/include/spdlog/version.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,688 copying 3rdparty/spdlog/include/spdlog/common.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,690 copying 3rdparty/spdlog/include/spdlog/pattern_formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,693 copying 3rdparty/spdlog/include/spdlog/fwd.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,695 copying 3rdparty/spdlog/include/spdlog/logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,697 creating build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:34,698 copying 3rdparty/spdlog/include/spdlog/cfg/helpers-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:34,700 copying 3rdparty/spdlog/include/spdlog/cfg/helpers.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:34,702 copying 3rdparty/spdlog/include/spdlog/cfg/argv.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:34,704 copying 3rdparty/spdlog/include/spdlog/cfg/env.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:34,706 copying 3rdparty/spdlog/include/spdlog/spdlog.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,708 copying 3rdparty/spdlog/include/spdlog/async_logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,710 copying 3rdparty/spdlog/include/spdlog/stopwatch.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:34,712 creating build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:34,714 copying 3rdparty/cutlass/include/cute/swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:34,717 creating build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,718 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,720 copying 3rdparty/cutlass/include/cute/arch/mma_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,723 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,767 copying 3rdparty/cutlass/include/cute/arch/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,769 copying 3rdparty/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,774 copying 3rdparty/cutlass/include/cute/arch/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,777 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,780 copying 3rdparty/cutlass/include/cute/arch/cluster_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,783 copying 3rdparty/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,785 copying 3rdparty/cutlass/include/cute/arch/util.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,787 copying 3rdparty/cutlass/include/cute/arch/copy_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,795 copying 3rdparty/cutlass/include/cute/arch/mma_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,799 copying 3rdparty/cutlass/include/cute/arch/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,801 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,818 copying 3rdparty/cutlass/include/cute/arch/mma_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,820 copying 3rdparty/cutlass/include/cute/arch/copy_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,823 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,827 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,830 copying 3rdparty/cutlass/include/cute/arch/copy_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,833 copying 3rdparty/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,836 copying 3rdparty/cutlass/include/cute/arch/mma_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,841 copying 3rdparty/cutlass/include/cute/arch/mma_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,843 copying 3rdparty/cutlass/include/cute/arch/mma_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,846 copying 3rdparty/cutlass/include/cute/arch/cluster_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,848 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,851 copying 3rdparty/cutlass/include/cute/arch/mma_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,860 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,903 copying 3rdparty/cutlass/include/cute/arch/copy_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,905 copying 3rdparty/cutlass/include/cute/arch/copy_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,908 copying 3rdparty/cutlass/include/cute/arch/mma_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,911 copying 3rdparty/cutlass/include/cute/arch/simd_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,913 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:34,935 copying 3rdparty/cutlass/include/cute/pointer_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:34,938 copying 3rdparty/cutlass/include/cute/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:34,940 copying 3rdparty/cutlass/include/cute/stride.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:34,943 copying 3rdparty/cutlass/include/cute/pointer_base.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:34,946 creating build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,947 copying 3rdparty/cutlass/include/cute/numeric/integer_sequence.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,949 copying 3rdparty/cutlass/include/cute/numeric/integral_ratio.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,952 copying 3rdparty/cutlass/include/cute/numeric/numeric_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,954 copying 3rdparty/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,956 copying 3rdparty/cutlass/include/cute/numeric/integral_constant.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,959 copying 3rdparty/cutlass/include/cute/numeric/complex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,961 copying 3rdparty/cutlass/include/cute/numeric/math.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,963 copying 3rdparty/cutlass/include/cute/numeric/real.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,965 copying 3rdparty/cutlass/include/cute/numeric/int.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:34,967 creating build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:34,968 copying 3rdparty/cutlass/include/cute/container/array_aligned.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:34,971 copying 3rdparty/cutlass/include/cute/container/array_subbyte.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:34,973 copying 3rdparty/cutlass/include/cute/container/cuda_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:34,976 copying 3rdparty/cutlass/include/cute/container/type_list.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:34,978 copying 3rdparty/cutlass/include/cute/container/bit_field.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:34,980 copying 3rdparty/cutlass/include/cute/container/alignment.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:34,982 copying 3rdparty/cutlass/include/cute/container/tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:34,985 copying 3rdparty/cutlass/include/cute/container/array.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:34,987 copying 3rdparty/cutlass/include/cute/tensor_zip.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:34,990 copying 3rdparty/cutlass/include/cute/swizzle_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:34,992 copying 3rdparty/cutlass/include/cute/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:34,996 creating build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:34,997 copying 3rdparty/cutlass/include/cute/algorithm/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,000 copying 3rdparty/cutlass/include/cute/algorithm/prefetch.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,002 copying 3rdparty/cutlass/include/cute/algorithm/axpby.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,005 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,007 copying 3rdparty/cutlass/include/cute/algorithm/gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,010 copying 3rdparty/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,012 copying 3rdparty/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,015 copying 3rdparty/cutlass/include/cute/algorithm/clear.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,017 copying 3rdparty/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,019 copying 3rdparty/cutlass/include/cute/algorithm/prefer.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,021 copying 3rdparty/cutlass/include/cute/algorithm/fill.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,024 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,026 copying 3rdparty/cutlass/include/cute/algorithm/functional.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:35,029 copying 3rdparty/cutlass/include/cute/underscore.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:35,032 copying 3rdparty/cutlass/include/cute/int_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:35,035 creating build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,036 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,048 copying 3rdparty/cutlass/include/cute/atom/mma_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,051 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,057 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,060 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,063 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,065 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,067 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,079 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,082 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,084 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,089 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,092 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,094 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,097 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,099 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,102 copying 3rdparty/cutlass/include/cute/atom/partitioner.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,104 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,111 copying 3rdparty/cutlass/include/cute/atom/copy_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,113 copying 3rdparty/cutlass/include/cute/atom/copy_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,116 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,118 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,121 copying 3rdparty/cutlass/include/cute/atom/mma_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,124 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,125 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,131 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,134 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,136 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:35,139 copying 3rdparty/cutlass/include/cute/layout_composed.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:35,142 copying 3rdparty/cutlass/include/cute/tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:35,144 copying 3rdparty/cutlass/include/cute/pointer_flagged.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:35,146 copying 3rdparty/cutlass/include/cute/pointer_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:35,149 copying 3rdparty/cutlass/include/cute/tensor_impl.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:35,152 copying 3rdparty/cutlass/include/cute/pointer.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:35,154 creating build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:35,155 copying 3rdparty/cutlass/include/cute/util/type_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:35,158 copying 3rdparty/cutlass/include/cute/util/print_latex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:35,160 copying 3rdparty/cutlass/include/cute/util/print.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:35,163 copying 3rdparty/cutlass/include/cute/util/print_tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:35,165 copying 3rdparty/cutlass/include/cute/util/print_svg.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:35,167 copying 3rdparty/cutlass/include/cute/util/debug.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:35,170 creating build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-24T15:42:35,171 copying 3rdparty/cutlass/include/cutlass/platform/platform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-24T15:42:35,174 copying 3rdparty/cutlass/include/cutlass/uint128.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,177 copying 3rdparty/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,179 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,181 copying 3rdparty/cutlass/include/cutlass/array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,185 copying 3rdparty/cutlass/include/cutlass/cutlass.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,187 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,189 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,192 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,195 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,197 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,200 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,202 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,204 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,207 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,209 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,211 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,214 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,216 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,218 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,221 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,223 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,225 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,228 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,230 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,232 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,235 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,237 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,240 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,243 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,246 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,249 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,252 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,255 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,258 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,263 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,267 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,270 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,273 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,276 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,279 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:35,281 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:35,284 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:35,288 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:35,290 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:35,294 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:35,297 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,300 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,303 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,306 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,310 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,313 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,316 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,319 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,323 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,326 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,328 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,331 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,335 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,338 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,341 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,343 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:35,345 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,346 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,349 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,351 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,354 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,370 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,372 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,374 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,377 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,379 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,382 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,384 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,387 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,389 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,391 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,394 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:35,396 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,397 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,400 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,402 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/activation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,405 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,407 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,410 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,412 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,414 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,417 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,419 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,421 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,423 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,425 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,428 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,430 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,432 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,435 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,437 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,439 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,442 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,444 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,446 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,449 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,451 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,453 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:35,455 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,456 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,460 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,463 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,466 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,470 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,473 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,475 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,478 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,480 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,482 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,486 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,488 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:35,489 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:35,491 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:35,493 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:35,496 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:35,498 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:35,502 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:35,505 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,508 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,512 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,514 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,517 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,519 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,522 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:35,525 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,526 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,529 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,534 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,536 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,539 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,542 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,545 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,548 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,551 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,554 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,557 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,560 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,563 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:35,567 copying 3rdparty/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-24T15:42:35,569 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-24T15:42:35,571 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-24T15:42:35,574 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-24T15:42:35,577 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-24T15:42:35,578 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-24T15:42:35,580 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-24T15:42:35,582 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-24T15:42:35,585 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-24T15:42:35,586 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-24T15:42:35,589 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-24T15:42:35,591 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-24T15:42:35,593 copying 3rdparty/cutlass/include/cutlass/uint256.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,595 creating build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,596 copying 3rdparty/cutlass/include/cutlass/arch/cache_operation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,598 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,600 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,603 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,605 copying 3rdparty/cutlass/include/cutlass/arch/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,607 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,610 copying 3rdparty/cutlass/include/cutlass/arch/wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,612 copying 3rdparty/cutlass/include/cutlass/arch/config.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,614 copying 3rdparty/cutlass/include/cutlass/arch/arch.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,616 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,618 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,621 copying 3rdparty/cutlass/include/cutlass/arch/memory.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,624 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,626 copying 3rdparty/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,629 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,631 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm72.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,633 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,635 copying 3rdparty/cutlass/include/cutlass/arch/synclog.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,638 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,642 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,644 copying 3rdparty/cutlass/include/cutlass/arch/reg_reconfig.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,646 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,648 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,651 copying 3rdparty/cutlass/include/cutlass/arch/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,653 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm90.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,656 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,658 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,661 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm100.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,663 copying 3rdparty/cutlass/include/cutlass/arch/simd.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:35,665 copying 3rdparty/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,667 copying 3rdparty/cutlass/include/cutlass/numeric_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,669 copying 3rdparty/cutlass/include/cutlass/blas3_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,671 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,673 copying 3rdparty/cutlass/include/cutlass/subbyte_reference.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,676 copying 3rdparty/cutlass/include/cutlass/tensor_ref.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,678 copying 3rdparty/cutlass/include/cutlass/floating_point_nvrtc.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,680 copying 3rdparty/cutlass/include/cutlass/real.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,682 copying 3rdparty/cutlass/include/cutlass/numeric_conversion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,688 copying 3rdparty/cutlass/include/cutlass/semaphore.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,690 copying 3rdparty/cutlass/include/cutlass/tensor_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,692 copying 3rdparty/cutlass/include/cutlass/float_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,695 copying 3rdparty/cutlass/include/cutlass/aligned_buffer.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,697 copying 3rdparty/cutlass/include/cutlass/block_striped.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,699 copying 3rdparty/cutlass/include/cutlass/constants.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,703 copying 3rdparty/cutlass/include/cutlass/functional.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,705 copying 3rdparty/cutlass/include/cutlass/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,714 copying 3rdparty/cutlass/include/cutlass/coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,716 copying 3rdparty/cutlass/include/cutlass/exmy_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,719 copying 3rdparty/cutlass/include/cutlass/cluster_launch.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,722 copying 3rdparty/cutlass/include/cutlass/integer_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,724 copying 3rdparty/cutlass/include/cutlass/version.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,726 creating build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,727 copying 3rdparty/cutlass/include/cutlass/layout/vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,730 copying 3rdparty/cutlass/include/cutlass/layout/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,733 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,735 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,738 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,741 copying 3rdparty/cutlass/include/cutlass/layout/tensor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,744 copying 3rdparty/cutlass/include/cutlass/layout/pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,746 copying 3rdparty/cutlass/include/cutlass/layout/permute.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,749 copying 3rdparty/cutlass/include/cutlass/layout/layout.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:35,751 copying 3rdparty/cutlass/include/cutlass/device_kernel.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:35,753 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:35,754 copying 3rdparty/cutlass/include/cutlass/gemm/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:35,757 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,758 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,760 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,763 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,765 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,769 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,771 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,774 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,776 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,779 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,782 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,785 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,787 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,789 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,792 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,796 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,798 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,800 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,803 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,806 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,808 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,811 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,813 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,816 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,819 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,823 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,826 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,829 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,832 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,834 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,837 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,839 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,841 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,843 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,846 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,848 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,850 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,853 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,855 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,858 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,860 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,863 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,865 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,868 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,870 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:35,872 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,873 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,876 copying 3rdparty/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,879 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,881 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,884 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,886 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,889 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,891 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,893 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,898 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,900 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,903 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,905 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,908 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,910 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,913 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,915 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,917 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,920 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,922 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,926 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,928 copying 3rdparty/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,930 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,933 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,935 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,937 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,939 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,942 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,944 copying 3rdparty/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,946 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,950 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,952 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,956 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,958 copying 3rdparty/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,961 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:35,964 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,965 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,968 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,971 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,973 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,976 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,979 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,981 copying 3rdparty/cutlass/include/cutlass/gemm/device/symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,984 copying 3rdparty/cutlass/include/cutlass/gemm/device/trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,987 copying 3rdparty/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,989 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,992 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,995 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:35,997 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,000 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,002 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,005 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,008 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,010 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,012 copying 3rdparty/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,015 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,017 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,019 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,022 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,025 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,027 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,030 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,032 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,035 copying 3rdparty/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,038 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,040 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:36,043 copying 3rdparty/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:36,045 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:36,046 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:36,048 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:36,050 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:36,053 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:36,055 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,056 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,060 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,063 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,066 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,068 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,071 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,074 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,077 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,081 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,083 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,085 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,089 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,091 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,095 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,098 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,101 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,104 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,107 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,110 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,112 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,115 copying 3rdparty/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,118 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,120 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,124 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,127 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,131 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,134 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,137 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,140 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,143 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,146 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,149 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,152 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,155 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,159 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,160 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,163 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,166 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,169 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,172 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,175 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,178 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,180 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,183 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,185 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,188 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,190 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,193 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,196 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,199 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,202 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,204 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,208 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,210 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,213 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,216 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,218 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,222 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,224 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,227 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,230 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,232 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,235 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,238 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:36,240 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,244 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,247 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,250 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,253 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,256 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,259 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,262 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,265 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,268 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,271 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,274 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,277 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,279 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,282 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,286 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,289 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,292 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,294 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:36,297 copying 3rdparty/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:36,300 copying 3rdparty/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:36,302 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,303 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,306 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,309 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,312 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,315 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,317 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,320 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,322 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,325 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,328 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,330 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,333 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,335 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,337 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,339 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,342 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,344 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,347 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,349 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,352 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,355 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,358 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,360 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,363 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,365 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,368 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,371 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,373 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,376 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,379 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,382 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,384 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,386 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,389 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,392 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,395 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,398 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,400 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,403 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,406 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,409 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,411 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,414 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,416 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,419 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,421 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,424 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,426 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,428 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,431 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,434 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,437 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,439 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,442 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,444 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,447 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,450 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,452 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,455 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,457 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,459 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,462 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,464 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,467 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,470 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,472 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,475 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,477 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,480 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,482 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,484 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,487 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,490 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,493 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,496 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,499 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,502 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,504 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,507 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,510 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,512 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,515 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,518 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,520 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,523 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,525 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,528 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,531 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,533 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,536 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,539 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,541 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,544 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,546 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,549 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,552 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,555 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,558 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,561 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,564 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,567 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,569 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,572 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,575 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,577 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,580 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,582 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,585 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,588 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,591 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,594 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,596 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,599 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,602 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,604 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:36,606 copying 3rdparty/cutlass/include/cutlass/workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:36,608 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,610 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,613 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,616 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,619 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,621 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,623 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,626 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,628 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,631 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,634 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,636 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,639 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,641 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,644 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,647 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,649 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,652 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,655 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,658 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,661 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,664 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,666 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,669 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,671 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,674 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:36,677 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-24T15:42:36,678 copying 3rdparty/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-24T15:42:36,680 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-24T15:42:36,681 copying 3rdparty/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-24T15:42:36,684 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-24T15:42:36,684 copying 3rdparty/cutlass/include/cutlass/transform/thread/transpose.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-24T15:42:36,687 copying 3rdparty/cutlass/include/cutlass/transform/thread/unary_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-24T15:42:36,689 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-24T15:42:36,690 copying 3rdparty/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-24T15:42:36,693 copying 3rdparty/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform 2026-04-24T15:42:36,696 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-24T15:42:36,697 copying 3rdparty/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-24T15:42:36,699 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-24T15:42:36,702 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-24T15:42:36,705 copying 3rdparty/cutlass/include/cutlass/wmma_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:36,707 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,708 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,711 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,714 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,716 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,719 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,721 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,724 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,727 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,730 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,732 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,735 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,738 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,740 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,743 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,746 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,749 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,752 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,756 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,759 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,763 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,766 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,771 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,774 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,778 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,781 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,785 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,788 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,791 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,794 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,798 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,801 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,804 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,808 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,811 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,814 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,818 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,821 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,825 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,828 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,831 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,835 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,838 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,841 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,844 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,847 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,850 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:36,853 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-24T15:42:36,854 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-24T15:42:36,857 copying 3rdparty/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-24T15:42:36,859 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-24T15:42:36,862 copying 3rdparty/cutlass/include/cutlass/conv/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:36,864 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:36,865 copying 3rdparty/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:36,868 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:36,870 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:36,873 copying 3rdparty/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:36,875 copying 3rdparty/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:36,878 copying 3rdparty/cutlass/include/cutlass/conv/convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:36,880 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-24T15:42:36,881 copying 3rdparty/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-24T15:42:36,884 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:36,885 copying 3rdparty/cutlass/include/cutlass/conv/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:36,887 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:36,890 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:36,893 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:36,895 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:36,897 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:36,898 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:36,900 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:36,903 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:36,905 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:36,907 copying 3rdparty/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:36,909 copying 3rdparty/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:36,912 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,913 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,915 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,918 copying 3rdparty/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,921 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,923 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,926 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,928 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,931 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,934 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,937 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,939 copying 3rdparty/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,941 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,944 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,946 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,949 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,952 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,954 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,957 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,959 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,962 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,964 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,966 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,968 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,971 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,974 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,977 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,979 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,982 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,985 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:36,987 copying 3rdparty/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:36,990 copying 3rdparty/cutlass/include/cutlass/complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:36,993 creating build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-24T15:42:36,994 copying 3rdparty/cutlass/include/cutlass/thread/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-24T15:42:36,996 copying 3rdparty/cutlass/include/cutlass/half.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:36,999 copying 3rdparty/cutlass/include/cutlass/array_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,001 copying 3rdparty/cutlass/include/cutlass/tensor_view.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,003 copying 3rdparty/cutlass/include/cutlass/numeric_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,006 copying 3rdparty/cutlass/include/cutlass/matrix_shape.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,008 copying 3rdparty/cutlass/include/cutlass/quaternion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,011 copying 3rdparty/cutlass/include/cutlass/float8.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,014 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,015 copying 3rdparty/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,018 copying 3rdparty/cutlass/include/cutlass/detail/collective.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,020 copying 3rdparty/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,023 copying 3rdparty/cutlass/include/cutlass/detail/dependent_false.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,025 copying 3rdparty/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,027 copying 3rdparty/cutlass/include/cutlass/detail/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,029 copying 3rdparty/cutlass/include/cutlass/detail/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,032 copying 3rdparty/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,035 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-24T15:42:37,036 copying 3rdparty/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-24T15:42:37,038 copying 3rdparty/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-24T15:42:37,040 copying 3rdparty/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-24T15:42:37,044 copying 3rdparty/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,046 copying 3rdparty/cutlass/include/cutlass/detail/cluster.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,049 copying 3rdparty/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,051 copying 3rdparty/cutlass/include/cutlass/detail/helper_macros.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:37,054 copying 3rdparty/cutlass/include/cutlass/blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,056 copying 3rdparty/cutlass/include/cutlass/fast_math.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,059 copying 3rdparty/cutlass/include/cutlass/tfloat32.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,062 copying 3rdparty/cutlass/include/cutlass/array_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,064 copying 3rdparty/cutlass/include/cutlass/bfloat16.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,067 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-24T15:42:37,068 copying 3rdparty/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-24T15:42:37,071 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:37,072 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:37,075 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:37,077 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:37,080 copying 3rdparty/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:37,082 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-24T15:42:37,083 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-24T15:42:37,086 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-24T15:42:37,088 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:37,089 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:37,092 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:37,095 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:37,098 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:37,100 copying 3rdparty/cutlass/include/cutlass/relatively_equal.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,103 copying 3rdparty/cutlass/include/cutlass/gemm_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,105 copying 3rdparty/cutlass/include/cutlass/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,108 copying 3rdparty/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,110 copying 3rdparty/cutlass/include/cutlass/predicate_vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,113 copying 3rdparty/cutlass/include/cutlass/core_io.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,116 copying 3rdparty/cutlass/include/cutlass/pitch_linear_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,118 copying 3rdparty/cutlass/include/cutlass/matrix_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,120 copying 3rdparty/cutlass/include/cutlass/kernel_launch.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,123 creating build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-24T15:42:37,124 copying 3rdparty/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-24T15:42:37,126 copying 3rdparty/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-24T15:42:37,129 copying 3rdparty/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-24T15:42:37,133 copying 3rdparty/cutlass/include/cutlass/trace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,135 copying 3rdparty/cutlass/include/cutlass/gemm_coord.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:37,138 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,140 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,142 copying 3rdparty/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,145 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,148 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,150 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,152 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,155 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,158 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,160 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,163 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,166 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,168 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,171 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,174 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,176 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-24T15:42:37,177 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-24T15:42:37,180 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-24T15:42:37,181 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-24T15:42:37,184 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-24T15:42:37,186 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-24T15:42:37,189 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,191 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:37,194 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,195 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,198 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,200 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,204 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,206 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,209 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,211 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,214 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,216 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,219 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,221 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,224 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,226 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,229 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,231 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,234 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,237 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,239 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,242 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,245 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,247 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,249 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,251 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,254 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:37,256 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-24T15:42:37,258 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-24T15:42:37,260 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-24T15:42:37,262 copying 3rdparty/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,265 copying 3rdparty/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,267 copying 3rdparty/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,270 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,272 copying 3rdparty/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,275 copying 3rdparty/cutlass/tools/util/include/cutlass/util/command_line.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,277 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,279 copying 3rdparty/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,282 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,285 copying 3rdparty/cutlass/tools/util/include/cutlass/util/distribution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,288 copying 3rdparty/cutlass/tools/util/include/cutlass/util/debug.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,290 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,293 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,296 copying 3rdparty/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,300 copying 3rdparty/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,303 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,305 copying 3rdparty/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,308 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,312 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,316 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,320 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,325 copying 3rdparty/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,329 copying 3rdparty/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,331 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:37,461 installing to build/bdist.linux-armv7l/wheel 2026-04-24T15:42:37,462 running install 2026-04-24T15:42:37,484 running install_lib 2026-04-24T15:42:37,491 creating build/bdist.linux-armv7l/wheel 2026-04-24T15:42:37,492 copying build/lib/build_backend.py -> build/bdist.linux-armv7l/wheel/. 2026-04-24T15:42:37,496 creating build/bdist.linux-armv7l/wheel/flashinfer 2026-04-24T15:42:37,497 copying build/lib/flashinfer/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,501 creating build/bdist.linux-armv7l/wheel/flashinfer/cudnn 2026-04-24T15:42:37,502 copying build/lib/flashinfer/cudnn/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-24T15:42:37,504 copying build/lib/flashinfer/cudnn/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-24T15:42:37,506 copying build/lib/flashinfer/cudnn/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-24T15:42:37,508 copying build/lib/flashinfer/cudnn/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-24T15:42:37,510 copying build/lib/flashinfer/compilation_context.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,512 copying build/lib/flashinfer/pod.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,515 creating build/bdist.linux-armv7l/wheel/flashinfer/norm 2026-04-24T15:42:37,516 copying build/lib/flashinfer/norm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-24T15:42:37,519 creating build/bdist.linux-armv7l/wheel/flashinfer/norm/kernels 2026-04-24T15:42:37,521 copying build/lib/flashinfer/norm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-24T15:42:37,522 copying build/lib/flashinfer/norm/kernels/layernorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-24T15:42:37,525 copying build/lib/flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-24T15:42:37,528 copying build/lib/flashinfer/norm/kernels/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-24T15:42:37,530 copying build/lib/flashinfer/norm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-24T15:42:37,534 creating build/bdist.linux-armv7l/wheel/flashinfer/logits_processor 2026-04-24T15:42:37,535 copying build/lib/flashinfer/logits_processor/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,537 copying build/lib/flashinfer/logits_processor/legalization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,539 copying build/lib/flashinfer/logits_processor/fusion_rules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,541 copying build/lib/flashinfer/logits_processor/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,543 copying build/lib/flashinfer/logits_processor/operators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,545 copying build/lib/flashinfer/logits_processor/validators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,547 copying build/lib/flashinfer/logits_processor/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,549 copying build/lib/flashinfer/logits_processor/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,551 copying build/lib/flashinfer/logits_processor/processors.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,553 copying build/lib/flashinfer/logits_processor/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-24T15:42:37,556 copying build/lib/flashinfer/gdn_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,559 copying build/lib/flashinfer/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,561 copying build/lib/flashinfer/gdn_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,564 copying build/lib/flashinfer/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,572 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe 2026-04-24T15:42:37,573 copying build/lib/flashinfer/fused_moe/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-24T15:42:37,576 copying build/lib/flashinfer/fused_moe/fused_routing_dsv3.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-24T15:42:37,580 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:37,581 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:37,584 copying build/lib/flashinfer/fused_moe/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:37,587 copying build/lib/flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:37,590 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:37,594 copying build/lib/flashinfer/fused_moe/cute_dsl/tuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:37,598 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:37,599 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_dispatch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:37,604 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:37,606 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_micro_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:37,612 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_static_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:37,618 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x/triton_compact.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:37,620 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_dynamic_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell_sm12x 2026-04-24T15:42:37,626 copying build/lib/flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:37,629 copying build/lib/flashinfer/fused_moe/cute_dsl/b12x_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-24T15:42:37,633 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:37,635 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:37,640 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:37,643 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:37,649 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:37,652 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-24T15:42:37,655 copying build/lib/flashinfer/fused_moe/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-24T15:42:37,659 copying build/lib/flashinfer/fused_moe/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-24T15:42:37,661 copying build/lib/flashinfer/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,663 copying build/lib/flashinfer/version.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,665 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl 2026-04-24T15:42:37,666 copying build/lib/flashinfer/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-24T15:42:37,668 copying build/lib/flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-24T15:42:37,671 copying build/lib/flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-24T15:42:37,675 copying build/lib/flashinfer/cute_dsl/blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-24T15:42:37,677 copying build/lib/flashinfer/cute_dsl/fp4_common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-24T15:42:37,680 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention 2026-04-24T15:42:37,681 copying build/lib/flashinfer/cute_dsl/attention/pipeline_topology.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,684 copying build/lib/flashinfer/cute_dsl/attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,686 copying build/lib/flashinfer/cute_dsl/attention/tmem_layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,688 copying build/lib/flashinfer/cute_dsl/attention/mla_decode_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,690 copying build/lib/flashinfer/cute_dsl/attention/config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,693 copying build/lib/flashinfer/cute_dsl/attention/mla_warp_schedule.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,696 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,697 copying build/lib/flashinfer/cute_dsl/attention/roles/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,699 copying build/lib/flashinfer/cute_dsl/attention/roles/softmax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,701 copying build/lib/flashinfer/cute_dsl/attention/roles/correction.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,704 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,706 copying build/lib/flashinfer/cute_dsl/attention/roles/softmax_math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,708 copying build/lib/flashinfer/cute_dsl/attention/roles/loader_tma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,710 copying build/lib/flashinfer/cute_dsl/attention/roles/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,712 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,715 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_pt_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,717 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_correction.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,719 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_mma_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,721 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_compute.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,724 copying build/lib/flashinfer/cute_dsl/attention/roles/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,726 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_loader_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-24T15:42:37,729 copying build/lib/flashinfer/cute_dsl/attention/warp_schedule.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,730 copying build/lib/flashinfer/cute_dsl/attention/collective_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,733 copying build/lib/flashinfer/cute_dsl/attention/mla_config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,735 copying build/lib/flashinfer/cute_dsl/attention/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,738 copying build/lib/flashinfer/cute_dsl/attention/compat.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,740 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/fusion 2026-04-24T15:42:37,741 copying build/lib/flashinfer/cute_dsl/attention/fusion/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/fusion 2026-04-24T15:42:37,743 copying build/lib/flashinfer/cute_dsl/attention/fusion/mask.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/fusion 2026-04-24T15:42:37,746 copying build/lib/flashinfer/cute_dsl/attention/fusion/variant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/fusion 2026-04-24T15:42:37,749 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/wrappers 2026-04-24T15:42:37,750 copying build/lib/flashinfer/cute_dsl/attention/wrappers/batch_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/wrappers 2026-04-24T15:42:37,752 copying build/lib/flashinfer/cute_dsl/attention/wrappers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/wrappers 2026-04-24T15:42:37,754 copying build/lib/flashinfer/cute_dsl/attention/wrappers/batch_mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/wrappers 2026-04-24T15:42:37,757 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/scheduler 2026-04-24T15:42:37,758 copying build/lib/flashinfer/cute_dsl/attention/scheduler/mla_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/scheduler 2026-04-24T15:42:37,761 copying build/lib/flashinfer/cute_dsl/attention/scheduler/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/scheduler 2026-04-24T15:42:37,762 copying build/lib/flashinfer/cute_dsl/attention/scheduler/persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/scheduler 2026-04-24T15:42:37,764 copying build/lib/flashinfer/cute_dsl/attention/mla_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,767 copying build/lib/flashinfer/cute_dsl/attention/mainloop_spec.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-24T15:42:37,769 copying build/lib/flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-24T15:42:37,772 copying build/lib/flashinfer/cute_dsl/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-24T15:42:37,774 copying build/lib/flashinfer/_build_meta.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,776 copying build/lib/flashinfer/deep_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,779 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels 2026-04-24T15:42:37,780 copying build/lib/flashinfer/gdn_kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-24T15:42:37,782 copying build/lib/flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-24T15:42:37,785 copying build/lib/flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-24T15:42:37,788 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:37,789 copying build/lib/flashinfer/gdn_kernels/blackwell/gated_delta_net_chunked.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:37,794 copying build/lib/flashinfer/gdn_kernels/blackwell/gdn_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:37,796 copying build/lib/flashinfer/gdn_kernels/blackwell/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:37,798 copying build/lib/flashinfer/gdn_kernels/blackwell/gated_delta_net_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-24T15:42:37,800 copying build/lib/flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-24T15:42:37,804 copying build/lib/flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-24T15:42:37,809 creating build/bdist.linux-armv7l/wheel/flashinfer/testing 2026-04-24T15:42:37,810 copying build/lib/flashinfer/testing/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-24T15:42:37,812 copying build/lib/flashinfer/testing/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-24T15:42:37,815 copying build/lib/flashinfer/sparse.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,818 copying build/lib/flashinfer/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,821 creating build/bdist.linux-armv7l/wheel/flashinfer/tuning_configs 2026-04-24T15:42:37,822 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-24T15:42:37,824 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-24T15:42:37,826 copying build/lib/flashinfer/tllm_enums.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:37,829 creating build/bdist.linux-armv7l/wheel/flashinfer/data 2026-04-24T15:42:37,830 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog 2026-04-24T15:42:37,832 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include 2026-04-24T15:42:37,834 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:37,835 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:37,838 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,839 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,842 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,844 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,846 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,848 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,849 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,852 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,854 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,855 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,857 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/circular_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,859 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,861 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/null_mutex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,863 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,865 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,867 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,869 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,871 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/console_globals.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,873 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/windows_include.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,874 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,876 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,878 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,880 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,882 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,883 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,885 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,887 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,889 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-24T15:42:37,891 copying build/lib/flashinfer/data/spdlog/include/spdlog/mdc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:37,893 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:37,895 copying build/lib/flashinfer/data/spdlog/include/spdlog/common-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:37,897 copying build/lib/flashinfer/data/spdlog/include/spdlog/tweakme.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:37,899 copying build/lib/flashinfer/data/spdlog/include/spdlog/async.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:37,901 copying build/lib/flashinfer/data/spdlog/include/spdlog/formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:37,904 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:37,905 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:37,906 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:37,908 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:37,911 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,912 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,915 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,917 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,921 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,924 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,926 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,929 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,931 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,933 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,936 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,939 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,941 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,947 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,949 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,951 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-24T15:42:37,953 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:37,955 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:37,957 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:37,958 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/fmt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:37,960 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ostr.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-24T15:42:37,963 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,964 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,966 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,967 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,969 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,972 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,974 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,975 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,977 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,979 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,982 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,984 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,985 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,988 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,989 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,991 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,993 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,994 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,997 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:37,998 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,001 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,003 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,005 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,007 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,008 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,010 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,012 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,014 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,016 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,018 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,020 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,022 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,025 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,026 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,028 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-24T15:42:38,030 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,032 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,034 copying build/lib/flashinfer/data/spdlog/include/spdlog/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,036 copying build/lib/flashinfer/data/spdlog/include/spdlog/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,038 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,040 copying build/lib/flashinfer/data/spdlog/include/spdlog/fwd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,042 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,044 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:38,045 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:38,047 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:38,049 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/argv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:38,051 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/env.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-24T15:42:38,052 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,055 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,057 copying build/lib/flashinfer/data/spdlog/include/spdlog/stopwatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-24T15:42:38,059 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/scripts 2026-04-24T15:42:38,060 copying build/lib/flashinfer/data/spdlog/scripts/extract_version.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/scripts 2026-04-24T15:42:38,062 copying build/lib/flashinfer/data/build_backend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-24T15:42:38,064 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass 2026-04-24T15:42:38,066 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test 2026-04-24T15:42:38,067 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/utils 2026-04-24T15:42:38,069 copying build/lib/flashinfer/data/cutlass/test/utils/test_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/utils 2026-04-24T15:42:38,072 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python 2026-04-24T15:42:38,073 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:38,074 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_complement.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:38,076 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:38,078 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_composition.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:38,080 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:38,082 copying build/lib/flashinfer/data/cutlass/test/python/pycute/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:38,084 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:38,086 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:38,088 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_coalesce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-24T15:42:38,090 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass 2026-04-24T15:42:38,091 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,092 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,095 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,097 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,099 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,101 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,102 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,104 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,106 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,109 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,111 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,113 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,116 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,118 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-24T15:42:38,120 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:38,122 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:38,125 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:38,127 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:38,129 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-24T15:42:38,131 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/installation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass 2026-04-24T15:42:38,134 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:38,135 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-24T15:42:38,137 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-24T15:42:38,139 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:38,141 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:38,143 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:38,145 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:38,148 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:38,150 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-24T15:42:38,153 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-24T15:42:38,154 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-24T15:42:38,158 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:38,159 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:38,162 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:38,164 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:38,166 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-24T15:42:38,169 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples 2026-04-24T15:42:38,171 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-24T15:42:38,172 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-24T15:42:38,175 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:38,176 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:38,178 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:38,180 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:38,182 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:38,184 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-24T15:42:38,187 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit 2026-04-24T15:42:38,188 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm 2026-04-24T15:42:38,189 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-24T15:42:38,191 copying build/lib/flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/unit/gemm/device 2026-04-24T15:42:38,194 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python 2026-04-24T15:42:38,195 copying build/lib/flashinfer/data/cutlass/python/setup_pycute.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-24T15:42:38,197 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:38,199 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-24T15:42:38,200 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-24T15:42:38,202 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-24T15:42:38,204 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-24T15:42:38,206 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:38,207 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:38,209 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:38,212 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:38,214 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:38,216 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-24T15:42:38,218 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:38,221 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,223 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-24T15:42:38,224 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-24T15:42:38,226 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-24T15:42:38,228 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,230 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,231 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,234 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,236 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,239 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,241 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,244 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,248 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-24T15:42:38,249 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-24T15:42:38,251 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:38,253 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:38,255 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:38,257 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:38,259 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:38,261 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:38,263 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:38,266 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:38,268 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-24T15:42:38,270 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,272 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,273 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,275 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,277 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,279 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,281 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,283 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,286 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,288 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,290 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,292 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,295 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,296 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-24T15:42:38,299 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,300 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,302 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,305 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,306 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,309 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,311 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,313 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,315 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,318 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-24T15:42:38,320 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-24T15:42:38,323 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-24T15:42:38,324 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-24T15:42:38,326 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-24T15:42:38,328 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-24T15:42:38,330 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,332 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,335 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,337 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,339 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-24T15:42:38,341 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:38,343 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/shape.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:38,346 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-24T15:42:38,349 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:38,350 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:38,352 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:38,355 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:38,357 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:38,360 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-24T15:42:38,364 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-24T15:42:38,366 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-24T15:42:38,368 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-24T15:42:38,371 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-24T15:42:38,375 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:38,376 copying build/lib/flashinfer/data/cutlass/python/pycute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:38,379 copying build/lib/flashinfer/data/cutlass/python/pycute/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:38,382 copying build/lib/flashinfer/data/cutlass/python/pycute/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:38,385 copying build/lib/flashinfer/data/cutlass/python/pycute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:38,387 copying build/lib/flashinfer/data/cutlass/python/pycute/int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-24T15:42:38,390 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,392 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,405 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,408 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,411 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,415 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,417 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,420 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,423 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/manifest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,426 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,429 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,433 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,436 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,439 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,442 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,447 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,450 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,453 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/symm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,456 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,459 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-24T15:42:38,462 copying build/lib/flashinfer/data/cutlass/python/setup_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-24T15:42:38,465 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL 2026-04-24T15:42:38,467 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-24T15:42:38,469 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,470 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,473 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,476 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,479 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,482 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,485 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,488 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,492 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,494 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,497 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,500 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,502 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,505 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-24T15:42:38,507 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-24T15:42:38,510 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-24T15:42:38,513 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,516 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,519 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,522 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,525 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-24T15:42:38,528 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-24T15:42:38,531 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,533 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:38,535 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:38,537 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:38,540 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:38,543 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:38,545 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:38,547 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-24T15:42:38,550 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,553 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:38,554 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:38,557 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:38,559 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:38,561 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:38,564 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:38,566 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:38,567 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-24T15:42:38,570 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:38,571 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:38,574 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:38,576 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:38,578 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:38,581 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-24T15:42:38,584 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,587 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,589 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,592 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,594 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,597 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:38,598 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:38,600 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:38,602 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:38,605 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-24T15:42:38,607 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,610 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,613 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,616 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:38,617 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:38,619 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:38,621 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:38,623 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:38,625 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-24T15:42:38,628 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,630 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,632 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-24T15:42:38,637 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,638 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,641 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,643 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:38,645 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:38,647 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:38,649 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:38,651 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:38,654 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:38,656 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:38,658 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-24T15:42:38,661 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:38,662 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:38,664 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:38,666 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:38,668 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:38,670 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:38,674 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:38,676 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:38,678 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-24T15:42:38,680 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,682 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,685 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,688 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-24T15:42:38,689 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-24T15:42:38,691 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-24T15:42:38,693 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-24T15:42:38,695 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-24T15:42:38,697 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-24T15:42:38,699 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-24T15:42:38,701 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-24T15:42:38,704 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:38,705 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:38,708 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:38,710 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:38,712 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-24T15:42:38,716 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-24T15:42:38,717 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-24T15:42:38,720 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-24T15:42:38,722 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-24T15:42:38,725 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-24T15:42:38,726 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-24T15:42:38,728 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-24T15:42:38,730 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-24T15:42:38,732 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:38,733 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:38,735 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:38,737 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:38,739 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:38,741 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-24T15:42:38,743 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,746 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,748 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,751 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,755 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,759 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,762 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-24T15:42:38,765 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:38,766 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:38,768 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:38,770 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:38,773 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:38,775 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:38,777 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-24T15:42:38,779 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-24T15:42:38,782 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:38,783 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:38,786 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:38,789 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:38,791 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-24T15:42:38,794 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:38,795 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:38,799 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:38,802 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:38,804 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:38,807 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:38,808 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-24T15:42:38,811 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-24T15:42:38,813 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL 2026-04-24T15:42:38,815 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src 2026-04-24T15:42:38,817 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src/source 2026-04-24T15:42:38,818 copying build/lib/flashinfer/data/cutlass/python/docs_src/source/conf.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/docs_src/source 2026-04-24T15:42:38,820 copying build/lib/flashinfer/data/cutlass/python/setup_cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-24T15:42:38,823 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools 2026-04-24T15:42:38,825 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util 2026-04-24T15:42:38,826 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include 2026-04-24T15:42:38,828 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass 2026-04-24T15:42:38,830 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,831 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,833 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,835 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,838 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,840 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference 2026-04-24T15:42:38,842 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,843 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,846 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,849 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,851 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,853 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,856 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,858 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,861 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,864 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,866 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-24T15:42:38,867 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-24T15:42:38,870 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-24T15:42:38,871 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-24T15:42:38,874 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-24T15:42:38,876 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-24T15:42:38,878 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,880 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-24T15:42:38,883 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,885 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,887 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,890 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,893 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,895 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,898 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,900 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,902 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,905 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,907 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,910 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,912 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,915 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,917 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,919 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,922 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,924 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,927 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,929 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,931 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,933 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,935 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,937 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,941 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-24T15:42:38,944 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-24T15:42:38,945 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-24T15:42:38,947 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-24T15:42:38,949 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,952 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,954 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,957 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,959 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,961 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,963 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,965 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,968 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,970 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,972 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,975 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,977 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,979 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,981 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,983 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,985 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,987 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,990 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,992 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,994 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,997 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:38,999 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:39,001 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-24T15:42:39,004 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/scripts 2026-04-24T15:42:39,005 copying build/lib/flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/scripts 2026-04-24T15:42:39,008 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples 2026-04-24T15:42:39,009 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen 2026-04-24T15:42:39,011 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,012 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,014 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,017 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,019 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,021 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,023 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,025 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,027 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,029 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,032 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,035 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,038 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-24T15:42:39,040 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-24T15:42:39,042 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-24T15:42:39,044 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-24T15:42:39,046 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-24T15:42:39,048 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-24T15:42:39,049 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-24T15:42:39,052 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-24T15:42:39,055 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-24T15:42:39,057 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python 2026-04-24T15:42:39,059 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL 2026-04-24T15:42:39,060 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental 2026-04-24T15:42:39,062 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-24T15:42:39,063 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-24T15:42:39,066 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:39,067 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:39,071 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:39,073 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:39,076 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:39,079 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-24T15:42:39,083 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,084 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,087 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,089 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,092 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,095 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,098 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,100 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,103 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,106 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,109 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,111 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,113 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,115 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-24T15:42:39,118 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:39,119 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:39,121 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:39,124 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:39,127 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-24T15:42:39,129 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-24T15:42:39,130 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-24T15:42:39,133 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-24T15:42:39,135 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-24T15:42:39,137 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:39,138 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:39,141 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:39,143 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:39,145 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:39,147 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:39,149 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:39,151 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:39,153 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-24T15:42:39,155 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-24T15:42:39,157 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-24T15:42:39,158 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-24T15:42:39,161 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-24T15:42:39,162 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-24T15:42:39,164 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-24T15:42:39,167 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:39,168 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:39,171 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:39,175 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:39,178 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-24T15:42:39,182 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:39,183 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:39,186 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:39,190 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:39,193 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:39,196 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:39,199 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:39,201 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-24T15:42:39,205 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:39,206 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:39,209 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:39,212 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:39,214 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-24T15:42:39,217 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,218 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,223 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,228 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:39,230 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:39,233 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:39,237 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:39,241 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-24T15:42:39,245 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:39,246 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:39,249 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:39,253 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:39,255 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:39,258 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-24T15:42:39,262 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,265 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,270 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:39,271 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:39,274 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:39,277 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:39,280 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:39,282 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-24T15:42:39,285 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,290 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,294 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,298 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,301 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,306 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-24T15:42:39,308 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-24T15:42:39,312 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-24T15:42:39,316 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-24T15:42:39,321 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-24T15:42:39,322 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-24T15:42:39,327 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-24T15:42:39,332 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-24T15:42:39,334 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,338 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,341 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,344 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,347 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,350 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,353 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,356 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-24T15:42:39,358 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-24T15:42:39,362 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-24T15:42:39,364 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-24T15:42:39,367 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:39,368 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:39,371 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:39,374 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:39,378 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-24T15:42:39,381 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,384 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-24T15:42:39,388 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-24T15:42:39,389 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-24T15:42:39,391 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-24T15:42:39,394 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-24T15:42:39,395 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-24T15:42:39,398 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-24T15:42:39,400 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include 2026-04-24T15:42:39,402 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,403 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,407 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,408 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,410 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,412 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,452 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,454 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,459 copying build/lib/flashinfer/data/cutlass/include/cute/arch/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,461 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,464 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,466 copying build/lib/flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,469 copying build/lib/flashinfer/data/cutlass/include/cute/arch/util.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,471 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,479 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,482 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,484 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,502 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,504 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,507 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,510 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,513 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,515 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,518 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,521 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,524 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,526 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,528 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,531 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,539 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,583 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,585 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,587 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,590 copying build/lib/flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,592 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-24T15:42:39,609 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,611 copying build/lib/flashinfer/data/cutlass/include/cute/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,614 copying build/lib/flashinfer/data/cutlass/include/cute/stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,617 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_base.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,620 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,621 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,624 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,627 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,629 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,632 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,635 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,637 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,640 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/real.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,642 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/int.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-24T15:42:39,645 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:39,647 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_aligned.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:39,649 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:39,652 copying build/lib/flashinfer/data/cutlass/include/cute/container/cuda_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:39,655 copying build/lib/flashinfer/data/cutlass/include/cute/container/type_list.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:39,658 copying build/lib/flashinfer/data/cutlass/include/cute/container/bit_field.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:39,660 copying build/lib/flashinfer/data/cutlass/include/cute/container/alignment.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:39,663 copying build/lib/flashinfer/data/cutlass/include/cute/container/tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:39,666 copying build/lib/flashinfer/data/cutlass/include/cute/container/array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-24T15:42:39,668 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_zip.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,671 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,674 copying build/lib/flashinfer/data/cutlass/include/cute/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,678 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,680 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,683 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,685 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,688 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,691 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,694 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,696 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,700 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/clear.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,702 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,705 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,707 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,710 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,713 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/functional.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-24T15:42:39,716 copying build/lib/flashinfer/data/cutlass/include/cute/underscore.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,719 copying build/lib/flashinfer/data/cutlass/include/cute/int_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,723 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,724 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,741 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,745 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,753 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,756 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,759 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,761 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,763 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,777 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,780 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,782 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,786 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,788 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,791 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,793 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,795 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,798 copying build/lib/flashinfer/data/cutlass/include/cute/atom/partitioner.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,800 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,807 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,809 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,812 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,815 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,818 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,820 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,822 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,827 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,830 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,832 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-24T15:42:39,835 copying build/lib/flashinfer/data/cutlass/include/cute/layout_composed.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,838 copying build/lib/flashinfer/data/cutlass/include/cute/tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,840 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_flagged.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,843 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,845 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_impl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,849 copying build/lib/flashinfer/data/cutlass/include/cute/pointer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-24T15:42:39,852 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:39,853 copying build/lib/flashinfer/data/cutlass/include/cute/util/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:39,856 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_latex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:39,859 copying build/lib/flashinfer/data/cutlass/include/cute/util/print.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:39,861 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:39,864 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_svg.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:39,866 copying build/lib/flashinfer/data/cutlass/include/cute/util/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-24T15:42:39,870 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:39,872 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/platform 2026-04-24T15:42:39,873 copying build/lib/flashinfer/data/cutlass/include/cutlass/platform/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/platform 2026-04-24T15:42:39,876 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint128.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:39,879 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:39,882 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:39,884 copying build/lib/flashinfer/data/cutlass/include/cutlass/array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:39,888 copying build/lib/flashinfer/data/cutlass/include/cutlass/cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:39,891 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-24T15:42:39,894 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,895 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,901 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,903 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,906 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,908 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,911 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,914 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,916 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,919 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,921 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,924 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,927 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,929 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,932 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,934 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,937 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,940 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,942 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,945 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,947 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,950 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,952 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,955 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,958 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,961 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,964 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,966 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,971 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,976 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,979 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,983 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,986 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:39,990 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:39,992 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:39,996 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:39,999 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:40,002 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:40,006 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-24T15:42:40,009 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,012 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,017 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,020 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,024 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,026 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,029 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,032 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,036 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,038 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,041 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,044 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,047 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,050 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,052 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,055 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-24T15:42:40,058 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,060 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,063 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,066 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,069 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,072 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,074 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,078 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,085 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,088 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,091 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,093 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,095 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,097 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,100 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,102 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-24T15:42:40,105 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,106 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,109 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,110 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,113 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,115 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,118 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,120 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,123 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,125 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,128 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,130 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,133 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,135 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,137 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,140 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,143 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,145 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,148 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,150 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,153 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,155 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,157 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,160 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,162 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,165 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-24T15:42:40,168 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,170 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,173 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,176 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,180 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,184 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,187 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,189 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,192 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,195 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,198 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,202 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,205 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:40,206 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:40,209 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:40,211 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:40,214 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:40,216 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:40,221 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-24T15:42:40,223 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,227 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,230 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,233 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,235 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,238 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,241 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-24T15:42:40,245 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,247 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,250 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,254 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,257 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,260 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,263 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,266 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,269 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,273 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,276 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,280 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,284 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,287 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-24T15:42:40,290 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-24T15:42:40,294 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental 2026-04-24T15:42:40,296 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed 2026-04-24T15:42:40,298 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-24T15:42:40,300 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-24T15:42:40,303 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-24T15:42:40,306 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-24T15:42:40,308 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-24T15:42:40,310 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-24T15:42:40,313 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-24T15:42:40,316 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-24T15:42:40,318 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-24T15:42:40,320 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-24T15:42:40,322 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-24T15:42:40,325 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint256.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,328 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,329 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,332 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,335 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,337 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,340 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,343 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,345 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,348 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,350 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/arch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,353 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,355 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,358 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,361 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,364 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,366 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,369 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,371 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,374 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,377 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,380 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,383 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,385 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,388 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,391 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,394 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,396 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,399 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,402 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,404 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-24T15:42:40,406 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,409 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,411 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,414 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,416 copying build/lib/flashinfer/data/cutlass/include/cutlass/subbyte_reference.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,420 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,423 copying build/lib/flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,425 copying build/lib/flashinfer/data/cutlass/include/cutlass/real.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,428 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,476 copying build/lib/flashinfer/data/cutlass/include/cutlass/semaphore.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,478 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,481 copying build/lib/flashinfer/data/cutlass/include/cutlass/float_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,484 copying build/lib/flashinfer/data/cutlass/include/cutlass/aligned_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,486 copying build/lib/flashinfer/data/cutlass/include/cutlass/block_striped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,488 copying build/lib/flashinfer/data/cutlass/include/cutlass/constants.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,492 copying build/lib/flashinfer/data/cutlass/include/cutlass/functional.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,495 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,506 copying build/lib/flashinfer/data/cutlass/include/cutlass/coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,508 copying build/lib/flashinfer/data/cutlass/include/cutlass/exmy_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,511 copying build/lib/flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,514 copying build/lib/flashinfer/data/cutlass/include/cutlass/integer_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,516 copying build/lib/flashinfer/data/cutlass/include/cutlass/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,519 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,520 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,522 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,524 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,527 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,530 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,533 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,535 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,538 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/permute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,540 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-24T15:42:40,542 copying build/lib/flashinfer/data/cutlass/include/cutlass/device_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:40,545 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:40,546 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:40,549 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,550 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,553 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,555 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,558 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,561 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,563 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,566 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,568 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,571 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,573 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,576 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,578 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,581 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,583 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,587 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,589 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,591 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,594 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,596 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,599 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,601 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,603 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,606 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,609 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,613 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,616 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,618 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,621 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,623 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,625 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,628 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,630 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,632 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,634 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,636 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,638 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,641 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,643 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,646 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,648 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,651 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,653 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,655 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,657 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-24T15:42:40,661 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,662 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,665 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,667 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,670 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,672 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,675 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,678 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,680 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,682 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,688 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,691 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,694 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,697 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,699 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,702 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,705 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,707 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,710 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,713 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,715 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,720 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,722 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,724 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,727 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,732 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,737 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,739 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,742 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,744 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,748 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,751 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,755 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,758 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,761 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-24T15:42:40,766 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,768 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,770 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,774 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,776 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,780 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,783 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,785 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,788 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,791 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,793 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,796 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,798 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,800 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,802 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,805 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,807 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,810 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,812 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,815 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,817 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,819 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,821 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,824 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,826 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,829 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,831 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,833 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,836 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,839 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,841 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-24T15:42:40,844 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:40,846 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:40,847 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:40,850 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:40,852 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:40,855 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-24T15:42:40,858 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,859 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,863 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,866 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,869 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,871 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,874 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,877 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,880 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,883 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,885 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,888 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,891 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,893 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,896 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,899 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,902 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,905 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,907 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,910 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,912 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,915 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,917 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,920 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,923 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,927 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,930 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,933 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,936 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,939 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,942 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,944 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,947 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,951 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,953 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:40,957 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,958 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,961 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,963 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,966 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,968 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,971 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,973 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,976 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,978 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,980 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,983 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,985 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,988 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,990 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,993 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,996 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:40,998 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,001 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,004 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,007 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,009 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,011 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,014 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,016 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,019 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,021 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,023 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,026 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,029 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-24T15:42:41,031 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,035 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,038 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,040 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,045 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,048 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,050 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,054 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,057 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,060 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,063 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,066 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,069 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,071 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,074 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,077 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,079 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,083 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,085 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-24T15:42:41,088 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:41,091 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-24T15:42:41,095 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,096 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,099 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,102 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,105 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,107 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,109 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,112 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,114 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,117 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,119 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,122 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,124 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,126 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,128 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,131 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,133 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,136 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,138 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,140 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,143 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,146 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,148 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,151 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,154 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,156 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,158 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,161 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,164 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,167 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,170 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,172 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,174 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,177 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,179 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,182 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,185 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,187 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,189 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,192 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,195 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,197 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,199 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,201 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,204 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,206 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,208 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,211 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,213 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,216 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,218 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,221 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,224 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,226 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,228 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,231 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,234 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,236 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,238 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,241 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,243 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,245 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,247 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,249 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,252 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,254 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,257 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,259 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,261 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,263 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,266 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,268 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,271 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,274 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,277 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,280 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,282 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,285 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,287 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,290 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,292 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,294 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,297 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,299 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,302 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,304 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,307 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,309 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,312 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,315 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,317 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,320 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,323 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,326 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,328 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,330 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,333 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,336 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,341 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,344 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,347 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,349 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,352 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,355 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,358 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,360 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,363 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,365 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,368 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,371 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,373 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,376 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,378 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,381 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,383 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,385 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-24T15:42:41,387 copying build/lib/flashinfer/data/cutlass/include/cutlass/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,390 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform 2026-04-24T15:42:41,392 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,393 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,396 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,399 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,401 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,404 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,406 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,409 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,411 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,414 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,417 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,419 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,422 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,424 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,427 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,430 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,433 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,436 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,438 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,441 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,443 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,447 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,449 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,452 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,454 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,456 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-24T15:42:41,460 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-24T15:42:41,461 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-24T15:42:41,464 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-24T15:42:41,465 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-24T15:42:41,468 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-24T15:42:41,469 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-24T15:42:41,471 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-24T15:42:41,474 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-24T15:42:41,475 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-24T15:42:41,478 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform 2026-04-24T15:42:41,481 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-24T15:42:41,483 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-24T15:42:41,486 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-24T15:42:41,489 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-24T15:42:41,492 copying build/lib/flashinfer/data/cutlass/include/cutlass/wmma_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,496 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:41,498 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,500 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,503 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,506 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,509 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,512 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,515 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,518 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,521 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,524 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,527 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,530 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,533 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,536 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,539 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,542 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,545 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,547 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,550 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,553 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,556 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,559 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,562 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,565 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,569 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,572 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,574 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,577 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,580 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,582 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,584 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,587 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,589 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,591 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,594 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,596 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,599 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,601 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,603 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,606 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,608 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,610 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,612 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,615 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,617 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,619 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,622 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-24T15:42:41,625 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-24T15:42:41,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-24T15:42:41,629 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-24T15:42:41,631 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-24T15:42:41,633 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:41,636 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:41,637 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:41,639 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:41,642 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:41,644 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-24T15:42:41,647 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:41,650 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:41,652 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-24T15:42:41,653 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-24T15:42:41,657 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:41,658 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:41,660 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:41,663 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:41,666 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:41,668 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-24T15:42:41,670 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:41,671 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:41,674 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:41,676 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:41,678 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-24T15:42:41,681 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:41,683 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:41,687 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,688 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,690 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,693 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,695 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,698 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,700 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,703 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,706 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,708 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,711 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,714 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,716 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,718 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,720 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,723 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,725 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,727 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,732 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,734 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,736 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,738 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,741 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,744 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,746 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,749 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,751 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,754 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,757 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-24T15:42:41,760 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-24T15:42:41,762 copying build/lib/flashinfer/data/cutlass/include/cutlass/complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,766 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/thread 2026-04-24T15:42:41,767 copying build/lib/flashinfer/data/cutlass/include/cutlass/thread/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/thread 2026-04-24T15:42:41,769 copying build/lib/flashinfer/data/cutlass/include/cutlass/half.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,772 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,774 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,776 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,778 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_shape.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,780 copying build/lib/flashinfer/data/cutlass/include/cutlass/quaternion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,783 copying build/lib/flashinfer/data/cutlass/include/cutlass/float8.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,786 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,787 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,790 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,792 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,794 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,796 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,798 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,800 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,802 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,805 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-24T15:42:41,806 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-24T15:42:41,809 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-24T15:42:41,811 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-24T15:42:41,814 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,816 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,818 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,820 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-24T15:42:41,822 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,825 copying build/lib/flashinfer/data/cutlass/include/cutlass/fast_math.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,827 copying build/lib/flashinfer/data/cutlass/include/cutlass/tfloat32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,830 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,832 copying build/lib/flashinfer/data/cutlass/include/cutlass/bfloat16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,835 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-24T15:42:41,836 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction 2026-04-24T15:42:41,839 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:41,840 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:41,842 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:41,844 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:41,847 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-24T15:42:41,849 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-24T15:42:41,850 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-24T15:42:41,853 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-24T15:42:41,856 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:41,857 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:41,859 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:41,862 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:41,864 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-24T15:42:41,866 copying build/lib/flashinfer/data/cutlass/include/cutlass/relatively_equal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,868 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,871 copying build/lib/flashinfer/data/cutlass/include/cutlass/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,873 copying build/lib/flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,875 copying build/lib/flashinfer/data/cutlass/include/cutlass/predicate_vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,878 copying build/lib/flashinfer/data/cutlass/include/cutlass/core_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,880 copying build/lib/flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,882 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,884 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_launch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,887 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-24T15:42:41,888 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-24T15:42:41,890 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-24T15:42:41,893 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-24T15:42:41,896 copying build/lib/flashinfer/data/cutlass/include/cutlass/trace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-24T15:42:41,901 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include 2026-04-24T15:42:41,903 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer 2026-04-24T15:42:41,904 copying build/lib/flashinfer/data/include/flashinfer/topk.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,909 copying build/lib/flashinfer/data/include/flashinfer/fastdiv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,911 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/norm 2026-04-24T15:42:41,912 copying build/lib/flashinfer/data/include/flashinfer/norm/ln_fwd_silu_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/norm 2026-04-24T15:42:41,915 copying build/lib/flashinfer/data/include/flashinfer/norm/ln_silu_headers.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/norm 2026-04-24T15:42:41,918 copying build/lib/flashinfer/data/include/flashinfer/math.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,920 copying build/lib/flashinfer/data/include/flashinfer/cubin_loader.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,923 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:41,924 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere 2026-04-24T15:42:41,926 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-24T15:42:41,927 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-24T15:42:41,930 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-24T15:42:41,932 copying build/lib/flashinfer/data/include/flashinfer/flat/unused.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:41,934 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/prefill 2026-04-24T15:42:41,936 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-24T15:42:41,938 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-24T15:42:41,940 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper 2026-04-24T15:42:41,942 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-24T15:42:41,943 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-24T15:42:41,946 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:41,947 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:41,950 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:41,951 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:41,954 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:41,956 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-24T15:42:41,960 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:41,961 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:41,964 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:41,966 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:41,968 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-24T15:42:41,970 copying build/lib/flashinfer/data/include/flashinfer/flat/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:41,972 copying build/lib/flashinfer/data/include/flashinfer/flat/cute_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:41,974 copying build/lib/flashinfer/data/include/flashinfer/flat/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:41,975 copying build/lib/flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:41,978 copying build/lib/flashinfer/data/include/flashinfer/flat/common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:41,980 copying build/lib/flashinfer/data/include/flashinfer/flat/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-24T15:42:41,982 copying build/lib/flashinfer/data/include/flashinfer/fp16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,985 copying build/lib/flashinfer/data/include/flashinfer/vec_dtypes.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,988 copying build/lib/flashinfer/data/include/flashinfer/cutlass_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,990 copying build/lib/flashinfer/data/include/flashinfer/allocator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,992 copying build/lib/flashinfer/data/include/flashinfer/activation.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,994 copying build/lib/flashinfer/data/include/flashinfer/topk_common.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,996 copying build/lib/flashinfer/data/include/flashinfer/exception.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:41,997 copying build/lib/flashinfer/data/include/flashinfer/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,000 copying build/lib/flashinfer/data/include/flashinfer/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,002 copying build/lib/flashinfer/data/include/flashinfer/norm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,006 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,007 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,010 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,012 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,014 copying build/lib/flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,017 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,019 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,021 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,023 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,026 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,028 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,031 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,032 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,034 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,037 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,039 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,041 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,044 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,046 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,048 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,050 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,053 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,055 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,057 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,060 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,062 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,065 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,067 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,069 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_nvfp4_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,072 copying build/lib/flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,075 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,077 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,078 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,082 copying build/lib/flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,084 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,087 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-24T15:42:42,089 copying build/lib/flashinfer/data/include/flashinfer/attention_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,090 copying build/lib/flashinfer/data/include/flashinfer/fast_topk_clusters_exact.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,093 copying build/lib/flashinfer/data/include/flashinfer/arch_condition.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,095 copying build/lib/flashinfer/data/include/flashinfer/sampling.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,099 copying build/lib/flashinfer/data/include/flashinfer/fp4_layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,101 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:42,102 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:42,105 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:42,109 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:42,112 copying build/lib/flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:42,115 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:42,118 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:42,121 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-24T15:42:42,123 copying build/lib/flashinfer/data/include/flashinfer/pos_enc.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,127 copying build/lib/flashinfer/data/include/flashinfer/layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,129 copying build/lib/flashinfer/data/include/flashinfer/air_top_p.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,132 copying build/lib/flashinfer/data/include/flashinfer/permuted_smem.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,134 copying build/lib/flashinfer/data/include/flashinfer/concat_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,137 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,138 copying build/lib/flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,141 copying build/lib/flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,143 copying build/lib/flashinfer/data/include/flashinfer/mamba/invoke_selective_state_update_mtp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,146 copying build/lib/flashinfer/data/include/flashinfer/mamba/common.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,148 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_horizontal.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,150 copying build/lib/flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,152 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_vertical.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,155 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,158 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_async_horizontal.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,161 copying build/lib/flashinfer/data/include/flashinfer/mamba/ssu_mtp_common.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,163 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_simple.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,166 copying build/lib/flashinfer/data/include/flashinfer/mamba/conversion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-24T15:42:42,168 copying build/lib/flashinfer/data/include/flashinfer/profiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,170 copying build/lib/flashinfer/data/include/flashinfer/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,173 copying build/lib/flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,176 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,177 copying build/lib/flashinfer/data/include/flashinfer/attention/state.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,179 copying build/lib/flashinfer/data/include/flashinfer/attention/mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,182 copying build/lib/flashinfer/data/include/flashinfer/attention/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,184 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,187 copying build/lib/flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,190 copying build/lib/flashinfer/data/include/flashinfer/attention/heap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,192 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,193 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,196 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,199 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,201 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,204 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,206 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,208 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,211 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,214 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,216 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,218 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,221 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,223 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-24T15:42:42,226 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:42,227 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:42,229 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:42,232 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:42,235 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:42,237 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:42,240 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-24T15:42:42,243 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,246 copying build/lib/flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,249 copying build/lib/flashinfer/data/include/flashinfer/attention/mask.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,251 copying build/lib/flashinfer/data/include/flashinfer/attention/default_decode_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,253 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,256 copying build/lib/flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,258 copying build/lib/flashinfer/data/include/flashinfer/attention/decode.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,261 copying build/lib/flashinfer/data/include/flashinfer/attention/cascade.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,265 copying build/lib/flashinfer/data/include/flashinfer/attention/pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,268 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-24T15:42:42,269 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-24T15:42:42,272 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-24T15:42:42,275 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-24T15:42:42,276 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-24T15:42:42,279 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-24T15:42:42,282 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-24T15:42:42,283 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-24T15:42:42,287 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:42,289 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:42,292 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:42,296 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:42,299 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:42,302 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:42,304 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:42,308 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:42,311 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-24T15:42:42,315 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:42,317 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:42,319 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:42,322 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:42,325 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:42,328 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:42,331 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:42,334 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:42,338 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-24T15:42:42,343 copying build/lib/flashinfer/data/include/flashinfer/attention/batch_pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,346 copying build/lib/flashinfer/data/include/flashinfer/attention/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,349 copying build/lib/flashinfer/data/include/flashinfer/attention/prefill.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,356 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,359 copying build/lib/flashinfer/data/include/flashinfer/attention/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,365 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent_template.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-24T15:42:42,368 copying build/lib/flashinfer/data/include/flashinfer/cp_async.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,371 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm 2026-04-24T15:42:42,374 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,376 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,379 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,383 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingCustomPolicy.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,388 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,393 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,396 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingDevKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,400 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,403 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,406 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-24T15:42:42,410 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:42,412 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:42,416 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:42,419 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:42,423 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:42,426 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:42,429 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-24T15:42:42,433 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-24T15:42:42,435 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-24T15:42:42,438 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm 2026-04-24T15:42:42,443 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,445 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,449 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,454 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,457 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,460 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,463 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,467 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,470 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,473 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-24T15:42:42,477 copying build/lib/flashinfer/data/include/flashinfer/logging.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,480 copying build/lib/flashinfer/data/include/flashinfer/page.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-24T15:42:42,483 copying build/lib/flashinfer/data/build_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-24T15:42:42,489 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc 2026-04-24T15:42:42,490 copying build/lib/flashinfer/data/csrc/trtllm_batched_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,492 copying build/lib/flashinfer/data/csrc/norm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,495 copying build/lib/flashinfer/data/csrc/single_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,497 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,498 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,500 copying build/lib/flashinfer/data/csrc/fmha_v2_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,503 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,505 copying build/lib/flashinfer/data/csrc/pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,507 copying build/lib/flashinfer/data/csrc/tgv_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,509 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,511 copying build/lib/flashinfer/data/csrc/flashinfer_cascade_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,512 copying build/lib/flashinfer/data/csrc/trtllm_alltoall_prepare.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,515 copying build/lib/flashinfer/data/csrc/single_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,517 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,518 copying build/lib/flashinfer/data/csrc/vllm_custom_all_reduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,521 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,522 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,525 copying build/lib/flashinfer/data/csrc/flashinfer_fast_topk_clusters_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,527 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,529 copying build/lib/flashinfer/data/csrc/rope.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,532 copying build/lib/flashinfer/data/csrc/batch_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,534 copying build/lib/flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,536 copying build/lib/flashinfer/data/csrc/gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,537 copying build/lib/flashinfer/data/csrc/sampling_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,539 copying build/lib/flashinfer/data/csrc/single_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,541 copying build/lib/flashinfer/data/csrc/group_gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,543 copying build/lib/flashinfer/data/csrc/tvm_ffi_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,545 copying build/lib/flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,547 copying build/lib/flashinfer/data/csrc/selective_state_update_kernel_inst.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,549 copying build/lib/flashinfer/data/csrc/batch_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,551 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,553 copying build/lib/flashinfer/data/csrc/batch_pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,555 copying build/lib/flashinfer/data/csrc/flashinfer_rope_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,557 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,559 copying build/lib/flashinfer/data/csrc/gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,560 copying build/lib/flashinfer/data/csrc/group_gemm_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,562 copying build/lib/flashinfer/data/csrc/blackwell_fmha_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,564 copying build/lib/flashinfer/data/csrc/batch_decode_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,566 copying build/lib/flashinfer/data/csrc/selective_state_update.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,568 copying build/lib/flashinfer/data/csrc/fmhaReduction.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,570 copying build/lib/flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,572 copying build/lib/flashinfer/data/csrc/rmsnorm_silu.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,575 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe 2026-04-24T15:42:42,576 copying build/lib/flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-24T15:42:42,579 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:42,580 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:42,584 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:42,586 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:42,592 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-24T15:42:42,594 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:42,595 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:42,598 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:42,601 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:42,604 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_common.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:42,606 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_custom.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-24T15:42:42,609 copying build/lib/flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-24T15:42:42,612 copying build/lib/flashinfer/data/csrc/renorm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,614 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,617 copying build/lib/flashinfer/data/csrc/cutlass_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,618 copying build/lib/flashinfer/data/csrc/page.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,620 copying build/lib/flashinfer/data/csrc/single_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,623 copying build/lib/flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,625 copying build/lib/flashinfer/data/csrc/batch_attention_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,627 copying build/lib/flashinfer/data/csrc/runtime_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,628 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,630 copying build/lib/flashinfer/data/csrc/group_gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,632 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,633 copying build/lib/flashinfer/data/csrc/flashinfer_xqa_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,635 copying build/lib/flashinfer/data/csrc/flashinfer_quantization_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,637 copying build/lib/flashinfer/data/csrc/batch_pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,639 copying build/lib/flashinfer/data/csrc/selective_state_update_dtype_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,640 copying build/lib/flashinfer/data/csrc/trtllm_moe_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,643 copying build/lib/flashinfer/data/csrc/gdn_prefill_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,645 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,647 copying build/lib/flashinfer/data/csrc/batch_pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,650 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,652 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,654 copying build/lib/flashinfer/data/csrc/moe_utils_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,656 copying build/lib/flashinfer/data/csrc/concat_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,658 copying build/lib/flashinfer/data/csrc/flashinfer_rmsnorm_silu_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,660 copying build/lib/flashinfer/data/csrc/batch_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,662 copying build/lib/flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,664 copying build/lib/flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,666 copying build/lib/flashinfer/data/csrc/flashinfer_page_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,667 copying build/lib/flashinfer/data/csrc/batch_pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,669 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,672 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,676 copying build/lib/flashinfer/data/csrc/flashinfer_topk_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,678 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,680 copying build/lib/flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,683 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/xqa 2026-04-24T15:42:42,684 copying build/lib/flashinfer/data/csrc/xqa/barriers.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,687 copying build/lib/flashinfer/data/csrc/xqa/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,689 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,691 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,696 copying build/lib/flashinfer/data/csrc/xqa/defines.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,698 copying build/lib/flashinfer/data/csrc/xqa/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,700 copying build/lib/flashinfer/data/csrc/xqa/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,703 copying build/lib/flashinfer/data/csrc/xqa/mhaUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,706 copying build/lib/flashinfer/data/csrc/xqa/gmma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,708 copying build/lib/flashinfer/data/csrc/xqa/mha.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,711 copying build/lib/flashinfer/data/csrc/xqa/mha_components.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,713 copying build/lib/flashinfer/data/csrc/xqa/ldgsts.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,715 copying build/lib/flashinfer/data/csrc/xqa/mha.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,721 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,723 copying build/lib/flashinfer/data/csrc/xqa/mha_stdheaders.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,726 copying build/lib/flashinfer/data/csrc/xqa/mha_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,731 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,733 copying build/lib/flashinfer/data/csrc/xqa/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,735 copying build/lib/flashinfer/data/csrc/xqa/xqa_wrapper.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,737 copying build/lib/flashinfer/data/csrc/xqa/tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,740 copying build/lib/flashinfer/data/csrc/xqa/hostUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,742 copying build/lib/flashinfer/data/csrc/xqa/cuda_hint.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,744 copying build/lib/flashinfer/data/csrc/xqa/specDec.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,746 copying build/lib/flashinfer/data/csrc/xqa/gmma_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-24T15:42:42,754 copying build/lib/flashinfer/data/csrc/bmm_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,756 copying build/lib/flashinfer/data/csrc/batch_decode_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,758 copying build/lib/flashinfer/data/csrc/batch_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,760 copying build/lib/flashinfer/data/csrc/pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,762 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,765 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,767 copying build/lib/flashinfer/data/csrc/batch_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,769 copying build/lib/flashinfer/data/csrc/batch_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:42,771 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal 2026-04-24T15:42:42,773 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp 2026-04-24T15:42:42,774 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:42,775 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:42,778 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:42,781 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:42,783 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:42,785 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-24T15:42:42,787 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-24T15:42:42,788 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-24T15:42:42,791 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm 2026-04-24T15:42:42,793 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions 2026-04-24T15:42:42,795 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include 2026-04-24T15:42:42,796 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:42,797 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:42,801 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue 2026-04-24T15:42:42,802 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-24T15:42:42,804 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-24T15:42:42,806 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-24T15:42:42,807 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-24T15:42:42,810 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-24T15:42:42,811 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-24T15:42:42,814 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-24T15:42:42,817 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:42,819 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:42,820 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:42,823 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:42,825 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:42,827 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:42,829 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-24T15:42:42,831 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:42,834 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:42,837 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication 2026-04-24T15:42:42,839 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-24T15:42:42,840 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-24T15:42:42,842 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:42,845 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm 2026-04-24T15:42:42,846 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,848 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,850 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,853 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,855 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,857 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,859 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,862 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,864 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,867 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,869 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,871 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,874 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-24T15:42:42,877 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-24T15:42:42,878 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-24T15:42:42,880 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-24T15:42:42,882 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-24T15:42:42,885 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,886 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,889 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,892 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,896 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,899 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,901 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-24T15:42:42,902 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-24T15:42:42,905 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-24T15:42:42,908 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-24T15:42:42,910 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,914 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,916 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,918 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,920 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-24T15:42:42,922 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,924 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,927 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,929 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,932 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,934 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,937 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,939 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,941 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,943 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,946 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,948 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,950 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-24T15:42:42,953 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform 2026-04-24T15:42:42,955 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-24T15:42:42,956 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-24T15:42:42,959 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail 2026-04-24T15:42:42,960 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-24T15:42:42,962 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-24T15:42:42,965 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:42,967 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-24T15:42:42,969 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-24T15:42:42,970 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-24T15:42:42,973 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,974 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,976 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,978 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,980 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,982 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,985 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,986 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,989 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,991 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-24T15:42:42,993 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:42,995 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-24T15:42:42,996 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-24T15:42:42,999 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-24T15:42:43,002 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-24T15:42:43,003 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-24T15:42:43,005 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-24T15:42:43,007 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:43,009 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:43,011 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:43,013 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:43,016 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-24T15:42:43,017 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:43,018 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:43,022 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-24T15:42:43,023 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-24T15:42:43,026 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-24T15:42:43,028 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:43,030 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:43,034 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:43,036 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-24T15:42:43,038 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-24T15:42:43,040 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-24T15:42:43,043 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,044 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,046 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,047 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,050 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,052 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,053 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,055 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,057 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,059 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,060 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,062 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,064 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,066 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,068 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,069 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,071 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,074 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,076 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,077 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,079 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-24T15:42:43,081 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-24T15:42:43,082 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-24T15:42:43,085 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-24T15:42:43,087 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-24T15:42:43,090 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:43,091 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:43,094 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:43,097 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:43,098 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:43,100 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-24T15:42:43,103 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,104 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,107 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,110 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,111 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,113 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,115 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,116 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,118 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,121 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,123 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,124 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,126 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,128 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,130 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,133 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,134 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,136 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,138 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,140 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,142 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-24T15:42:43,145 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:43,146 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:43,148 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:43,150 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:43,152 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:43,154 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:43,159 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-24T15:42:43,162 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-24T15:42:43,163 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-24T15:42:43,165 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-24T15:42:43,168 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:43,170 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:43,172 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-24T15:42:43,175 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,176 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,179 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,182 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,184 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,186 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,188 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,190 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,193 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,198 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,201 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,205 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-24T15:42:43,209 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:43,211 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:43,213 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:43,216 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:43,218 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:43,221 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:43,224 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:43,227 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-24T15:42:43,230 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include 2026-04-24T15:42:43,232 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm 2026-04-24T15:42:43,234 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,236 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,240 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,242 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,245 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,248 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,250 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,252 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,255 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,258 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,261 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,264 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-24T15:42:43,266 copying build/lib/flashinfer/data/csrc/cascade.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,269 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,272 copying build/lib/flashinfer/data/csrc/topk.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,275 copying build/lib/flashinfer/data/csrc/single_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,277 copying build/lib/flashinfer/data/csrc/single_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,280 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,283 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,285 copying build/lib/flashinfer/data/csrc/batch_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,287 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,290 copying build/lib/flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,292 copying build/lib/flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,294 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,296 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,298 copying build/lib/flashinfer/data/csrc/dsv3_router_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,301 copying build/lib/flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,302 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,305 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,306 copying build/lib/flashinfer/data/csrc/single_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,308 copying build/lib/flashinfer/data/csrc/flashinfer_sampling_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,310 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,312 copying build/lib/flashinfer/data/csrc/sampling.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,314 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,316 copying build/lib/flashinfer/data/csrc/batch_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,319 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,321 copying build/lib/flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,323 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,325 copying build/lib/flashinfer/data/csrc/batch_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,328 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,329 copying build/lib/flashinfer/data/csrc/trtllm_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,332 copying build/lib/flashinfer/data/csrc/batch_attention_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,334 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,336 copying build/lib/flashinfer/data/csrc/flashinfer_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,338 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,341 copying build/lib/flashinfer/data/csrc/trtllm_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,343 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,345 copying build/lib/flashinfer/data/csrc/flashinfer_norm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,347 copying build/lib/flashinfer/data/csrc/fp4_kv_dequantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,349 copying build/lib/flashinfer/data/csrc/tinygemm2.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,352 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,353 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,355 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,357 copying build/lib/flashinfer/data/csrc/logging.cc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,359 copying build/lib/flashinfer/data/csrc/group_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,361 copying build/lib/flashinfer/data/csrc/trtllm_fmha_v2_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,363 copying build/lib/flashinfer/data/csrc/batch_attention.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,366 copying build/lib/flashinfer/data/csrc/trtllm_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,368 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,370 copying build/lib/flashinfer/data/csrc/fmha_v2_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,372 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,373 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,376 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,378 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,381 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,383 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,386 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,388 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,390 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,393 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,395 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,398 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,400 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,403 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,405 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,407 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:43,408 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:43,411 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:43,413 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:43,415 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-24T15:42:43,418 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,420 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,422 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,423 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,425 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,430 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,433 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,435 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,438 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,441 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,444 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,448 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,452 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,455 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/mask.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,459 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,460 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,464 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,469 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,472 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,475 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,478 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,481 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,484 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,486 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,489 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,491 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,494 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,497 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,500 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,503 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,507 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,509 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,512 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-24T15:42:43,514 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,517 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,521 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,523 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,526 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:43,528 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:43,531 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:43,534 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:43,537 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:43,540 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-24T15:42:43,544 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,547 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,550 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,554 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-24T15:42:43,557 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,559 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-24T15:42:43,562 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,565 copying build/lib/flashinfer/data/csrc/pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,567 copying build/lib/flashinfer/data/csrc/single_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,569 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,572 copying build/lib/flashinfer/data/csrc/selective_state_update_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,574 copying build/lib/flashinfer/data/csrc/batch_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,576 copying build/lib/flashinfer/data/csrc/tgv_gemm.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,578 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,580 copying build/lib/flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,584 copying build/lib/flashinfer/data/csrc/trtllm_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,586 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,589 copying build/lib/flashinfer/data/csrc/flashinfer_mamba_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,591 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,594 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,596 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,599 copying build/lib/flashinfer/data/csrc/batch_decode_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,601 copying build/lib/flashinfer/data/csrc/single_prefill_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,603 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,606 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,608 copying build/lib/flashinfer/data/csrc/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,611 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,613 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,615 copying build/lib/flashinfer/data/csrc/batch_decode_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,618 copying build/lib/flashinfer/data/csrc/pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,620 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,624 copying build/lib/flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,626 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,629 copying build/lib/flashinfer/data/csrc/batch_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,631 copying build/lib/flashinfer/data/csrc/fp4_kv_quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-24T15:42:43,635 copying build/lib/flashinfer/trtllm_low_latency_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,638 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm 2026-04-24T15:42:43,639 copying build/lib/flashinfer/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-24T15:42:43,642 copying build/lib/flashinfer/gemm/gemm_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-24T15:42:43,650 copying build/lib/flashinfer/gemm/routergemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-24T15:42:43,654 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm/kernels 2026-04-24T15:42:43,655 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm120.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-24T15:42:43,659 copying build/lib/flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-24T15:42:43,664 copying build/lib/flashinfer/gemm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-24T15:42:43,666 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-24T15:42:43,671 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-24T15:42:43,676 copying build/lib/flashinfer/gemm/kernels/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-24T15:42:43,679 creating build/bdist.linux-armv7l/wheel/flashinfer/profiler 2026-04-24T15:42:43,680 copying build/lib/flashinfer/profiler/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/profiler 2026-04-24T15:42:43,683 copying build/lib/flashinfer/autotuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,687 copying build/lib/flashinfer/cuda_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,689 copying build/lib/flashinfer/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,692 copying build/lib/flashinfer/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,697 creating build/bdist.linux-armv7l/wheel/flashinfer/comm 2026-04-24T15:42:43,698 copying build/lib/flashinfer/comm/allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,701 copying build/lib/flashinfer/comm/dlpack_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,704 copying build/lib/flashinfer/comm/mapping.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,706 copying build/lib/flashinfer/comm/trtllm_moe_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,708 copying build/lib/flashinfer/comm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,710 copying build/lib/flashinfer/comm/vllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,712 copying build/lib/flashinfer/comm/workspace_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,714 copying build/lib/flashinfer/comm/trtllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,717 copying build/lib/flashinfer/comm/nvshmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,719 copying build/lib/flashinfer/comm/mnnvl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,722 copying build/lib/flashinfer/comm/nvshmem_allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,724 copying build/lib/flashinfer/comm/trtllm_mnnvl_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,727 copying build/lib/flashinfer/comm/cuda_ipc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,729 copying build/lib/flashinfer/comm/trtllm_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-24T15:42:43,732 copying build/lib/flashinfer/concat_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,734 creating build/bdist.linux-armv7l/wheel/flashinfer/triton 2026-04-24T15:42:43,735 copying build/lib/flashinfer/triton/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-24T15:42:43,737 copying build/lib/flashinfer/triton/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-24T15:42:43,739 copying build/lib/flashinfer/triton/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-24T15:42:43,741 copying build/lib/flashinfer/triton/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-24T15:42:43,743 creating build/bdist.linux-armv7l/wheel/flashinfer/triton/kernels 2026-04-24T15:42:43,744 copying build/lib/flashinfer/triton/kernels/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-24T15:42:43,746 copying build/lib/flashinfer/triton/kernels/ssd_chunk_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-24T15:42:43,749 copying build/lib/flashinfer/triton/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-24T15:42:43,750 copying build/lib/flashinfer/triton/kernels/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-24T15:42:43,752 copying build/lib/flashinfer/triton/kernels/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-24T15:42:43,754 copying build/lib/flashinfer/triton/kernels/quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-24T15:42:43,756 copying build/lib/flashinfer/triton/kernels/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-24T15:42:43,758 copying build/lib/flashinfer/triton/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-24T15:42:43,761 copying build/lib/flashinfer/triton/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-24T15:42:43,762 copying build/lib/flashinfer/triton/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-24T15:42:43,764 copying build/lib/flashinfer/triton/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-24T15:42:43,766 copying build/lib/flashinfer/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,769 copying build/lib/flashinfer/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,772 copying build/lib/flashinfer/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,774 copying build/lib/flashinfer/artifacts.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,777 creating build/bdist.linux-armv7l/wheel/flashinfer/mla 2026-04-24T15:42:43,777 copying build/lib/flashinfer/mla/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mla 2026-04-24T15:42:43,779 copying build/lib/flashinfer/mla/_core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mla 2026-04-24T15:42:43,782 copying build/lib/flashinfer/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,784 copying build/lib/flashinfer/green_ctx.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,787 creating build/bdist.linux-armv7l/wheel/flashinfer/dsv3_ops 2026-04-24T15:42:43,788 copying build/lib/flashinfer/dsv3_ops/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/dsv3_ops 2026-04-24T15:42:43,790 copying build/lib/flashinfer/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,793 copying build/lib/flashinfer/aot.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,796 creating build/bdist.linux-armv7l/wheel/flashinfer/mamba 2026-04-24T15:42:43,797 copying build/lib/flashinfer/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-24T15:42:43,799 copying build/lib/flashinfer/mamba/ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-24T15:42:43,801 copying build/lib/flashinfer/mamba/ssd_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-24T15:42:43,806 copying build/lib/flashinfer/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-24T15:42:43,809 copying build/lib/flashinfer/mamba/ssd_combined.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-24T15:42:43,812 creating build/bdist.linux-armv7l/wheel/flashinfer/jit 2026-04-24T15:42:43,813 copying build/lib/flashinfer/jit/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,815 copying build/lib/flashinfer/jit/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,817 copying build/lib/flashinfer/jit/comm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,819 copying build/lib/flashinfer/jit/fp4_kv_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,821 copying build/lib/flashinfer/jit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,823 copying build/lib/flashinfer/jit/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,825 copying build/lib/flashinfer/jit/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,827 copying build/lib/flashinfer/jit/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,829 copying build/lib/flashinfer/jit/quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,831 copying build/lib/flashinfer/jit/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,833 copying build/lib/flashinfer/jit/gdn.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,835 copying build/lib/flashinfer/jit/spdlog.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,837 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm 2026-04-24T15:42:43,838 copying build/lib/flashinfer/jit/gemm/fp8_blockscale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-24T15:42:43,840 copying build/lib/flashinfer/jit/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-24T15:42:43,842 copying build/lib/flashinfer/jit/gemm/deepgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-24T15:42:43,845 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm/cutlass 2026-04-24T15:42:43,846 copying build/lib/flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-24T15:42:43,849 copying build/lib/flashinfer/jit/gemm/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-24T15:42:43,851 copying build/lib/flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-24T15:42:43,853 copying build/lib/flashinfer/jit/gemm/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-24T15:42:43,856 copying build/lib/flashinfer/jit/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,858 copying build/lib/flashinfer/jit/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,860 copying build/lib/flashinfer/jit/cpp_ext.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,862 copying build/lib/flashinfer/jit/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,863 copying build/lib/flashinfer/jit/fp4_kv_dequantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,865 copying build/lib/flashinfer/jit/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,867 copying build/lib/flashinfer/jit/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,868 copying build/lib/flashinfer/jit/env.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,870 copying build/lib/flashinfer/jit/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,872 copying build/lib/flashinfer/jit/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,874 copying build/lib/flashinfer/jit/tinygemm2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,876 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/mamba 2026-04-24T15:42:43,877 copying build/lib/flashinfer/jit/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-24T15:42:43,879 copying build/lib/flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-24T15:42:43,880 copying build/lib/flashinfer/jit/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-24T15:42:43,883 copying build/lib/flashinfer/jit/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,886 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention 2026-04-24T15:42:43,887 copying build/lib/flashinfer/jit/attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-24T15:42:43,888 copying build/lib/flashinfer/jit/attention/modules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-24T15:42:43,891 copying build/lib/flashinfer/jit/attention/variants.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-24T15:42:43,894 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:43,895 copying build/lib/flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:43,897 copying build/lib/flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:43,900 copying build/lib/flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:43,908 copying build/lib/flashinfer/jit/attention/fmha_v2/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-24T15:42:43,911 copying build/lib/flashinfer/jit/attention/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-24T15:42:43,913 copying build/lib/flashinfer/jit/cubin_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,915 copying build/lib/flashinfer/jit/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,917 copying build/lib/flashinfer/jit/dsv3_optimizations.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,919 copying build/lib/flashinfer/jit/rmsnorm_silu.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,921 copying build/lib/flashinfer/jit/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-24T15:42:43,923 copying build/lib/flashinfer/__main__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,926 creating build/bdist.linux-armv7l/wheel/flashinfer/parallel_attention 2026-04-24T15:42:43,927 copying build/lib/flashinfer/parallel_attention/parallel_attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer/parallel_attention 2026-04-24T15:42:43,929 copying build/lib/flashinfer/parallel_attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/parallel_attention 2026-04-24T15:42:43,931 copying build/lib/flashinfer/parallel_attention/parallel_wrapper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/parallel_attention 2026-04-24T15:42:43,933 copying build/lib/flashinfer/parallel_attention/parallel_config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/parallel_attention 2026-04-24T15:42:43,936 copying build/lib/flashinfer/parallel_attention/attention_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer/parallel_attention 2026-04-24T15:42:43,938 copying build/lib/flashinfer/parallel_attention/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/parallel_attention 2026-04-24T15:42:43,940 copying build/lib/flashinfer/api_logging.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,944 copying build/lib/flashinfer/attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,946 copying build/lib/flashinfer/py.typed -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,948 copying build/lib/flashinfer/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,950 copying build/lib/flashinfer/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-24T15:42:43,953 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization 2026-04-24T15:42:43,954 copying build/lib/flashinfer/quantization/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-24T15:42:43,957 copying build/lib/flashinfer/quantization/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-24T15:42:43,959 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization/kernels 2026-04-24T15:42:43,960 copying build/lib/flashinfer/quantization/kernels/mxfp4_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-24T15:42:43,963 copying build/lib/flashinfer/quantization/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-24T15:42:43,965 copying build/lib/flashinfer/quantization/kernels/nvfp4_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-24T15:42:43,968 copying build/lib/flashinfer/quantization/kernels/mxfp8_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-24T15:42:43,971 copying build/lib/flashinfer/quantization/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-24T15:42:43,974 copying build/lib/flashinfer/quantization/packbits.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-24T15:42:43,977 copying build/lib/flashinfer/quantization/quantization_cute_dsl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-24T15:42:43,980 copying build/lib/build_utils.py -> build/bdist.linux-armv7l/wheel/. 2026-04-24T15:42:43,982 running install_egg_info 2026-04-24T15:42:43,994 running egg_info 2026-04-24T15:42:44,002 writing flashinfer_python.egg-info/PKG-INFO 2026-04-24T15:42:44,018 writing dependency_links to flashinfer_python.egg-info/dependency_links.txt 2026-04-24T15:42:44,020 writing entry points to flashinfer_python.egg-info/entry_points.txt 2026-04-24T15:42:44,022 writing requirements to flashinfer_python.egg-info/requires.txt 2026-04-24T15:42:44,024 writing top-level names to flashinfer_python.egg-info/top_level.txt 2026-04-24T15:42:44,823 reading manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-24T15:42:44,944 adding license file 'LICENSE' 2026-04-24T15:42:45,067 writing manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-24T15:42:45,072 Copying flashinfer_python.egg-info to build/bdist.linux-armv7l/wheel/./flashinfer_python-0.6.9-py3.11.egg-info 2026-04-24T15:42:45,086 running install_scripts 2026-04-24T15:42:45,098 creating build/bdist.linux-armv7l/wheel/flashinfer_python-0.6.9.dist-info/WHEEL 2026-04-24T15:42:45,100 creating '/tmp/pip-wheel-eodox3n_/.tmp-eiq8aqvw/flashinfer_python-0.6.9-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-04-24T15:42:45,102 adding 'build_backend.py' 2026-04-24T15:42:45,104 adding 'build_utils.py' 2026-04-24T15:42:45,107 adding 'flashinfer/__init__.py' 2026-04-24T15:42:45,109 adding 'flashinfer/__main__.py' 2026-04-24T15:42:45,111 adding 'flashinfer/_build_meta.py' 2026-04-24T15:42:45,112 adding 'flashinfer/activation.py' 2026-04-24T15:42:45,116 adding 'flashinfer/aot.py' 2026-04-24T15:42:45,123 adding 'flashinfer/api_logging.py' 2026-04-24T15:42:45,125 adding 'flashinfer/artifacts.py' 2026-04-24T15:42:45,127 adding 'flashinfer/attention.py' 2026-04-24T15:42:45,136 adding 'flashinfer/autotuner.py' 2026-04-24T15:42:45,140 adding 'flashinfer/cascade.py' 2026-04-24T15:42:45,142 adding 'flashinfer/compilation_context.py' 2026-04-24T15:42:45,143 adding 'flashinfer/concat_ops.py' 2026-04-24T15:42:45,144 adding 'flashinfer/cuda_utils.py' 2026-04-24T15:42:45,155 adding 'flashinfer/decode.py' 2026-04-24T15:42:45,161 adding 'flashinfer/deep_gemm.py' 2026-04-24T15:42:45,162 adding 'flashinfer/fp4_quantization.py' 2026-04-24T15:42:45,163 adding 'flashinfer/fp8_quantization.py' 2026-04-24T15:42:45,166 adding 'flashinfer/gdn_decode.py' 2026-04-24T15:42:45,168 adding 'flashinfer/gdn_prefill.py' 2026-04-24T15:42:45,170 adding 'flashinfer/green_ctx.py' 2026-04-24T15:42:45,172 adding 'flashinfer/page.py' 2026-04-24T15:42:45,176 adding 'flashinfer/pod.py' 2026-04-24T15:42:45,194 adding 'flashinfer/prefill.py' 2026-04-24T15:42:45,196 adding 'flashinfer/py.typed' 2026-04-24T15:42:45,200 adding 'flashinfer/rope.py' 2026-04-24T15:42:45,207 adding 'flashinfer/sampling.py' 2026-04-24T15:42:45,211 adding 'flashinfer/sparse.py' 2026-04-24T15:42:45,213 adding 'flashinfer/tllm_enums.py' 2026-04-24T15:42:45,214 adding 'flashinfer/tllm_utils.py' 2026-04-24T15:42:45,217 adding 'flashinfer/topk.py' 2026-04-24T15:42:45,219 adding 'flashinfer/trtllm_low_latency_gemm.py' 2026-04-24T15:42:45,224 adding 'flashinfer/utils.py' 2026-04-24T15:42:45,225 adding 'flashinfer/version.py' 2026-04-24T15:42:45,228 adding 'flashinfer/xqa.py' 2026-04-24T15:42:45,230 adding 'flashinfer/comm/__init__.py' 2026-04-24T15:42:45,234 adding 'flashinfer/comm/allreduce.py' 2026-04-24T15:42:45,236 adding 'flashinfer/comm/cuda_ipc.py' 2026-04-24T15:42:45,238 adding 'flashinfer/comm/dlpack_utils.py' 2026-04-24T15:42:45,240 adding 'flashinfer/comm/mapping.py' 2026-04-24T15:42:45,246 adding 'flashinfer/comm/mnnvl.py' 2026-04-24T15:42:45,248 adding 'flashinfer/comm/nvshmem.py' 2026-04-24T15:42:45,249 adding 'flashinfer/comm/nvshmem_allreduce.py' 2026-04-24T15:42:45,251 adding 'flashinfer/comm/trtllm_alltoall.py' 2026-04-24T15:42:45,256 adding 'flashinfer/comm/trtllm_ar.py' 2026-04-24T15:42:45,259 adding 'flashinfer/comm/trtllm_mnnvl_ar.py' 2026-04-24T15:42:45,262 adding 'flashinfer/comm/trtllm_moe_alltoall.py' 2026-04-24T15:42:45,264 adding 'flashinfer/comm/vllm_ar.py' 2026-04-24T15:42:45,266 adding 'flashinfer/comm/workspace_base.py' 2026-04-24T15:42:45,267 adding 'flashinfer/cudnn/__init__.py' 2026-04-24T15:42:45,269 adding 'flashinfer/cudnn/decode.py' 2026-04-24T15:42:45,272 adding 'flashinfer/cudnn/prefill.py' 2026-04-24T15:42:45,274 adding 'flashinfer/cudnn/utils.py' 2026-04-24T15:42:45,275 adding 'flashinfer/cute_dsl/__init__.py' 2026-04-24T15:42:45,280 adding 'flashinfer/cute_dsl/add_rmsnorm_fp4quant.py' 2026-04-24T15:42:45,282 adding 'flashinfer/cute_dsl/blockscaled_gemm.py' 2026-04-24T15:42:45,286 adding 'flashinfer/cute_dsl/fp4_common.py' 2026-04-24T15:42:45,294 adding 'flashinfer/cute_dsl/gemm_allreduce_two_shot.py' 2026-04-24T15:42:45,298 adding 'flashinfer/cute_dsl/rmsnorm_fp4quant.py' 2026-04-24T15:42:45,301 adding 'flashinfer/cute_dsl/utils.py' 2026-04-24T15:42:45,303 adding 'flashinfer/cute_dsl/attention/__init__.py' 2026-04-24T15:42:45,306 adding 'flashinfer/cute_dsl/attention/collective_builder.py' 2026-04-24T15:42:45,308 adding 'flashinfer/cute_dsl/attention/compat.py' 2026-04-24T15:42:45,309 adding 'flashinfer/cute_dsl/attention/config.py' 2026-04-24T15:42:45,311 adding 'flashinfer/cute_dsl/attention/mainloop_spec.py' 2026-04-24T15:42:45,313 adding 'flashinfer/cute_dsl/attention/mla_config.py' 2026-04-24T15:42:45,316 adding 'flashinfer/cute_dsl/attention/mla_decode.py' 2026-04-24T15:42:45,319 adding 'flashinfer/cute_dsl/attention/mla_decode_fp8.py' 2026-04-24T15:42:45,321 adding 'flashinfer/cute_dsl/attention/mla_warp_schedule.py' 2026-04-24T15:42:45,323 adding 'flashinfer/cute_dsl/attention/pipeline_topology.py' 2026-04-24T15:42:45,326 adding 'flashinfer/cute_dsl/attention/prefill.py' 2026-04-24T15:42:45,327 adding 'flashinfer/cute_dsl/attention/tmem_layout.py' 2026-04-24T15:42:45,329 adding 'flashinfer/cute_dsl/attention/warp_schedule.py' 2026-04-24T15:42:45,331 adding 'flashinfer/cute_dsl/attention/fusion/__init__.py' 2026-04-24T15:42:45,332 adding 'flashinfer/cute_dsl/attention/fusion/mask.py' 2026-04-24T15:42:45,335 adding 'flashinfer/cute_dsl/attention/fusion/variant.py' 2026-04-24T15:42:45,337 adding 'flashinfer/cute_dsl/attention/roles/__init__.py' 2026-04-24T15:42:45,339 adding 'flashinfer/cute_dsl/attention/roles/correction.py' 2026-04-24T15:42:45,341 adding 'flashinfer/cute_dsl/attention/roles/epilogue.py' 2026-04-24T15:42:45,343 adding 'flashinfer/cute_dsl/attention/roles/loader_tma.py' 2026-04-24T15:42:45,346 adding 'flashinfer/cute_dsl/attention/roles/mla_compute.py' 2026-04-24T15:42:45,348 adding 'flashinfer/cute_dsl/attention/roles/mla_correction.py' 2026-04-24T15:42:45,351 adding 'flashinfer/cute_dsl/attention/roles/mla_loader.py' 2026-04-24T15:42:45,353 adding 'flashinfer/cute_dsl/attention/roles/mla_loader_fp8.py' 2026-04-24T15:42:45,356 adding 'flashinfer/cute_dsl/attention/roles/mla_mma.py' 2026-04-24T15:42:45,358 adding 'flashinfer/cute_dsl/attention/roles/mla_mma_fp8.py' 2026-04-24T15:42:45,359 adding 'flashinfer/cute_dsl/attention/roles/mla_pt_loader.py' 2026-04-24T15:42:45,362 adding 'flashinfer/cute_dsl/attention/roles/mma.py' 2026-04-24T15:42:45,365 adding 'flashinfer/cute_dsl/attention/roles/softmax.py' 2026-04-24T15:42:45,366 adding 'flashinfer/cute_dsl/attention/roles/softmax_math.py' 2026-04-24T15:42:45,368 adding 'flashinfer/cute_dsl/attention/scheduler/__init__.py' 2026-04-24T15:42:45,370 adding 'flashinfer/cute_dsl/attention/scheduler/mla_persistent.py' 2026-04-24T15:42:45,371 adding 'flashinfer/cute_dsl/attention/scheduler/persistent.py' 2026-04-24T15:42:45,373 adding 'flashinfer/cute_dsl/attention/wrappers/__init__.py' 2026-04-24T15:42:45,376 adding 'flashinfer/cute_dsl/attention/wrappers/batch_mla.py' 2026-04-24T15:42:45,379 adding 'flashinfer/cute_dsl/attention/wrappers/batch_prefill.py' 2026-04-24T15:42:45,381 adding 'flashinfer/data/build_backend.py' 2026-04-24T15:42:45,382 adding 'flashinfer/data/build_utils.py' 2026-04-24T15:42:45,388 adding 'flashinfer/data/csrc/batch_attention.cu' 2026-04-24T15:42:45,389 adding 'flashinfer/data/csrc/batch_attention_customize_config.jinja' 2026-04-24T15:42:45,390 adding 'flashinfer/data/csrc/batch_attention_jit_binding.cu' 2026-04-24T15:42:45,392 adding 'flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja' 2026-04-24T15:42:45,393 adding 'flashinfer/data/csrc/batch_decode.cu' 2026-04-24T15:42:45,395 adding 'flashinfer/data/csrc/batch_decode_customize_config.jinja' 2026-04-24T15:42:45,396 adding 'flashinfer/data/csrc/batch_decode_jit_binding.cu' 2026-04-24T15:42:45,397 adding 'flashinfer/data/csrc/batch_decode_kernel_inst.jinja' 2026-04-24T15:42:45,399 adding 'flashinfer/data/csrc/batch_decode_mla_binding.cu' 2026-04-24T15:42:45,400 adding 'flashinfer/data/csrc/batch_decode_mla_config.jinja' 2026-04-24T15:42:45,401 adding 'flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu' 2026-04-24T15:42:45,402 adding 'flashinfer/data/csrc/batch_decode_mla_plan.cu' 2026-04-24T15:42:45,404 adding 'flashinfer/data/csrc/batch_decode_mla_run.cu' 2026-04-24T15:42:45,405 adding 'flashinfer/data/csrc/batch_mla_binding.cu' 2026-04-24T15:42:45,407 adding 'flashinfer/data/csrc/batch_mla_config.jinja' 2026-04-24T15:42:45,408 adding 'flashinfer/data/csrc/batch_mla_plan.cu' 2026-04-24T15:42:45,409 adding 'flashinfer/data/csrc/batch_mla_run.cu' 2026-04-24T15:42:45,411 adding 'flashinfer/data/csrc/batch_mla_sm90_binding.cu' 2026-04-24T15:42:45,412 adding 'flashinfer/data/csrc/batch_mla_sm90_plan.cu' 2026-04-24T15:42:45,414 adding 'flashinfer/data/csrc/batch_mla_sm90_run.cu' 2026-04-24T15:42:45,416 adding 'flashinfer/data/csrc/batch_pod.cu' 2026-04-24T15:42:45,417 adding 'flashinfer/data/csrc/batch_pod_customize_config.jinja' 2026-04-24T15:42:45,419 adding 'flashinfer/data/csrc/batch_pod_jit_binding.cu' 2026-04-24T15:42:45,420 adding 'flashinfer/data/csrc/batch_pod_kernel_inst.jinja' 2026-04-24T15:42:45,422 adding 'flashinfer/data/csrc/batch_prefill.cu' 2026-04-24T15:42:45,423 adding 'flashinfer/data/csrc/batch_prefill_customize_config.jinja' 2026-04-24T15:42:45,424 adding 'flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja' 2026-04-24T15:42:45,425 adding 'flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja' 2026-04-24T15:42:45,427 adding 'flashinfer/data/csrc/batch_prefill_fp8_sm90.cu' 2026-04-24T15:42:45,429 adding 'flashinfer/data/csrc/batch_prefill_jit_binding.cu' 2026-04-24T15:42:45,430 adding 'flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja' 2026-04-24T15:42:45,431 adding 'flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja' 2026-04-24T15:42:45,432 adding 'flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja' 2026-04-24T15:42:45,434 adding 'flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja' 2026-04-24T15:42:45,435 adding 'flashinfer/data/csrc/batch_prefill_sm90.cu' 2026-04-24T15:42:45,437 adding 'flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja' 2026-04-24T15:42:45,438 adding 'flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu' 2026-04-24T15:42:45,440 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.cu' 2026-04-24T15:42:45,441 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.jinja' 2026-04-24T15:42:45,442 adding 'flashinfer/data/csrc/blackwell_fmha_plan.cu' 2026-04-24T15:42:45,444 adding 'flashinfer/data/csrc/bmm_fp8.cu' 2026-04-24T15:42:45,445 adding 'flashinfer/data/csrc/cascade.cu' 2026-04-24T15:42:45,447 adding 'flashinfer/data/csrc/concat_mla.cu' 2026-04-24T15:42:45,452 adding 'flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu' 2026-04-24T15:42:45,454 adding 'flashinfer/data/csrc/cudnn_sdpa_utils.h' 2026-04-24T15:42:45,456 adding 'flashinfer/data/csrc/cutlass_mla.cu' 2026-04-24T15:42:45,457 adding 'flashinfer/data/csrc/dsv3_router_gemm.cu' 2026-04-24T15:42:45,459 adding 'flashinfer/data/csrc/flashinfer_cascade_binding.cu' 2026-04-24T15:42:45,460 adding 'flashinfer/data/csrc/flashinfer_fast_topk_clusters_binding.cu' 2026-04-24T15:42:45,462 adding 'flashinfer/data/csrc/flashinfer_gemm_binding.cu' 2026-04-24T15:42:45,463 adding 'flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu' 2026-04-24T15:42:45,465 adding 'flashinfer/data/csrc/flashinfer_mamba_binding.cu' 2026-04-24T15:42:45,466 adding 'flashinfer/data/csrc/flashinfer_mla_binding.cu' 2026-04-24T15:42:45,467 adding 'flashinfer/data/csrc/flashinfer_norm_binding.cu' 2026-04-24T15:42:45,469 adding 'flashinfer/data/csrc/flashinfer_page_binding.cu' 2026-04-24T15:42:45,470 adding 'flashinfer/data/csrc/flashinfer_quantization_binding.cu' 2026-04-24T15:42:45,471 adding 'flashinfer/data/csrc/flashinfer_rmsnorm_silu_binding.cu' 2026-04-24T15:42:45,472 adding 'flashinfer/data/csrc/flashinfer_rope_binding.cu' 2026-04-24T15:42:45,474 adding 'flashinfer/data/csrc/flashinfer_sampling_binding.cu' 2026-04-24T15:42:45,475 adding 'flashinfer/data/csrc/flashinfer_topk_binding.cu' 2026-04-24T15:42:45,476 adding 'flashinfer/data/csrc/flashinfer_xqa_binding.cu' 2026-04-24T15:42:45,478 adding 'flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc' 2026-04-24T15:42:45,480 adding 'flashinfer/data/csrc/fmhaReduction.cu' 2026-04-24T15:42:45,482 adding 'flashinfer/data/csrc/fmha_cutlass_sm100.cu' 2026-04-24T15:42:45,483 adding 'flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu' 2026-04-24T15:42:45,485 adding 'flashinfer/data/csrc/fmha_v2_jit_binding.cu' 2026-04-24T15:42:45,488 adding 'flashinfer/data/csrc/fmha_v2_run.cu' 2026-04-24T15:42:45,490 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.cu' 2026-04-24T15:42:45,491 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.jinja' 2026-04-24T15:42:45,493 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu' 2026-04-24T15:42:45,494 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja' 2026-04-24T15:42:45,496 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu' 2026-04-24T15:42:45,497 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja' 2026-04-24T15:42:45,499 adding 'flashinfer/data/csrc/fp4_kv_dequantization.cu' 2026-04-24T15:42:45,501 adding 'flashinfer/data/csrc/fp4_kv_quantization.cu' 2026-04-24T15:42:45,503 adding 'flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu' 2026-04-24T15:42:45,504 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.cu' 2026-04-24T15:42:45,506 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.jinja' 2026-04-24T15:42:45,507 adding 'flashinfer/data/csrc/gdn_prefill_launcher.cu' 2026-04-24T15:42:45,509 adding 'flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja' 2026-04-24T15:42:45,511 adding 'flashinfer/data/csrc/gemm_groupwise_sm100.cu' 2026-04-24T15:42:45,512 adding 'flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja' 2026-04-24T15:42:45,514 adding 'flashinfer/data/csrc/gemm_groupwise_sm120.cu' 2026-04-24T15:42:45,515 adding 'flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja' 2026-04-24T15:42:45,516 adding 'flashinfer/data/csrc/gemm_sm100_binding.cu' 2026-04-24T15:42:45,518 adding 'flashinfer/data/csrc/gemm_sm120_binding.cu' 2026-04-24T15:42:45,519 adding 'flashinfer/data/csrc/group_gemm.cu' 2026-04-24T15:42:45,521 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu' 2026-04-24T15:42:45,522 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja' 2026-04-24T15:42:45,524 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu' 2026-04-24T15:42:45,525 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja' 2026-04-24T15:42:45,527 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu' 2026-04-24T15:42:45,528 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja' 2026-04-24T15:42:45,530 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120.cu' 2026-04-24T15:42:45,531 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120_kernel_inst.jinja' 2026-04-24T15:42:45,533 adding 'flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120.cu' 2026-04-24T15:42:45,534 adding 'flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120_kernel_inst.jinja' 2026-04-24T15:42:45,536 adding 'flashinfer/data/csrc/group_gemm_sm100_binding.cu' 2026-04-24T15:42:45,537 adding 'flashinfer/data/csrc/group_gemm_sm120_binding.cu' 2026-04-24T15:42:45,538 adding 'flashinfer/data/csrc/group_gemm_sm90.cu' 2026-04-24T15:42:45,540 adding 'flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja' 2026-04-24T15:42:45,541 adding 'flashinfer/data/csrc/logging.cc' 2026-04-24T15:42:45,543 adding 'flashinfer/data/csrc/moe_utils_binding.cu' 2026-04-24T15:42:45,545 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.cu' 2026-04-24T15:42:45,546 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja' 2026-04-24T15:42:45,548 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.cu' 2026-04-24T15:42:45,550 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.jinja' 2026-04-24T15:42:45,551 adding 'flashinfer/data/csrc/norm.cu' 2026-04-24T15:42:45,553 adding 'flashinfer/data/csrc/page.cu' 2026-04-24T15:42:45,555 adding 'flashinfer/data/csrc/pod.cu' 2026-04-24T15:42:45,556 adding 'flashinfer/data/csrc/pod_customize_config.jinja' 2026-04-24T15:42:45,557 adding 'flashinfer/data/csrc/pod_jit_binding.cu' 2026-04-24T15:42:45,559 adding 'flashinfer/data/csrc/pod_kernel_inst.jinja' 2026-04-24T15:42:45,560 adding 'flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu' 2026-04-24T15:42:45,562 adding 'flashinfer/data/csrc/quantization.cu' 2026-04-24T15:42:45,563 adding 'flashinfer/data/csrc/renorm.cu' 2026-04-24T15:42:45,565 adding 'flashinfer/data/csrc/rmsnorm_silu.cu' 2026-04-24T15:42:45,567 adding 'flashinfer/data/csrc/rope.cu' 2026-04-24T15:42:45,569 adding 'flashinfer/data/csrc/runtime_utils.h' 2026-04-24T15:42:45,571 adding 'flashinfer/data/csrc/sampling.cu' 2026-04-24T15:42:45,572 adding 'flashinfer/data/csrc/sampling_utils.h' 2026-04-24T15:42:45,575 adding 'flashinfer/data/csrc/selective_state_update.cu' 2026-04-24T15:42:45,576 adding 'flashinfer/data/csrc/selective_state_update_customize_config.jinja' 2026-04-24T15:42:45,578 adding 'flashinfer/data/csrc/selective_state_update_dtype_inst.jinja' 2026-04-24T15:42:45,579 adding 'flashinfer/data/csrc/selective_state_update_kernel_inst.cu' 2026-04-24T15:42:45,580 adding 'flashinfer/data/csrc/seq_chunk_cumsum.cu' 2026-04-24T15:42:45,582 adding 'flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu' 2026-04-24T15:42:45,583 adding 'flashinfer/data/csrc/single_decode.cu' 2026-04-24T15:42:45,584 adding 'flashinfer/data/csrc/single_decode_customize_config.jinja' 2026-04-24T15:42:45,586 adding 'flashinfer/data/csrc/single_decode_jit_binding.cu' 2026-04-24T15:42:45,587 adding 'flashinfer/data/csrc/single_decode_kernel_inst.jinja' 2026-04-24T15:42:45,588 adding 'flashinfer/data/csrc/single_prefill.cu' 2026-04-24T15:42:45,590 adding 'flashinfer/data/csrc/single_prefill_customize_config.jinja' 2026-04-24T15:42:45,591 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90.cu' 2026-04-24T15:42:45,592 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja' 2026-04-24T15:42:45,594 adding 'flashinfer/data/csrc/single_prefill_jit_binding.cu' 2026-04-24T15:42:45,595 adding 'flashinfer/data/csrc/single_prefill_kernel_inst.jinja' 2026-04-24T15:42:45,596 adding 'flashinfer/data/csrc/single_prefill_sm90.cu' 2026-04-24T15:42:45,598 adding 'flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja' 2026-04-24T15:42:45,599 adding 'flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu' 2026-04-24T15:42:45,600 adding 'flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja' 2026-04-24T15:42:45,602 adding 'flashinfer/data/csrc/tgv_gemm.cu' 2026-04-24T15:42:45,603 adding 'flashinfer/data/csrc/tgv_gemm.jinja' 2026-04-24T15:42:45,606 adding 'flashinfer/data/csrc/tinygemm2.cu' 2026-04-24T15:42:45,608 adding 'flashinfer/data/csrc/topk.cu' 2026-04-24T15:42:45,610 adding 'flashinfer/data/csrc/trtllm_allreduce.cu' 2026-04-24T15:42:45,611 adding 'flashinfer/data/csrc/trtllm_allreduce_fusion.cu' 2026-04-24T15:42:45,614 adding 'flashinfer/data/csrc/trtllm_alltoall.cu' 2026-04-24T15:42:45,617 adding 'flashinfer/data/csrc/trtllm_alltoall_prepare.cu' 2026-04-24T15:42:45,620 adding 'flashinfer/data/csrc/trtllm_batched_gemm_runner.cu' 2026-04-24T15:42:45,623 adding 'flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu' 2026-04-24T15:42:45,626 adding 'flashinfer/data/csrc/trtllm_fmha_v2_binding.cu' 2026-04-24T15:42:45,635 adding 'flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu' 2026-04-24T15:42:45,639 adding 'flashinfer/data/csrc/trtllm_fused_moe_runner.cu' 2026-04-24T15:42:45,642 adding 'flashinfer/data/csrc/trtllm_gemm_runner.cu' 2026-04-24T15:42:45,644 adding 'flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu' 2026-04-24T15:42:45,645 adding 'flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu' 2026-04-24T15:42:45,647 adding 'flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu' 2026-04-24T15:42:45,649 adding 'flashinfer/data/csrc/trtllm_moe_alltoall.cu' 2026-04-24T15:42:45,651 adding 'flashinfer/data/csrc/tvm_ffi_utils.h' 2026-04-24T15:42:45,652 adding 'flashinfer/data/csrc/vllm_custom_all_reduce.cu' 2026-04-24T15:42:45,655 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h' 2026-04-24T15:42:45,657 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h' 2026-04-24T15:42:45,658 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h' 2026-04-24T15:42:45,660 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h' 2026-04-24T15:42:45,662 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h' 2026-04-24T15:42:45,664 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h' 2026-04-24T15:42:45,666 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h' 2026-04-24T15:42:45,669 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h' 2026-04-24T15:42:45,671 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h' 2026-04-24T15:42:45,673 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h' 2026-04-24T15:42:45,675 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h' 2026-04-24T15:42:45,680 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h' 2026-04-24T15:42:45,682 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h' 2026-04-24T15:42:45,684 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h' 2026-04-24T15:42:45,686 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h' 2026-04-24T15:42:45,688 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h' 2026-04-24T15:42:45,691 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h' 2026-04-24T15:42:45,694 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h' 2026-04-24T15:42:45,696 adding 'flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h' 2026-04-24T15:42:45,701 adding 'flashinfer/data/csrc/fmha_v2/fmha/fragment.h' 2026-04-24T15:42:45,702 adding 'flashinfer/data/csrc/fmha_v2/fmha/gemm.h' 2026-04-24T15:42:45,704 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h' 2026-04-24T15:42:45,708 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h' 2026-04-24T15:42:45,711 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h' 2026-04-24T15:42:45,713 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h' 2026-04-24T15:42:45,717 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h' 2026-04-24T15:42:45,720 adding 'flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h' 2026-04-24T15:42:45,722 adding 'flashinfer/data/csrc/fmha_v2/fmha/mask.h' 2026-04-24T15:42:45,724 adding 'flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h' 2026-04-24T15:42:45,725 adding 'flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h' 2026-04-24T15:42:45,730 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h' 2026-04-24T15:42:45,735 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h' 2026-04-24T15:42:45,737 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h' 2026-04-24T15:42:45,740 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h' 2026-04-24T15:42:45,751 adding 'flashinfer/data/csrc/fmha_v2/fmha/softmax.h' 2026-04-24T15:42:45,755 adding 'flashinfer/data/csrc/fmha_v2/fmha/traits.h' 2026-04-24T15:42:45,761 adding 'flashinfer/data/csrc/fmha_v2/fmha/utils.h' 2026-04-24T15:42:45,764 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h' 2026-04-24T15:42:45,767 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h' 2026-04-24T15:42:45,769 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h' 2026-04-24T15:42:45,773 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h' 2026-04-24T15:42:45,775 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h' 2026-04-24T15:42:45,777 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h' 2026-04-24T15:42:45,779 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h' 2026-04-24T15:42:45,786 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h' 2026-04-24T15:42:45,788 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h' 2026-04-24T15:42:45,790 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h' 2026-04-24T15:42:45,791 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h' 2026-04-24T15:42:45,793 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h' 2026-04-24T15:42:45,795 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h' 2026-04-24T15:42:45,798 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h' 2026-04-24T15:42:45,800 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h' 2026-04-24T15:42:45,805 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h' 2026-04-24T15:42:45,807 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h' 2026-04-24T15:42:45,808 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h' 2026-04-24T15:42:45,811 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h' 2026-04-24T15:42:45,814 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h' 2026-04-24T15:42:45,818 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h' 2026-04-24T15:42:45,822 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h' 2026-04-24T15:42:45,825 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h' 2026-04-24T15:42:45,827 adding 'flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja' 2026-04-24T15:42:45,829 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel.jinja' 2026-04-24T15:42:45,831 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja' 2026-04-24T15:42:45,833 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja' 2026-04-24T15:42:45,836 adding 'flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh' 2026-04-24T15:42:45,839 adding 'flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu' 2026-04-24T15:42:45,841 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu' 2026-04-24T15:42:45,865 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh' 2026-04-24T15:42:45,868 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu' 2026-04-24T15:42:45,873 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu' 2026-04-24T15:42:45,879 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu' 2026-04-24T15:42:45,881 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_common.cu' 2026-04-24T15:42:45,886 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_custom.cu' 2026-04-24T15:42:45,890 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu' 2026-04-24T15:42:45,894 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu' 2026-04-24T15:42:45,897 adding 'flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp' 2026-04-24T15:42:45,898 adding 'flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp' 2026-04-24T15:42:45,902 adding 'flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu' 2026-04-24T15:42:45,904 adding 'flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp' 2026-04-24T15:42:45,905 adding 'flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp' 2026-04-24T15:42:45,908 adding 'flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu' 2026-04-24T15:42:45,911 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h' 2026-04-24T15:42:45,912 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h' 2026-04-24T15:42:45,914 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h' 2026-04-24T15:42:45,915 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h' 2026-04-24T15:42:45,916 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h' 2026-04-24T15:42:45,920 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h' 2026-04-24T15:42:45,922 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h' 2026-04-24T15:42:45,923 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h' 2026-04-24T15:42:45,925 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h' 2026-04-24T15:42:45,927 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h' 2026-04-24T15:42:45,928 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h' 2026-04-24T15:42:45,930 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h' 2026-04-24T15:42:45,932 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh' 2026-04-24T15:42:45,934 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h' 2026-04-24T15:42:45,936 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh' 2026-04-24T15:42:45,937 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h' 2026-04-24T15:42:45,939 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h' 2026-04-24T15:42:45,940 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh' 2026-04-24T15:42:45,942 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh' 2026-04-24T15:42:45,943 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h' 2026-04-24T15:42:45,946 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h' 2026-04-24T15:42:45,948 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h' 2026-04-24T15:42:45,950 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h' 2026-04-24T15:42:45,952 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h' 2026-04-24T15:42:45,954 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h' 2026-04-24T15:42:45,956 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h' 2026-04-24T15:42:45,957 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h' 2026-04-24T15:42:45,959 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp' 2026-04-24T15:42:45,961 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp' 2026-04-24T15:42:45,962 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp' 2026-04-24T15:42:45,964 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h' 2026-04-24T15:42:45,965 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h' 2026-04-24T15:42:45,968 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp' 2026-04-24T15:42:45,972 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp' 2026-04-24T15:42:45,976 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp' 2026-04-24T15:42:45,979 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp' 2026-04-24T15:42:45,982 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp' 2026-04-24T15:42:45,984 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h' 2026-04-24T15:42:45,987 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp' 2026-04-24T15:42:45,988 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp' 2026-04-24T15:42:45,989 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp' 2026-04-24T15:42:45,991 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp' 2026-04-24T15:42:45,992 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp' 2026-04-24T15:42:45,994 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp' 2026-04-24T15:42:46,001 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp' 2026-04-24T15:42:46,004 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp' 2026-04-24T15:42:46,008 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-24T15:42:46,015 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-24T15:42:46,018 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl' 2026-04-24T15:42:46,020 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl' 2026-04-24T15:42:46,022 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl' 2026-04-24T15:42:46,024 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h' 2026-04-24T15:42:46,026 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh' 2026-04-24T15:42:46,029 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh' 2026-04-24T15:42:46,031 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh' 2026-04-24T15:42:46,032 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h' 2026-04-24T15:42:46,033 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp' 2026-04-24T15:42:46,035 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h' 2026-04-24T15:42:46,036 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh' 2026-04-24T15:42:46,039 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h' 2026-04-24T15:42:46,041 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h' 2026-04-24T15:42:46,044 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp' 2026-04-24T15:42:46,047 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp' 2026-04-24T15:42:46,049 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h' 2026-04-24T15:42:46,051 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h' 2026-04-24T15:42:46,053 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h' 2026-04-24T15:42:46,055 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h' 2026-04-24T15:42:46,057 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h' 2026-04-24T15:42:46,058 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h' 2026-04-24T15:42:46,060 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h' 2026-04-24T15:42:46,063 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h' 2026-04-24T15:42:46,066 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h' 2026-04-24T15:42:46,067 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h' 2026-04-24T15:42:46,070 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h' 2026-04-24T15:42:46,072 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h' 2026-04-24T15:42:46,075 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h' 2026-04-24T15:42:46,077 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h' 2026-04-24T15:42:46,079 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h' 2026-04-24T15:42:46,081 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h' 2026-04-24T15:42:46,084 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp' 2026-04-24T15:42:46,086 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh' 2026-04-24T15:42:46,089 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh' 2026-04-24T15:42:46,092 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh' 2026-04-24T15:42:46,094 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh' 2026-04-24T15:42:46,097 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh' 2026-04-24T15:42:46,104 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh' 2026-04-24T15:42:46,106 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh' 2026-04-24T15:42:46,107 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh' 2026-04-24T15:42:46,110 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh' 2026-04-24T15:42:46,111 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh' 2026-04-24T15:42:46,113 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh' 2026-04-24T15:42:46,115 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu' 2026-04-24T15:42:46,116 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h' 2026-04-24T15:42:46,118 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu' 2026-04-24T15:42:46,119 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h' 2026-04-24T15:42:46,122 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh' 2026-04-24T15:42:46,123 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h' 2026-04-24T15:42:46,127 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh' 2026-04-24T15:42:46,131 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu' 2026-04-24T15:42:46,133 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h' 2026-04-24T15:42:46,136 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu' 2026-04-24T15:42:46,137 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h' 2026-04-24T15:42:46,141 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp' 2026-04-24T15:42:46,143 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h' 2026-04-24T15:42:46,144 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h' 2026-04-24T15:42:46,147 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu' 2026-04-24T15:42:46,148 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h' 2026-04-24T15:42:46,155 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh' 2026-04-24T15:42:46,158 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh' 2026-04-24T15:42:46,160 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh' 2026-04-24T15:42:46,163 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh' 2026-04-24T15:42:46,165 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh' 2026-04-24T15:42:46,167 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu' 2026-04-24T15:42:46,168 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu' 2026-04-24T15:42:46,169 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu' 2026-04-24T15:42:46,170 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu' 2026-04-24T15:42:46,171 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu' 2026-04-24T15:42:46,173 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu' 2026-04-24T15:42:46,174 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu' 2026-04-24T15:42:46,175 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu' 2026-04-24T15:42:46,176 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu' 2026-04-24T15:42:46,178 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu' 2026-04-24T15:42:46,179 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu' 2026-04-24T15:42:46,180 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu' 2026-04-24T15:42:46,182 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu' 2026-04-24T15:42:46,183 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu' 2026-04-24T15:42:46,184 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu' 2026-04-24T15:42:46,186 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu' 2026-04-24T15:42:46,187 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu' 2026-04-24T15:42:46,188 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h' 2026-04-24T15:42:46,191 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h' 2026-04-24T15:42:46,194 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h' 2026-04-24T15:42:46,195 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h' 2026-04-24T15:42:46,198 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl' 2026-04-24T15:42:46,200 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h' 2026-04-24T15:42:46,201 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h' 2026-04-24T15:42:46,203 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h' 2026-04-24T15:42:46,209 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h' 2026-04-24T15:42:46,210 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h' 2026-04-24T15:42:46,212 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu' 2026-04-24T15:42:46,214 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu' 2026-04-24T15:42:46,215 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu' 2026-04-24T15:42:46,216 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu' 2026-04-24T15:42:46,217 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu' 2026-04-24T15:42:46,219 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu' 2026-04-24T15:42:46,220 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu' 2026-04-24T15:42:46,221 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu' 2026-04-24T15:42:46,222 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu' 2026-04-24T15:42:46,224 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu' 2026-04-24T15:42:46,225 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu' 2026-04-24T15:42:46,226 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu' 2026-04-24T15:42:46,227 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu' 2026-04-24T15:42:46,229 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu' 2026-04-24T15:42:46,233 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h' 2026-04-24T15:42:46,236 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h' 2026-04-24T15:42:46,238 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h' 2026-04-24T15:42:46,240 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu' 2026-04-24T15:42:46,242 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh' 2026-04-24T15:42:46,243 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h' 2026-04-24T15:42:46,245 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h' 2026-04-24T15:42:46,247 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl' 2026-04-24T15:42:46,248 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h' 2026-04-24T15:42:46,256 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl' 2026-04-24T15:42:46,258 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h' 2026-04-24T15:42:46,260 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl' 2026-04-24T15:42:46,262 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp' 2026-04-24T15:42:46,264 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h' 2026-04-24T15:42:46,267 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp' 2026-04-24T15:42:46,269 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp' 2026-04-24T15:42:46,270 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h' 2026-04-24T15:42:46,272 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp' 2026-04-24T15:42:46,273 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h' 2026-04-24T15:42:46,274 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h' 2026-04-24T15:42:46,275 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h' 2026-04-24T15:42:46,278 adding 'flashinfer/data/csrc/xqa/barriers.cuh' 2026-04-24T15:42:46,279 adding 'flashinfer/data/csrc/xqa/cuda_hint.cuh' 2026-04-24T15:42:46,281 adding 'flashinfer/data/csrc/xqa/defines.h' 2026-04-24T15:42:46,282 adding 'flashinfer/data/csrc/xqa/gmma.cuh' 2026-04-24T15:42:46,292 adding 'flashinfer/data/csrc/xqa/gmma_impl.cuh' 2026-04-24T15:42:46,296 adding 'flashinfer/data/csrc/xqa/hostUtils.h' 2026-04-24T15:42:46,297 adding 'flashinfer/data/csrc/xqa/ldgsts.cuh' 2026-04-24T15:42:46,311 adding 'flashinfer/data/csrc/xqa/mha.cu' 2026-04-24T15:42:46,313 adding 'flashinfer/data/csrc/xqa/mha.h' 2026-04-24T15:42:46,316 adding 'flashinfer/data/csrc/xqa/mhaUtils.cuh' 2026-04-24T15:42:46,318 adding 'flashinfer/data/csrc/xqa/mha_components.cuh' 2026-04-24T15:42:46,331 adding 'flashinfer/data/csrc/xqa/mha_sm90.cu' 2026-04-24T15:42:46,335 adding 'flashinfer/data/csrc/xqa/mha_stdheaders.cuh' 2026-04-24T15:42:46,343 adding 'flashinfer/data/csrc/xqa/mla_sm120.cu' 2026-04-24T15:42:46,345 adding 'flashinfer/data/csrc/xqa/mla_sm120.cuh' 2026-04-24T15:42:46,346 adding 'flashinfer/data/csrc/xqa/mma.cuh' 2026-04-24T15:42:46,348 adding 'flashinfer/data/csrc/xqa/platform.h' 2026-04-24T15:42:46,349 adding 'flashinfer/data/csrc/xqa/specDec.h' 2026-04-24T15:42:46,350 adding 'flashinfer/data/csrc/xqa/tensorMap.cpp' 2026-04-24T15:42:46,352 adding 'flashinfer/data/csrc/xqa/tensorMap.h' 2026-04-24T15:42:46,353 adding 'flashinfer/data/csrc/xqa/tma.h' 2026-04-24T15:42:46,357 adding 'flashinfer/data/csrc/xqa/utils.cuh' 2026-04-24T15:42:46,359 adding 'flashinfer/data/csrc/xqa/utils.h' 2026-04-24T15:42:46,360 adding 'flashinfer/data/csrc/xqa/xqa_wrapper.cu' 2026-04-24T15:42:46,364 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py' 2026-04-24T15:42:46,365 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py' 2026-04-24T15:42:46,367 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py' 2026-04-24T15:42:46,370 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py' 2026-04-24T15:42:46,372 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py' 2026-04-24T15:42:46,374 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py' 2026-04-24T15:42:46,377 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py' 2026-04-24T15:42:46,378 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py' 2026-04-24T15:42:46,381 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py' 2026-04-24T15:42:46,383 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py' 2026-04-24T15:42:46,384 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py' 2026-04-24T15:42:46,387 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py' 2026-04-24T15:42:46,389 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py' 2026-04-24T15:42:46,391 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py' 2026-04-24T15:42:46,393 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py' 2026-04-24T15:42:46,397 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py' 2026-04-24T15:42:46,400 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py' 2026-04-24T15:42:46,401 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py' 2026-04-24T15:42:46,403 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py' 2026-04-24T15:42:46,404 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py' 2026-04-24T15:42:46,407 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py' 2026-04-24T15:42:46,409 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py' 2026-04-24T15:42:46,412 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py' 2026-04-24T15:42:46,413 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py' 2026-04-24T15:42:46,416 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py' 2026-04-24T15:42:46,418 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py' 2026-04-24T15:42:46,420 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py' 2026-04-24T15:42:46,425 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py' 2026-04-24T15:42:46,429 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py' 2026-04-24T15:42:46,431 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py' 2026-04-24T15:42:46,435 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py' 2026-04-24T15:42:46,437 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py' 2026-04-24T15:42:46,442 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py' 2026-04-24T15:42:46,453 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py' 2026-04-24T15:42:46,463 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py' 2026-04-24T15:42:46,474 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-24T15:42:46,481 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py' 2026-04-24T15:42:46,490 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py' 2026-04-24T15:42:46,497 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py' 2026-04-24T15:42:46,505 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py' 2026-04-24T15:42:46,514 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py' 2026-04-24T15:42:46,521 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py' 2026-04-24T15:42:46,532 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py' 2026-04-24T15:42:46,544 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py' 2026-04-24T15:42:46,556 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py' 2026-04-24T15:42:46,566 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py' 2026-04-24T15:42:46,583 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py' 2026-04-24T15:42:46,587 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py' 2026-04-24T15:42:46,590 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py' 2026-04-24T15:42:46,593 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py' 2026-04-24T15:42:46,604 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py' 2026-04-24T15:42:46,615 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py' 2026-04-24T15:42:46,626 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py' 2026-04-24T15:42:46,636 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py' 2026-04-24T15:42:46,640 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py' 2026-04-24T15:42:46,648 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py' 2026-04-24T15:42:46,655 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py' 2026-04-24T15:42:46,658 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py' 2026-04-24T15:42:46,660 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py' 2026-04-24T15:42:46,672 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py' 2026-04-24T15:42:46,675 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py' 2026-04-24T15:42:46,677 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py' 2026-04-24T15:42:46,685 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py' 2026-04-24T15:42:46,693 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py' 2026-04-24T15:42:46,701 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py' 2026-04-24T15:42:46,703 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py' 2026-04-24T15:42:46,713 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py' 2026-04-24T15:42:46,723 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py' 2026-04-24T15:42:46,733 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py' 2026-04-24T15:42:46,735 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py' 2026-04-24T15:42:46,750 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py' 2026-04-24T15:42:46,766 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py' 2026-04-24T15:42:46,769 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py' 2026-04-24T15:42:46,771 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py' 2026-04-24T15:42:46,774 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py' 2026-04-24T15:42:46,777 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py' 2026-04-24T15:42:46,781 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py' 2026-04-24T15:42:46,784 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py' 2026-04-24T15:42:46,789 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py' 2026-04-24T15:42:46,791 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py' 2026-04-24T15:42:46,793 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py' 2026-04-24T15:42:46,795 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py' 2026-04-24T15:42:46,796 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py' 2026-04-24T15:42:46,799 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py' 2026-04-24T15:42:46,801 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py' 2026-04-24T15:42:46,802 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py' 2026-04-24T15:42:46,803 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py' 2026-04-24T15:42:46,805 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py' 2026-04-24T15:42:46,806 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py' 2026-04-24T15:42:46,807 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py' 2026-04-24T15:42:46,808 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py' 2026-04-24T15:42:46,810 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py' 2026-04-24T15:42:46,813 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py' 2026-04-24T15:42:46,815 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py' 2026-04-24T15:42:46,818 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py' 2026-04-24T15:42:46,820 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py' 2026-04-24T15:42:46,829 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py' 2026-04-24T15:42:46,839 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py' 2026-04-24T15:42:46,850 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py' 2026-04-24T15:42:46,853 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py' 2026-04-24T15:42:46,857 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py' 2026-04-24T15:42:46,865 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py' 2026-04-24T15:42:46,867 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py' 2026-04-24T15:42:46,875 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py' 2026-04-24T15:42:46,879 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py' 2026-04-24T15:42:46,881 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py' 2026-04-24T15:42:46,884 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py' 2026-04-24T15:42:46,887 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py' 2026-04-24T15:42:46,893 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py' 2026-04-24T15:42:46,899 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py' 2026-04-24T15:42:46,908 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py' 2026-04-24T15:42:46,911 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py' 2026-04-24T15:42:46,913 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py' 2026-04-24T15:42:46,915 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py' 2026-04-24T15:42:46,917 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py' 2026-04-24T15:42:46,919 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py' 2026-04-24T15:42:46,922 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py' 2026-04-24T15:42:46,924 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py' 2026-04-24T15:42:46,926 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py' 2026-04-24T15:42:46,929 adding 'flashinfer/data/cutlass/include/cute/config.hpp' 2026-04-24T15:42:46,932 adding 'flashinfer/data/cutlass/include/cute/int_tuple.hpp' 2026-04-24T15:42:46,938 adding 'flashinfer/data/cutlass/include/cute/layout.hpp' 2026-04-24T15:42:46,941 adding 'flashinfer/data/cutlass/include/cute/layout_composed.hpp' 2026-04-24T15:42:46,943 adding 'flashinfer/data/cutlass/include/cute/pointer.hpp' 2026-04-24T15:42:46,945 adding 'flashinfer/data/cutlass/include/cute/pointer_base.hpp' 2026-04-24T15:42:46,946 adding 'flashinfer/data/cutlass/include/cute/pointer_flagged.hpp' 2026-04-24T15:42:46,948 adding 'flashinfer/data/cutlass/include/cute/pointer_sparse.hpp' 2026-04-24T15:42:46,949 adding 'flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp' 2026-04-24T15:42:46,952 adding 'flashinfer/data/cutlass/include/cute/stride.hpp' 2026-04-24T15:42:46,954 adding 'flashinfer/data/cutlass/include/cute/swizzle.hpp' 2026-04-24T15:42:46,957 adding 'flashinfer/data/cutlass/include/cute/swizzle_layout.hpp' 2026-04-24T15:42:46,958 adding 'flashinfer/data/cutlass/include/cute/tensor.hpp' 2026-04-24T15:42:46,962 adding 'flashinfer/data/cutlass/include/cute/tensor_impl.hpp' 2026-04-24T15:42:46,964 adding 'flashinfer/data/cutlass/include/cute/tensor_zip.hpp' 2026-04-24T15:42:46,966 adding 'flashinfer/data/cutlass/include/cute/underscore.hpp' 2026-04-24T15:42:46,968 adding 'flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp' 2026-04-24T15:42:46,969 adding 'flashinfer/data/cutlass/include/cute/algorithm/clear.hpp' 2026-04-24T15:42:46,971 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp' 2026-04-24T15:42:46,974 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp' 2026-04-24T15:42:46,977 adding 'flashinfer/data/cutlass/include/cute/algorithm/copy.hpp' 2026-04-24T15:42:46,978 adding 'flashinfer/data/cutlass/include/cute/algorithm/fill.hpp' 2026-04-24T15:42:46,980 adding 'flashinfer/data/cutlass/include/cute/algorithm/functional.hpp' 2026-04-24T15:42:46,982 adding 'flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp' 2026-04-24T15:42:46,984 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp' 2026-04-24T15:42:46,985 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp' 2026-04-24T15:42:46,987 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp' 2026-04-24T15:42:46,988 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp' 2026-04-24T15:42:46,991 adding 'flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp' 2026-04-24T15:42:46,994 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp' 2026-04-24T15:42:46,995 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp' 2026-04-24T15:42:46,997 adding 'flashinfer/data/cutlass/include/cute/arch/config.hpp' 2026-04-24T15:42:46,999 adding 'flashinfer/data/cutlass/include/cute/arch/copy.hpp' 2026-04-24T15:42:47,011 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp' 2026-04-24T15:42:47,015 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp' 2026-04-24T15:42:47,017 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp' 2026-04-24T15:42:47,018 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp' 2026-04-24T15:42:47,020 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp' 2026-04-24T15:42:47,021 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp' 2026-04-24T15:42:47,024 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp' 2026-04-24T15:42:47,027 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp' 2026-04-24T15:42:47,029 adding 'flashinfer/data/cutlass/include/cute/arch/mma.hpp' 2026-04-24T15:42:47,030 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp' 2026-04-24T15:42:47,033 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp' 2026-04-24T15:42:47,037 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp' 2026-04-24T15:42:47,042 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp' 2026-04-24T15:42:47,048 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp' 2026-04-24T15:42:47,050 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp' 2026-04-24T15:42:47,051 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp' 2026-04-24T15:42:47,053 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp' 2026-04-24T15:42:47,056 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp' 2026-04-24T15:42:47,058 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp' 2026-04-24T15:42:47,070 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp' 2026-04-24T15:42:47,074 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp' 2026-04-24T15:42:47,110 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp' 2026-04-24T15:42:47,200 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp' 2026-04-24T15:42:47,254 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp' 2026-04-24T15:42:47,349 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp' 2026-04-24T15:42:47,369 adding 'flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp' 2026-04-24T15:42:47,370 adding 'flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp' 2026-04-24T15:42:47,372 adding 'flashinfer/data/cutlass/include/cute/arch/util.hpp' 2026-04-24T15:42:47,376 adding 'flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp' 2026-04-24T15:42:47,378 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp' 2026-04-24T15:42:47,386 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp' 2026-04-24T15:42:47,389 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp' 2026-04-24T15:42:47,391 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp' 2026-04-24T15:42:47,393 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp' 2026-04-24T15:42:47,394 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp' 2026-04-24T15:42:47,395 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp' 2026-04-24T15:42:47,397 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp' 2026-04-24T15:42:47,401 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp' 2026-04-24T15:42:47,408 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp' 2026-04-24T15:42:47,410 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp' 2026-04-24T15:42:47,413 adding 'flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp' 2026-04-24T15:42:47,415 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp' 2026-04-24T15:42:47,425 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp' 2026-04-24T15:42:47,429 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp' 2026-04-24T15:42:47,430 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp' 2026-04-24T15:42:47,432 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp' 2026-04-24T15:42:47,434 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp' 2026-04-24T15:42:47,435 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp' 2026-04-24T15:42:47,437 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp' 2026-04-24T15:42:47,439 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp' 2026-04-24T15:42:47,440 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp' 2026-04-24T15:42:47,451 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp' 2026-04-24T15:42:47,473 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp' 2026-04-24T15:42:47,486 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp' 2026-04-24T15:42:47,505 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp' 2026-04-24T15:42:47,510 adding 'flashinfer/data/cutlass/include/cute/atom/partitioner.hpp' 2026-04-24T15:42:47,512 adding 'flashinfer/data/cutlass/include/cute/container/alignment.hpp' 2026-04-24T15:42:47,514 adding 'flashinfer/data/cutlass/include/cute/container/array.hpp' 2026-04-24T15:42:47,515 adding 'flashinfer/data/cutlass/include/cute/container/array_aligned.hpp' 2026-04-24T15:42:47,517 adding 'flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp' 2026-04-24T15:42:47,519 adding 'flashinfer/data/cutlass/include/cute/container/bit_field.hpp' 2026-04-24T15:42:47,520 adding 'flashinfer/data/cutlass/include/cute/container/cuda_types.hpp' 2026-04-24T15:42:47,523 adding 'flashinfer/data/cutlass/include/cute/container/tuple.hpp' 2026-04-24T15:42:47,524 adding 'flashinfer/data/cutlass/include/cute/container/type_list.hpp' 2026-04-24T15:42:47,527 adding 'flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp' 2026-04-24T15:42:47,528 adding 'flashinfer/data/cutlass/include/cute/numeric/complex.hpp' 2026-04-24T15:42:47,530 adding 'flashinfer/data/cutlass/include/cute/numeric/int.hpp' 2026-04-24T15:42:47,531 adding 'flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp' 2026-04-24T15:42:47,533 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp' 2026-04-24T15:42:47,535 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp' 2026-04-24T15:42:47,537 adding 'flashinfer/data/cutlass/include/cute/numeric/math.hpp' 2026-04-24T15:42:47,538 adding 'flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp' 2026-04-24T15:42:47,540 adding 'flashinfer/data/cutlass/include/cute/numeric/real.hpp' 2026-04-24T15:42:47,542 adding 'flashinfer/data/cutlass/include/cute/util/debug.hpp' 2026-04-24T15:42:47,543 adding 'flashinfer/data/cutlass/include/cute/util/print.hpp' 2026-04-24T15:42:47,545 adding 'flashinfer/data/cutlass/include/cute/util/print_latex.hpp' 2026-04-24T15:42:47,547 adding 'flashinfer/data/cutlass/include/cute/util/print_svg.hpp' 2026-04-24T15:42:47,549 adding 'flashinfer/data/cutlass/include/cute/util/print_tensor.hpp' 2026-04-24T15:42:47,550 adding 'flashinfer/data/cutlass/include/cute/util/type_traits.hpp' 2026-04-24T15:42:47,553 adding 'flashinfer/data/cutlass/include/cutlass/aligned_buffer.h' 2026-04-24T15:42:47,557 adding 'flashinfer/data/cutlass/include/cutlass/array.h' 2026-04-24T15:42:47,559 adding 'flashinfer/data/cutlass/include/cutlass/array_planar_complex.h' 2026-04-24T15:42:47,561 adding 'flashinfer/data/cutlass/include/cutlass/array_subbyte.h' 2026-04-24T15:42:47,563 adding 'flashinfer/data/cutlass/include/cutlass/barrier.h' 2026-04-24T15:42:47,565 adding 'flashinfer/data/cutlass/include/cutlass/bfloat16.h' 2026-04-24T15:42:47,567 adding 'flashinfer/data/cutlass/include/cutlass/blas3.h' 2026-04-24T15:42:47,568 adding 'flashinfer/data/cutlass/include/cutlass/blas3_types.h' 2026-04-24T15:42:47,570 adding 'flashinfer/data/cutlass/include/cutlass/block_striped.h' 2026-04-24T15:42:47,572 adding 'flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp' 2026-04-24T15:42:47,576 adding 'flashinfer/data/cutlass/include/cutlass/complex.h' 2026-04-24T15:42:47,579 adding 'flashinfer/data/cutlass/include/cutlass/constants.h' 2026-04-24T15:42:47,581 adding 'flashinfer/data/cutlass/include/cutlass/coord.h' 2026-04-24T15:42:47,583 adding 'flashinfer/data/cutlass/include/cutlass/core_io.h' 2026-04-24T15:42:47,585 adding 'flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp' 2026-04-24T15:42:47,587 adding 'flashinfer/data/cutlass/include/cutlass/cutlass.h' 2026-04-24T15:42:47,588 adding 'flashinfer/data/cutlass/include/cutlass/device_kernel.h' 2026-04-24T15:42:47,592 adding 'flashinfer/data/cutlass/include/cutlass/exmy_base.h' 2026-04-24T15:42:47,596 adding 'flashinfer/data/cutlass/include/cutlass/fast_math.h' 2026-04-24T15:42:47,600 adding 'flashinfer/data/cutlass/include/cutlass/float8.h' 2026-04-24T15:42:47,603 adding 'flashinfer/data/cutlass/include/cutlass/float_subbyte.h' 2026-04-24T15:42:47,604 adding 'flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h' 2026-04-24T15:42:47,607 adding 'flashinfer/data/cutlass/include/cutlass/functional.h' 2026-04-24T15:42:47,609 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.h' 2026-04-24T15:42:47,610 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp' 2026-04-24T15:42:47,613 adding 'flashinfer/data/cutlass/include/cutlass/half.h' 2026-04-24T15:42:47,615 adding 'flashinfer/data/cutlass/include/cutlass/integer_subbyte.h' 2026-04-24T15:42:47,616 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h' 2026-04-24T15:42:47,617 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp' 2026-04-24T15:42:47,619 adding 'flashinfer/data/cutlass/include/cutlass/kernel_launch.h' 2026-04-24T15:42:47,640 adding 'flashinfer/data/cutlass/include/cutlass/matrix.h' 2026-04-24T15:42:47,644 adding 'flashinfer/data/cutlass/include/cutlass/matrix_coord.h' 2026-04-24T15:42:47,645 adding 'flashinfer/data/cutlass/include/cutlass/matrix_shape.h' 2026-04-24T15:42:47,661 adding 'flashinfer/data/cutlass/include/cutlass/numeric_conversion.h' 2026-04-24T15:42:47,665 adding 'flashinfer/data/cutlass/include/cutlass/numeric_size.h' 2026-04-24T15:42:47,666 adding 'flashinfer/data/cutlass/include/cutlass/numeric_types.h' 2026-04-24T15:42:47,667 adding 'flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h' 2026-04-24T15:42:47,670 adding 'flashinfer/data/cutlass/include/cutlass/predicate_vector.h' 2026-04-24T15:42:47,672 adding 'flashinfer/data/cutlass/include/cutlass/quaternion.h' 2026-04-24T15:42:47,674 adding 'flashinfer/data/cutlass/include/cutlass/real.h' 2026-04-24T15:42:47,675 adding 'flashinfer/data/cutlass/include/cutlass/relatively_equal.h' 2026-04-24T15:42:47,677 adding 'flashinfer/data/cutlass/include/cutlass/semaphore.h' 2026-04-24T15:42:47,680 adding 'flashinfer/data/cutlass/include/cutlass/subbyte_reference.h' 2026-04-24T15:42:47,682 adding 'flashinfer/data/cutlass/include/cutlass/tensor_coord.h' 2026-04-24T15:42:47,684 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref.h' 2026-04-24T15:42:47,686 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h' 2026-04-24T15:42:47,688 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view.h' 2026-04-24T15:42:47,689 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h' 2026-04-24T15:42:47,691 adding 'flashinfer/data/cutlass/include/cutlass/tfloat32.h' 2026-04-24T15:42:47,693 adding 'flashinfer/data/cutlass/include/cutlass/trace.h' 2026-04-24T15:42:47,695 adding 'flashinfer/data/cutlass/include/cutlass/uint128.h' 2026-04-24T15:42:47,696 adding 'flashinfer/data/cutlass/include/cutlass/uint256.h' 2026-04-24T15:42:47,698 adding 'flashinfer/data/cutlass/include/cutlass/version.h' 2026-04-24T15:42:47,699 adding 'flashinfer/data/cutlass/include/cutlass/wmma_array.h' 2026-04-24T15:42:47,701 adding 'flashinfer/data/cutlass/include/cutlass/workspace.h' 2026-04-24T15:42:47,703 adding 'flashinfer/data/cutlass/include/cutlass/arch/arch.h' 2026-04-24T15:42:47,706 adding 'flashinfer/data/cutlass/include/cutlass/arch/barrier.h' 2026-04-24T15:42:47,707 adding 'flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h' 2026-04-24T15:42:47,709 adding 'flashinfer/data/cutlass/include/cutlass/arch/config.h' 2026-04-24T15:42:47,711 adding 'flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h' 2026-04-24T15:42:47,713 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory.h' 2026-04-24T15:42:47,714 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h' 2026-04-24T15:42:47,716 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h' 2026-04-24T15:42:47,718 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma.h' 2026-04-24T15:42:47,720 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h' 2026-04-24T15:42:47,721 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h' 2026-04-24T15:42:47,723 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h' 2026-04-24T15:42:47,724 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h' 2026-04-24T15:42:47,726 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h' 2026-04-24T15:42:47,728 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h' 2026-04-24T15:42:47,730 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h' 2026-04-24T15:42:47,732 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h' 2026-04-24T15:42:47,734 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h' 2026-04-24T15:42:47,736 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h' 2026-04-24T15:42:47,738 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h' 2026-04-24T15:42:47,739 adding 'flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h' 2026-04-24T15:42:47,741 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd.h' 2026-04-24T15:42:47,742 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h' 2026-04-24T15:42:47,743 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h' 2026-04-24T15:42:47,747 adding 'flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp' 2026-04-24T15:42:47,749 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma.h' 2026-04-24T15:42:47,750 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h' 2026-04-24T15:42:47,752 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h' 2026-04-24T15:42:47,753 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h' 2026-04-24T15:42:47,756 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h' 2026-04-24T15:42:47,759 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h' 2026-04-24T15:42:47,761 adding 'flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp' 2026-04-24T15:42:47,763 adding 'flashinfer/data/cutlass/include/cutlass/conv/convolution.h' 2026-04-24T15:42:47,765 adding 'flashinfer/data/cutlass/include/cutlass/conv/detail.hpp' 2026-04-24T15:42:47,766 adding 'flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp' 2026-04-24T15:42:47,768 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp' 2026-04-24T15:42:47,770 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp' 2026-04-24T15:42:47,771 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp' 2026-04-24T15:42:47,776 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp' 2026-04-24T15:42:47,780 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp' 2026-04-24T15:42:47,782 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl' 2026-04-24T15:42:47,784 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl' 2026-04-24T15:42:47,786 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl' 2026-04-24T15:42:47,788 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl' 2026-04-24T15:42:47,791 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp' 2026-04-24T15:42:47,793 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h' 2026-04-24T15:42:47,795 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h' 2026-04-24T15:42:47,797 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h' 2026-04-24T15:42:47,799 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp' 2026-04-24T15:42:47,801 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h' 2026-04-24T15:42:47,804 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h' 2026-04-24T15:42:47,807 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h' 2026-04-24T15:42:47,809 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h' 2026-04-24T15:42:47,810 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h' 2026-04-24T15:42:47,811 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h' 2026-04-24T15:42:47,813 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h' 2026-04-24T15:42:47,815 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h' 2026-04-24T15:42:47,817 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h' 2026-04-24T15:42:47,819 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h' 2026-04-24T15:42:47,821 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h' 2026-04-24T15:42:47,823 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h' 2026-04-24T15:42:47,825 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h' 2026-04-24T15:42:47,826 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h' 2026-04-24T15:42:47,828 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h' 2026-04-24T15:42:47,830 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h' 2026-04-24T15:42:47,832 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h' 2026-04-24T15:42:47,834 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h' 2026-04-24T15:42:47,836 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h' 2026-04-24T15:42:47,838 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h' 2026-04-24T15:42:47,840 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h' 2026-04-24T15:42:47,842 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h' 2026-04-24T15:42:47,845 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h' 2026-04-24T15:42:47,847 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h' 2026-04-24T15:42:47,850 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h' 2026-04-24T15:42:47,852 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h' 2026-04-24T15:42:47,856 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp' 2026-04-24T15:42:47,858 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp' 2026-04-24T15:42:47,860 adding 'flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h' 2026-04-24T15:42:47,864 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,866 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,868 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,871 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,873 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,875 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h' 2026-04-24T15:42:47,877 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h' 2026-04-24T15:42:47,879 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,881 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,883 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h' 2026-04-24T15:42:47,885 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h' 2026-04-24T15:42:47,887 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,889 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h' 2026-04-24T15:42:47,891 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h' 2026-04-24T15:42:47,893 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,895 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,897 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,899 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,900 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,902 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,904 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,906 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,908 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,910 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,912 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,914 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,916 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h' 2026-04-24T15:42:47,918 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,920 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,922 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-24T15:42:47,924 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-24T15:42:47,926 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h' 2026-04-24T15:42:47,928 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h' 2026-04-24T15:42:47,930 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h' 2026-04-24T15:42:47,932 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h' 2026-04-24T15:42:47,934 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h' 2026-04-24T15:42:47,936 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h' 2026-04-24T15:42:47,938 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h' 2026-04-24T15:42:47,941 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h' 2026-04-24T15:42:47,944 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h' 2026-04-24T15:42:47,947 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h' 2026-04-24T15:42:47,949 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h' 2026-04-24T15:42:47,952 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h' 2026-04-24T15:42:47,954 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-24T15:42:47,956 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-24T15:42:47,958 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h' 2026-04-24T15:42:47,960 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h' 2026-04-24T15:42:47,963 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h' 2026-04-24T15:42:47,965 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h' 2026-04-24T15:42:47,968 adding 'flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp' 2026-04-24T15:42:47,970 adding 'flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp' 2026-04-24T15:42:47,972 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective.hpp' 2026-04-24T15:42:47,973 adding 'flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp' 2026-04-24T15:42:47,975 adding 'flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp' 2026-04-24T15:42:47,977 adding 'flashinfer/data/cutlass/include/cutlass/detail/layout.hpp' 2026-04-24T15:42:47,979 adding 'flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp' 2026-04-24T15:42:47,980 adding 'flashinfer/data/cutlass/include/cutlass/detail/mma.hpp' 2026-04-24T15:42:47,982 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp' 2026-04-24T15:42:47,984 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp' 2026-04-24T15:42:47,986 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp' 2026-04-24T15:42:47,987 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp' 2026-04-24T15:42:47,993 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp' 2026-04-24T15:42:47,994 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp' 2026-04-24T15:42:47,996 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp' 2026-04-24T15:42:47,999 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp' 2026-04-24T15:42:48,001 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp' 2026-04-24T15:42:48,003 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp' 2026-04-24T15:42:48,005 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp' 2026-04-24T15:42:48,008 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp' 2026-04-24T15:42:48,011 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp' 2026-04-24T15:42:48,013 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp' 2026-04-24T15:42:48,017 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp' 2026-04-24T15:42:48,019 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp' 2026-04-24T15:42:48,025 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp' 2026-04-24T15:42:48,033 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp' 2026-04-24T15:42:48,037 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp' 2026-04-24T15:42:48,041 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp' 2026-04-24T15:42:48,047 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp' 2026-04-24T15:42:48,050 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp' 2026-04-24T15:42:48,052 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp' 2026-04-24T15:42:48,058 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp' 2026-04-24T15:42:48,063 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp' 2026-04-24T15:42:48,065 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp' 2026-04-24T15:42:48,072 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl' 2026-04-24T15:42:48,074 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl' 2026-04-24T15:42:48,077 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl' 2026-04-24T15:42:48,078 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl' 2026-04-24T15:42:48,082 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl' 2026-04-24T15:42:48,083 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl' 2026-04-24T15:42:48,085 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp' 2026-04-24T15:42:48,088 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp' 2026-04-24T15:42:48,091 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp' 2026-04-24T15:42:48,094 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp' 2026-04-24T15:42:48,097 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp' 2026-04-24T15:42:48,100 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp' 2026-04-24T15:42:48,104 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp' 2026-04-24T15:42:48,110 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp' 2026-04-24T15:42:48,114 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp' 2026-04-24T15:42:48,119 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp' 2026-04-24T15:42:48,125 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp' 2026-04-24T15:42:48,129 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp' 2026-04-24T15:42:48,133 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp' 2026-04-24T15:42:48,137 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h' 2026-04-24T15:42:48,138 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h' 2026-04-24T15:42:48,139 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp' 2026-04-24T15:42:48,141 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h' 2026-04-24T15:42:48,144 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h' 2026-04-24T15:42:48,146 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h' 2026-04-24T15:42:48,149 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h' 2026-04-24T15:42:48,150 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h' 2026-04-24T15:42:48,152 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h' 2026-04-24T15:42:48,154 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h' 2026-04-24T15:42:48,156 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h' 2026-04-24T15:42:48,158 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h' 2026-04-24T15:42:48,159 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h' 2026-04-24T15:42:48,161 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h' 2026-04-24T15:42:48,162 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h' 2026-04-24T15:42:48,164 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h' 2026-04-24T15:42:48,166 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h' 2026-04-24T15:42:48,169 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h' 2026-04-24T15:42:48,170 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h' 2026-04-24T15:42:48,172 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h' 2026-04-24T15:42:48,173 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h' 2026-04-24T15:42:48,175 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp' 2026-04-24T15:42:48,177 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h' 2026-04-24T15:42:48,178 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h' 2026-04-24T15:42:48,180 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h' 2026-04-24T15:42:48,183 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h' 2026-04-24T15:42:48,184 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h' 2026-04-24T15:42:48,186 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h' 2026-04-24T15:42:48,187 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h' 2026-04-24T15:42:48,189 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h' 2026-04-24T15:42:48,192 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h' 2026-04-24T15:42:48,193 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h' 2026-04-24T15:42:48,195 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h' 2026-04-24T15:42:48,197 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h' 2026-04-24T15:42:48,198 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h' 2026-04-24T15:42:48,200 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h' 2026-04-24T15:42:48,202 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h' 2026-04-24T15:42:48,203 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h' 2026-04-24T15:42:48,205 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h' 2026-04-24T15:42:48,206 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h' 2026-04-24T15:42:48,208 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h' 2026-04-24T15:42:48,210 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h' 2026-04-24T15:42:48,212 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h' 2026-04-24T15:42:48,214 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h' 2026-04-24T15:42:48,216 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h' 2026-04-24T15:42:48,218 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h' 2026-04-24T15:42:48,220 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h' 2026-04-24T15:42:48,221 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h' 2026-04-24T15:42:48,224 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h' 2026-04-24T15:42:48,225 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h' 2026-04-24T15:42:48,227 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h' 2026-04-24T15:42:48,230 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h' 2026-04-24T15:42:48,233 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h' 2026-04-24T15:42:48,238 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h' 2026-04-24T15:42:48,241 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h' 2026-04-24T15:42:48,243 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h' 2026-04-24T15:42:48,246 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h' 2026-04-24T15:42:48,248 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h' 2026-04-24T15:42:48,250 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h' 2026-04-24T15:42:48,252 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h' 2026-04-24T15:42:48,254 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h' 2026-04-24T15:42:48,256 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h' 2026-04-24T15:42:48,260 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h' 2026-04-24T15:42:48,263 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h' 2026-04-24T15:42:48,265 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h' 2026-04-24T15:42:48,267 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h' 2026-04-24T15:42:48,269 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h' 2026-04-24T15:42:48,271 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h' 2026-04-24T15:42:48,274 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h' 2026-04-24T15:42:48,275 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h' 2026-04-24T15:42:48,278 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h' 2026-04-24T15:42:48,279 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h' 2026-04-24T15:42:48,281 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h' 2026-04-24T15:42:48,283 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h' 2026-04-24T15:42:48,286 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp' 2026-04-24T15:42:48,287 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp' 2026-04-24T15:42:48,290 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp' 2026-04-24T15:42:48,293 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp' 2026-04-24T15:42:48,295 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp' 2026-04-24T15:42:48,297 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h' 2026-04-24T15:42:48,299 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h' 2026-04-24T15:42:48,300 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h' 2026-04-24T15:42:48,302 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h' 2026-04-24T15:42:48,304 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h' 2026-04-24T15:42:48,306 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h' 2026-04-24T15:42:48,307 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h' 2026-04-24T15:42:48,309 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h' 2026-04-24T15:42:48,311 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h' 2026-04-24T15:42:48,313 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h' 2026-04-24T15:42:48,316 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h' 2026-04-24T15:42:48,318 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h' 2026-04-24T15:42:48,320 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h' 2026-04-24T15:42:48,322 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h' 2026-04-24T15:42:48,323 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h' 2026-04-24T15:42:48,326 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp' 2026-04-24T15:42:48,329 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp' 2026-04-24T15:42:48,331 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp' 2026-04-24T15:42:48,333 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp' 2026-04-24T15:42:48,334 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp' 2026-04-24T15:42:48,336 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp' 2026-04-24T15:42:48,338 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp' 2026-04-24T15:42:48,341 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp' 2026-04-24T15:42:48,346 adding 'flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp' 2026-04-24T15:42:48,348 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm.h' 2026-04-24T15:42:48,349 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h' 2026-04-24T15:42:48,351 adding 'flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp' 2026-04-24T15:42:48,354 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp' 2026-04-24T15:42:48,355 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp' 2026-04-24T15:42:48,357 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp' 2026-04-24T15:42:48,358 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp' 2026-04-24T15:42:48,360 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp' 2026-04-24T15:42:48,366 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp' 2026-04-24T15:42:48,373 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp' 2026-04-24T15:42:48,379 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-24T15:42:48,384 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp' 2026-04-24T15:42:48,391 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp' 2026-04-24T15:42:48,396 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp' 2026-04-24T15:42:48,403 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp' 2026-04-24T15:42:48,409 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp' 2026-04-24T15:42:48,415 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-24T15:42:48,421 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-24T15:42:48,426 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp' 2026-04-24T15:42:48,431 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp' 2026-04-24T15:42:48,434 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp' 2026-04-24T15:42:48,438 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-24T15:42:48,441 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp' 2026-04-24T15:42:48,447 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp' 2026-04-24T15:42:48,453 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp' 2026-04-24T15:42:48,459 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-24T15:42:48,464 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-24T15:42:48,470 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp' 2026-04-24T15:42:48,475 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp' 2026-04-24T15:42:48,480 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp' 2026-04-24T15:42:48,488 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp' 2026-04-24T15:42:48,494 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp' 2026-04-24T15:42:48,500 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp' 2026-04-24T15:42:48,505 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp' 2026-04-24T15:42:48,511 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp' 2026-04-24T15:42:48,516 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp' 2026-04-24T15:42:48,520 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp' 2026-04-24T15:42:48,524 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp' 2026-04-24T15:42:48,529 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp' 2026-04-24T15:42:48,531 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp' 2026-04-24T15:42:48,534 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp' 2026-04-24T15:42:48,537 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp' 2026-04-24T15:42:48,543 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-24T15:42:48,548 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp' 2026-04-24T15:42:48,552 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-24T15:42:48,557 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-24T15:42:48,561 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp' 2026-04-24T15:42:48,564 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp' 2026-04-24T15:42:48,568 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp' 2026-04-24T15:42:48,573 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-24T15:42:48,576 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp' 2026-04-24T15:42:48,579 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-24T15:42:48,582 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-24T15:42:48,587 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-24T15:42:48,591 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-24T15:42:48,595 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-24T15:42:48,599 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl' 2026-04-24T15:42:48,601 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl' 2026-04-24T15:42:48,603 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl' 2026-04-24T15:42:48,605 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl' 2026-04-24T15:42:48,607 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl' 2026-04-24T15:42:48,610 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl' 2026-04-24T15:42:48,613 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl' 2026-04-24T15:42:48,615 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl' 2026-04-24T15:42:48,617 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl' 2026-04-24T15:42:48,620 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl' 2026-04-24T15:42:48,621 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl' 2026-04-24T15:42:48,623 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl' 2026-04-24T15:42:48,624 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl' 2026-04-24T15:42:48,626 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl' 2026-04-24T15:42:48,629 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl' 2026-04-24T15:42:48,631 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl' 2026-04-24T15:42:48,634 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl' 2026-04-24T15:42:48,636 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl' 2026-04-24T15:42:48,639 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl' 2026-04-24T15:42:48,641 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl' 2026-04-24T15:42:48,642 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl' 2026-04-24T15:42:48,644 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl' 2026-04-24T15:42:48,646 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl' 2026-04-24T15:42:48,650 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl' 2026-04-24T15:42:48,652 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl' 2026-04-24T15:42:48,655 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl' 2026-04-24T15:42:48,659 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl' 2026-04-24T15:42:48,661 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl' 2026-04-24T15:42:48,663 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl' 2026-04-24T15:42:48,666 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h' 2026-04-24T15:42:48,669 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h' 2026-04-24T15:42:48,672 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h' 2026-04-24T15:42:48,675 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h' 2026-04-24T15:42:48,677 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h' 2026-04-24T15:42:48,680 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h' 2026-04-24T15:42:48,683 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h' 2026-04-24T15:42:48,686 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h' 2026-04-24T15:42:48,687 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h' 2026-04-24T15:42:48,689 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h' 2026-04-24T15:42:48,691 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h' 2026-04-24T15:42:48,693 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h' 2026-04-24T15:42:48,695 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h' 2026-04-24T15:42:48,697 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h' 2026-04-24T15:42:48,698 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h' 2026-04-24T15:42:48,701 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h' 2026-04-24T15:42:48,703 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h' 2026-04-24T15:42:48,706 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h' 2026-04-24T15:42:48,709 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h' 2026-04-24T15:42:48,711 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h' 2026-04-24T15:42:48,713 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h' 2026-04-24T15:42:48,715 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h' 2026-04-24T15:42:48,717 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h' 2026-04-24T15:42:48,719 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h' 2026-04-24T15:42:48,720 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h' 2026-04-24T15:42:48,723 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h' 2026-04-24T15:42:48,724 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h' 2026-04-24T15:42:48,726 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h' 2026-04-24T15:42:48,729 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h' 2026-04-24T15:42:48,732 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h' 2026-04-24T15:42:48,737 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h' 2026-04-24T15:42:48,740 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h' 2026-04-24T15:42:48,742 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h' 2026-04-24T15:42:48,744 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h' 2026-04-24T15:42:48,746 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h' 2026-04-24T15:42:48,748 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h' 2026-04-24T15:42:48,749 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h' 2026-04-24T15:42:48,751 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h' 2026-04-24T15:42:48,753 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h' 2026-04-24T15:42:48,754 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h' 2026-04-24T15:42:48,756 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h' 2026-04-24T15:42:48,758 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h' 2026-04-24T15:42:48,759 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h' 2026-04-24T15:42:48,761 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h' 2026-04-24T15:42:48,762 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h' 2026-04-24T15:42:48,764 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h' 2026-04-24T15:42:48,766 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h' 2026-04-24T15:42:48,767 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h' 2026-04-24T15:42:48,769 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h' 2026-04-24T15:42:48,770 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h' 2026-04-24T15:42:48,772 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h' 2026-04-24T15:42:48,774 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h' 2026-04-24T15:42:48,775 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h' 2026-04-24T15:42:48,777 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h' 2026-04-24T15:42:48,779 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h' 2026-04-24T15:42:48,781 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h' 2026-04-24T15:42:48,783 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h' 2026-04-24T15:42:48,785 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h' 2026-04-24T15:42:48,786 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h' 2026-04-24T15:42:48,788 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h' 2026-04-24T15:42:48,790 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h' 2026-04-24T15:42:48,792 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h' 2026-04-24T15:42:48,794 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h' 2026-04-24T15:42:48,795 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h' 2026-04-24T15:42:48,797 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h' 2026-04-24T15:42:48,800 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h' 2026-04-24T15:42:48,802 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h' 2026-04-24T15:42:48,804 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h' 2026-04-24T15:42:48,805 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h' 2026-04-24T15:42:48,807 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h' 2026-04-24T15:42:48,809 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h' 2026-04-24T15:42:48,811 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h' 2026-04-24T15:42:48,813 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h' 2026-04-24T15:42:48,815 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h' 2026-04-24T15:42:48,818 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h' 2026-04-24T15:42:48,820 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h' 2026-04-24T15:42:48,821 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h' 2026-04-24T15:42:48,824 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h' 2026-04-24T15:42:48,827 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h' 2026-04-24T15:42:48,830 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h' 2026-04-24T15:42:48,833 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h' 2026-04-24T15:42:48,834 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h' 2026-04-24T15:42:48,843 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h' 2026-04-24T15:42:48,845 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h' 2026-04-24T15:42:48,847 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h' 2026-04-24T15:42:48,849 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp' 2026-04-24T15:42:48,851 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h' 2026-04-24T15:42:48,852 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h' 2026-04-24T15:42:48,857 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h' 2026-04-24T15:42:48,859 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h' 2026-04-24T15:42:48,862 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h' 2026-04-24T15:42:48,865 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h' 2026-04-24T15:42:48,869 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h' 2026-04-24T15:42:48,872 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h' 2026-04-24T15:42:48,874 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h' 2026-04-24T15:42:48,876 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h' 2026-04-24T15:42:48,880 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h' 2026-04-24T15:42:48,882 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h' 2026-04-24T15:42:48,884 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h' 2026-04-24T15:42:48,886 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h' 2026-04-24T15:42:48,889 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h' 2026-04-24T15:42:48,891 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h' 2026-04-24T15:42:48,893 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h' 2026-04-24T15:42:48,895 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h' 2026-04-24T15:42:48,898 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h' 2026-04-24T15:42:48,905 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp' 2026-04-24T15:42:48,911 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp' 2026-04-24T15:42:48,917 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp' 2026-04-24T15:42:48,921 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp' 2026-04-24T15:42:48,926 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-24T15:42:48,930 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp' 2026-04-24T15:42:48,936 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp' 2026-04-24T15:42:48,941 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp' 2026-04-24T15:42:48,946 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp' 2026-04-24T15:42:48,951 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp' 2026-04-24T15:42:48,953 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp' 2026-04-24T15:42:48,956 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp' 2026-04-24T15:42:48,959 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp' 2026-04-24T15:42:48,963 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp' 2026-04-24T15:42:48,969 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp' 2026-04-24T15:42:48,974 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp' 2026-04-24T15:42:48,978 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp' 2026-04-24T15:42:48,981 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp' 2026-04-24T15:42:48,983 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp' 2026-04-24T15:42:48,988 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp' 2026-04-24T15:42:48,993 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp' 2026-04-24T15:42:48,996 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp' 2026-04-24T15:42:48,998 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp' 2026-04-24T15:42:49,003 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp' 2026-04-24T15:42:49,007 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp' 2026-04-24T15:42:49,010 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp' 2026-04-24T15:42:49,013 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp' 2026-04-24T15:42:49,016 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp' 2026-04-24T15:42:49,018 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp' 2026-04-24T15:42:49,021 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp' 2026-04-24T15:42:49,027 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp' 2026-04-24T15:42:49,029 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h' 2026-04-24T15:42:49,032 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h' 2026-04-24T15:42:49,033 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h' 2026-04-24T15:42:49,036 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp' 2026-04-24T15:42:49,039 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h' 2026-04-24T15:42:49,041 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp' 2026-04-24T15:42:49,042 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp' 2026-04-24T15:42:49,051 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h' 2026-04-24T15:42:49,054 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h' 2026-04-24T15:42:49,056 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h' 2026-04-24T15:42:49,058 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h' 2026-04-24T15:42:49,060 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h' 2026-04-24T15:42:49,062 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h' 2026-04-24T15:42:49,066 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h' 2026-04-24T15:42:49,068 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h' 2026-04-24T15:42:49,070 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h' 2026-04-24T15:42:49,072 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h' 2026-04-24T15:42:49,075 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h' 2026-04-24T15:42:49,077 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h' 2026-04-24T15:42:49,080 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h' 2026-04-24T15:42:49,085 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h' 2026-04-24T15:42:49,088 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h' 2026-04-24T15:42:49,090 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h' 2026-04-24T15:42:49,092 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h' 2026-04-24T15:42:49,094 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h' 2026-04-24T15:42:49,096 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h' 2026-04-24T15:42:49,097 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h' 2026-04-24T15:42:49,099 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h' 2026-04-24T15:42:49,100 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h' 2026-04-24T15:42:49,102 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h' 2026-04-24T15:42:49,104 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h' 2026-04-24T15:42:49,105 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h' 2026-04-24T15:42:49,107 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h' 2026-04-24T15:42:49,110 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h' 2026-04-24T15:42:49,113 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h' 2026-04-24T15:42:49,115 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h' 2026-04-24T15:42:49,117 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h' 2026-04-24T15:42:49,120 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h' 2026-04-24T15:42:49,122 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h' 2026-04-24T15:42:49,124 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h' 2026-04-24T15:42:49,125 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h' 2026-04-24T15:42:49,127 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h' 2026-04-24T15:42:49,130 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h' 2026-04-24T15:42:49,133 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h' 2026-04-24T15:42:49,136 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h' 2026-04-24T15:42:49,139 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h' 2026-04-24T15:42:49,141 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h' 2026-04-24T15:42:49,143 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h' 2026-04-24T15:42:49,146 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h' 2026-04-24T15:42:49,148 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h' 2026-04-24T15:42:49,150 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h' 2026-04-24T15:42:49,153 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h' 2026-04-24T15:42:49,155 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h' 2026-04-24T15:42:49,157 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h' 2026-04-24T15:42:49,160 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h' 2026-04-24T15:42:49,162 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h' 2026-04-24T15:42:49,165 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h' 2026-04-24T15:42:49,168 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h' 2026-04-24T15:42:49,170 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h' 2026-04-24T15:42:49,171 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h' 2026-04-24T15:42:49,173 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h' 2026-04-24T15:42:49,174 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h' 2026-04-24T15:42:49,176 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h' 2026-04-24T15:42:49,177 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h' 2026-04-24T15:42:49,179 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h' 2026-04-24T15:42:49,182 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h' 2026-04-24T15:42:49,185 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h' 2026-04-24T15:42:49,190 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h' 2026-04-24T15:42:49,192 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h' 2026-04-24T15:42:49,194 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h' 2026-04-24T15:42:49,197 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h' 2026-04-24T15:42:49,199 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h' 2026-04-24T15:42:49,201 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h' 2026-04-24T15:42:49,202 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h' 2026-04-24T15:42:49,206 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h' 2026-04-24T15:42:49,208 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h' 2026-04-24T15:42:49,211 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h' 2026-04-24T15:42:49,213 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h' 2026-04-24T15:42:49,216 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h' 2026-04-24T15:42:49,217 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h' 2026-04-24T15:42:49,219 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h' 2026-04-24T15:42:49,221 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h' 2026-04-24T15:42:49,230 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h' 2026-04-24T15:42:49,237 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h' 2026-04-24T15:42:49,242 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h' 2026-04-24T15:42:49,244 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h' 2026-04-24T15:42:49,246 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h' 2026-04-24T15:42:49,248 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h' 2026-04-24T15:42:49,250 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h' 2026-04-24T15:42:49,252 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h' 2026-04-24T15:42:49,254 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h' 2026-04-24T15:42:49,255 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h' 2026-04-24T15:42:49,257 adding 'flashinfer/data/cutlass/include/cutlass/layout/layout.h' 2026-04-24T15:42:49,260 adding 'flashinfer/data/cutlass/include/cutlass/layout/matrix.h' 2026-04-24T15:42:49,263 adding 'flashinfer/data/cutlass/include/cutlass/layout/permute.h' 2026-04-24T15:42:49,264 adding 'flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h' 2026-04-24T15:42:49,266 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor.h' 2026-04-24T15:42:49,269 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h' 2026-04-24T15:42:49,271 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h' 2026-04-24T15:42:49,274 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h' 2026-04-24T15:42:49,275 adding 'flashinfer/data/cutlass/include/cutlass/layout/vector.h' 2026-04-24T15:42:49,277 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp' 2026-04-24T15:42:49,281 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp' 2026-04-24T15:42:49,285 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp' 2026-04-24T15:42:49,289 adding 'flashinfer/data/cutlass/include/cutlass/platform/platform.h' 2026-04-24T15:42:49,291 adding 'flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h' 2026-04-24T15:42:49,293 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h' 2026-04-24T15:42:49,295 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h' 2026-04-24T15:42:49,297 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h' 2026-04-24T15:42:49,299 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h' 2026-04-24T15:42:49,301 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h' 2026-04-24T15:42:49,303 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h' 2026-04-24T15:42:49,306 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h' 2026-04-24T15:42:49,308 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h' 2026-04-24T15:42:49,310 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h' 2026-04-24T15:42:49,312 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h' 2026-04-24T15:42:49,314 adding 'flashinfer/data/cutlass/include/cutlass/thread/matrix.h' 2026-04-24T15:42:49,318 adding 'flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h' 2026-04-24T15:42:49,322 adding 'flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp' 2026-04-24T15:42:49,325 adding 'flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp' 2026-04-24T15:42:49,327 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp' 2026-04-24T15:42:49,330 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp' 2026-04-24T15:42:49,332 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp' 2026-04-24T15:42:49,334 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h' 2026-04-24T15:42:49,335 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h' 2026-04-24T15:42:49,338 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h' 2026-04-24T15:42:49,341 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h' 2026-04-24T15:42:49,344 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h' 2026-04-24T15:42:49,347 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-24T15:42:49,348 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-24T15:42:49,353 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h' 2026-04-24T15:42:49,356 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h' 2026-04-24T15:42:49,358 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h' 2026-04-24T15:42:49,361 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h' 2026-04-24T15:42:49,365 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h' 2026-04-24T15:42:49,368 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h' 2026-04-24T15:42:49,370 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h' 2026-04-24T15:42:49,372 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h' 2026-04-24T15:42:49,374 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h' 2026-04-24T15:42:49,376 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h' 2026-04-24T15:42:49,378 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h' 2026-04-24T15:42:49,380 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h' 2026-04-24T15:42:49,382 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h' 2026-04-24T15:42:49,385 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h' 2026-04-24T15:42:49,387 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h' 2026-04-24T15:42:49,389 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h' 2026-04-24T15:42:49,391 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h' 2026-04-24T15:42:49,394 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h' 2026-04-24T15:42:49,397 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h' 2026-04-24T15:42:49,399 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h' 2026-04-24T15:42:49,401 adding 'flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h' 2026-04-24T15:42:49,404 adding 'flashinfer/data/cutlass/python/setup_cutlass.py' 2026-04-24T15:42:49,405 adding 'flashinfer/data/cutlass/python/setup_library.py' 2026-04-24T15:42:49,406 adding 'flashinfer/data/cutlass/python/setup_pycute.py' 2026-04-24T15:42:49,409 adding 'flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py' 2026-04-24T15:42:49,411 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py' 2026-04-24T15:42:49,412 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py' 2026-04-24T15:42:49,414 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py' 2026-04-24T15:42:49,416 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py' 2026-04-24T15:42:49,417 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py' 2026-04-24T15:42:49,420 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py' 2026-04-24T15:42:49,431 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py' 2026-04-24T15:42:49,434 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py' 2026-04-24T15:42:49,436 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py' 2026-04-24T15:42:49,439 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py' 2026-04-24T15:42:49,449 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py' 2026-04-24T15:42:49,451 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py' 2026-04-24T15:42:49,457 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py' 2026-04-24T15:42:49,465 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py' 2026-04-24T15:42:49,466 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py' 2026-04-24T15:42:49,468 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py' 2026-04-24T15:42:49,471 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py' 2026-04-24T15:42:49,472 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py' 2026-04-24T15:42:49,474 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py' 2026-04-24T15:42:49,475 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py' 2026-04-24T15:42:49,477 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py' 2026-04-24T15:42:49,479 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py' 2026-04-24T15:42:49,481 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py' 2026-04-24T15:42:49,483 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py' 2026-04-24T15:42:49,485 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py' 2026-04-24T15:42:49,488 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py' 2026-04-24T15:42:49,490 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py' 2026-04-24T15:42:49,491 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py' 2026-04-24T15:42:49,493 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py' 2026-04-24T15:42:49,495 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py' 2026-04-24T15:42:49,496 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py' 2026-04-24T15:42:49,499 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py' 2026-04-24T15:42:49,501 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py' 2026-04-24T15:42:49,503 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py' 2026-04-24T15:42:49,506 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py' 2026-04-24T15:42:49,514 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py' 2026-04-24T15:42:49,516 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py' 2026-04-24T15:42:49,517 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py' 2026-04-24T15:42:49,518 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py' 2026-04-24T15:42:49,520 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py' 2026-04-24T15:42:49,521 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py' 2026-04-24T15:42:49,524 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py' 2026-04-24T15:42:49,527 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py' 2026-04-24T15:42:49,529 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py' 2026-04-24T15:42:49,532 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py' 2026-04-24T15:42:49,536 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py' 2026-04-24T15:42:49,556 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py' 2026-04-24T15:42:49,559 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py' 2026-04-24T15:42:49,560 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py' 2026-04-24T15:42:49,565 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py' 2026-04-24T15:42:49,574 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py' 2026-04-24T15:42:49,580 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py' 2026-04-24T15:42:49,583 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py' 2026-04-24T15:42:49,585 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py' 2026-04-24T15:42:49,587 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py' 2026-04-24T15:42:49,588 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py' 2026-04-24T15:42:49,589 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py' 2026-04-24T15:42:49,591 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py' 2026-04-24T15:42:49,593 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py' 2026-04-24T15:42:49,600 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py' 2026-04-24T15:42:49,602 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py' 2026-04-24T15:42:49,604 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py' 2026-04-24T15:42:49,606 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py' 2026-04-24T15:42:49,608 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py' 2026-04-24T15:42:49,609 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py' 2026-04-24T15:42:49,611 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py' 2026-04-24T15:42:49,612 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py' 2026-04-24T15:42:49,615 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py' 2026-04-24T15:42:49,617 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py' 2026-04-24T15:42:49,619 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py' 2026-04-24T15:42:49,620 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py' 2026-04-24T15:42:49,622 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py' 2026-04-24T15:42:49,624 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py' 2026-04-24T15:42:49,625 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py' 2026-04-24T15:42:49,627 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py' 2026-04-24T15:42:49,629 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py' 2026-04-24T15:42:49,630 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py' 2026-04-24T15:42:49,632 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py' 2026-04-24T15:42:49,635 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py' 2026-04-24T15:42:49,637 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py' 2026-04-24T15:42:49,639 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py' 2026-04-24T15:42:49,641 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py' 2026-04-24T15:42:49,643 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py' 2026-04-24T15:42:49,646 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py' 2026-04-24T15:42:49,648 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py' 2026-04-24T15:42:49,650 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py' 2026-04-24T15:42:49,652 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py' 2026-04-24T15:42:49,654 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py' 2026-04-24T15:42:49,655 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py' 2026-04-24T15:42:49,657 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py' 2026-04-24T15:42:49,659 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py' 2026-04-24T15:42:49,661 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py' 2026-04-24T15:42:49,662 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py' 2026-04-24T15:42:49,671 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py' 2026-04-24T15:42:49,675 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py' 2026-04-24T15:42:49,678 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py' 2026-04-24T15:42:49,680 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py' 2026-04-24T15:42:49,683 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py' 2026-04-24T15:42:49,685 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py' 2026-04-24T15:42:49,686 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py' 2026-04-24T15:42:49,688 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py' 2026-04-24T15:42:49,690 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py' 2026-04-24T15:42:49,692 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py' 2026-04-24T15:42:49,695 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py' 2026-04-24T15:42:49,698 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py' 2026-04-24T15:42:49,703 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py' 2026-04-24T15:42:49,706 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py' 2026-04-24T15:42:49,709 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py' 2026-04-24T15:42:49,711 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py' 2026-04-24T15:42:49,713 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py' 2026-04-24T15:42:49,715 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py' 2026-04-24T15:42:49,719 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py' 2026-04-24T15:42:49,722 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py' 2026-04-24T15:42:49,724 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py' 2026-04-24T15:42:49,726 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py' 2026-04-24T15:42:49,727 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py' 2026-04-24T15:42:49,731 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py' 2026-04-24T15:42:49,733 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py' 2026-04-24T15:42:49,735 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py' 2026-04-24T15:42:49,739 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py' 2026-04-24T15:42:49,740 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py' 2026-04-24T15:42:49,742 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py' 2026-04-24T15:42:49,744 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py' 2026-04-24T15:42:49,746 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py' 2026-04-24T15:42:49,749 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py' 2026-04-24T15:42:49,751 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py' 2026-04-24T15:42:49,754 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py' 2026-04-24T15:42:49,756 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/shape.py' 2026-04-24T15:42:49,757 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py' 2026-04-24T15:42:49,759 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py' 2026-04-24T15:42:49,761 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py' 2026-04-24T15:42:49,763 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py' 2026-04-24T15:42:49,766 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py' 2026-04-24T15:42:49,769 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py' 2026-04-24T15:42:49,771 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py' 2026-04-24T15:42:49,773 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py' 2026-04-24T15:42:49,780 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py' 2026-04-24T15:42:49,783 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py' 2026-04-24T15:42:49,785 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py' 2026-04-24T15:42:49,786 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py' 2026-04-24T15:42:49,788 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py' 2026-04-24T15:42:49,790 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py' 2026-04-24T15:42:49,792 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py' 2026-04-24T15:42:49,794 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py' 2026-04-24T15:42:49,796 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py' 2026-04-24T15:42:49,797 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py' 2026-04-24T15:42:49,799 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py' 2026-04-24T15:42:49,800 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py' 2026-04-24T15:42:49,802 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py' 2026-04-24T15:42:49,803 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py' 2026-04-24T15:42:49,805 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py' 2026-04-24T15:42:49,806 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py' 2026-04-24T15:42:49,808 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py' 2026-04-24T15:42:49,809 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py' 2026-04-24T15:42:49,811 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py' 2026-04-24T15:42:49,813 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py' 2026-04-24T15:42:49,814 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py' 2026-04-24T15:42:49,816 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py' 2026-04-24T15:42:49,818 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py' 2026-04-24T15:42:49,820 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py' 2026-04-24T15:42:49,821 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py' 2026-04-24T15:42:49,823 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py' 2026-04-24T15:42:49,825 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py' 2026-04-24T15:42:49,826 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py' 2026-04-24T15:42:49,828 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py' 2026-04-24T15:42:49,830 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py' 2026-04-24T15:42:49,831 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py' 2026-04-24T15:42:49,833 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py' 2026-04-24T15:42:49,835 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py' 2026-04-24T15:42:49,836 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py' 2026-04-24T15:42:49,838 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py' 2026-04-24T15:42:49,839 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py' 2026-04-24T15:42:49,841 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py' 2026-04-24T15:42:49,842 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py' 2026-04-24T15:42:49,843 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py' 2026-04-24T15:42:49,845 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py' 2026-04-24T15:42:49,847 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py' 2026-04-24T15:42:49,849 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py' 2026-04-24T15:42:49,850 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py' 2026-04-24T15:42:49,852 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py' 2026-04-24T15:42:49,854 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py' 2026-04-24T15:42:49,857 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py' 2026-04-24T15:42:49,859 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py' 2026-04-24T15:42:49,861 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py' 2026-04-24T15:42:49,862 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py' 2026-04-24T15:42:49,864 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py' 2026-04-24T15:42:49,869 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py' 2026-04-24T15:42:49,873 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py' 2026-04-24T15:42:49,875 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py' 2026-04-24T15:42:49,878 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py' 2026-04-24T15:42:49,880 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py' 2026-04-24T15:42:49,882 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py' 2026-04-24T15:42:49,884 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py' 2026-04-24T15:42:49,885 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py' 2026-04-24T15:42:49,886 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py' 2026-04-24T15:42:49,888 adding 'flashinfer/data/cutlass/python/cutlass_library/__init__.py' 2026-04-24T15:42:49,891 adding 'flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py' 2026-04-24T15:42:49,894 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py' 2026-04-24T15:42:49,896 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py' 2026-04-24T15:42:49,900 adding 'flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py' 2026-04-24T15:42:49,906 adding 'flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py' 2026-04-24T15:42:49,933 adding 'flashinfer/data/cutlass/python/cutlass_library/generator.py' 2026-04-24T15:42:49,938 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics.py' 2026-04-24T15:42:49,940 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py' 2026-04-24T15:42:49,946 adding 'flashinfer/data/cutlass/python/cutlass_library/library.py' 2026-04-24T15:42:49,950 adding 'flashinfer/data/cutlass/python/cutlass_library/manifest.py' 2026-04-24T15:42:49,953 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py' 2026-04-24T15:42:49,955 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py' 2026-04-24T15:42:49,957 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py' 2026-04-24T15:42:49,959 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py' 2026-04-24T15:42:49,961 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py' 2026-04-24T15:42:49,964 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py' 2026-04-24T15:42:49,966 adding 'flashinfer/data/cutlass/python/cutlass_library/symm_operation.py' 2026-04-24T15:42:49,968 adding 'flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py' 2026-04-24T15:42:49,971 adding 'flashinfer/data/cutlass/python/docs_src/source/conf.py' 2026-04-24T15:42:49,973 adding 'flashinfer/data/cutlass/python/pycute/__init__.py' 2026-04-24T15:42:49,975 adding 'flashinfer/data/cutlass/python/pycute/int_tuple.py' 2026-04-24T15:42:49,977 adding 'flashinfer/data/cutlass/python/pycute/layout.py' 2026-04-24T15:42:49,978 adding 'flashinfer/data/cutlass/python/pycute/swizzle.py' 2026-04-24T15:42:49,980 adding 'flashinfer/data/cutlass/python/pycute/typing.py' 2026-04-24T15:42:49,982 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py' 2026-04-24T15:42:49,984 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py' 2026-04-24T15:42:49,986 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-24T15:42:49,988 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py' 2026-04-24T15:42:49,990 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py' 2026-04-24T15:42:49,991 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py' 2026-04-24T15:42:49,993 adding 'flashinfer/data/cutlass/test/python/cutlass/installation.py' 2026-04-24T15:42:49,996 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py' 2026-04-24T15:42:49,998 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py' 2026-04-24T15:42:50,000 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py' 2026-04-24T15:42:50,001 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py' 2026-04-24T15:42:50,004 adding 'flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py' 2026-04-24T15:42:50,006 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py' 2026-04-24T15:42:50,007 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py' 2026-04-24T15:42:50,008 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py' 2026-04-24T15:42:50,010 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py' 2026-04-24T15:42:50,012 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py' 2026-04-24T15:42:50,013 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py' 2026-04-24T15:42:50,016 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py' 2026-04-24T15:42:50,018 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py' 2026-04-24T15:42:50,019 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py' 2026-04-24T15:42:50,021 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py' 2026-04-24T15:42:50,022 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py' 2026-04-24T15:42:50,024 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py' 2026-04-24T15:42:50,025 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py' 2026-04-24T15:42:50,027 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py' 2026-04-24T15:42:50,028 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py' 2026-04-24T15:42:50,030 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py' 2026-04-24T15:42:50,031 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py' 2026-04-24T15:42:50,033 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py' 2026-04-24T15:42:50,035 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py' 2026-04-24T15:42:50,037 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py' 2026-04-24T15:42:50,039 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py' 2026-04-24T15:42:50,041 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py' 2026-04-24T15:42:50,043 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py' 2026-04-24T15:42:50,045 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/utils.py' 2026-04-24T15:42:50,047 adding 'flashinfer/data/cutlass/test/python/pycute/run_all_tests.py' 2026-04-24T15:42:50,048 adding 'flashinfer/data/cutlass/test/python/pycute/test_coalesce.py' 2026-04-24T15:42:50,050 adding 'flashinfer/data/cutlass/test/python/pycute/test_complement.py' 2026-04-24T15:42:50,051 adding 'flashinfer/data/cutlass/test/python/pycute/test_composition.py' 2026-04-24T15:42:50,053 adding 'flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py' 2026-04-24T15:42:50,054 adding 'flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py' 2026-04-24T15:42:50,056 adding 'flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py' 2026-04-24T15:42:50,057 adding 'flashinfer/data/cutlass/test/python/pycute/test_typing.py' 2026-04-24T15:42:50,060 adding 'flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py' 2026-04-24T15:42:50,064 adding 'flashinfer/data/cutlass/test/utils/test_sharding.py' 2026-04-24T15:42:50,068 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp' 2026-04-24T15:42:50,069 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h' 2026-04-24T15:42:50,071 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp' 2026-04-24T15:42:50,073 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h' 2026-04-24T15:42:50,075 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h' 2026-04-24T15:42:50,077 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h' 2026-04-24T15:42:50,079 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h' 2026-04-24T15:42:50,081 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h' 2026-04-24T15:42:50,083 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h' 2026-04-24T15:42:50,085 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h' 2026-04-24T15:42:50,087 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h' 2026-04-24T15:42:50,088 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h' 2026-04-24T15:42:50,090 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h' 2026-04-24T15:42:50,091 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h' 2026-04-24T15:42:50,093 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h' 2026-04-24T15:42:50,094 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h' 2026-04-24T15:42:50,096 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp' 2026-04-24T15:42:50,098 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp' 2026-04-24T15:42:50,099 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h' 2026-04-24T15:42:50,102 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h' 2026-04-24T15:42:50,104 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h' 2026-04-24T15:42:50,106 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h' 2026-04-24T15:42:50,107 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h' 2026-04-24T15:42:50,110 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp' 2026-04-24T15:42:50,112 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp' 2026-04-24T15:42:50,115 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp' 2026-04-24T15:42:50,116 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h' 2026-04-24T15:42:50,118 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h' 2026-04-24T15:42:50,120 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h' 2026-04-24T15:42:50,122 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h' 2026-04-24T15:42:50,126 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h' 2026-04-24T15:42:50,128 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h' 2026-04-24T15:42:50,130 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h' 2026-04-24T15:42:50,132 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h' 2026-04-24T15:42:50,133 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp' 2026-04-24T15:42:50,135 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h' 2026-04-24T15:42:50,137 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h' 2026-04-24T15:42:50,141 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h' 2026-04-24T15:42:50,143 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h' 2026-04-24T15:42:50,145 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h' 2026-04-24T15:42:50,147 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h' 2026-04-24T15:42:50,149 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h' 2026-04-24T15:42:50,150 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h' 2026-04-24T15:42:50,152 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h' 2026-04-24T15:42:50,154 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h' 2026-04-24T15:42:50,158 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp' 2026-04-24T15:42:50,160 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h' 2026-04-24T15:42:50,162 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h' 2026-04-24T15:42:50,164 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h' 2026-04-24T15:42:50,166 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h' 2026-04-24T15:42:50,168 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h' 2026-04-24T15:42:50,171 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp' 2026-04-24T15:42:50,173 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h' 2026-04-24T15:42:50,175 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h' 2026-04-24T15:42:50,177 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h' 2026-04-24T15:42:50,179 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h' 2026-04-24T15:42:50,180 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h' 2026-04-24T15:42:50,182 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h' 2026-04-24T15:42:50,184 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp' 2026-04-24T15:42:50,185 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h' 2026-04-24T15:42:50,187 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h' 2026-04-24T15:42:50,191 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h' 2026-04-24T15:42:50,193 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp' 2026-04-24T15:42:50,195 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h' 2026-04-24T15:42:50,196 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h' 2026-04-24T15:42:50,198 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h' 2026-04-24T15:42:50,199 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp' 2026-04-24T15:42:50,201 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h' 2026-04-24T15:42:50,203 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h' 2026-04-24T15:42:50,206 adding 'flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py' 2026-04-24T15:42:50,208 adding 'flashinfer/data/include/flashinfer/activation.cuh' 2026-04-24T15:42:50,211 adding 'flashinfer/data/include/flashinfer/air_top_p.cuh' 2026-04-24T15:42:50,213 adding 'flashinfer/data/include/flashinfer/allocator.h' 2026-04-24T15:42:50,214 adding 'flashinfer/data/include/flashinfer/arch_condition.h' 2026-04-24T15:42:50,215 adding 'flashinfer/data/include/flashinfer/attention_impl.cuh' 2026-04-24T15:42:50,217 adding 'flashinfer/data/include/flashinfer/concat_mla.cuh' 2026-04-24T15:42:50,219 adding 'flashinfer/data/include/flashinfer/cp_async.cuh' 2026-04-24T15:42:50,220 adding 'flashinfer/data/include/flashinfer/cubin_loader.h' 2026-04-24T15:42:50,222 adding 'flashinfer/data/include/flashinfer/cutlass_utils.cuh' 2026-04-24T15:42:50,223 adding 'flashinfer/data/include/flashinfer/exception.h' 2026-04-24T15:42:50,226 adding 'flashinfer/data/include/flashinfer/fast_topk_clusters_exact.cuh' 2026-04-24T15:42:50,228 adding 'flashinfer/data/include/flashinfer/fastdiv.cuh' 2026-04-24T15:42:50,230 adding 'flashinfer/data/include/flashinfer/fp16.h' 2026-04-24T15:42:50,231 adding 'flashinfer/data/include/flashinfer/fp4_layout.cuh' 2026-04-24T15:42:50,232 adding 'flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh' 2026-04-24T15:42:50,234 adding 'flashinfer/data/include/flashinfer/layout.cuh' 2026-04-24T15:42:50,235 adding 'flashinfer/data/include/flashinfer/logging.h' 2026-04-24T15:42:50,237 adding 'flashinfer/data/include/flashinfer/math.cuh' 2026-04-24T15:42:50,239 adding 'flashinfer/data/include/flashinfer/mma.cuh' 2026-04-24T15:42:50,243 adding 'flashinfer/data/include/flashinfer/norm.cuh' 2026-04-24T15:42:50,245 adding 'flashinfer/data/include/flashinfer/page.cuh' 2026-04-24T15:42:50,247 adding 'flashinfer/data/include/flashinfer/permuted_smem.cuh' 2026-04-24T15:42:50,253 adding 'flashinfer/data/include/flashinfer/pos_enc.cuh' 2026-04-24T15:42:50,255 adding 'flashinfer/data/include/flashinfer/profiler.cuh' 2026-04-24T15:42:50,257 adding 'flashinfer/data/include/flashinfer/quantization.cuh' 2026-04-24T15:42:50,263 adding 'flashinfer/data/include/flashinfer/sampling.cuh' 2026-04-24T15:42:50,277 adding 'flashinfer/data/include/flashinfer/topk.cuh' 2026-04-24T15:42:50,279 adding 'flashinfer/data/include/flashinfer/topk_common.cuh' 2026-04-24T15:42:50,282 adding 'flashinfer/data/include/flashinfer/utils.cuh' 2026-04-24T15:42:50,286 adding 'flashinfer/data/include/flashinfer/vec_dtypes.cuh' 2026-04-24T15:42:50,290 adding 'flashinfer/data/include/flashinfer/attention/batch_pod.cuh' 2026-04-24T15:42:50,293 adding 'flashinfer/data/include/flashinfer/attention/cascade.cuh' 2026-04-24T15:42:50,295 adding 'flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh' 2026-04-24T15:42:50,300 adding 'flashinfer/data/include/flashinfer/attention/decode.cuh' 2026-04-24T15:42:50,303 adding 'flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh' 2026-04-24T15:42:50,305 adding 'flashinfer/data/include/flashinfer/attention/default_decode_params.cuh' 2026-04-24T15:42:50,307 adding 'flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh' 2026-04-24T15:42:50,308 adding 'flashinfer/data/include/flashinfer/attention/heap.h' 2026-04-24T15:42:50,310 adding 'flashinfer/data/include/flashinfer/attention/hopper.cuh' 2026-04-24T15:42:50,312 adding 'flashinfer/data/include/flashinfer/attention/mask.cuh' 2026-04-24T15:42:50,316 adding 'flashinfer/data/include/flashinfer/attention/mla.cuh' 2026-04-24T15:42:50,321 adding 'flashinfer/data/include/flashinfer/attention/mla_hopper.cuh' 2026-04-24T15:42:50,322 adding 'flashinfer/data/include/flashinfer/attention/mla_params.cuh' 2026-04-24T15:42:50,326 adding 'flashinfer/data/include/flashinfer/attention/persistent.cuh' 2026-04-24T15:42:50,328 adding 'flashinfer/data/include/flashinfer/attention/persistent_template.cuh' 2026-04-24T15:42:50,330 adding 'flashinfer/data/include/flashinfer/attention/pod.cuh' 2026-04-24T15:42:50,340 adding 'flashinfer/data/include/flashinfer/attention/prefill.cuh' 2026-04-24T15:42:50,348 adding 'flashinfer/data/include/flashinfer/attention/scheduler.cuh' 2026-04-24T15:42:50,350 adding 'flashinfer/data/include/flashinfer/attention/state.cuh' 2026-04-24T15:42:50,352 adding 'flashinfer/data/include/flashinfer/attention/variant_helper.cuh' 2026-04-24T15:42:50,353 adding 'flashinfer/data/include/flashinfer/attention/variants.cuh' 2026-04-24T15:42:50,355 adding 'flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh' 2026-04-24T15:42:50,357 adding 'flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh' 2026-04-24T15:42:50,359 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp' 2026-04-24T15:42:50,361 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp' 2026-04-24T15:42:50,362 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp' 2026-04-24T15:42:50,367 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp' 2026-04-24T15:42:50,369 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp' 2026-04-24T15:42:50,374 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp' 2026-04-24T15:42:50,377 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp' 2026-04-24T15:42:50,379 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp' 2026-04-24T15:42:50,380 adding 'flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp' 2026-04-24T15:42:50,383 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp' 2026-04-24T15:42:50,385 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp' 2026-04-24T15:42:50,387 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp' 2026-04-24T15:42:50,388 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp' 2026-04-24T15:42:50,390 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp' 2026-04-24T15:42:50,393 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp' 2026-04-24T15:42:50,396 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp' 2026-04-24T15:42:50,398 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp' 2026-04-24T15:42:50,405 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp' 2026-04-24T15:42:50,408 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp' 2026-04-24T15:42:50,410 adding 'flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh' 2026-04-24T15:42:50,412 adding 'flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh' 2026-04-24T15:42:50,414 adding 'flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh' 2026-04-24T15:42:50,416 adding 'flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh' 2026-04-24T15:42:50,418 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh' 2026-04-24T15:42:50,420 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh' 2026-04-24T15:42:50,421 adding 'flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh' 2026-04-24T15:42:50,424 adding 'flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh' 2026-04-24T15:42:50,427 adding 'flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh' 2026-04-24T15:42:50,429 adding 'flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh' 2026-04-24T15:42:50,431 adding 'flashinfer/data/include/flashinfer/attention/hopper/utils.cuh' 2026-04-24T15:42:50,433 adding 'flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh' 2026-04-24T15:42:50,434 adding 'flashinfer/data/include/flashinfer/attention/hopper/variants.cuh' 2026-04-24T15:42:50,437 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh' 2026-04-24T15:42:50,439 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh' 2026-04-24T15:42:50,441 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh' 2026-04-24T15:42:50,443 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh' 2026-04-24T15:42:50,445 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh' 2026-04-24T15:42:50,448 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh' 2026-04-24T15:42:50,455 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh' 2026-04-24T15:42:50,461 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh' 2026-04-24T15:42:50,465 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh' 2026-04-24T15:42:50,467 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh' 2026-04-24T15:42:50,471 adding 'flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh' 2026-04-24T15:42:50,477 adding 'flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh' 2026-04-24T15:42:50,480 adding 'flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh' 2026-04-24T15:42:50,482 adding 'flashinfer/data/include/flashinfer/flat/common.hpp' 2026-04-24T15:42:50,483 adding 'flashinfer/data/include/flashinfer/flat/cute_ext.hpp' 2026-04-24T15:42:50,484 adding 'flashinfer/data/include/flashinfer/flat/debug.hpp' 2026-04-24T15:42:50,486 adding 'flashinfer/data/include/flashinfer/flat/math.hpp' 2026-04-24T15:42:50,487 adding 'flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp' 2026-04-24T15:42:50,488 adding 'flashinfer/data/include/flashinfer/flat/type_traits.hpp' 2026-04-24T15:42:50,490 adding 'flashinfer/data/include/flashinfer/flat/unused.hpp' 2026-04-24T15:42:50,493 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp' 2026-04-24T15:42:50,494 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp' 2026-04-24T15:42:50,497 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp' 2026-04-24T15:42:50,499 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp' 2026-04-24T15:42:50,505 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp' 2026-04-24T15:42:50,507 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp' 2026-04-24T15:42:50,508 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp' 2026-04-24T15:42:50,510 adding 'flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp' 2026-04-24T15:42:50,512 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp' 2026-04-24T15:42:50,515 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp' 2026-04-24T15:42:50,516 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp' 2026-04-24T15:42:50,518 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp' 2026-04-24T15:42:50,520 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp' 2026-04-24T15:42:50,522 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh' 2026-04-24T15:42:50,524 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h' 2026-04-24T15:42:50,526 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h' 2026-04-24T15:42:50,528 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h' 2026-04-24T15:42:50,529 adding 'flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh' 2026-04-24T15:42:50,532 adding 'flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h' 2026-04-24T15:42:50,534 adding 'flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh' 2026-04-24T15:42:50,535 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h' 2026-04-24T15:42:50,537 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h' 2026-04-24T15:42:50,539 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h' 2026-04-24T15:42:50,541 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h' 2026-04-24T15:42:50,543 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h' 2026-04-24T15:42:50,546 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h' 2026-04-24T15:42:50,548 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h' 2026-04-24T15:42:50,550 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h' 2026-04-24T15:42:50,551 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h' 2026-04-24T15:42:50,553 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h' 2026-04-24T15:42:50,555 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh' 2026-04-24T15:42:50,557 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh' 2026-04-24T15:42:50,558 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm.cuh' 2026-04-24T15:42:50,560 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh' 2026-04-24T15:42:50,562 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh' 2026-04-24T15:42:50,564 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh' 2026-04-24T15:42:50,566 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh' 2026-04-24T15:42:50,568 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm120.cuh' 2026-04-24T15:42:50,571 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_nvfp4_groupwise_sm120.cuh' 2026-04-24T15:42:50,573 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh' 2026-04-24T15:42:50,575 adding 'flashinfer/data/include/flashinfer/gemm/group_gemv.cuh' 2026-04-24T15:42:50,576 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h' 2026-04-24T15:42:50,578 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h' 2026-04-24T15:42:50,580 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template_sm120.h' 2026-04-24T15:42:50,582 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h' 2026-04-24T15:42:50,585 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm120.h' 2026-04-24T15:42:50,594 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh' 2026-04-24T15:42:50,596 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h' 2026-04-24T15:42:50,597 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h' 2026-04-24T15:42:50,600 adding 'flashinfer/data/include/flashinfer/mamba/common.cuh' 2026-04-24T15:42:50,602 adding 'flashinfer/data/include/flashinfer/mamba/conversion.cuh' 2026-04-24T15:42:50,604 adding 'flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh' 2026-04-24T15:42:50,606 adding 'flashinfer/data/include/flashinfer/mamba/invoke_selective_state_update_mtp.cuh' 2026-04-24T15:42:50,609 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_async_horizontal.cuh' 2026-04-24T15:42:50,612 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_horizontal.cuh' 2026-04-24T15:42:50,615 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_simple.cuh' 2026-04-24T15:42:50,618 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_vertical.cuh' 2026-04-24T15:42:50,623 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh' 2026-04-24T15:42:50,625 adding 'flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh' 2026-04-24T15:42:50,627 adding 'flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh' 2026-04-24T15:42:50,628 adding 'flashinfer/data/include/flashinfer/mamba/ssu_mtp_common.cuh' 2026-04-24T15:42:50,631 adding 'flashinfer/data/include/flashinfer/norm/ln_fwd_silu_kernel.cuh' 2026-04-24T15:42:50,639 adding 'flashinfer/data/include/flashinfer/norm/ln_silu_headers.cuh' 2026-04-24T15:42:50,641 adding 'flashinfer/data/include/flashinfer/trtllm/common.h' 2026-04-24T15:42:50,644 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h' 2026-04-24T15:42:50,646 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh' 2026-04-24T15:42:50,647 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h' 2026-04-24T15:42:50,649 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h' 2026-04-24T15:42:50,651 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh' 2026-04-24T15:42:50,652 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h' 2026-04-24T15:42:50,654 adding 'flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh' 2026-04-24T15:42:50,657 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h' 2026-04-24T15:42:50,658 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h' 2026-04-24T15:42:50,663 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh' 2026-04-24T15:42:50,665 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h' 2026-04-24T15:42:50,666 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh' 2026-04-24T15:42:50,668 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h' 2026-04-24T15:42:50,673 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h' 2026-04-24T15:42:50,675 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h' 2026-04-24T15:42:50,676 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh' 2026-04-24T15:42:50,679 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h' 2026-04-24T15:42:50,681 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h' 2026-04-24T15:42:50,684 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingCustomPolicy.cuh' 2026-04-24T15:42:50,687 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingDevKernel.h' 2026-04-24T15:42:50,691 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh' 2026-04-24T15:42:50,694 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h' 2026-04-24T15:42:50,696 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh' 2026-04-24T15:42:50,697 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h' 2026-04-24T15:42:50,700 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h' 2026-04-24T15:42:50,703 adding 'flashinfer/data/spdlog/include/spdlog/async.h' 2026-04-24T15:42:50,704 adding 'flashinfer/data/spdlog/include/spdlog/async_logger-inl.h' 2026-04-24T15:42:50,706 adding 'flashinfer/data/spdlog/include/spdlog/async_logger.h' 2026-04-24T15:42:50,707 adding 'flashinfer/data/spdlog/include/spdlog/common-inl.h' 2026-04-24T15:42:50,709 adding 'flashinfer/data/spdlog/include/spdlog/common.h' 2026-04-24T15:42:50,710 adding 'flashinfer/data/spdlog/include/spdlog/formatter.h' 2026-04-24T15:42:50,711 adding 'flashinfer/data/spdlog/include/spdlog/fwd.h' 2026-04-24T15:42:50,713 adding 'flashinfer/data/spdlog/include/spdlog/logger-inl.h' 2026-04-24T15:42:50,715 adding 'flashinfer/data/spdlog/include/spdlog/logger.h' 2026-04-24T15:42:50,716 adding 'flashinfer/data/spdlog/include/spdlog/mdc.h' 2026-04-24T15:42:50,720 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h' 2026-04-24T15:42:50,722 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter.h' 2026-04-24T15:42:50,723 adding 'flashinfer/data/spdlog/include/spdlog/spdlog-inl.h' 2026-04-24T15:42:50,725 adding 'flashinfer/data/spdlog/include/spdlog/spdlog.h' 2026-04-24T15:42:50,726 adding 'flashinfer/data/spdlog/include/spdlog/stopwatch.h' 2026-04-24T15:42:50,728 adding 'flashinfer/data/spdlog/include/spdlog/tweakme.h' 2026-04-24T15:42:50,729 adding 'flashinfer/data/spdlog/include/spdlog/version.h' 2026-04-24T15:42:50,731 adding 'flashinfer/data/spdlog/include/spdlog/cfg/argv.h' 2026-04-24T15:42:50,732 adding 'flashinfer/data/spdlog/include/spdlog/cfg/env.h' 2026-04-24T15:42:50,733 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h' 2026-04-24T15:42:50,735 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers.h' 2026-04-24T15:42:50,738 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h' 2026-04-24T15:42:50,739 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer.h' 2026-04-24T15:42:50,740 adding 'flashinfer/data/spdlog/include/spdlog/details/circular_q.h' 2026-04-24T15:42:50,742 adding 'flashinfer/data/spdlog/include/spdlog/details/console_globals.h' 2026-04-24T15:42:50,743 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h' 2026-04-24T15:42:50,744 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper.h' 2026-04-24T15:42:50,746 adding 'flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h' 2026-04-24T15:42:50,747 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h' 2026-04-24T15:42:50,748 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg.h' 2026-04-24T15:42:50,750 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h' 2026-04-24T15:42:50,751 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h' 2026-04-24T15:42:50,752 adding 'flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h' 2026-04-24T15:42:50,753 adding 'flashinfer/data/spdlog/include/spdlog/details/null_mutex.h' 2026-04-24T15:42:50,756 adding 'flashinfer/data/spdlog/include/spdlog/details/os-inl.h' 2026-04-24T15:42:50,758 adding 'flashinfer/data/spdlog/include/spdlog/details/os.h' 2026-04-24T15:42:50,759 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h' 2026-04-24T15:42:50,760 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h' 2026-04-24T15:42:50,762 adding 'flashinfer/data/spdlog/include/spdlog/details/registry-inl.h' 2026-04-24T15:42:50,763 adding 'flashinfer/data/spdlog/include/spdlog/details/registry.h' 2026-04-24T15:42:50,764 adding 'flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h' 2026-04-24T15:42:50,766 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h' 2026-04-24T15:42:50,767 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client.h' 2026-04-24T15:42:50,769 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h' 2026-04-24T15:42:50,770 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool.h' 2026-04-24T15:42:50,772 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h' 2026-04-24T15:42:50,773 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client.h' 2026-04-24T15:42:50,774 adding 'flashinfer/data/spdlog/include/spdlog/details/windows_include.h' 2026-04-24T15:42:50,777 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h' 2026-04-24T15:42:50,778 adding 'flashinfer/data/spdlog/include/spdlog/fmt/chrono.h' 2026-04-24T15:42:50,779 adding 'flashinfer/data/spdlog/include/spdlog/fmt/compile.h' 2026-04-24T15:42:50,780 adding 'flashinfer/data/spdlog/include/spdlog/fmt/fmt.h' 2026-04-24T15:42:50,781 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ostr.h' 2026-04-24T15:42:50,783 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ranges.h' 2026-04-24T15:42:50,784 adding 'flashinfer/data/spdlog/include/spdlog/fmt/std.h' 2026-04-24T15:42:50,785 adding 'flashinfer/data/spdlog/include/spdlog/fmt/xchar.h' 2026-04-24T15:42:50,788 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h' 2026-04-24T15:42:50,795 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h' 2026-04-24T15:42:50,799 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h' 2026-04-24T15:42:50,801 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h' 2026-04-24T15:42:50,814 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h' 2026-04-24T15:42:50,816 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst' 2026-04-24T15:42:50,825 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h' 2026-04-24T15:42:50,845 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h' 2026-04-24T15:42:50,848 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h' 2026-04-24T15:42:50,850 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h' 2026-04-24T15:42:50,851 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h' 2026-04-24T15:42:50,854 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h' 2026-04-24T15:42:50,857 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h' 2026-04-24T15:42:50,860 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h' 2026-04-24T15:42:50,861 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h' 2026-04-24T15:42:50,864 adding 'flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h' 2026-04-24T15:42:50,865 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h' 2026-04-24T15:42:50,867 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h' 2026-04-24T15:42:50,868 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h' 2026-04-24T15:42:50,869 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h' 2026-04-24T15:42:50,870 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h' 2026-04-24T15:42:50,872 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h' 2026-04-24T15:42:50,873 adding 'flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h' 2026-04-24T15:42:50,875 adding 'flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h' 2026-04-24T15:42:50,876 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h' 2026-04-24T15:42:50,877 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h' 2026-04-24T15:42:50,879 adding 'flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h' 2026-04-24T15:42:50,881 adding 'flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h' 2026-04-24T15:42:50,882 adding 'flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h' 2026-04-24T15:42:50,883 adding 'flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h' 2026-04-24T15:42:50,885 adding 'flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h' 2026-04-24T15:42:50,886 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h' 2026-04-24T15:42:50,888 adding 'flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h' 2026-04-24T15:42:50,889 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h' 2026-04-24T15:42:50,891 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h' 2026-04-24T15:42:50,892 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h' 2026-04-24T15:42:50,894 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h' 2026-04-24T15:42:50,895 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink.h' 2026-04-24T15:42:50,896 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h' 2026-04-24T15:42:50,897 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h' 2026-04-24T15:42:50,899 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h' 2026-04-24T15:42:50,900 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h' 2026-04-24T15:42:50,901 adding 'flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h' 2026-04-24T15:42:50,903 adding 'flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h' 2026-04-24T15:42:50,904 adding 'flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h' 2026-04-24T15:42:50,906 adding 'flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h' 2026-04-24T15:42:50,907 adding 'flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h' 2026-04-24T15:42:50,909 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h' 2026-04-24T15:42:50,910 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h' 2026-04-24T15:42:50,912 adding 'flashinfer/data/spdlog/scripts/extract_version.py' 2026-04-24T15:42:50,913 adding 'flashinfer/dsv3_ops/__init__.py' 2026-04-24T15:42:50,915 adding 'flashinfer/fused_moe/__init__.py' 2026-04-24T15:42:50,924 adding 'flashinfer/fused_moe/core.py' 2026-04-24T15:42:50,926 adding 'flashinfer/fused_moe/fused_routing_dsv3.py' 2026-04-24T15:42:50,928 adding 'flashinfer/fused_moe/utils.py' 2026-04-24T15:42:50,930 adding 'flashinfer/fused_moe/cute_dsl/__init__.py' 2026-04-24T15:42:50,932 adding 'flashinfer/fused_moe/cute_dsl/b12x_moe.py' 2026-04-24T15:42:50,935 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-24T15:42:50,938 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-24T15:42:50,941 adding 'flashinfer/fused_moe/cute_dsl/fused_moe.py' 2026-04-24T15:42:50,944 adding 'flashinfer/fused_moe/cute_dsl/moe_utils.py' 2026-04-24T15:42:50,947 adding 'flashinfer/fused_moe/cute_dsl/tuner.py' 2026-04-24T15:42:50,949 adding 'flashinfer/fused_moe/cute_dsl/blackwell/__init__.py' 2026-04-24T15:42:50,961 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-24T15:42:50,972 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-24T15:42:50,975 adding 'flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py' 2026-04-24T15:42:50,977 adding 'flashinfer/fused_moe/cute_dsl/blackwell/utils.py' 2026-04-24T15:42:50,979 adding 'flashinfer/fused_moe/cute_dsl/blackwell_sm12x/__init__.py' 2026-04-24T15:42:50,984 adding 'flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_dispatch.py' 2026-04-24T15:42:50,993 adding 'flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_dynamic_kernel.py' 2026-04-24T15:42:51,002 adding 'flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_micro_kernel.py' 2026-04-24T15:42:51,011 adding 'flashinfer/fused_moe/cute_dsl/blackwell_sm12x/moe_static_kernel.py' 2026-04-24T15:42:51,013 adding 'flashinfer/fused_moe/cute_dsl/blackwell_sm12x/triton_compact.py' 2026-04-24T15:42:51,015 adding 'flashinfer/gdn_kernels/__init__.py' 2026-04-24T15:42:51,023 adding 'flashinfer/gdn_kernels/gdn_decode_bf16_state.py' 2026-04-24T15:42:51,031 adding 'flashinfer/gdn_kernels/gdn_decode_mtp.py' 2026-04-24T15:42:51,035 adding 'flashinfer/gdn_kernels/gdn_decode_nontranspose.py' 2026-04-24T15:42:51,038 adding 'flashinfer/gdn_kernels/gdn_decode_pretranspose.py' 2026-04-24T15:42:51,040 adding 'flashinfer/gdn_kernels/blackwell/__init__.py' 2026-04-24T15:42:51,054 adding 'flashinfer/gdn_kernels/blackwell/gated_delta_net_chunked.py' 2026-04-24T15:42:51,056 adding 'flashinfer/gdn_kernels/blackwell/gated_delta_net_tile_scheduler.py' 2026-04-24T15:42:51,058 adding 'flashinfer/gdn_kernels/blackwell/gdn_prefill.py' 2026-04-24T15:42:51,060 adding 'flashinfer/gemm/__init__.py' 2026-04-24T15:42:51,082 adding 'flashinfer/gemm/gemm_base.py' 2026-04-24T15:42:51,086 adding 'flashinfer/gemm/routergemm.py' 2026-04-24T15:42:51,088 adding 'flashinfer/gemm/kernels/__init__.py' 2026-04-24T15:42:51,096 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py' 2026-04-24T15:42:51,105 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py' 2026-04-24T15:42:51,113 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm120.py' 2026-04-24T15:42:51,124 adding 'flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py' 2026-04-24T15:42:51,126 adding 'flashinfer/gemm/kernels/utils.py' 2026-04-24T15:42:51,128 adding 'flashinfer/jit/__init__.py' 2026-04-24T15:42:51,130 adding 'flashinfer/jit/activation.py' 2026-04-24T15:42:51,131 adding 'flashinfer/jit/cascade.py' 2026-04-24T15:42:51,132 adding 'flashinfer/jit/comm.py' 2026-04-24T15:42:51,134 adding 'flashinfer/jit/core.py' 2026-04-24T15:42:51,136 adding 'flashinfer/jit/cpp_ext.py' 2026-04-24T15:42:51,138 adding 'flashinfer/jit/cubin_loader.py' 2026-04-24T15:42:51,139 adding 'flashinfer/jit/dsv3_optimizations.py' 2026-04-24T15:42:51,141 adding 'flashinfer/jit/env.py' 2026-04-24T15:42:51,142 adding 'flashinfer/jit/fp4_kv_dequantization.py' 2026-04-24T15:42:51,143 adding 'flashinfer/jit/fp4_kv_quantization.py' 2026-04-24T15:42:51,144 adding 'flashinfer/jit/fp4_quantization.py' 2026-04-24T15:42:51,146 adding 'flashinfer/jit/fp8_quantization.py' 2026-04-24T15:42:51,148 adding 'flashinfer/jit/fused_moe.py' 2026-04-24T15:42:51,149 adding 'flashinfer/jit/gdn.py' 2026-04-24T15:42:51,150 adding 'flashinfer/jit/mla.py' 2026-04-24T15:42:51,152 adding 'flashinfer/jit/moe_utils.py' 2026-04-24T15:42:51,153 adding 'flashinfer/jit/norm.py' 2026-04-24T15:42:51,154 adding 'flashinfer/jit/page.py' 2026-04-24T15:42:51,155 adding 'flashinfer/jit/quantization.py' 2026-04-24T15:42:51,157 adding 'flashinfer/jit/rmsnorm_silu.py' 2026-04-24T15:42:51,158 adding 'flashinfer/jit/rope.py' 2026-04-24T15:42:51,159 adding 'flashinfer/jit/sampling.py' 2026-04-24T15:42:51,161 adding 'flashinfer/jit/spdlog.py' 2026-04-24T15:42:51,162 adding 'flashinfer/jit/tinygemm2.py' 2026-04-24T15:42:51,163 adding 'flashinfer/jit/tllm_utils.py' 2026-04-24T15:42:51,164 adding 'flashinfer/jit/topk.py' 2026-04-24T15:42:51,166 adding 'flashinfer/jit/utils.py' 2026-04-24T15:42:51,167 adding 'flashinfer/jit/xqa.py' 2026-04-24T15:42:51,169 adding 'flashinfer/jit/attention/__init__.py' 2026-04-24T15:42:51,173 adding 'flashinfer/jit/attention/modules.py' 2026-04-24T15:42:51,175 adding 'flashinfer/jit/attention/utils.py' 2026-04-24T15:42:51,177 adding 'flashinfer/jit/attention/variants.py' 2026-04-24T15:42:51,182 adding 'flashinfer/jit/attention/fmha_v2/fmha_library.py' 2026-04-24T15:42:51,184 adding 'flashinfer/jit/attention/fmha_v2/generate_kernels.py' 2026-04-24T15:42:51,199 adding 'flashinfer/jit/attention/fmha_v2/generator_utils.py' 2026-04-24T15:42:51,204 adding 'flashinfer/jit/attention/fmha_v2/utils.py' 2026-04-24T15:42:51,206 adding 'flashinfer/jit/gemm/__init__.py' 2026-04-24T15:42:51,209 adding 'flashinfer/jit/gemm/core.py' 2026-04-24T15:42:51,210 adding 'flashinfer/jit/gemm/deepgemm.py' 2026-04-24T15:42:51,211 adding 'flashinfer/jit/gemm/fp8_blockscale.py' 2026-04-24T15:42:51,213 adding 'flashinfer/jit/gemm/cutlass/__init__.py' 2026-04-24T15:42:51,218 adding 'flashinfer/jit/gemm/cutlass/cutlass_library.py' 2026-04-24T15:42:51,222 adding 'flashinfer/jit/gemm/cutlass/generate_kernels.py' 2026-04-24T15:42:51,224 adding 'flashinfer/jit/mamba/__init__.py' 2026-04-24T15:42:51,226 adding 'flashinfer/jit/mamba/selective_state_update.py' 2026-04-24T15:42:51,227 adding 'flashinfer/jit/mamba/seq_chunk_cumsum.py' 2026-04-24T15:42:51,229 adding 'flashinfer/logits_processor/__init__.py' 2026-04-24T15:42:51,231 adding 'flashinfer/logits_processor/compiler.py' 2026-04-24T15:42:51,232 adding 'flashinfer/logits_processor/fusion_rules.py' 2026-04-24T15:42:51,234 adding 'flashinfer/logits_processor/legalization.py' 2026-04-24T15:42:51,235 adding 'flashinfer/logits_processor/op.py' 2026-04-24T15:42:51,237 adding 'flashinfer/logits_processor/operators.py' 2026-04-24T15:42:51,239 adding 'flashinfer/logits_processor/pipeline.py' 2026-04-24T15:42:51,241 adding 'flashinfer/logits_processor/processors.py' 2026-04-24T15:42:51,242 adding 'flashinfer/logits_processor/types.py' 2026-04-24T15:42:51,244 adding 'flashinfer/logits_processor/validators.py' 2026-04-24T15:42:51,246 adding 'flashinfer/mamba/__init__.py' 2026-04-24T15:42:51,248 adding 'flashinfer/mamba/selective_state_update.py' 2026-04-24T15:42:51,251 adding 'flashinfer/mamba/ssd_combined.py' 2026-04-24T15:42:51,265 adding 'flashinfer/mamba/ssd_kernel.py' 2026-04-24T15:42:51,267 adding 'flashinfer/mamba/ssd_tile_scheduler.py' 2026-04-24T15:42:51,269 adding 'flashinfer/mla/__init__.py' 2026-04-24T15:42:51,273 adding 'flashinfer/mla/_core.py' 2026-04-24T15:42:51,276 adding 'flashinfer/norm/__init__.py' 2026-04-24T15:42:51,279 adding 'flashinfer/norm/utils.py' 2026-04-24T15:42:51,281 adding 'flashinfer/norm/kernels/__init__.py' 2026-04-24T15:42:51,284 adding 'flashinfer/norm/kernels/fused_add_rmsnorm.py' 2026-04-24T15:42:51,286 adding 'flashinfer/norm/kernels/layernorm.py' 2026-04-24T15:42:51,290 adding 'flashinfer/norm/kernels/rmsnorm.py' 2026-04-24T15:42:51,292 adding 'flashinfer/parallel_attention/__init__.py' 2026-04-24T15:42:51,293 adding 'flashinfer/parallel_attention/attention_ops.py' 2026-04-24T15:42:51,295 adding 'flashinfer/parallel_attention/parallel_attention.py' 2026-04-24T15:42:51,296 adding 'flashinfer/parallel_attention/parallel_config.py' 2026-04-24T15:42:51,298 adding 'flashinfer/parallel_attention/parallel_wrapper.py' 2026-04-24T15:42:51,301 adding 'flashinfer/parallel_attention/utils.py' 2026-04-24T15:42:51,302 adding 'flashinfer/profiler/__init__.py' 2026-04-24T15:42:51,304 adding 'flashinfer/quantization/__init__.py' 2026-04-24T15:42:51,310 adding 'flashinfer/quantization/fp4_quantization.py' 2026-04-24T15:42:51,312 adding 'flashinfer/quantization/fp8_quantization.py' 2026-04-24T15:42:51,313 adding 'flashinfer/quantization/packbits.py' 2026-04-24T15:42:51,317 adding 'flashinfer/quantization/quantization_cute_dsl_utils.py' 2026-04-24T15:42:51,319 adding 'flashinfer/quantization/kernels/__init__.py' 2026-04-24T15:42:51,323 adding 'flashinfer/quantization/kernels/mxfp4_quantize.py' 2026-04-24T15:42:51,327 adding 'flashinfer/quantization/kernels/mxfp8_quantize.py' 2026-04-24T15:42:51,332 adding 'flashinfer/quantization/kernels/nvfp4_quantize.py' 2026-04-24T15:42:51,334 adding 'flashinfer/testing/__init__.py' 2026-04-24T15:42:51,340 adding 'flashinfer/testing/utils.py' 2026-04-24T15:42:51,342 adding 'flashinfer/triton/__init__.py' 2026-04-24T15:42:51,344 adding 'flashinfer/triton/activation.py' 2026-04-24T15:42:51,345 adding 'flashinfer/triton/cascade.py' 2026-04-24T15:42:51,347 adding 'flashinfer/triton/gemm.py' 2026-04-24T15:42:51,349 adding 'flashinfer/triton/norm.py' 2026-04-24T15:42:51,350 adding 'flashinfer/triton/page.py' 2026-04-24T15:42:51,352 adding 'flashinfer/triton/sm_constraint_gemm.py' 2026-04-24T15:42:51,353 adding 'flashinfer/triton/utils.py' 2026-04-24T15:42:51,355 adding 'flashinfer/triton/kernels/__init__.py' 2026-04-24T15:42:51,356 adding 'flashinfer/triton/kernels/activation.py' 2026-04-24T15:42:51,357 adding 'flashinfer/triton/kernels/cascade.py' 2026-04-24T15:42:51,359 adding 'flashinfer/triton/kernels/norm.py' 2026-04-24T15:42:51,360 adding 'flashinfer/triton/kernels/quant.py' 2026-04-24T15:42:51,362 adding 'flashinfer/triton/kernels/sm_constraint_gemm.py' 2026-04-24T15:42:51,363 adding 'flashinfer/triton/kernels/ssd_chunk_state.py' 2026-04-24T15:42:51,365 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py' 2026-04-24T15:42:51,367 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py' 2026-04-24T15:42:51,370 adding 'flashinfer_python-0.6.9.dist-info/licenses/LICENSE' 2026-04-24T15:42:51,372 adding 'flashinfer_python-0.6.9.dist-info/METADATA' 2026-04-24T15:42:51,374 adding 'flashinfer_python-0.6.9.dist-info/WHEEL' 2026-04-24T15:42:51,375 adding 'flashinfer_python-0.6.9.dist-info/entry_points.txt' 2026-04-24T15:42:51,376 adding 'flashinfer_python-0.6.9.dist-info/top_level.txt' 2026-04-24T15:42:51,419 adding 'flashinfer_python-0.6.9.dist-info/RECORD' 2026-04-24T15:42:51,556 removing build/bdist.linux-armv7l/wheel 2026-04-24T15:42:52,274 Building wheel for flashinfer-python (pyproject.toml): finished with status 'done' 2026-04-24T15:42:52,483 Created wheel for flashinfer-python: filename=flashinfer_python-0.6.9-py3-none-any.whl size=9507967 sha256=15cd5b7efa4ac59d0d8bc1012606939c05df9ff00375ac1318bfad8305ad0fb6 2026-04-24T15:42:52,484 Stored in directory: /tmp/pip-ephem-wheel-cache-63k8cme3/wheels/dd/47/3f/3a2dd06199f5acf2d2c8f7c3f60d98e0a2eca06885538791c5 2026-04-24T15:42:52,571 Successfully built flashinfer-python 2026-04-24T15:42:52,802 Removed build tracker: '/tmp/pip-build-tracker-0l9cu84t'