2026-04-06T02:37:22,222 Created temporary directory: /tmp/pip-ephem-wheel-cache-igurac_r 2026-04-06T02:37:22,224 Created temporary directory: /tmp/pip-build-tracker-lo4oc_78 2026-04-06T02:37:22,224 Initialized build tracking at /tmp/pip-build-tracker-lo4oc_78 2026-04-06T02:37:22,225 Created build tracker: /tmp/pip-build-tracker-lo4oc_78 2026-04-06T02:37:22,225 Entered build tracker: /tmp/pip-build-tracker-lo4oc_78 2026-04-06T02:37:22,226 Created temporary directory: /tmp/pip-wheel-x54aw7uf 2026-04-06T02:37:22,229 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-06T02:37:22,231 Created temporary directory: /tmp/pip-ephem-wheel-cache-6go0d428 2026-04-06T02:37:22,253 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-06T02:37:22,257 2 location(s) to search for versions of flashinfer-python: 2026-04-06T02:37:22,257 * https://pypi.org/simple/flashinfer-python/ 2026-04-06T02:37:22,257 * https://www.piwheels.org/simple/flashinfer-python/ 2026-04-06T02:37:22,257 Fetching project page and analyzing links: https://pypi.org/simple/flashinfer-python/ 2026-04-06T02:37:22,258 Getting page https://pypi.org/simple/flashinfer-python/ 2026-04-06T02:37:22,260 Found index url https://pypi.org/simple 2026-04-06T02:37:22,413 Fetched page https://pypi.org/simple/flashinfer-python/ as application/vnd.pypi.simple.v1+json 2026-04-06T02:37:22,429 Found link https://files.pythonhosted.org/packages/6c/e9/5d6adcf888922a17c6fc52a0e5bed78785239af1219f41e1073b063a07ff/flashinfer_python-0.2.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post1 2026-04-06T02:37:22,430 Found link https://files.pythonhosted.org/packages/c8/39/bac839234a3beaab4292e489b4d8941cc97ba4f76474aff0407d7b05a84f/flashinfer_python-0.2.0.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post2 2026-04-06T02:37:22,431 Found link https://files.pythonhosted.org/packages/94/74/4dda2a7a7aa08bcfb8039faf2202bf0fea6b378d0d4968864737400fc329/flashinfer_python-0.2.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1 2026-04-06T02:37:22,432 Found link https://files.pythonhosted.org/packages/7f/3d/aab500609825108d3f6a4b440a7eeb6436d578d3e781e97ea015fd49a530/flashinfer_python-0.2.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post1 2026-04-06T02:37:22,433 Found link https://files.pythonhosted.org/packages/30/ac/afd1d2c472857be8f83389eb506e1413a2ac3a603889bea3cf24d5ab5be5/flashinfer_python-0.2.1.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post2 2026-04-06T02:37:22,435 Found link https://files.pythonhosted.org/packages/90/00/833dd50745bc15bb7a7451b77589d444ce963d48c0cb730b4760bfebffad/flashinfer_python-0.2.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2 2026-04-06T02:37:22,436 Found link https://files.pythonhosted.org/packages/02/cc/db9635c56653d3fa5a28f14ac858e0801de621aa33d3b528e4781aee906f/flashinfer_python-0.2.2.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2.post1 2026-04-06T02:37:22,437 Found link https://files.pythonhosted.org/packages/b6/10/2a63f1d09c5b337705236005dc9ccce513dcc08b7fd037cb40426f1695b1/flashinfer_python-0.2.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.3 2026-04-06T02:37:22,438 Found link https://files.pythonhosted.org/packages/a4/e5/8d193ccf65b92c009c4be50fdffa88fa0edc8fd6e6169bacaca6bab84d89/flashinfer_python-0.2.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.4 2026-04-06T02:37:22,439 Found link https://files.pythonhosted.org/packages/b2/c4/9ec0f79e2480fc5c93307c4a1ac903e5cf33c551c0eaeb648196234b55af/flashinfer_python-0.2.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.5 2026-04-06T02:37:22,441 Found link https://files.pythonhosted.org/packages/95/4a/a3109d57463d25a153b16c0d0f06495e4d18b727c81f8e08e42e97faaf45/flashinfer_python-0.2.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6 2026-04-06T02:37:22,441 Found link https://files.pythonhosted.org/packages/34/26/3c6f12ffaefbfa0c453030d6e15941269b3a4ffcd267daec32d1a10dda96/flashinfer_python-0.2.6.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6.post1 2026-04-06T02:37:22,442 Found link https://files.pythonhosted.org/packages/f9/a0/5e700751f2393a504bc5eb2879e77d783a5b70778a254289711323126abc/flashinfer_python-0.2.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7 2026-04-06T02:37:22,443 Found link https://files.pythonhosted.org/packages/c0/10/43cf1ea7a03ca8e75a185190708e48286e1583d781e93d1de130e5d450ca/flashinfer_python-0.2.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7.post1 2026-04-06T02:37:22,444 Found link https://files.pythonhosted.org/packages/f1/80/8dfae62d04af4597d7615b892f346ace68bcb07dfbef2a9e614219d96a8a/flashinfer_python-0.2.8rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8rc1 2026-04-06T02:37:22,445 Found link https://files.pythonhosted.org/packages/72/0e/827624993516e80f62ba88dd368ad5e180c41324f063c00d27fa638a430e/flashinfer_python-0.2.8.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8 2026-04-06T02:37:22,446 Found link https://files.pythonhosted.org/packages/17/50/42afc9a81031939140fcbfd93e5a3652dc4995e338b4e6d007b0dda04f93/flashinfer_python-0.2.9rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc1 2026-04-06T02:37:22,448 Found link https://files.pythonhosted.org/packages/ed/1a/9f30eda3178ed2f5f7e311ae0011d02c4542d087f84c9247e4b30668b767/flashinfer_python-0.2.9rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc2 2026-04-06T02:37:22,449 Found link https://files.pythonhosted.org/packages/45/fc/4deff13f1420cc6e5871b7505a6c0d9031eb49cd09571ae576aec59bed61/flashinfer_python-0.2.9.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9 2026-04-06T02:37:22,450 Found link https://files.pythonhosted.org/packages/74/e4/2c6d6a19d13ed13d4863f6900febe72b502334e43292d5fe9a1ac2f6c5be/flashinfer_python-0.2.10.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.10 2026-04-06T02:37:22,451 Found link https://files.pythonhosted.org/packages/72/8b/f315dda5993d1c018ca5ecfef0775c6a3c7a8f59ac426fabb7f3f6b93482/flashinfer_python-0.2.11.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11 2026-04-06T02:37:22,452 Found link https://files.pythonhosted.org/packages/37/e3/2e8e31f7f7ee26f39968264e4fcf74f9810d90e940859016d974106ed5c6/flashinfer_python-0.2.11.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post1 2026-04-06T02:37:22,453 Found link https://files.pythonhosted.org/packages/b6/01/fa069f076cfe5bed34ddc3b7f772aa09c70e03e572dd9d3569ff887f33b1/flashinfer_python-0.2.11.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post2 2026-04-06T02:37:22,454 Found link https://files.pythonhosted.org/packages/a3/09/5d89ef0bc2d19d3ebcf3b9fa621c945909f681818c9d55aa3181921db874/flashinfer_python-0.2.11.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post3 2026-04-06T02:37:22,455 Found link https://files.pythonhosted.org/packages/b9/5a/7a839afb07af313549b9d9f1057b02aaf067f020267d5a9d128e50596bf4/flashinfer_python-0.2.12.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.12 2026-04-06T02:37:22,457 Found link https://files.pythonhosted.org/packages/f2/20/e79142a9f26aab61b17e2c906a49e9a3d3c656d97608c8773785c3b13140/flashinfer_python-0.2.13.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.13 2026-04-06T02:37:22,458 Found link https://files.pythonhosted.org/packages/ed/26/d1eac56b37d225cb3f84495bd897829dece21f62463487f3c1d9cafe78a0/flashinfer_python-0.2.14.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14 2026-04-06T02:37:22,459 Found link https://files.pythonhosted.org/packages/94/d4/4a2bf3d49f84b2d975925c1c024790b4e4768bdefbc5e27529d68368355a/flashinfer_python-0.2.14.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14.post1 2026-04-06T02:37:22,460 Found link https://files.pythonhosted.org/packages/56/e3/7c0a4df2640a97ecfed45fe9110ecc6a67d4967278723abf8e6531b6bc1f/flashinfer_python-0.3.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0rc1 2026-04-06T02:37:22,461 Found link https://files.pythonhosted.org/packages/1f/b4/5c4cbb0f3cbc5e8d4c19b3f163c048eed959a0ac0c603cfb3939a3079c52/flashinfer_python-0.3.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0 2026-04-06T02:37:22,462 Found link https://files.pythonhosted.org/packages/59/1b/83a9c58432b4a5d6ff04b97d4873bedfb5e35d38972ca8946b3acdbffeb4/flashinfer_python-0.3.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0.post1 2026-04-06T02:37:22,463 Found link https://files.pythonhosted.org/packages/ba/71/dd3001b8be8174d90561764a5f3be4ca219517bde2841189ea6973a3873f/flashinfer_python-0.3.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1 2026-04-06T02:37:22,464 Found link https://files.pythonhosted.org/packages/49/a7/f5bd3878f94fc47e25ecc0828f910233022366f7e832dfa02f3617fad41f/flashinfer_python-0.3.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1.post1 2026-04-06T02:37:22,465 Found link https://files.pythonhosted.org/packages/df/b4/f113bb950e5244d1c72c3d73c03fac0db939f085670e3a45a41fe92ffde0/flashinfer_python-0.4.0rc0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc0 2026-04-06T02:37:22,466 Found link https://files.pythonhosted.org/packages/2e/a8/adceccda3aae01b7bdb5f99c68a2b401c58600f34a6386d9489ff736cdbc/flashinfer_python-0.4.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc1 2026-04-06T02:37:22,467 Found link https://files.pythonhosted.org/packages/15/c0/5fb88fc273fed23dbf3b0ef0bffa7db26e2df24e016202df1b4e98b95879/flashinfer_python-0.4.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc2 2026-04-06T02:37:22,468 Found link https://files.pythonhosted.org/packages/65/91/cf9e3a0a2626711bfab18ea4a4c739e0eb823e9513addc0e9e1b8f929538/flashinfer_python-0.4.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc3 2026-04-06T02:37:22,470 Found link https://files.pythonhosted.org/packages/94/ec/bdcc0ec502994d544cbe69763d999458ae2deda67e58c1cb2d85867677c4/flashinfer_python-0.4.0rc4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc4 2026-04-06T02:37:22,471 Found link https://files.pythonhosted.org/packages/08/29/f5609be182174e8c97124baeb90bb955fe05e2e1353776f48e226c153214/flashinfer_python-0.4.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0 2026-04-06T02:37:22,472 Found link https://files.pythonhosted.org/packages/64/cf/f82142abd7c819fb84a53f18fe1ac9e7cf1af8790b93c06dbf430001473b/flashinfer_python-0.4.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.1 2026-04-06T02:37:22,473 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/c7/92/126dacc3476fab07478bdfc9944abd22aafa1000088d93bf86fb9ec78a29/flashinfer_python-0.5.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,474 Found link https://files.pythonhosted.org/packages/53/47/a759f1ae9ef4ceb4e12895665b65dfacea2085494626e764627dd3548fa8/flashinfer_python-0.5.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc1 2026-04-06T02:37:22,474 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/fb/aa/7b5d28c2aec11acfce18f2655d0b4614c7e34547fab218b4f2fd0d57bdce/flashinfer_python-0.5.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,475 Found link https://files.pythonhosted.org/packages/3d/5a/58a7b60f79a1ac9c652b4055b06e88b5f57e8ef4c7dd4830ef48fa4cc265/flashinfer_python-0.5.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc2 2026-04-06T02:37:22,476 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/5f/8f/7077cf0a44056a65045a793d6d55845d95818fb6455bfebb44ddea7e1f12/flashinfer_python-0.5.0rc3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,477 Found link https://files.pythonhosted.org/packages/60/d1/8c90d6dfc95ab609028e9d541a6cdb3483f5c1475b07d97465ff3f0db14c/flashinfer_python-0.5.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc3 2026-04-06T02:37:22,478 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/eb/8a/425b75b44ce5eeefe01dd61d4ee260b8e5f9dcf1a500d5f08d6cd4095d3a/flashinfer_python-0.5.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,479 Found link https://files.pythonhosted.org/packages/e3/1d/b82cd2606f4f0033e2fb28194dc3b04fd8101643e4ceb1d13fb1466cfd28/flashinfer_python-0.5.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0 2026-04-06T02:37:22,480 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f4/f1/33dedad087a2bc3d66244126bd5d1c79721ea22d1f2124299f9e5bdaf3b1/flashinfer_python-0.5.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,481 Found link https://files.pythonhosted.org/packages/6c/bb/897c3b9d683dcf6490f70e468efb585eebcd673970b13a04ed947b491982/flashinfer_python-0.5.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.1 2026-04-06T02:37:22,482 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/8d/0c/4a8ffbbc0d85e314f534cf5c32711f2af5d5e6e49225a5a414400a67b684/flashinfer_python-0.5.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,483 Found link https://files.pythonhosted.org/packages/d8/04/e357eaa50238e12c49e66fcf47f83e066e741ef19a117c136782b32eafbb/flashinfer_python-0.5.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.2 2026-04-06T02:37:22,483 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/78/6dc7e7da8cb87c9965644ea0d2439457a1bc9256c45ceda0044595be4143/flashinfer_python-0.5.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,484 Found link https://files.pythonhosted.org/packages/b4/91/cca69baeff24bb3efd12c7479a026432c8717ee47193694010494c528b22/flashinfer_python-0.5.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.5.3 2026-04-06T02:37:22,485 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/b2/0c/cb2d60eb86f0171451d676f17b90484ab66baf73c54cefe15c9a7c800739/flashinfer_python-0.6.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,486 Found link https://files.pythonhosted.org/packages/53/2a/e855be4851ad6bfcebed929807fb541715f9a3a7d7b239b696e635b49d0e/flashinfer_python-0.6.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc1 2026-04-06T02:37:22,487 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/05/22/9193f1da2468acec8ba99c4bee8aeacbda489777acf00b5871a73209acf7/flashinfer_python-0.6.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,488 Found link https://files.pythonhosted.org/packages/1b/71/dd1bb86ea531e5c1a34f8ad851901bf2e2ce500618b5a4da19bd69f7de11/flashinfer_python-0.6.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc2 2026-04-06T02:37:22,488 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/90/5834597488f5ea62b1cc874338125c79ce21c11d777ac6f7b47f12cf2bb3/flashinfer_python-0.6.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,489 Found link https://files.pythonhosted.org/packages/ad/8d/c7330f27f09b9110af2f6c44c6f68d7b536f525f8ac539210073bfcdb965/flashinfer_python-0.6.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0 2026-04-06T02:37:22,490 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/d5/bca632bb5781689415186421bbee2ad39ae8a39b0996d579c76901e5c66f/flashinfer_python-0.6.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,491 Found link https://files.pythonhosted.org/packages/68/81/5a84e14df7358d2c2903b18c6f2779bd4b4a6739076d01a847d4c18fb102/flashinfer_python-0.6.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.1 2026-04-06T02:37:22,492 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/aa/c0/ee819d16f6b40e287727bb3db471f4eaa9e0372e233bf2f7343faaa3009f/flashinfer_python-0.6.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,493 Found link https://files.pythonhosted.org/packages/89/86/b25115177606ae3b6cec373d290798c28e185d033b66f6b80a89589e7786/flashinfer_python-0.6.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.2 2026-04-06T02:37:22,494 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/13/2d95248101d8cb978db9000a4dceafb5b122484a694b53e84df1ac2a7b3d/flashinfer_python-0.6.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,495 Found link https://files.pythonhosted.org/packages/d6/aa/c564313b42dee7573da4ed0e441844f0c2bd827aecc9f29ea02c3838ffae/flashinfer_python-0.6.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.3 2026-04-06T02:37:22,496 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/17/9a/d2bab76d2bb15062c6a2329614653e4f8bec9c78eec9069856ef0c7c0a79/flashinfer_python-0.6.4-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,497 Found link https://files.pythonhosted.org/packages/77/45/15645d2a4ee81d08206f3e132a77323e48312f510462415d7cd1122eba43/flashinfer_python-0.6.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.4 2026-04-06T02:37:22,498 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/4f/83/eea2a74700b5fcae36ee2b748db9c3554a83a3f9e2dc4f3816369c5cb653/flashinfer_python-0.6.5-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,498 Found link https://files.pythonhosted.org/packages/e2/2f/5c52276af3cc40ac1f6eaf823ccd8e257f77e2fe5d465fa641ad3dba4d1b/flashinfer_python-0.6.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.5 2026-04-06T02:37:22,499 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/e0/61/385d06755f3ab66333018285657adf0daf8a90a129448231fd09e315bd2e/flashinfer_python-0.6.6-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,500 Found link https://files.pythonhosted.org/packages/03/70/c5a235297351021f5d3d3233523a85f5a6468495587489ad2f257e8eafe2/flashinfer_python-0.6.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.6 2026-04-06T02:37:22,500 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f1/e8/91361a5f07667f36181cfd08e2d7d28be4cae2aa5a24016339174b308c38/flashinfer_python-0.6.7-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,501 Found link https://files.pythonhosted.org/packages/d9/2d/aa36fa1fee744c46fef99436baea5cda4a34244846c1df0fea97eaa9a856/flashinfer_python-0.6.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7 2026-04-06T02:37:22,502 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/16/92/516c79e5d8d1f0b41793e499c37a9299115ac8bc05171661b30d4a94beb8/flashinfer_python-0.6.7.post1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,503 Found link https://files.pythonhosted.org/packages/60/6c/4b1a3d380c04306bde63412043e679d5a52d3da7feed91f1e9ba8ce8bc3f/flashinfer_python-0.6.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post1 2026-04-06T02:37:22,504 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/62/9e/bf26a95bb219eb3d43cc6f3cd1dde6f560081fbcb50f846535c9f571a807/flashinfer_python-0.6.7.post2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,505 Found link https://files.pythonhosted.org/packages/cc/95/81eafb78574312db79ef7144a4e77f2fee015343f413ef3000f279c8a118/flashinfer_python-0.6.7.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post2 2026-04-06T02:37:22,506 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/01/6b/4117cd7cbeff07818ae7c6b8bf5a6d1ee3eed29356672b731b55af3d4453/flashinfer_python-0.6.7.post3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,507 Found link https://files.pythonhosted.org/packages/12/b5/466778818d195b96a062467ee389d0fcfa51fdfecad4a831922916d4c48a/flashinfer_python-0.6.7.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post3 2026-04-06T02:37:22,507 Fetching project page and analyzing links: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-06T02:37:22,508 Getting page https://www.piwheels.org/simple/flashinfer-python/ 2026-04-06T02:37:22,510 Found index url https://www.piwheels.org/simple 2026-04-06T02:37:22,682 Fetched page https://www.piwheels.org/simple/flashinfer-python/ as text/html 2026-04-06T02:37:22,691 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7.post1-py3-none-any.whl#sha256=c9bf5183228f6636ddb26d7354f250af4b2385876527538a0ff7f94fd48207d2 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,692 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7-py3-none-any.whl#sha256=9b349825a2d26c3e4653c594d7a1d7b2126a43b29a4a70a6d48f3aaac23b96f3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,692 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.6-py3-none-any.whl#sha256=94791e01c31510c057b4decabff24cbc62466682667867e84214c62c45d9b343 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,692 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.5-py3-none-any.whl#sha256=4b0a6c246959ca2dbc232fa1fe2f17ff857fd258de5dfacfa45033f21b6b7b93 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,693 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.4-py3-none-any.whl#sha256=22ee7972266bb31ce1583330769efc0ecd001fb70371531ce4c77f2d6eda0d59 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,693 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.3-py3-none-any.whl#sha256=ed3282188580afd663819924a772b2b531ac5bb88438bbe89d0baf67fe8c9fa5 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,694 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.1-py3-none-any.whl#sha256=9e0e308062a81d4e4c462313bfe33edce7712309e8c89aed722065249e644833 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,695 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0-py3-none-any.whl#sha256=7ebc0582df714a933fc4c58ed4d12f4e61b4ad30b22b9155f290e96ee3eee3a0 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,695 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc2-py3-none-any.whl#sha256=63057b7ee43a4f6764c6ed8fe4c4c6de5a94da058fe0975bf279db0567c26204 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,696 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc1-py3-none-any.whl#sha256=e30a125bf89f8155f83aca80e5fb88a3d81224225485ce70f0f4c4c3a27da92c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,696 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.3-py3-none-any.whl#sha256=1de562233dfbd8de835c2eb757275a7759eda034460093c1eb9ff3c7d5c0845d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-06T02:37:22,697 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.2-py3-none-any.whl#sha256=bd3d206d1243bee523cf6cda27e0219e8fdf9026ade2e32045c8d9d4b7f7bf7a (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,697 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.1-py3-none-any.whl#sha256=8d73e4b66b7eb7fc4500f7f7e61aa194efebc769e7da1635a86506c97bf6fa0d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,698 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0-py3-none-any.whl#sha256=ac991d1911cff4a7453f02d88922803e7ca794a0af1dceaa920e33b81c78f5c8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,698 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc3-py3-none-any.whl#sha256=8799f4a93afc14042ac6f521f6fb682e4d62d738dc18a1e8798b7a2ba5b2e4ec (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,699 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc2-py3-none-any.whl#sha256=4ee4d438c8c7fdc242a917c3f97076562f3c44411dcaceb4f7d29082c41c0f8c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,699 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc1-py3-none-any.whl#sha256=a9d675075f3cb79ac1b5cba9e8430496d3983127609dc780a117b2b44bdb025d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,700 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.1-py3-none-any.whl#sha256=8fc8fc3233781e384689c5f202124ae7d266cb8dee14055cbb3c90fca530bf7f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,700 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.0-py3-none-any.whl#sha256=da0141b2163f9703e49972728eeb502d45eda60c25529a460d0d0d61963eedb2 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-06T02:37:22,701 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.5-py3-none-any.whl#sha256=cb2a17c3ea5f47f8129f6410e2892f30051e15665f2ae54db540c8677c187d31 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,701 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.4-py3-none-any.whl#sha256=4a85bd6ac785f106f0ad9fe213abf42f96ab84ccd04aec3ab9acf76d47d2aa3f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,702 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.3-py3-none-any.whl#sha256=b8ead688a4857a2b360c992fb46ae2930fc4c43b50a092b7e42a13b40ee195da (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,703 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2.post1-py3-none-any.whl#sha256=0097a08376ae147084ea6bd0848fc2ea1764f524c510a48755aa8c63259b4466 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,703 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2-py3-none-any.whl#sha256=c109a340b7e60cb57d8c9ccec2c10e303a36b82a56ba8dcaaa0efbee2a48b97f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,704 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post2-py3-none-any.whl#sha256=dc91f387ba09e4df899238705ec37bbe3648395d828240b77db84378d1b91e9e (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,705 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post1-py3-none-any.whl#sha256=a44b9d872cf2ba6812d3c0750d98ad01b73e9ccbede933c7eade01b6c27b6232 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,705 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1-py3-none-any.whl#sha256=e07427d9eff1b8d091b5837c3ffc4fe7885dbf01d271d7225f7a89a2e3925f27 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,705 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post2-py3-none-any.whl#sha256=52c20b84ef1e848dd49c726ffc27801df8acccb4038aea61a2d73fa685bf75f8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,706 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post1-py3-none-any.whl#sha256=783c1039e0a7db0478a579d5cc54894def70ae601b1e5b90a3c3de2209334bf3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-06T02:37:22,706 Skipping link: not a file: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-06T02:37:22,707 Skipping link: not a file: https://pypi.org/simple/flashinfer-python/ 2026-04-06T02:37:22,734 Given no hashes to check 1 links for project 'flashinfer-python': discarding no candidates 2026-04-06T02:37:22,752 Collecting flashinfer-python==0.6.7.post3 2026-04-06T02:37:22,754 Created temporary directory: /tmp/pip-unpack-muwfahx4 2026-04-06T02:37:22,984 Downloading flashinfer_python-0.6.7.post3.tar.gz (6.5 MB) 2026-04-06T02:37:31,218 Added flashinfer-python==0.6.7.post3 from https://files.pythonhosted.org/packages/12/b5/466778818d195b96a062467ee389d0fcfa51fdfecad4a831922916d4c48a/flashinfer_python-0.6.7.post3.tar.gz to build tracker '/tmp/pip-build-tracker-lo4oc_78' 2026-04-06T02:37:31,226 Created temporary directory: /tmp/pip-build-env-l3zj72y8 2026-04-06T02:37:31,231 Installing build dependencies: started 2026-04-06T02:37:31,232 Running command pip subprocess to install build dependencies 2026-04-06T02:37:32,404 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-04-06T02:37:32,838 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-06T02:37:32,861 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-06T02:37:34,617 Collecting setuptools>=77 2026-04-06T02:37:34,694 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-04-06T02:37:34,917 Collecting packaging>=24 2026-04-06T02:37:34,936 Using cached https://www.piwheels.org/simple/packaging/packaging-26.0-py3-none-any.whl (74 kB) 2026-04-06T02:37:35,637 Collecting apache-tvm-ffi!=0.1.8,!=0.1.8.post0,<0.2,>=0.1.6 2026-04-06T02:37:35,655 Downloading https://www.piwheels.org/simple/apache-tvm-ffi/apache_tvm_ffi-0.1.9-cp311-cp311-linux_armv7l.whl (2.2 MB) 2026-04-06T02:37:36,308 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.2/2.2 MB 3.5 MB/s eta 0:00:00 2026-04-06T02:37:36,597 Collecting typing-extensions>=4.5 2026-04-06T02:37:36,613 Using cached https://www.piwheels.org/simple/typing-extensions/typing_extensions-4.15.0-py3-none-any.whl (44 kB) 2026-04-06T02:37:39,881 Installing collected packages: typing-extensions, setuptools, packaging, apache-tvm-ffi 2026-04-06T02:37:44,678 Creating /tmp/pip-build-env-l3zj72y8/overlay/local/bin 2026-04-06T02:37:44,680 changing mode of /tmp/pip-build-env-l3zj72y8/overlay/local/bin/tvm-ffi-config to 755 2026-04-06T02:37:44,682 changing mode of /tmp/pip-build-env-l3zj72y8/overlay/local/bin/tvm-ffi-stubgen to 755 2026-04-06T02:37:44,715 Successfully installed apache-tvm-ffi-0.1.9 packaging-26.0 setuptools-82.0.1 typing-extensions-4.15.0 2026-04-06T02:37:45,015 Installing build dependencies: finished with status 'done' 2026-04-06T02:37:45,022 Getting requirements to build wheel: started 2026-04-06T02:37:45,024 Running command Getting requirements to build wheel 2026-04-06T02:37:52,252 Build metadata file already exists (not in git repo), keeping it 2026-04-06T02:37:52,320 Getting requirements to build wheel: finished with status 'done' 2026-04-06T02:37:52,324 Created temporary directory: /tmp/pip-modern-metadata-yow_bi5h 2026-04-06T02:37:52,326 Preparing metadata (pyproject.toml): started 2026-04-06T02:37:52,327 Running command Preparing metadata (pyproject.toml) 2026-04-06T02:38:00,203 /tmp/pip-build-env-l3zj72y8/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-06T02:38:00,204 !! 2026-04-06T02:38:00,205 ******************************************************************************** 2026-04-06T02:38:00,206 Pattern 'LICENSE*.txt' did not match any files. 2026-04-06T02:38:00,207 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-06T02:38:00,208 or your builds will no longer be supported. 2026-04-06T02:38:00,208 ******************************************************************************** 2026-04-06T02:38:00,209 !! 2026-04-06T02:38:00,210 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-06T02:38:00,215 Build metadata file already exists (not in git repo), keeping it 2026-04-06T02:38:00,216 running dist_info 2026-04-06T02:38:00,229 creating /tmp/pip-modern-metadata-yow_bi5h/flashinfer_python.egg-info 2026-04-06T02:38:00,230 writing /tmp/pip-modern-metadata-yow_bi5h/flashinfer_python.egg-info/PKG-INFO 2026-04-06T02:38:00,234 writing dependency_links to /tmp/pip-modern-metadata-yow_bi5h/flashinfer_python.egg-info/dependency_links.txt 2026-04-06T02:38:00,236 writing entry points to /tmp/pip-modern-metadata-yow_bi5h/flashinfer_python.egg-info/entry_points.txt 2026-04-06T02:38:00,238 writing requirements to /tmp/pip-modern-metadata-yow_bi5h/flashinfer_python.egg-info/requires.txt 2026-04-06T02:38:00,240 writing top-level names to /tmp/pip-modern-metadata-yow_bi5h/flashinfer_python.egg-info/top_level.txt 2026-04-06T02:38:00,242 writing manifest file '/tmp/pip-modern-metadata-yow_bi5h/flashinfer_python.egg-info/SOURCES.txt' 2026-04-06T02:38:01,054 reading manifest file '/tmp/pip-modern-metadata-yow_bi5h/flashinfer_python.egg-info/SOURCES.txt' 2026-04-06T02:38:01,056 adding license file 'LICENSE' 2026-04-06T02:38:01,131 writing manifest file '/tmp/pip-modern-metadata-yow_bi5h/flashinfer_python.egg-info/SOURCES.txt' 2026-04-06T02:38:01,135 creating '/tmp/pip-modern-metadata-yow_bi5h/flashinfer_python-0.6.7.post3.dist-info' 2026-04-06T02:38:01,266 Preparing metadata (pyproject.toml): finished with status 'done' 2026-04-06T02:38:01,272 Source in /tmp/pip-wheel-x54aw7uf/flashinfer-python_ecd6ed0409b1419c8747d7f7125ab960 has version 0.6.7.post3, which satisfies requirement flashinfer-python==0.6.7.post3 from https://files.pythonhosted.org/packages/12/b5/466778818d195b96a062467ee389d0fcfa51fdfecad4a831922916d4c48a/flashinfer_python-0.6.7.post3.tar.gz 2026-04-06T02:38:01,273 Removed flashinfer-python==0.6.7.post3 from https://files.pythonhosted.org/packages/12/b5/466778818d195b96a062467ee389d0fcfa51fdfecad4a831922916d4c48a/flashinfer_python-0.6.7.post3.tar.gz from build tracker '/tmp/pip-build-tracker-lo4oc_78' 2026-04-06T02:38:01,279 Created temporary directory: /tmp/pip-unpack-6zc_7vm7 2026-04-06T02:38:01,280 Building wheels for collected packages: flashinfer-python 2026-04-06T02:38:01,507 Created temporary directory: /tmp/pip-wheel-sbyrc2da 2026-04-06T02:38:01,508 Destination directory: /tmp/pip-wheel-sbyrc2da 2026-04-06T02:38:01,510 Building wheel for flashinfer-python (pyproject.toml): started 2026-04-06T02:38:01,511 Running command Building wheel for flashinfer-python (pyproject.toml) 2026-04-06T02:38:07,554 /tmp/pip-build-env-l3zj72y8/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-06T02:38:07,554 !! 2026-04-06T02:38:07,555 ******************************************************************************** 2026-04-06T02:38:07,555 Pattern 'LICENSE*.txt' did not match any files. 2026-04-06T02:38:07,556 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-06T02:38:07,557 or your builds will no longer be supported. 2026-04-06T02:38:07,558 ******************************************************************************** 2026-04-06T02:38:07,559 !! 2026-04-06T02:38:07,560 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-06T02:38:07,560 Build metadata file already exists (not in git repo), keeping it 2026-04-06T02:38:07,561 running bdist_wheel 2026-04-06T02:38:07,581 running build 2026-04-06T02:38:07,582 running build_py 2026-04-06T02:38:07,588 creating build/lib 2026-04-06T02:38:07,590 copying build_backend.py -> build/lib 2026-04-06T02:38:07,593 copying build_utils.py -> build/lib 2026-04-06T02:38:07,596 creating build/lib/flashinfer 2026-04-06T02:38:07,597 copying flashinfer/decode.py -> build/lib/flashinfer 2026-04-06T02:38:07,607 copying flashinfer/topk.py -> build/lib/flashinfer 2026-04-06T02:38:07,830 copying flashinfer/_build_meta.py -> build/lib/flashinfer 2026-04-06T02:38:07,834 copying flashinfer/sampling.py -> build/lib/flashinfer 2026-04-06T02:38:07,843 copying flashinfer/__main__.py -> build/lib/flashinfer 2026-04-06T02:38:07,849 copying flashinfer/autotuner.py -> build/lib/flashinfer 2026-04-06T02:38:07,930 copying flashinfer/trtllm_low_latency_gemm.py -> build/lib/flashinfer 2026-04-06T02:38:07,936 copying flashinfer/gdn_decode.py -> build/lib/flashinfer 2026-04-06T02:38:07,942 copying flashinfer/artifacts.py -> build/lib/flashinfer 2026-04-06T02:38:07,948 copying flashinfer/tllm_enums.py -> build/lib/flashinfer 2026-04-06T02:38:07,953 copying flashinfer/fp8_quantization.py -> build/lib/flashinfer 2026-04-06T02:38:07,958 copying flashinfer/page.py -> build/lib/flashinfer 2026-04-06T02:38:07,963 copying flashinfer/version.py -> build/lib/flashinfer 2026-04-06T02:38:07,966 copying flashinfer/attention.py -> build/lib/flashinfer 2026-04-06T02:38:07,969 copying flashinfer/aot.py -> build/lib/flashinfer 2026-04-06T02:38:07,972 copying flashinfer/__init__.py -> build/lib/flashinfer 2026-04-06T02:38:07,974 copying flashinfer/prefill.py -> build/lib/flashinfer 2026-04-06T02:38:07,980 copying flashinfer/pod.py -> build/lib/flashinfer 2026-04-06T02:38:07,983 copying flashinfer/cascade.py -> build/lib/flashinfer 2026-04-06T02:38:07,987 copying flashinfer/rope.py -> build/lib/flashinfer 2026-04-06T02:38:07,990 copying flashinfer/green_ctx.py -> build/lib/flashinfer 2026-04-06T02:38:07,993 copying flashinfer/sparse.py -> build/lib/flashinfer 2026-04-06T02:38:07,996 copying flashinfer/activation.py -> build/lib/flashinfer 2026-04-06T02:38:07,998 copying flashinfer/xqa.py -> build/lib/flashinfer 2026-04-06T02:38:08,001 copying flashinfer/compilation_context.py -> build/lib/flashinfer 2026-04-06T02:38:08,003 copying flashinfer/deep_gemm.py -> build/lib/flashinfer 2026-04-06T02:38:08,006 copying flashinfer/api_logging.py -> build/lib/flashinfer 2026-04-06T02:38:08,010 copying flashinfer/cuda_utils.py -> build/lib/flashinfer 2026-04-06T02:38:08,011 copying flashinfer/concat_ops.py -> build/lib/flashinfer 2026-04-06T02:38:08,013 copying flashinfer/fp4_quantization.py -> build/lib/flashinfer 2026-04-06T02:38:08,015 copying flashinfer/tllm_utils.py -> build/lib/flashinfer 2026-04-06T02:38:08,016 copying flashinfer/utils.py -> build/lib/flashinfer 2026-04-06T02:38:08,019 copying flashinfer/mla.py -> build/lib/flashinfer 2026-04-06T02:38:08,022 copying flashinfer/gdn_prefill.py -> build/lib/flashinfer 2026-04-06T02:38:08,025 creating build/lib/flashinfer/cute_dsl 2026-04-06T02:38:08,026 copying flashinfer/cute_dsl/fp4_common.py -> build/lib/flashinfer/cute_dsl 2026-04-06T02:38:08,030 copying flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-06T02:38:08,034 copying flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/lib/flashinfer/cute_dsl 2026-04-06T02:38:08,038 copying flashinfer/cute_dsl/__init__.py -> build/lib/flashinfer/cute_dsl 2026-04-06T02:38:08,040 copying flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-06T02:38:08,044 copying flashinfer/cute_dsl/blockscaled_gemm.py -> build/lib/flashinfer/cute_dsl 2026-04-06T02:38:08,047 copying flashinfer/cute_dsl/utils.py -> build/lib/flashinfer/cute_dsl 2026-04-06T02:38:08,049 creating build/lib/flashinfer/mamba 2026-04-06T02:38:08,051 copying flashinfer/mamba/ssd_combined.py -> build/lib/flashinfer/mamba 2026-04-06T02:38:08,053 copying flashinfer/mamba/ssd_kernel.py -> build/lib/flashinfer/mamba 2026-04-06T02:38:08,058 copying flashinfer/mamba/__init__.py -> build/lib/flashinfer/mamba 2026-04-06T02:38:08,060 copying flashinfer/mamba/selective_state_update.py -> build/lib/flashinfer/mamba 2026-04-06T02:38:08,063 copying flashinfer/mamba/ssd_tile_scheduler.py -> build/lib/flashinfer/mamba 2026-04-06T02:38:08,066 creating build/lib/flashinfer/testing 2026-04-06T02:38:08,067 copying flashinfer/testing/__init__.py -> build/lib/flashinfer/testing 2026-04-06T02:38:08,069 copying flashinfer/testing/utils.py -> build/lib/flashinfer/testing 2026-04-06T02:38:08,073 creating build/lib/flashinfer/tuning_configs 2026-04-06T02:38:08,075 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/lib/flashinfer/tuning_configs 2026-04-06T02:38:08,078 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/lib/flashinfer/tuning_configs 2026-04-06T02:38:08,082 creating build/lib/flashinfer/comm 2026-04-06T02:38:08,083 copying flashinfer/comm/mapping.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,086 copying flashinfer/comm/trtllm_ar.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,089 copying flashinfer/comm/nvshmem.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,091 copying flashinfer/comm/workspace_base.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,093 copying flashinfer/comm/__init__.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,095 copying flashinfer/comm/mnnvl.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,098 copying flashinfer/comm/vllm_ar.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,101 copying flashinfer/comm/dlpack_utils.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,103 copying flashinfer/comm/trtllm_alltoall.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,107 copying flashinfer/comm/allreduce.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,110 copying flashinfer/comm/trtllm_mnnvl_ar.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,114 copying flashinfer/comm/trtllm_moe_alltoall.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,117 copying flashinfer/comm/cuda_ipc.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,119 copying flashinfer/comm/nvshmem_allreduce.py -> build/lib/flashinfer/comm 2026-04-06T02:38:08,122 creating build/lib/flashinfer/gdn_kernels 2026-04-06T02:38:08,123 copying flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/lib/flashinfer/gdn_kernels 2026-04-06T02:38:08,128 copying flashinfer/gdn_kernels/__init__.py -> build/lib/flashinfer/gdn_kernels 2026-04-06T02:38:08,130 copying flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-06T02:38:08,133 copying flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-06T02:38:08,136 copying flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/lib/flashinfer/gdn_kernels 2026-04-06T02:38:08,141 creating build/lib/flashinfer/norm 2026-04-06T02:38:08,142 copying flashinfer/norm/__init__.py -> build/lib/flashinfer/norm 2026-04-06T02:38:08,145 copying flashinfer/norm/utils.py -> build/lib/flashinfer/norm 2026-04-06T02:38:08,148 creating build/lib/flashinfer/triton 2026-04-06T02:38:08,150 copying flashinfer/triton/gemm.py -> build/lib/flashinfer/triton 2026-04-06T02:38:08,152 copying flashinfer/triton/norm.py -> build/lib/flashinfer/triton 2026-04-06T02:38:08,154 copying flashinfer/triton/page.py -> build/lib/flashinfer/triton 2026-04-06T02:38:08,155 copying flashinfer/triton/sm_constraint_gemm.py -> build/lib/flashinfer/triton 2026-04-06T02:38:08,158 copying flashinfer/triton/__init__.py -> build/lib/flashinfer/triton 2026-04-06T02:38:08,160 copying flashinfer/triton/cascade.py -> build/lib/flashinfer/triton 2026-04-06T02:38:08,162 copying flashinfer/triton/activation.py -> build/lib/flashinfer/triton 2026-04-06T02:38:08,164 copying flashinfer/triton/utils.py -> build/lib/flashinfer/triton 2026-04-06T02:38:08,166 creating build/lib/flashinfer/quantization 2026-04-06T02:38:08,167 copying flashinfer/quantization/quantization_cute_dsl_utils.py -> build/lib/flashinfer/quantization 2026-04-06T02:38:08,171 copying flashinfer/quantization/fp8_quantization.py -> build/lib/flashinfer/quantization 2026-04-06T02:38:08,173 copying flashinfer/quantization/__init__.py -> build/lib/flashinfer/quantization 2026-04-06T02:38:08,175 copying flashinfer/quantization/packbits.py -> build/lib/flashinfer/quantization 2026-04-06T02:38:08,177 copying flashinfer/quantization/fp4_quantization.py -> build/lib/flashinfer/quantization 2026-04-06T02:38:08,183 creating build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,184 copying flashinfer/logits_processor/types.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,186 copying flashinfer/logits_processor/fusion_rules.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,188 copying flashinfer/logits_processor/__init__.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,190 copying flashinfer/logits_processor/compiler.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,193 copying flashinfer/logits_processor/legalization.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,195 copying flashinfer/logits_processor/operators.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,199 copying flashinfer/logits_processor/processors.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,203 copying flashinfer/logits_processor/pipeline.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,206 copying flashinfer/logits_processor/op.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,208 copying flashinfer/logits_processor/validators.py -> build/lib/flashinfer/logits_processor 2026-04-06T02:38:08,213 creating build/lib/flashinfer/jit 2026-04-06T02:38:08,215 copying flashinfer/jit/spdlog.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,218 copying flashinfer/jit/topk.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,221 copying flashinfer/jit/quantization.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,223 copying flashinfer/jit/sampling.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,226 copying flashinfer/jit/norm.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,229 copying flashinfer/jit/dsv3_optimizations.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,231 copying flashinfer/jit/fp8_quantization.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,234 copying flashinfer/jit/page.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,236 copying flashinfer/jit/gdn.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,239 copying flashinfer/jit/fp4_kv_quantization.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,241 copying flashinfer/jit/__init__.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,244 copying flashinfer/jit/env.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,247 copying flashinfer/jit/tinygemm2.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,275 copying flashinfer/jit/core.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,278 copying flashinfer/jit/fp4_kv_dequantization.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,281 copying flashinfer/jit/cascade.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,283 copying flashinfer/jit/rope.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,286 copying flashinfer/jit/cubin_loader.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,289 copying flashinfer/jit/fused_moe.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,292 copying flashinfer/jit/comm.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,304 copying flashinfer/jit/moe_utils.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,307 copying flashinfer/jit/activation.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,309 copying flashinfer/jit/xqa.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,321 copying flashinfer/jit/cpp_ext.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,324 copying flashinfer/jit/fp4_quantization.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,326 copying flashinfer/jit/tllm_utils.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,328 copying flashinfer/jit/utils.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,331 copying flashinfer/jit/mla.py -> build/lib/flashinfer/jit 2026-04-06T02:38:08,343 creating build/lib/flashinfer/dsv3_ops 2026-04-06T02:38:08,344 copying flashinfer/dsv3_ops/__init__.py -> build/lib/flashinfer/dsv3_ops 2026-04-06T02:38:08,348 creating build/lib/flashinfer/gemm 2026-04-06T02:38:08,349 copying flashinfer/gemm/routergemm.py -> build/lib/flashinfer/gemm 2026-04-06T02:38:08,352 copying flashinfer/gemm/__init__.py -> build/lib/flashinfer/gemm 2026-04-06T02:38:08,354 copying flashinfer/gemm/gemm_base.py -> build/lib/flashinfer/gemm 2026-04-06T02:38:08,364 creating build/lib/flashinfer/cudnn 2026-04-06T02:38:08,365 copying flashinfer/cudnn/decode.py -> build/lib/flashinfer/cudnn 2026-04-06T02:38:08,368 copying flashinfer/cudnn/__init__.py -> build/lib/flashinfer/cudnn 2026-04-06T02:38:08,370 copying flashinfer/cudnn/prefill.py -> build/lib/flashinfer/cudnn 2026-04-06T02:38:08,373 copying flashinfer/cudnn/utils.py -> build/lib/flashinfer/cudnn 2026-04-06T02:38:08,377 creating build/lib/flashinfer/data 2026-04-06T02:38:08,378 copying ./build_utils.py -> build/lib/flashinfer/data 2026-04-06T02:38:08,380 copying ./build_backend.py -> build/lib/flashinfer/data 2026-04-06T02:38:08,383 creating build/lib/flashinfer/fused_moe 2026-04-06T02:38:08,385 copying flashinfer/fused_moe/fused_routing_dsv3.py -> build/lib/flashinfer/fused_moe 2026-04-06T02:38:08,387 copying flashinfer/fused_moe/__init__.py -> build/lib/flashinfer/fused_moe 2026-04-06T02:38:08,391 copying flashinfer/fused_moe/core.py -> build/lib/flashinfer/fused_moe 2026-04-06T02:38:08,397 copying flashinfer/fused_moe/utils.py -> build/lib/flashinfer/fused_moe 2026-04-06T02:38:08,400 creating build/lib/flashinfer/profiler 2026-04-06T02:38:08,401 copying flashinfer/profiler/__init__.py -> build/lib/flashinfer/profiler 2026-04-06T02:38:08,403 creating build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:08,405 copying flashinfer/gdn_kernels/blackwell_prefill/gdn.py -> build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:08,419 copying flashinfer/gdn_kernels/blackwell_prefill/gdn_helpers.py -> build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:08,421 copying flashinfer/gdn_kernels/blackwell_prefill/__init__.py -> build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:08,423 copying flashinfer/gdn_kernels/blackwell_prefill/gdn_tile_scheduler.py -> build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:08,426 creating build/lib/flashinfer/norm/kernels 2026-04-06T02:38:08,427 copying flashinfer/norm/kernels/rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-06T02:38:08,430 copying flashinfer/norm/kernels/layernorm.py -> build/lib/flashinfer/norm/kernels 2026-04-06T02:38:08,432 copying flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-06T02:38:08,445 copying flashinfer/norm/kernels/__init__.py -> build/lib/flashinfer/norm/kernels 2026-04-06T02:38:08,460 creating build/lib/flashinfer/triton/kernels 2026-04-06T02:38:08,461 copying flashinfer/triton/kernels/norm.py -> build/lib/flashinfer/triton/kernels 2026-04-06T02:38:08,463 copying flashinfer/triton/kernels/quant.py -> build/lib/flashinfer/triton/kernels 2026-04-06T02:38:08,465 copying flashinfer/triton/kernels/sm_constraint_gemm.py -> build/lib/flashinfer/triton/kernels 2026-04-06T02:38:08,467 copying flashinfer/triton/kernels/__init__.py -> build/lib/flashinfer/triton/kernels 2026-04-06T02:38:08,468 copying flashinfer/triton/kernels/cascade.py -> build/lib/flashinfer/triton/kernels 2026-04-06T02:38:08,471 copying flashinfer/triton/kernels/ssd_chunk_state.py -> build/lib/flashinfer/triton/kernels 2026-04-06T02:38:08,473 copying flashinfer/triton/kernels/activation.py -> build/lib/flashinfer/triton/kernels 2026-04-06T02:38:08,475 creating build/lib/flashinfer/quantization/kernels 2026-04-06T02:38:08,476 copying flashinfer/quantization/kernels/mxfp4_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-06T02:38:08,479 copying flashinfer/quantization/kernels/__init__.py -> build/lib/flashinfer/quantization/kernels 2026-04-06T02:38:08,481 copying flashinfer/quantization/kernels/mxfp8_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-06T02:38:08,484 creating build/lib/flashinfer/jit/mamba 2026-04-06T02:38:08,485 copying flashinfer/jit/mamba/__init__.py -> build/lib/flashinfer/jit/mamba 2026-04-06T02:38:08,487 copying flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/lib/flashinfer/jit/mamba 2026-04-06T02:38:08,489 copying flashinfer/jit/mamba/selective_state_update.py -> build/lib/flashinfer/jit/mamba 2026-04-06T02:38:08,492 creating build/lib/flashinfer/jit/gemm 2026-04-06T02:38:08,492 copying flashinfer/jit/gemm/__init__.py -> build/lib/flashinfer/jit/gemm 2026-04-06T02:38:08,494 copying flashinfer/jit/gemm/core.py -> build/lib/flashinfer/jit/gemm 2026-04-06T02:38:08,497 copying flashinfer/jit/gemm/fp8_blockscale.py -> build/lib/flashinfer/jit/gemm 2026-04-06T02:38:08,499 copying flashinfer/jit/gemm/deepgemm.py -> build/lib/flashinfer/jit/gemm 2026-04-06T02:38:08,501 creating build/lib/flashinfer/jit/attention 2026-04-06T02:38:08,503 copying flashinfer/jit/attention/modules.py -> build/lib/flashinfer/jit/attention 2026-04-06T02:38:08,506 copying flashinfer/jit/attention/__init__.py -> build/lib/flashinfer/jit/attention 2026-04-06T02:38:08,508 copying flashinfer/jit/attention/variants.py -> build/lib/flashinfer/jit/attention 2026-04-06T02:38:08,510 copying flashinfer/jit/attention/utils.py -> build/lib/flashinfer/jit/attention 2026-04-06T02:38:08,513 creating build/lib/flashinfer/jit/gemm/cutlass 2026-04-06T02:38:08,514 copying flashinfer/jit/gemm/cutlass/__init__.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-06T02:38:08,516 copying flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-06T02:38:08,519 copying flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-06T02:38:08,522 creating build/lib/flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:08,523 copying flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:08,526 copying flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:08,528 copying flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:08,534 copying flashinfer/jit/attention/fmha_v2/utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:08,537 creating build/lib/flashinfer/gemm/kernels 2026-04-06T02:38:08,538 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/lib/flashinfer/gemm/kernels 2026-04-06T02:38:08,542 copying flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/lib/flashinfer/gemm/kernels 2026-04-06T02:38:08,547 copying flashinfer/gemm/kernels/__init__.py -> build/lib/flashinfer/gemm/kernels 2026-04-06T02:38:08,549 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/lib/flashinfer/gemm/kernels 2026-04-06T02:38:08,556 creating build/lib/flashinfer/data/spdlog/scripts 2026-04-06T02:38:08,558 copying 3rdparty/spdlog/scripts/extract_version.py -> build/lib/flashinfer/data/spdlog/scripts 2026-04-06T02:38:08,584 creating build/lib/flashinfer/data/cutlass/python 2026-04-06T02:38:08,585 copying 3rdparty/cutlass/python/setup_pycute.py -> build/lib/flashinfer/data/cutlass/python 2026-04-06T02:38:08,588 copying 3rdparty/cutlass/python/setup_library.py -> build/lib/flashinfer/data/cutlass/python 2026-04-06T02:38:08,590 copying 3rdparty/cutlass/python/setup_cutlass.py -> build/lib/flashinfer/data/cutlass/python 2026-04-06T02:38:08,593 creating build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-06T02:38:08,595 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-06T02:38:08,597 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-06T02:38:08,601 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-06T02:38:08,602 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-06T02:38:08,605 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-06T02:38:08,607 copying 3rdparty/cutlass/examples/40_cutlass_py/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-06T02:38:08,611 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-06T02:38:08,614 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-06T02:38:08,615 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-06T02:38:08,619 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:08,620 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:08,625 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:08,629 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:08,633 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:08,636 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:08,637 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:08,640 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:08,644 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:08,647 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:08,650 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:08,654 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:08,659 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:08,662 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-06T02:38:08,664 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-06T02:38:08,668 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,670 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,674 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,676 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,678 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,681 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,684 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,687 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,690 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,692 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,696 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,699 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,701 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,705 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:08,708 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:08,709 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:08,712 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:08,714 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:08,717 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:08,720 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-06T02:38:08,721 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-06T02:38:08,723 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-06T02:38:08,726 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:08,727 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:08,730 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:08,732 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:08,735 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:08,738 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,739 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,745 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,750 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,755 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,758 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,762 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,767 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,771 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,777 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,781 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,783 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,789 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,793 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,796 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,800 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,804 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,808 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,812 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:08,817 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-06T02:38:08,819 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-06T02:38:08,821 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:08,822 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:08,825 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:08,828 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:08,831 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:08,834 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:08,838 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-06T02:38:08,839 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-06T02:38:08,843 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:08,844 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:08,846 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:08,848 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:08,850 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:08,852 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:08,854 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:08,856 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:08,857 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:08,860 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-06T02:38:08,861 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-06T02:38:08,863 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-06T02:38:08,865 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-06T02:38:08,866 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-06T02:38:08,909 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-06T02:38:08,914 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-06T02:38:08,920 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:08,921 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:08,924 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:08,927 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:08,932 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:08,936 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:09,019 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:09,020 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:09,023 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:09,026 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:09,029 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:09,032 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:09,037 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:09,038 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:09,045 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:09,051 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:09,055 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:09,059 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-06T02:38:09,060 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-06T02:38:09,079 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-06T02:38:09,081 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-06T02:38:09,170 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:09,171 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:09,175 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:09,181 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:09,188 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:09,196 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-06T02:38:09,198 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-06T02:38:09,201 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-06T02:38:09,204 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-06T02:38:09,214 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-06T02:38:09,215 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-06T02:38:09,219 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-06T02:38:09,223 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-06T02:38:09,228 creating build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,230 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,234 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,237 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,243 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,247 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,250 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,254 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,256 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,259 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,262 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,266 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,269 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:09,273 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:09,274 copying 3rdparty/cutlass/python/cutlass_cppgen/swizzle.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:09,277 copying 3rdparty/cutlass/python/cutlass_cppgen/library_defaults.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:09,281 copying 3rdparty/cutlass/python/cutlass_cppgen/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:09,284 copying 3rdparty/cutlass/python/cutlass_cppgen/shape.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:09,289 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-06T02:38:09,291 copying 3rdparty/cutlass/python/CuTeDSL/prep_editable_install.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-06T02:38:09,296 creating build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,297 copying 3rdparty/cutlass/python/cutlass_library/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,305 copying 3rdparty/cutlass/python/cutlass_library/rank_2k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,308 copying 3rdparty/cutlass/python/cutlass_library/trmm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,312 copying 3rdparty/cutlass/python/cutlass_library/heuristics_provider.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,315 copying 3rdparty/cutlass/python/cutlass_library/conv3d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,318 copying 3rdparty/cutlass/python/cutlass_library/sm100_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,322 copying 3rdparty/cutlass/python/cutlass_library/sm100_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,324 copying 3rdparty/cutlass/python/cutlass_library/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,327 copying 3rdparty/cutlass/python/cutlass_library/heuristics.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,330 copying 3rdparty/cutlass/python/cutlass_library/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,331 copying 3rdparty/cutlass/python/cutlass_library/rank_k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,334 copying 3rdparty/cutlass/python/cutlass_library/symm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,336 copying 3rdparty/cutlass/python/cutlass_library/generator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,349 copying 3rdparty/cutlass/python/cutlass_library/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,353 copying 3rdparty/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,357 copying 3rdparty/cutlass/python/cutlass_library/manifest.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,362 copying 3rdparty/cutlass/python/cutlass_library/sm90_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,365 copying 3rdparty/cutlass/python/cutlass_library/conv3x_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,367 copying 3rdparty/cutlass/python/cutlass_library/sm90_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:09,370 creating build/lib/flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:09,371 copying 3rdparty/cutlass/python/pycute/typing.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:09,373 copying 3rdparty/cutlass/python/pycute/swizzle.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:09,375 copying 3rdparty/cutlass/python/pycute/__init__.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:09,377 copying 3rdparty/cutlass/python/pycute/layout.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:09,379 copying 3rdparty/cutlass/python/pycute/int_tuple.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:09,382 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-06T02:38:09,383 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-06T02:38:09,385 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-06T02:38:09,387 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-06T02:38:09,390 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-06T02:38:09,391 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/common.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-06T02:38:09,394 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-06T02:38:09,395 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-06T02:38:09,398 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:09,399 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:09,402 copying 3rdparty/cutlass/python/cutlass_cppgen/op/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:09,404 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:09,406 copying 3rdparty/cutlass/python/cutlass_cppgen/op/conv.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:09,409 copying 3rdparty/cutlass/python/cutlass_cppgen/op/op.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:09,412 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,413 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,415 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,417 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,420 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,421 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,424 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,426 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,429 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,431 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,433 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,436 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,438 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,440 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:09,442 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:09,443 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/check.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:09,445 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:09,447 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:09,449 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:09,451 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:09,453 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-06T02:38:09,454 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-06T02:38:09,456 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-06T02:38:09,458 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-06T02:38:09,459 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-06T02:38:09,461 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-06T02:38:09,464 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,465 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,467 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,469 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,471 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,473 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,475 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,477 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,479 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,481 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:09,484 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-06T02:38:09,485 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-06T02:38:09,486 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-06T02:38:09,489 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-06T02:38:09,491 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:09,492 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:09,494 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:09,496 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:09,499 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:09,500 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:09,503 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:09,505 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:09,507 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:09,509 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,510 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,512 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,514 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,516 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,518 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,520 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,521 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,524 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,526 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,527 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,529 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,531 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,533 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:09,535 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-06T02:38:09,536 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/torch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-06T02:38:09,538 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-06T02:38:09,540 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-06T02:38:09,543 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:09,544 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:09,546 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:09,548 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:09,550 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:09,553 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:09,556 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:09,559 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:09,560 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:09,563 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:09,566 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:09,569 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:09,572 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,573 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,576 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,579 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,582 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,585 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,588 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,590 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,592 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,595 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,597 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,599 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,604 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:09,606 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:09,607 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:09,610 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:09,613 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:09,615 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:09,617 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:09,619 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:09,623 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,624 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,627 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,630 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,632 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,635 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,639 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,642 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,644 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,647 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,649 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,652 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,657 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:09,660 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,661 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,664 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,667 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,669 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,671 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,675 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,677 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,678 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,681 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,683 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,685 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,688 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,690 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,692 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,694 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,696 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,699 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:09,701 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:09,702 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:09,705 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:09,707 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:09,710 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:09,713 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:09,716 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:09,717 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:09,719 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:09,721 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:09,723 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:09,725 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:09,726 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:09,729 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:09,732 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:09,733 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:09,735 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:09,737 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:09,739 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:09,740 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:09,742 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:09,745 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:09,746 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:09,748 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:09,750 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:09,752 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:09,754 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:09,756 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:09,757 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:09,760 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:09,762 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:09,763 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:09,766 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:09,767 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:09,769 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:09,771 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:09,773 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:09,775 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:09,777 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:09,780 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:09,783 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:09,784 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:09,786 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:09,788 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:09,790 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:09,792 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:09,794 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:09,797 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:09,799 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:09,801 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-06T02:38:09,802 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-06T02:38:09,805 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-06T02:38:09,807 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-06T02:38:09,809 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:09,810 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:09,812 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:09,814 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:09,816 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:09,817 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:09,820 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-06T02:38:09,821 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-06T02:38:09,823 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-06T02:38:09,825 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-06T02:38:09,828 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:09,830 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:09,832 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:09,835 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:09,837 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:09,840 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-06T02:38:09,841 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-06T02:38:09,843 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-06T02:38:09,845 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-06T02:38:09,847 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-06T02:38:09,848 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-06T02:38:09,850 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-06T02:38:09,853 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-06T02:38:09,855 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-06T02:38:09,856 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-06T02:38:09,858 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-06T02:38:09,860 creating build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-06T02:38:09,862 copying 3rdparty/cutlass/python/docs_src/source/conf.py -> build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-06T02:38:09,865 creating build/lib/flashinfer/data/cutlass/test/utils 2026-04-06T02:38:09,867 copying 3rdparty/cutlass/test/utils/test_sharding.py -> build/lib/flashinfer/data/cutlass/test/utils 2026-04-06T02:38:09,870 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-06T02:38:09,872 copying 3rdparty/cutlass/test/examples/CuTeDSL/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-06T02:38:09,875 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:09,875 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:09,878 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:09,880 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:09,882 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:09,884 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:09,886 creating build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-06T02:38:09,888 copying 3rdparty/cutlass/test/python/cutlass/installation.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-06T02:38:09,890 creating build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:09,891 copying 3rdparty/cutlass/test/python/pycute/test_int_tuple.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:09,893 copying 3rdparty/cutlass/test/python/pycute/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:09,895 copying 3rdparty/cutlass/test/python/pycute/test_composition.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:09,897 copying 3rdparty/cutlass/test/python/pycute/test_complement.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:09,899 copying 3rdparty/cutlass/test/python/pycute/test_typing.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:09,901 copying 3rdparty/cutlass/test/python/pycute/test_coalesce.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:09,902 copying 3rdparty/cutlass/test/python/pycute/test_right_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:09,904 copying 3rdparty/cutlass/test/python/pycute/test_left_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:09,906 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:09,907 copying 3rdparty/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:09,909 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:09,911 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:09,913 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:09,916 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-06T02:38:09,917 copying 3rdparty/cutlass/test/python/cutlass/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-06T02:38:09,920 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:09,921 copying 3rdparty/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:09,923 copying 3rdparty/cutlass/test/python/cutlass/interface/evt_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:09,925 copying 3rdparty/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:09,927 copying 3rdparty/cutlass/test/python/cutlass/interface/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:09,930 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:09,930 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:09,933 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:09,935 copying 3rdparty/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:09,937 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:09,939 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:09,941 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:09,944 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,945 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,948 copying 3rdparty/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,950 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,952 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,954 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,956 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,958 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,960 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,962 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,964 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,966 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,968 copying 3rdparty/cutlass/test/python/cutlass/gemm/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,970 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:09,973 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-06T02:38:09,974 copying 3rdparty/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-06T02:38:09,977 creating build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-06T02:38:09,979 copying 3rdparty/cutlass/test/unit/gemm/device/simt_sm50.py -> build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-06T02:38:10,004 creating build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-06T02:38:10,006 copying 3rdparty/cutlass/tools/util/scripts/split_test_cmake.py -> build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-06T02:38:10,030 creating build/lib/flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:10,031 copying flashinfer/fused_moe/cute_dsl/tuner.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:10,035 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:10,038 copying flashinfer/fused_moe/cute_dsl/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:10,041 copying flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:10,045 copying flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:10,049 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:10,053 creating build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:10,055 copying flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:10,059 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:10,065 copying flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:10,068 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:10,073 copying flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:10,620 copying flashinfer/py.typed -> build/lib/flashinfer 2026-04-06T02:38:10,622 creating build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,623 copying ./csrc/flashinfer_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,625 copying ./csrc/single_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,627 copying ./csrc/logging.cc -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,629 copying ./csrc/single_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,631 copying ./csrc/batch_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,633 copying ./csrc/runtime_utils.h -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,635 copying ./csrc/batch_mla_sm90_run.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,638 copying ./csrc/batch_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,640 copying ./csrc/single_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,642 copying ./csrc/trtllm_fused_moe_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,645 copying ./csrc/fmhaReduction.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,648 copying ./csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,650 copying ./csrc/fmha_v2_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,652 copying ./csrc/gemm_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,654 copying ./csrc/trtllm_alltoall_prepare.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,657 copying ./csrc/single_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,659 copying ./csrc/cutlass_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,661 copying ./csrc/mxfp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,664 copying ./csrc/tgv_gemm.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,666 copying ./csrc/single_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,668 copying ./csrc/seq_chunk_cumsum_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,670 copying ./csrc/batch_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,672 copying ./csrc/batch_attention.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,674 copying ./csrc/page.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,676 copying ./csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,678 copying ./csrc/batch_attention_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,680 copying ./csrc/cudnn_sdpa_utils.h -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,683 copying ./csrc/batch_prefill_ragged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,685 copying ./csrc/fp4_kv_dequantization.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,687 copying ./csrc/fmha_cutlass_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,689 copying ./csrc/selective_state_update_kernel_inst.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,691 copying ./csrc/bf16_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,693 copying ./csrc/pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,696 copying ./csrc/vllm_custom_all_reduce.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,698 copying ./csrc/single_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,700 copying ./csrc/group_gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,701 copying ./csrc/flashinfer_gemm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,703 copying ./csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,705 copying ./csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,707 copying ./csrc/bf16_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,709 copying ./csrc/gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,711 copying ./csrc/flashinfer_mamba_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,712 copying ./csrc/batch_pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,714 copying ./csrc/norm.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,717 copying ./csrc/topk.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,719 copying ./csrc/single_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,721 copying ./csrc/trtllm_fmha_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,724 copying ./csrc/batch_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,726 copying ./csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,728 copying ./csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,730 copying ./csrc/batch_prefill_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,732 copying ./csrc/batch_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,734 copying ./csrc/batch_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,737 copying ./csrc/fp4_kv_quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:10,739 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:10,741 copying ./csrc/nv_internal/cpp/common/tllmException.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:10,744 copying ./csrc/nv_internal/cpp/common/envUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:10,746 copying ./csrc/nv_internal/cpp/common/memoryUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:10,749 copying ./csrc/nv_internal/cpp/common/stringUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:10,751 copying ./csrc/nv_internal/cpp/common/logger.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:10,753 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-06T02:38:10,754 copying ./csrc/nv_internal/cpp/kernels/quantization.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-06T02:38:10,757 creating build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,759 copying ./csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,761 copying ./csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,763 copying ./csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,766 copying ./csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,768 copying ./csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,771 copying ./csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,773 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,775 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,777 copying ./csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,780 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,783 copying ./csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:10,785 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,786 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,791 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,794 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,796 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,799 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,801 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,804 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,806 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,809 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,811 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,814 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:10,816 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:10,817 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:10,820 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:10,822 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:10,824 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:10,827 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:10,829 copying ./csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:10,831 copying ./csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:10,833 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,834 copying ./csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,836 copying ./csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,839 copying ./csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,841 copying ./csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,843 copying ./csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,846 copying ./csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,848 copying ./csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,851 copying ./csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,853 copying ./csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:10,855 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:10,856 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:10,858 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:10,861 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-06T02:38:10,862 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-06T02:38:10,865 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-06T02:38:10,867 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-06T02:38:10,868 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-06T02:38:10,871 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-06T02:38:10,874 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-06T02:38:10,875 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-06T02:38:10,877 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-06T02:38:10,879 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:10,881 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:10,882 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:10,885 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:10,887 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:10,890 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:10,892 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:10,894 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-06T02:38:10,895 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-06T02:38:10,898 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-06T02:38:10,901 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:10,904 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:10,907 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-06T02:38:10,909 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,910 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,912 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,914 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,916 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,919 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,921 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,923 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,925 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,927 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,929 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,931 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,933 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,935 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,937 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,939 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,941 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,943 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,944 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,946 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,948 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:10,951 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-06T02:38:10,952 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-06T02:38:10,954 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-06T02:38:10,956 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-06T02:38:10,959 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:10,960 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:10,963 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:10,965 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:10,967 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:10,971 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:10,973 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,974 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,976 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,978 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,980 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,981 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,983 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,986 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,989 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,991 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,992 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,994 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,996 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:10,999 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:11,001 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:11,004 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:11,006 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:11,008 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:11,011 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:11,012 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:11,014 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:11,017 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:11,018 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:11,020 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:11,022 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:11,024 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:11,030 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:11,032 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:11,034 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-06T02:38:11,036 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:11,038 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:11,040 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-06T02:38:11,043 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-06T02:38:11,045 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-06T02:38:11,046 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-06T02:38:11,049 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-06T02:38:11,052 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-06T02:38:11,053 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-06T02:38:11,056 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:11,057 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:11,059 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:11,061 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:11,063 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:11,065 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:11,068 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:11,071 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:11,073 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:11,075 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:11,078 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-06T02:38:11,079 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-06T02:38:11,082 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-06T02:38:11,083 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-06T02:38:11,086 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-06T02:38:11,087 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-06T02:38:11,090 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-06T02:38:11,092 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-06T02:38:11,095 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:11,097 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:11,099 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-06T02:38:11,100 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-06T02:38:11,103 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-06T02:38:11,105 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-06T02:38:11,108 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,109 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,111 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,114 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,117 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,119 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,123 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,125 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,129 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,131 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,133 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-06T02:38:11,134 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-06T02:38:11,137 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-06T02:38:11,140 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-06T02:38:11,142 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:11,144 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,145 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,148 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,150 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,153 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,155 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,157 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,160 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,163 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,166 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,168 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,171 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,174 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:11,177 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,178 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,181 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,183 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,185 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,188 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,190 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,192 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,195 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,198 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,201 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,204 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,207 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:11,210 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:11,212 copying ./csrc/batch_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,214 copying ./csrc/trtllm_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,217 copying ./csrc/moe_utils_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,219 copying ./csrc/trtllm_fused_moe_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,223 copying ./csrc/trtllm_low_latency_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,226 copying ./csrc/cudnn_sdpa_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,229 copying ./csrc/batch_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,231 copying ./csrc/batch_decode_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,233 copying ./csrc/flashinfer_cascade_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,235 copying ./csrc/group_gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,236 copying ./csrc/flashinfer_quantization_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,238 copying ./csrc/pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,240 copying ./csrc/batch_decode_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,242 copying ./csrc/batch_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,245 copying ./csrc/concat_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,247 copying ./csrc/blackwell_fmha_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,249 copying ./csrc/batch_mla_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,250 copying ./csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,253 copying ./csrc/group_gemm_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,255 copying ./csrc/batch_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,257 copying ./csrc/selective_state_update_dtype_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,259 copying ./csrc/single_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,261 copying ./csrc/fp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,263 copying ./csrc/flashinfer_sampling_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,265 copying ./csrc/batch_pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,267 copying ./csrc/flashinfer_topk_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,269 copying ./csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,270 copying ./csrc/fp4_gemm_cutlass_sm120.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,272 copying ./csrc/flashinfer_norm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,274 copying ./csrc/fp4_gemm_cutlass_sm103.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,277 copying ./csrc/batch_pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,279 creating build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,280 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,283 copying ./csrc/fmha_v2/fused_multihead_attention_utils.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,286 copying ./csrc/fmha_v2/fused_multihead_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,288 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,291 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,294 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,296 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,298 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,301 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,303 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,305 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,308 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,310 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,313 copying ./csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,315 copying ./csrc/fmha_v2/fused_multihead_cross_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,317 creating build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:11,318 copying ./csrc/fmha_v2/templates/fa_kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:11,321 copying ./csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:11,324 copying ./csrc/fmha_v2/templates/kernel_hopper.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:11,326 copying ./csrc/fmha_v2/templates/kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:11,328 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,331 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,333 copying ./csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:11,336 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,337 copying ./csrc/fmha_v2/fmha/softmax.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,341 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:11,342 copying ./csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:11,345 copying ./csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:11,348 copying ./csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:11,351 copying ./csrc/fmha_v2/fmha/warpspec/dma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:11,354 copying ./csrc/fmha_v2/fmha/warpspec/compute.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:11,357 copying ./csrc/fmha_v2/fmha/gemm.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,359 copying ./csrc/fmha_v2/fmha/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,363 copying ./csrc/fmha_v2/fmha/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,367 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,368 copying ./csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,370 copying ./csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,373 copying ./csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,376 copying ./csrc/fmha_v2/fmha/hopper/tma_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,378 copying ./csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,380 copying ./csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,385 copying ./csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,388 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,391 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,393 copying ./csrc/fmha_v2/fmha/hopper/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,396 copying ./csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,399 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,402 copying ./csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,405 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,408 copying ./csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,411 copying ./csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,413 copying ./csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,415 copying ./csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:11,419 copying ./csrc/fmha_v2/fmha/traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,422 copying ./csrc/fmha_v2/fmha/paged_kv_cache.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,424 copying ./csrc/fmha_v2/fmha/gmem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,427 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,430 copying ./csrc/fmha_v2/fmha/mask.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,433 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,435 copying ./csrc/fmha_v2/fmha/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,439 copying ./csrc/fmha_v2/fmha/alibi_params.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,441 copying ./csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,444 copying ./csrc/fmha_v2/fmha/smem_tile_v.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,448 copying ./csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,450 copying ./csrc/fmha_v2/fmha/numeric_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,453 copying ./csrc/fmha_v2/fmha/utils.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,456 copying ./csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,459 copying ./csrc/fmha_v2/fmha/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:11,462 copying ./csrc/quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,464 copying ./csrc/gemm_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,466 copying ./csrc/mxfp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,469 copying ./csrc/group_gemm_fp8_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,471 copying ./csrc/seq_chunk_cumsum.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,473 copying ./csrc/batch_pod.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,476 copying ./csrc/tinygemm2.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,479 copying ./csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,481 copying ./csrc/group_gemm_fp8_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,483 copying ./csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,485 copying ./csrc/trtllm_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,488 copying ./csrc/trtllm_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,490 copying ./csrc/fp4_gemm_cutlass_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,493 copying ./csrc/batch_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,495 copying ./csrc/batch_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,497 copying ./csrc/trtllm_moe_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,500 copying ./csrc/single_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,502 copying ./csrc/renorm.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,504 copying ./csrc/fp4_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,507 copying ./csrc/batch_decode_mla_cute_sm80.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,509 copying ./csrc/trtllm_moe_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,512 copying ./csrc/single_prefill_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,514 copying ./csrc/nvshmem_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,516 copying ./csrc/batch_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,519 copying ./csrc/prefill_kernel_delta_rule_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,522 copying ./csrc/selective_state_update.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,525 copying ./csrc/fmha_v2_run.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,529 copying ./csrc/fp4_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,532 copying ./csrc/batch_decode_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,534 copying ./csrc/trtllm_mnnvl_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,537 copying ./csrc/batch_decode_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,540 copying ./csrc/gdn_prefill_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,543 copying ./csrc/fp4_gemm_cutlass_sm103.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,546 copying ./csrc/trtllm_fmha_v2_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,549 copying ./csrc/flashinfer_page_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,552 copying ./csrc/trtllm_batched_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,555 copying ./csrc/fp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,558 copying ./csrc/dsv3_router_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,561 copying ./csrc/group_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,564 copying ./csrc/bmm_fp8.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,566 copying ./csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,570 copying ./csrc/single_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,572 copying ./csrc/batch_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,575 copying ./csrc/batch_attention_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,577 copying ./csrc/tgv_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,580 copying ./csrc/sampling_utils.h -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,583 copying ./csrc/rope.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,586 copying ./csrc/tvm_ffi_utils.h -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,589 copying ./csrc/flashinfer_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,592 copying ./csrc/fmha_cutlass_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,594 copying ./csrc/batch_mla_sm90_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,597 copying ./csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,599 copying ./csrc/selective_state_update_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,602 copying ./csrc/pod.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,605 copying ./csrc/pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,608 copying ./csrc/flashinfer_rope_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,610 copying ./csrc/batch_attention_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,613 copying ./csrc/trtllm_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,617 creating build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:11,618 copying ./csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:11,621 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:11,624 copying ./csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:11,628 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:11,638 copying ./csrc/fused_moe/moeTopKFuncs.cuh -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-06T02:38:11,640 copying ./csrc/fused_moe/noAuxTcKernels.cu -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-06T02:38:11,643 creating build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:11,644 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_renormalize.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:11,647 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:11,650 creating build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:11,651 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/RoutingRenormalizeCommon.cuh -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:11,654 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchInitExpertCounts.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:11,656 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchOffsetsKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:11,658 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchBlockKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:11,660 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchClusterKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:11,663 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramScoresKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:11,665 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:11,668 creating build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:11,669 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/RoutingDeepSeekCommon.cuh -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:11,671 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchInitExpertCounts.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:11,673 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchOffsetsKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:11,676 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchMainKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:11,678 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchClusterKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:11,680 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchCoopKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:11,683 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchHistogramKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:11,685 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:11,688 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:11,691 copying ./csrc/single_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,693 copying ./csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,695 copying ./csrc/gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,697 copying ./csrc/flashinfer_xqa_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,699 copying ./csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,701 creating build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,702 copying ./csrc/xqa/mha.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,706 copying ./csrc/xqa/tma.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,709 copying ./csrc/xqa/mla_sm120.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,713 copying ./csrc/xqa/ldgsts.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,715 copying ./csrc/xqa/tensorMap.cpp -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,717 copying ./csrc/xqa/mha.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,720 copying ./csrc/xqa/hostUtils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,740 copying ./csrc/xqa/platform.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,742 copying ./csrc/xqa/gmma_impl.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,750 copying ./csrc/xqa/mha_components.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,753 copying ./csrc/xqa/mha_stdheaders.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,756 copying ./csrc/xqa/mma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,758 copying ./csrc/xqa/mla_sm120.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,760 copying ./csrc/xqa/utils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,764 copying ./csrc/xqa/defines.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,766 copying ./csrc/xqa/tensorMap.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,768 copying ./csrc/xqa/gmma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,770 copying ./csrc/xqa/cuda_hint.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,772 copying ./csrc/xqa/mhaUtils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,775 copying ./csrc/xqa/utils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,777 copying ./csrc/xqa/barriers.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,779 copying ./csrc/xqa/specDec.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,781 copying ./csrc/xqa/xqa_wrapper.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,784 copying ./csrc/xqa/mha_sm90.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-06T02:38:11,789 copying ./csrc/group_gemm_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,791 copying ./csrc/single_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,792 copying ./csrc/cascade.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,795 copying ./csrc/sampling.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,797 copying ./csrc/batch_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-06T02:38:11,800 creating build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,801 copying ./include/flashinfer/quantization.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,804 copying ./include/flashinfer/logging.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,806 creating build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:11,807 copying ./include/flashinfer/mamba/selective_state_update.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:11,809 copying ./include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:11,812 copying ./include/flashinfer/mamba/conversion.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:11,815 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:11,817 copying ./include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:11,820 copying ./include/flashinfer/mamba/create_tensor_map.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:11,822 copying ./include/flashinfer/mamba/common.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:11,824 copying ./include/flashinfer/sampling.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,828 copying ./include/flashinfer/math.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,830 copying ./include/flashinfer/allocator.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,832 copying ./include/flashinfer/exception.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,834 copying ./include/flashinfer/concat_mla.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,837 copying ./include/flashinfer/mma.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,840 copying ./include/flashinfer/air_top_p.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,843 copying ./include/flashinfer/fp16.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,845 creating build/lib/flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:11,846 copying ./include/flashinfer/comm/trtllm_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:11,849 copying ./include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:11,852 copying ./include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:11,855 copying ./include/flashinfer/comm/trtllm_alltoall.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:11,859 copying ./include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:11,861 copying ./include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:11,864 copying ./include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:11,867 copying ./include/flashinfer/attention_impl.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,869 copying ./include/flashinfer/topk.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,873 copying ./include/flashinfer/utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,875 copying ./include/flashinfer/cp_async.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,877 copying ./include/flashinfer/arch_condition.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,879 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-06T02:38:11,881 copying ./include/flashinfer/flat/hopper/device/device_universal.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-06T02:38:11,884 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:11,885 copying ./include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:11,887 copying ./include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:11,889 copying ./include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:11,891 copying ./include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:11,895 copying ./include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:11,898 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:11,899 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:11,902 copying ./include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:11,904 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:11,906 copying ./include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:11,909 creating build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-06T02:38:11,910 copying ./include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-06T02:38:11,913 copying ./include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-06T02:38:11,916 copying ./include/flashinfer/flat/math_order_barrier.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:11,918 copying ./include/flashinfer/flat/cute_ext.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:11,921 creating build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-06T02:38:11,922 copying ./include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-06T02:38:11,924 copying ./include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-06T02:38:11,926 copying ./include/flashinfer/flat/debug.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:11,929 copying ./include/flashinfer/flat/unused.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:11,930 copying ./include/flashinfer/flat/type_traits.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:11,933 copying ./include/flashinfer/flat/common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:11,934 copying ./include/flashinfer/flat/math.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:11,936 copying ./include/flashinfer/pos_enc.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,940 copying ./include/flashinfer/cutlass_utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,942 copying ./include/flashinfer/frag_layout_swizzle.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,943 copying ./include/flashinfer/fastdiv.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,945 copying ./include/flashinfer/activation.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,947 copying ./include/flashinfer/profiler.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,950 copying ./include/flashinfer/cubin_loader.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,952 copying ./include/flashinfer/permuted_smem.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,954 copying ./include/flashinfer/fp4_layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,956 copying ./include/flashinfer/page.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:11,959 creating build/lib/flashinfer/data/include/flashinfer/trtllm 2026-04-06T02:38:11,960 copying ./include/flashinfer/trtllm/common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm 2026-04-06T02:38:11,963 creating build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:11,964 copying ./include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:11,968 copying ./include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:11,970 copying ./include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:11,972 copying ./include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:11,975 copying ./include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:11,979 copying ./include/flashinfer/trtllm/common/cudaUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:11,982 creating build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-06T02:38:11,983 copying ./include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-06T02:38:11,985 creating build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:11,987 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmOptions.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:11,995 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmInterface.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:12,002 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelTraits.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:12,007 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/TmaDescriptor.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:12,011 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/Enums.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:12,014 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:12,017 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParamsDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:12,021 creating build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:12,022 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SfLayoutDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:12,024 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/DtypeDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:12,027 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/MmaDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:12,029 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaKernelLauncher.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:12,031 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaArchDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:12,034 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CommonUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:12,036 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SparsityDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:12,038 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:12,039 copying ./include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:12,043 copying ./include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:12,045 copying ./include/flashinfer/trtllm/fused_moe/runner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:12,048 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:12,051 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:12,054 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:12,057 copying ./include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:12,061 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,062 copying ./include/flashinfer/trtllm/fmha/kernelUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,065 copying ./include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,068 copying ./include/flashinfer/trtllm/fmha/kernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,073 copying ./include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,076 copying ./include/flashinfer/trtllm/fmha/decoder_params.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,078 copying ./include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,083 copying ./include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,086 copying ./include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,089 copying ./include/flashinfer/trtllm/fmha/lse.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:12,091 creating build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,092 copying ./include/flashinfer/gemm/group_gemv.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,095 copying ./include/flashinfer/gemm/tgv_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,103 copying ./include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,105 copying ./include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,108 copying ./include/flashinfer/gemm/bmm_fp8.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,111 copying ./include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,114 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,116 copying ./include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,119 copying ./include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,122 copying ./include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,125 copying ./include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,128 copying ./include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,131 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,133 copying ./include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,137 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,139 copying ./include/flashinfer/gemm/tgv_gemm_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,142 copying ./include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,145 copying ./include/flashinfer/gemm/tgv_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,148 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,151 copying ./include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,154 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,156 copying ./include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,159 copying ./include/flashinfer/gemm/cutlass_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,162 copying ./include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,165 copying ./include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,168 copying ./include/flashinfer/gemm/group_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,171 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,174 copying ./include/flashinfer/gemm/group_gemm_lora.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,176 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,179 copying ./include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,183 copying ./include/flashinfer/gemm/group_gemm_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:12,186 copying ./include/flashinfer/norm.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:12,191 copying ./include/flashinfer/layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:12,195 copying ./include/flashinfer/vec_dtypes.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-06T02:38:12,202 creating build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,203 copying ./include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,207 copying ./include/flashinfer/attention/mask.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,209 copying ./include/flashinfer/attention/cascade.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,213 copying ./include/flashinfer/attention/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,215 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,216 copying ./include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,219 copying ./include/flashinfer/attention/hopper/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,222 copying ./include/flashinfer/attention/hopper/mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,225 copying ./include/flashinfer/attention/hopper/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,227 copying ./include/flashinfer/attention/hopper/default_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,229 copying ./include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,232 copying ./include/flashinfer/attention/hopper/utils.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,235 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:12,236 copying ./include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:12,239 copying ./include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:12,242 copying ./include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:12,245 copying ./include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:12,249 copying ./include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:12,252 copying ./include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:12,255 copying ./include/flashinfer/attention/hopper/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,258 copying ./include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,261 copying ./include/flashinfer/attention/hopper/named_barrier.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,263 copying ./include/flashinfer/attention/hopper/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,266 copying ./include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,270 copying ./include/flashinfer/attention/hopper/attention_updater.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:12,273 copying ./include/flashinfer/attention/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,276 copying ./include/flashinfer/attention/prefill.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,284 copying ./include/flashinfer/attention/mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,288 copying ./include/flashinfer/attention/cutlass_mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,291 copying ./include/flashinfer/attention/persistent_template.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,293 copying ./include/flashinfer/attention/heap.h -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,295 copying ./include/flashinfer/attention/mla_hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,300 copying ./include/flashinfer/attention/default_prefill_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,302 copying ./include/flashinfer/attention/default_decode_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,305 copying ./include/flashinfer/attention/scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,309 copying ./include/flashinfer/attention/persistent.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,312 copying ./include/flashinfer/attention/pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,315 copying ./include/flashinfer/attention/decode.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,318 copying ./include/flashinfer/attention/hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,321 copying ./include/flashinfer/attention/mla_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,323 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-06T02:38:12,324 copying ./include/flashinfer/attention/blackwell/plan.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-06T02:38:12,326 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-06T02:38:12,327 copying ./include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-06T02:38:12,330 copying ./include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-06T02:38:12,333 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-06T02:38:12,334 copying ./include/flashinfer/attention/blackwell/device/fmha.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-06T02:38:12,337 copying ./include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-06T02:38:12,341 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:12,342 copying ./include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:12,345 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:12,348 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:12,351 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:12,355 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:12,358 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:12,361 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:12,365 copying ./include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:12,368 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:12,370 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:12,375 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:12,378 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:12,381 copying ./include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:12,384 copying ./include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:12,387 copying ./include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:12,390 copying ./include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:12,393 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:12,396 copying ./include/flashinfer/attention/state.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,399 copying ./include/flashinfer/attention/batch_pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:12,402 creating build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,404 copying 3rdparty/spdlog/include/spdlog/async_logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,407 copying 3rdparty/spdlog/include/spdlog/stopwatch.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,410 copying 3rdparty/spdlog/include/spdlog/common-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,412 copying 3rdparty/spdlog/include/spdlog/formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,414 copying 3rdparty/spdlog/include/spdlog/spdlog.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,417 copying 3rdparty/spdlog/include/spdlog/common.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,420 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:12,422 copying 3rdparty/spdlog/include/spdlog/fmt/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:12,424 copying 3rdparty/spdlog/include/spdlog/fmt/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:12,427 copying 3rdparty/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:12,430 copying 3rdparty/spdlog/include/spdlog/fmt/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:12,432 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,433 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,437 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/core.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,441 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,445 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,447 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,452 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,455 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,457 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/locale.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,459 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,463 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,465 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/printf.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,468 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,470 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,473 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/color.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,476 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/args.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:12,478 copying 3rdparty/spdlog/include/spdlog/fmt/fmt.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:12,480 copying 3rdparty/spdlog/include/spdlog/fmt/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:12,482 copying 3rdparty/spdlog/include/spdlog/fmt/ostr.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:12,484 copying 3rdparty/spdlog/include/spdlog/fmt/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:12,487 creating build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,488 copying 3rdparty/spdlog/include/spdlog/details/os-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,491 copying 3rdparty/spdlog/include/spdlog/details/thread_pool.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,493 copying 3rdparty/spdlog/include/spdlog/details/thread_pool-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,495 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,497 copying 3rdparty/spdlog/include/spdlog/details/console_globals.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,499 copying 3rdparty/spdlog/include/spdlog/details/udp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,501 copying 3rdparty/spdlog/include/spdlog/details/null_mutex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,503 copying 3rdparty/spdlog/include/spdlog/details/tcp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,505 copying 3rdparty/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,508 copying 3rdparty/spdlog/include/spdlog/details/tcp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,510 copying 3rdparty/spdlog/include/spdlog/details/circular_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,513 copying 3rdparty/spdlog/include/spdlog/details/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,515 copying 3rdparty/spdlog/include/spdlog/details/backtracer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,517 copying 3rdparty/spdlog/include/spdlog/details/log_msg-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,519 copying 3rdparty/spdlog/include/spdlog/details/windows_include.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,521 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,523 copying 3rdparty/spdlog/include/spdlog/details/backtracer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,524 copying 3rdparty/spdlog/include/spdlog/details/registry-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,527 copying 3rdparty/spdlog/include/spdlog/details/udp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,529 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,531 copying 3rdparty/spdlog/include/spdlog/details/synchronous_factory.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,533 copying 3rdparty/spdlog/include/spdlog/details/fmt_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,536 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,540 copying 3rdparty/spdlog/include/spdlog/details/file_helper-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,542 copying 3rdparty/spdlog/include/spdlog/details/file_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,544 copying 3rdparty/spdlog/include/spdlog/details/log_msg.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,546 copying 3rdparty/spdlog/include/spdlog/details/registry.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:12,548 copying 3rdparty/spdlog/include/spdlog/logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,551 copying 3rdparty/spdlog/include/spdlog/logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,554 copying 3rdparty/spdlog/include/spdlog/mdc.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,557 copying 3rdparty/spdlog/include/spdlog/async_logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,559 copying 3rdparty/spdlog/include/spdlog/pattern_formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,561 copying 3rdparty/spdlog/include/spdlog/version.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,563 copying 3rdparty/spdlog/include/spdlog/async.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,565 creating build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:12,566 copying 3rdparty/spdlog/include/spdlog/cfg/helpers-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:12,568 copying 3rdparty/spdlog/include/spdlog/cfg/helpers.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:12,570 copying 3rdparty/spdlog/include/spdlog/cfg/env.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:12,572 copying 3rdparty/spdlog/include/spdlog/cfg/argv.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:12,574 copying 3rdparty/spdlog/include/spdlog/spdlog-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,576 copying 3rdparty/spdlog/include/spdlog/tweakme.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,578 copying 3rdparty/spdlog/include/spdlog/pattern_formatter-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,582 copying 3rdparty/spdlog/include/spdlog/fwd.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:12,584 creating build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,585 copying 3rdparty/spdlog/include/spdlog/sinks/udp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,587 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,590 copying 3rdparty/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,591 copying 3rdparty/spdlog/include/spdlog/sinks/systemd_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,594 copying 3rdparty/spdlog/include/spdlog/sinks/tcp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,596 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,598 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,600 copying 3rdparty/spdlog/include/spdlog/sinks/sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,602 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,604 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,607 copying 3rdparty/spdlog/include/spdlog/sinks/null_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,609 copying 3rdparty/spdlog/include/spdlog/sinks/callback_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,611 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,613 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,615 copying 3rdparty/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,618 copying 3rdparty/spdlog/include/spdlog/sinks/mongo_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,620 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,622 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,624 copying 3rdparty/spdlog/include/spdlog/sinks/android_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,626 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,628 copying 3rdparty/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,631 copying 3rdparty/spdlog/include/spdlog/sinks/syslog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,633 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,635 copying 3rdparty/spdlog/include/spdlog/sinks/ostream_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,637 copying 3rdparty/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,639 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,641 copying 3rdparty/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,643 copying 3rdparty/spdlog/include/spdlog/sinks/kafka_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,645 copying 3rdparty/spdlog/include/spdlog/sinks/msvc_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,647 copying 3rdparty/spdlog/include/spdlog/sinks/dist_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,649 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,651 copying 3rdparty/spdlog/include/spdlog/sinks/sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,653 copying 3rdparty/spdlog/include/spdlog/sinks/qt_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,655 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:12,658 creating build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,660 copying 3rdparty/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,663 copying 3rdparty/cutlass/include/cute/numeric/integral_constant.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,666 copying 3rdparty/cutlass/include/cute/numeric/real.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,668 copying 3rdparty/cutlass/include/cute/numeric/integral_ratio.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,670 copying 3rdparty/cutlass/include/cute/numeric/complex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,672 copying 3rdparty/cutlass/include/cute/numeric/numeric_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,675 copying 3rdparty/cutlass/include/cute/numeric/integer_sequence.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,677 copying 3rdparty/cutlass/include/cute/numeric/int.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,679 copying 3rdparty/cutlass/include/cute/numeric/math.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:12,682 copying 3rdparty/cutlass/include/cute/swizzle_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:12,685 creating build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,685 copying 3rdparty/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,689 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,692 copying 3rdparty/cutlass/include/cute/arch/copy_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,695 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,722 copying 3rdparty/cutlass/include/cute/arch/util.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,724 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,728 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,730 copying 3rdparty/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,734 copying 3rdparty/cutlass/include/cute/arch/mma_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,738 copying 3rdparty/cutlass/include/cute/arch/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,740 copying 3rdparty/cutlass/include/cute/arch/mma_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,742 copying 3rdparty/cutlass/include/cute/arch/mma_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,746 copying 3rdparty/cutlass/include/cute/arch/copy_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,748 copying 3rdparty/cutlass/include/cute/arch/mma_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,756 copying 3rdparty/cutlass/include/cute/arch/simd_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,758 copying 3rdparty/cutlass/include/cute/arch/copy_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,760 copying 3rdparty/cutlass/include/cute/arch/cluster_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,763 copying 3rdparty/cutlass/include/cute/arch/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,765 copying 3rdparty/cutlass/include/cute/arch/mma_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,767 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,814 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,856 copying 3rdparty/cutlass/include/cute/arch/mma_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,859 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,862 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,880 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,883 copying 3rdparty/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,886 copying 3rdparty/cutlass/include/cute/arch/mma_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,889 copying 3rdparty/cutlass/include/cute/arch/mma_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,891 copying 3rdparty/cutlass/include/cute/arch/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,894 copying 3rdparty/cutlass/include/cute/arch/copy_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,896 copying 3rdparty/cutlass/include/cute/arch/cluster_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:12,899 copying 3rdparty/cutlass/include/cute/arch/copy_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:13,120 copying 3rdparty/cutlass/include/cute/pointer_flagged.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,123 copying 3rdparty/cutlass/include/cute/layout_composed.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,126 creating build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,127 copying 3rdparty/cutlass/include/cute/algorithm/fill.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,129 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,132 copying 3rdparty/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,134 copying 3rdparty/cutlass/include/cute/algorithm/prefetch.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,137 copying 3rdparty/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,140 copying 3rdparty/cutlass/include/cute/algorithm/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,144 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,148 copying 3rdparty/cutlass/include/cute/algorithm/clear.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,150 copying 3rdparty/cutlass/include/cute/algorithm/axpby.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,153 copying 3rdparty/cutlass/include/cute/algorithm/gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,157 copying 3rdparty/cutlass/include/cute/algorithm/prefer.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,160 copying 3rdparty/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,163 copying 3rdparty/cutlass/include/cute/algorithm/functional.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:13,167 copying 3rdparty/cutlass/include/cute/stride.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,171 copying 3rdparty/cutlass/include/cute/swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,174 creating build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:13,175 copying 3rdparty/cutlass/include/cute/util/print_svg.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:13,178 copying 3rdparty/cutlass/include/cute/util/debug.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:13,181 copying 3rdparty/cutlass/include/cute/util/print.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:13,184 copying 3rdparty/cutlass/include/cute/util/print_latex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:13,188 copying 3rdparty/cutlass/include/cute/util/type_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:13,191 copying 3rdparty/cutlass/include/cute/util/print_tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:13,194 copying 3rdparty/cutlass/include/cute/tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,196 creating build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,197 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,201 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,204 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,210 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,213 copying 3rdparty/cutlass/include/cute/atom/copy_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,216 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,219 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,221 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,226 copying 3rdparty/cutlass/include/cute/atom/partitioner.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,228 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,232 copying 3rdparty/cutlass/include/cute/atom/mma_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,235 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,244 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,247 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,262 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,265 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,268 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,271 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,274 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,291 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,300 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,303 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,307 copying 3rdparty/cutlass/include/cute/atom/mma_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,310 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,314 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,317 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,353 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,355 copying 3rdparty/cutlass/include/cute/atom/copy_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:13,357 copying 3rdparty/cutlass/include/cute/underscore.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,360 copying 3rdparty/cutlass/include/cute/int_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,362 copying 3rdparty/cutlass/include/cute/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,365 copying 3rdparty/cutlass/include/cute/pointer_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,367 copying 3rdparty/cutlass/include/cute/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,370 creating build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:13,371 copying 3rdparty/cutlass/include/cute/container/alignment.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:13,374 copying 3rdparty/cutlass/include/cute/container/array.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:13,376 copying 3rdparty/cutlass/include/cute/container/bit_field.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:13,378 copying 3rdparty/cutlass/include/cute/container/tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:13,381 copying 3rdparty/cutlass/include/cute/container/array_aligned.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:13,383 copying 3rdparty/cutlass/include/cute/container/array_subbyte.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:13,386 copying 3rdparty/cutlass/include/cute/container/cuda_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:13,388 copying 3rdparty/cutlass/include/cute/container/type_list.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:13,390 copying 3rdparty/cutlass/include/cute/tensor_impl.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,393 copying 3rdparty/cutlass/include/cute/pointer_base.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,396 copying 3rdparty/cutlass/include/cute/tensor_zip.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,398 copying 3rdparty/cutlass/include/cute/pointer.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,401 copying 3rdparty/cutlass/include/cute/pointer_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:13,403 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-06T02:38:13,405 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-06T02:38:13,409 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-06T02:38:13,411 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-06T02:38:13,412 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-06T02:38:13,416 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-06T02:38:13,420 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-06T02:38:13,422 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-06T02:38:13,423 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-06T02:38:13,426 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-06T02:38:13,428 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-06T02:38:13,430 creating build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-06T02:38:13,432 copying 3rdparty/cutlass/include/cutlass/thread/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-06T02:38:13,435 copying 3rdparty/cutlass/include/cutlass/constants.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:13,438 copying 3rdparty/cutlass/include/cutlass/fast_math.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:13,442 copying 3rdparty/cutlass/include/cutlass/wmma_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:13,444 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,446 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,449 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,452 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,455 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,458 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,460 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,463 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,466 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,469 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,475 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,478 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,482 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,485 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,488 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,491 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,494 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,497 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,501 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/activation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,504 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,507 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,510 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,513 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,516 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,519 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,521 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:13,524 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,525 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,530 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,535 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,542 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,546 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,549 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,552 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,555 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,558 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,561 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,566 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,570 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,574 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:13,577 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,578 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,580 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,583 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,585 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,588 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,590 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,592 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,595 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,598 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,600 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,603 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,605 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,608 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,610 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,612 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:13,615 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,616 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,619 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,622 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,625 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,628 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,631 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,635 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,638 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,640 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,643 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,647 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,650 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,653 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,656 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,658 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,661 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,664 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,666 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:13,669 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:13,670 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:13,672 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:13,675 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:13,679 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:13,682 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:13,684 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:13,687 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,688 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,691 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,694 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,697 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,700 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,703 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,706 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,708 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,711 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,713 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,716 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,719 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,721 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:13,722 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:13,725 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:13,727 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:13,730 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:13,733 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:13,735 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,738 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,740 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,743 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,745 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,748 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,750 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,753 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,755 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,758 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,761 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,763 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,766 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,768 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,771 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,773 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,776 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,779 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,781 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,784 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,786 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,789 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,792 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,794 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,796 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,799 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,802 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,805 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,807 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,809 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,812 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,815 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,817 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,820 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,823 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,825 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,828 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:13,831 copying 3rdparty/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-06T02:38:13,833 copying 3rdparty/cutlass/include/cutlass/gemm_coord.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:13,835 copying 3rdparty/cutlass/include/cutlass/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:13,842 copying 3rdparty/cutlass/include/cutlass/real.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:13,845 copying 3rdparty/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:13,847 creating build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-06T02:38:13,848 copying 3rdparty/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-06T02:38:13,851 copying 3rdparty/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-06T02:38:13,854 copying 3rdparty/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-06T02:38:13,857 copying 3rdparty/cutlass/include/cutlass/float_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:13,860 copying 3rdparty/cutlass/include/cutlass/gemm_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:13,862 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-06T02:38:13,864 copying 3rdparty/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-06T02:38:13,867 copying 3rdparty/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:13,870 copying 3rdparty/cutlass/include/cutlass/conv/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:13,872 copying 3rdparty/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:13,874 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-06T02:38:13,875 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-06T02:38:13,878 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-06T02:38:13,882 copying 3rdparty/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-06T02:38:13,884 copying 3rdparty/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:13,888 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:13,889 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:13,891 copying 3rdparty/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:13,894 copying 3rdparty/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:13,897 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:13,899 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:13,900 copying 3rdparty/cutlass/include/cutlass/conv/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:13,903 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:13,906 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:13,908 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:13,910 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:13,912 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:13,914 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:13,917 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:13,919 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:13,921 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:13,924 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,925 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,927 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,930 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,932 copying 3rdparty/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,935 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,937 copying 3rdparty/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,939 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,942 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,944 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,947 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,950 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,952 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,955 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,958 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,961 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,964 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,967 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,969 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,972 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,975 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,977 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,980 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,983 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,985 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,988 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,992 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,995 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:13,997 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:14,000 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:14,003 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,004 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,007 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,009 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,012 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,015 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,017 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,020 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,023 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,025 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,028 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,030 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,033 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,036 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,038 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,041 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,044 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,048 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,052 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,055 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,059 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,062 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,065 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,069 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,072 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,075 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,078 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,083 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,086 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,089 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,092 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,095 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,098 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,102 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,105 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,108 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,110 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,114 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,117 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,120 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,123 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,126 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,130 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,133 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,136 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,139 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,142 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:14,145 copying 3rdparty/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:14,147 copying 3rdparty/cutlass/include/cutlass/conv/convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:14,150 copying 3rdparty/cutlass/include/cutlass/uint256.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,153 copying 3rdparty/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,155 copying 3rdparty/cutlass/include/cutlass/half.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,158 copying 3rdparty/cutlass/include/cutlass/bfloat16.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,162 copying 3rdparty/cutlass/include/cutlass/integer_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,165 creating build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,166 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,171 copying 3rdparty/cutlass/include/cutlass/arch/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,174 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,177 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm90.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,180 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,182 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,185 copying 3rdparty/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,188 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,191 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,195 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm72.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,198 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,202 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,205 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,208 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,210 copying 3rdparty/cutlass/include/cutlass/arch/config.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,213 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,217 copying 3rdparty/cutlass/include/cutlass/arch/wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,220 copying 3rdparty/cutlass/include/cutlass/arch/cache_operation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,222 copying 3rdparty/cutlass/include/cutlass/arch/memory.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,226 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,229 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm100.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,232 copying 3rdparty/cutlass/include/cutlass/arch/reg_reconfig.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,234 copying 3rdparty/cutlass/include/cutlass/arch/synclog.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,238 copying 3rdparty/cutlass/include/cutlass/arch/arch.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,241 copying 3rdparty/cutlass/include/cutlass/arch/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,244 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,246 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,249 copying 3rdparty/cutlass/include/cutlass/arch/simd.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,251 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:14,253 copying 3rdparty/cutlass/include/cutlass/coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,256 copying 3rdparty/cutlass/include/cutlass/floating_point_nvrtc.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,258 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,260 copying 3rdparty/cutlass/include/cutlass/tensor_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,262 copying 3rdparty/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,265 copying 3rdparty/cutlass/include/cutlass/array_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,267 copying 3rdparty/cutlass/include/cutlass/blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,269 creating build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,270 copying 3rdparty/cutlass/include/cutlass/layout/pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,273 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,276 copying 3rdparty/cutlass/include/cutlass/layout/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,278 copying 3rdparty/cutlass/include/cutlass/layout/permute.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,281 copying 3rdparty/cutlass/include/cutlass/layout/tensor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,284 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,287 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,289 copying 3rdparty/cutlass/include/cutlass/layout/vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,291 copying 3rdparty/cutlass/include/cutlass/layout/layout.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:14,293 copying 3rdparty/cutlass/include/cutlass/block_striped.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,296 copying 3rdparty/cutlass/include/cutlass/kernel_launch.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,298 copying 3rdparty/cutlass/include/cutlass/tensor_ref.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,301 copying 3rdparty/cutlass/include/cutlass/aligned_buffer.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,303 copying 3rdparty/cutlass/include/cutlass/blas3_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,304 copying 3rdparty/cutlass/include/cutlass/uint128.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,307 copying 3rdparty/cutlass/include/cutlass/numeric_conversion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,313 copying 3rdparty/cutlass/include/cutlass/workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,316 copying 3rdparty/cutlass/include/cutlass/predicate_vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,318 copying 3rdparty/cutlass/include/cutlass/version.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,320 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-06T02:38:14,322 copying 3rdparty/cutlass/include/cutlass/transform/thread/transpose.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-06T02:38:14,324 copying 3rdparty/cutlass/include/cutlass/transform/thread/unary_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-06T02:38:14,326 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-06T02:38:14,327 copying 3rdparty/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-06T02:38:14,330 copying 3rdparty/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform 2026-04-06T02:38:14,334 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-06T02:38:14,335 copying 3rdparty/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-06T02:38:14,338 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-06T02:38:14,339 copying 3rdparty/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-06T02:38:14,342 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-06T02:38:14,343 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-06T02:38:14,347 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-06T02:38:14,349 copying 3rdparty/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-06T02:38:14,352 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,353 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,356 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,359 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,362 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,369 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,375 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,382 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,389 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,393 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,397 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,402 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,409 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,416 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,454 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,499 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,502 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,505 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,518 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,520 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,523 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,525 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,527 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,531 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,553 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,555 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:14,558 creating build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-06T02:38:14,559 copying 3rdparty/cutlass/include/cutlass/platform/platform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-06T02:38:14,562 copying 3rdparty/cutlass/include/cutlass/device_kernel.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,565 copying 3rdparty/cutlass/include/cutlass/numeric_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,567 copying 3rdparty/cutlass/include/cutlass/trace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,569 copying 3rdparty/cutlass/include/cutlass/array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,575 copying 3rdparty/cutlass/include/cutlass/core_io.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,578 copying 3rdparty/cutlass/include/cutlass/matrix_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,580 copying 3rdparty/cutlass/include/cutlass/cutlass.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,582 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-06T02:38:14,584 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-06T02:38:14,587 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-06T02:38:14,589 copying 3rdparty/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-06T02:38:14,591 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:14,592 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:14,595 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:14,597 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:14,600 copying 3rdparty/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:14,603 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:14,604 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:14,606 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:14,609 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:14,611 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:14,614 copying 3rdparty/cutlass/include/cutlass/quaternion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,617 copying 3rdparty/cutlass/include/cutlass/tfloat32.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,619 copying 3rdparty/cutlass/include/cutlass/numeric_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,621 copying 3rdparty/cutlass/include/cutlass/tensor_view.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,624 copying 3rdparty/cutlass/include/cutlass/semaphore.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,626 copying 3rdparty/cutlass/include/cutlass/subbyte_reference.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,629 copying 3rdparty/cutlass/include/cutlass/exmy_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,632 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,633 copying 3rdparty/cutlass/include/cutlass/detail/cluster.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,635 copying 3rdparty/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,637 copying 3rdparty/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,640 copying 3rdparty/cutlass/include/cutlass/detail/helper_macros.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,642 copying 3rdparty/cutlass/include/cutlass/detail/dependent_false.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,644 copying 3rdparty/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,647 copying 3rdparty/cutlass/include/cutlass/detail/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,649 copying 3rdparty/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,652 copying 3rdparty/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,654 copying 3rdparty/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,656 copying 3rdparty/cutlass/include/cutlass/detail/collective.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,659 copying 3rdparty/cutlass/include/cutlass/detail/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:14,661 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-06T02:38:14,662 copying 3rdparty/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-06T02:38:14,665 copying 3rdparty/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-06T02:38:14,668 copying 3rdparty/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-06T02:38:14,670 copying 3rdparty/cutlass/include/cutlass/pitch_linear_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,672 copying 3rdparty/cutlass/include/cutlass/float8.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:14,675 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:14,677 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:14,679 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:14,681 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:14,684 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:14,687 copying 3rdparty/cutlass/include/cutlass/gemm/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-06T02:38:14,690 copying 3rdparty/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-06T02:38:14,691 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,692 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,695 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,698 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,701 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,703 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,705 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,708 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,711 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,713 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,716 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,718 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,721 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,723 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,726 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,732 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,734 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,737 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,740 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,744 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,746 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,749 copying 3rdparty/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,751 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,753 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,756 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,758 copying 3rdparty/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,761 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,763 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,766 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,769 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,772 copying 3rdparty/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,775 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,777 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,780 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,784 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,787 copying 3rdparty/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:14,789 copying 3rdparty/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-06T02:38:14,791 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,792 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,795 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,798 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,801 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,803 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,806 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,808 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,811 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,814 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,817 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,819 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,821 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,824 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,827 copying 3rdparty/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,830 copying 3rdparty/cutlass/include/cutlass/gemm/device/symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,832 copying 3rdparty/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,835 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,838 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,840 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,842 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,845 copying 3rdparty/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,848 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,850 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,853 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,855 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,857 copying 3rdparty/cutlass/include/cutlass/gemm/device/trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,860 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,862 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,864 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,867 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:14,869 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,870 copying 3rdparty/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,873 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,875 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,878 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,881 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,884 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,887 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,891 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,894 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,896 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,900 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,903 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,906 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,909 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,912 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,915 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,919 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,921 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,925 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,928 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,932 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,935 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,938 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,942 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,946 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,949 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,952 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,958 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,962 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,971 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,977 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,983 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,988 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:14,994 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,000 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,005 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,011 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,018 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,023 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,028 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,031 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,036 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,041 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,047 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,051 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,054 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,058 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,061 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,065 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,069 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,072 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,074 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,078 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:15,081 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,083 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,086 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,089 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,092 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,096 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,100 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,103 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,106 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,109 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,112 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,115 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,118 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,122 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,159 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,163 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,166 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,170 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,175 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,180 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,186 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,190 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,407 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,410 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,413 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,416 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,419 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,423 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,426 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,429 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:15,432 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,433 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,436 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,439 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,442 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,445 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,448 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,451 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,454 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,457 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,460 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,464 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,467 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,470 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,473 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,476 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,480 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,483 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,486 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,489 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,492 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,494 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,497 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,499 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,501 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,504 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,507 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,509 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,512 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,514 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,517 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,519 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,521 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,524 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,526 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,529 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,531 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,535 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,537 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,539 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,542 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,545 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,548 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,550 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,553 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,555 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,559 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,561 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,566 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,569 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,571 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,574 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,576 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,579 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,581 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,585 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,587 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,591 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,594 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,598 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,601 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,604 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,607 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,609 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,612 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,615 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,617 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,620 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,623 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,625 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,628 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,631 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,633 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,636 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,639 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,641 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,643 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,646 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,649 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,652 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,654 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,656 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,660 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,663 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,666 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,669 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,673 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,675 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,678 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,681 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,684 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,688 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,691 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,693 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,696 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,699 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,702 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,705 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,708 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,711 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,713 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,716 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,718 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,721 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,724 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,727 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,730 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,733 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,737 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,742 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,745 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,748 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,752 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,755 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,759 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,764 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:15,773 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,775 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,779 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,786 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,791 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,800 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,805 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,810 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,814 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,818 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,823 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,827 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,832 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,838 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,843 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,850 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,854 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,858 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,862 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,865 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,868 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,876 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,882 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,886 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,889 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,892 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,895 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,898 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,901 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,904 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,907 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,911 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:15,913 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,128 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,130 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,133 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,135 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,138 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,140 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,143 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,146 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,148 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,151 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,154 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,157 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:16,161 copying 3rdparty/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-06T02:38:16,165 copying 3rdparty/cutlass/include/cutlass/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:16,167 copying 3rdparty/cutlass/include/cutlass/cluster_launch.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:16,170 copying 3rdparty/cutlass/include/cutlass/functional.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:16,173 copying 3rdparty/cutlass/include/cutlass/relatively_equal.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:16,175 copying 3rdparty/cutlass/include/cutlass/array_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:16,177 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:16,180 copying 3rdparty/cutlass/include/cutlass/matrix_shape.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:16,182 copying 3rdparty/cutlass/include/cutlass/complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:16,184 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,186 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,189 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-06T02:38:16,191 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-06T02:38:16,193 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,196 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,198 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,201 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,204 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,206 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,208 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,211 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,213 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,216 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,219 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-06T02:38:16,220 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-06T02:38:16,223 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-06T02:38:16,225 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-06T02:38:16,228 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:16,231 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-06T02:38:16,232 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-06T02:38:16,234 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-06T02:38:16,236 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,237 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,240 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,242 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,244 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,247 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,249 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,252 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,254 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,257 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,259 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,261 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,264 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,266 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,269 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,271 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,274 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,277 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,279 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,282 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,284 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,287 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,290 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,293 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,295 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:16,298 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,300 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,303 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,306 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,308 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,311 copying 3rdparty/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,314 copying 3rdparty/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,317 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,319 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,321 copying 3rdparty/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,324 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,327 copying 3rdparty/cutlass/tools/util/include/cutlass/util/distribution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,329 copying 3rdparty/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,332 copying 3rdparty/cutlass/tools/util/include/cutlass/util/debug.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,334 copying 3rdparty/cutlass/tools/util/include/cutlass/util/command_line.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,337 copying 3rdparty/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,339 copying 3rdparty/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,341 copying 3rdparty/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,344 copying 3rdparty/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,346 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,348 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,351 copying 3rdparty/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,353 copying 3rdparty/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,355 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,357 copying 3rdparty/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,359 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,362 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:16,490 installing to build/bdist.linux-armv7l/wheel 2026-04-06T02:38:16,491 running install 2026-04-06T02:38:16,514 running install_lib 2026-04-06T02:38:16,521 creating build/bdist.linux-armv7l/wheel 2026-04-06T02:38:16,524 creating build/bdist.linux-armv7l/wheel/flashinfer 2026-04-06T02:38:16,525 copying build/lib/flashinfer/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,530 copying build/lib/flashinfer/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,532 copying build/lib/flashinfer/_build_meta.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,534 copying build/lib/flashinfer/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,538 copying build/lib/flashinfer/__main__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,541 copying build/lib/flashinfer/autotuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,544 copying build/lib/flashinfer/trtllm_low_latency_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,547 copying build/lib/flashinfer/gdn_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,550 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl 2026-04-06T02:38:16,551 copying build/lib/flashinfer/cute_dsl/fp4_common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-06T02:38:16,554 copying build/lib/flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-06T02:38:16,557 copying build/lib/flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-06T02:38:16,562 copying build/lib/flashinfer/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-06T02:38:16,565 copying build/lib/flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-06T02:38:16,569 copying build/lib/flashinfer/cute_dsl/blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-06T02:38:16,571 copying build/lib/flashinfer/cute_dsl/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-06T02:38:16,575 creating build/bdist.linux-armv7l/wheel/flashinfer/mamba 2026-04-06T02:38:16,576 copying build/lib/flashinfer/mamba/ssd_combined.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-06T02:38:16,580 copying build/lib/flashinfer/mamba/ssd_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-06T02:38:16,586 copying build/lib/flashinfer/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-06T02:38:16,589 copying build/lib/flashinfer/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-06T02:38:16,592 copying build/lib/flashinfer/mamba/ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-06T02:38:16,595 copying build/lib/flashinfer/artifacts.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,598 copying build/lib/flashinfer/tllm_enums.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,601 copying build/lib/flashinfer/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,603 copying build/lib/flashinfer/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,607 copying build/lib/flashinfer/version.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,609 copying build/lib/flashinfer/attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,613 creating build/bdist.linux-armv7l/wheel/flashinfer/testing 2026-04-06T02:38:16,614 copying build/lib/flashinfer/testing/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-06T02:38:16,617 copying build/lib/flashinfer/testing/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-06T02:38:16,621 copying build/lib/flashinfer/aot.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,626 creating build/bdist.linux-armv7l/wheel/flashinfer/tuning_configs 2026-04-06T02:38:16,627 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-06T02:38:16,630 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-06T02:38:16,633 copying build/lib/flashinfer/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,636 creating build/bdist.linux-armv7l/wheel/flashinfer/comm 2026-04-06T02:38:16,638 copying build/lib/flashinfer/comm/mapping.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,641 copying build/lib/flashinfer/comm/trtllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,645 copying build/lib/flashinfer/comm/nvshmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,648 copying build/lib/flashinfer/comm/workspace_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,650 copying build/lib/flashinfer/comm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,653 copying build/lib/flashinfer/comm/mnnvl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,658 copying build/lib/flashinfer/comm/vllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,660 copying build/lib/flashinfer/comm/dlpack_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,662 copying build/lib/flashinfer/comm/trtllm_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,665 copying build/lib/flashinfer/comm/allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,667 copying build/lib/flashinfer/comm/trtllm_mnnvl_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,670 copying build/lib/flashinfer/comm/trtllm_moe_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,672 copying build/lib/flashinfer/comm/cuda_ipc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,674 copying build/lib/flashinfer/comm/nvshmem_allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-06T02:38:16,677 copying build/lib/flashinfer/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,682 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels 2026-04-06T02:38:16,683 copying build/lib/flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-06T02:38:16,686 copying build/lib/flashinfer/gdn_kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-06T02:38:16,688 copying build/lib/flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-06T02:38:16,692 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:16,693 copying build/lib/flashinfer/gdn_kernels/blackwell_prefill/gdn.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:16,698 copying build/lib/flashinfer/gdn_kernels/blackwell_prefill/gdn_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:16,700 copying build/lib/flashinfer/gdn_kernels/blackwell_prefill/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:16,702 copying build/lib/flashinfer/gdn_kernels/blackwell_prefill/gdn_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell_prefill 2026-04-06T02:38:16,705 copying build/lib/flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-06T02:38:16,708 copying build/lib/flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-06T02:38:16,713 creating build/bdist.linux-armv7l/wheel/flashinfer/norm 2026-04-06T02:38:16,715 copying build/lib/flashinfer/norm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-06T02:38:16,718 creating build/bdist.linux-armv7l/wheel/flashinfer/norm/kernels 2026-04-06T02:38:16,720 copying build/lib/flashinfer/norm/kernels/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-06T02:38:16,723 copying build/lib/flashinfer/norm/kernels/layernorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-06T02:38:16,726 copying build/lib/flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-06T02:38:16,729 copying build/lib/flashinfer/norm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-06T02:38:16,731 copying build/lib/flashinfer/norm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-06T02:38:16,734 copying build/lib/flashinfer/pod.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,739 creating build/bdist.linux-armv7l/wheel/flashinfer/triton 2026-04-06T02:38:16,740 copying build/lib/flashinfer/triton/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-06T02:38:16,743 copying build/lib/flashinfer/triton/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-06T02:38:16,745 copying build/lib/flashinfer/triton/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-06T02:38:16,747 copying build/lib/flashinfer/triton/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-06T02:38:16,750 copying build/lib/flashinfer/triton/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-06T02:38:16,752 copying build/lib/flashinfer/triton/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-06T02:38:16,755 creating build/bdist.linux-armv7l/wheel/flashinfer/triton/kernels 2026-04-06T02:38:16,756 copying build/lib/flashinfer/triton/kernels/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-06T02:38:16,759 copying build/lib/flashinfer/triton/kernels/quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-06T02:38:16,761 copying build/lib/flashinfer/triton/kernels/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-06T02:38:16,763 copying build/lib/flashinfer/triton/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-06T02:38:16,765 copying build/lib/flashinfer/triton/kernels/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-06T02:38:16,768 copying build/lib/flashinfer/triton/kernels/ssd_chunk_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-06T02:38:16,770 copying build/lib/flashinfer/triton/kernels/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-06T02:38:16,772 copying build/lib/flashinfer/triton/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-06T02:38:16,775 copying build/lib/flashinfer/triton/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-06T02:38:16,777 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization 2026-04-06T02:38:16,779 copying build/lib/flashinfer/quantization/quantization_cute_dsl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-06T02:38:16,782 copying build/lib/flashinfer/quantization/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-06T02:38:16,785 copying build/lib/flashinfer/quantization/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-06T02:38:16,787 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization/kernels 2026-04-06T02:38:16,789 copying build/lib/flashinfer/quantization/kernels/mxfp4_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-06T02:38:16,792 copying build/lib/flashinfer/quantization/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-06T02:38:16,794 copying build/lib/flashinfer/quantization/kernels/mxfp8_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-06T02:38:16,797 copying build/lib/flashinfer/quantization/packbits.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-06T02:38:16,799 copying build/lib/flashinfer/quantization/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-06T02:38:16,803 copying build/lib/flashinfer/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,806 copying build/lib/flashinfer/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,811 creating build/bdist.linux-armv7l/wheel/flashinfer/logits_processor 2026-04-06T02:38:16,812 copying build/lib/flashinfer/logits_processor/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,814 copying build/lib/flashinfer/logits_processor/fusion_rules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,817 copying build/lib/flashinfer/logits_processor/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,819 copying build/lib/flashinfer/logits_processor/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,822 copying build/lib/flashinfer/logits_processor/legalization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,824 copying build/lib/flashinfer/logits_processor/operators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,826 copying build/lib/flashinfer/logits_processor/processors.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,829 copying build/lib/flashinfer/logits_processor/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,831 copying build/lib/flashinfer/logits_processor/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,833 copying build/lib/flashinfer/logits_processor/validators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-06T02:38:16,835 copying build/lib/flashinfer/green_ctx.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:16,838 creating build/bdist.linux-armv7l/wheel/flashinfer/jit 2026-04-06T02:38:16,839 copying build/lib/flashinfer/jit/spdlog.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,841 copying build/lib/flashinfer/jit/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,842 copying build/lib/flashinfer/jit/quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,844 copying build/lib/flashinfer/jit/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,846 copying build/lib/flashinfer/jit/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,848 copying build/lib/flashinfer/jit/dsv3_optimizations.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,850 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/mamba 2026-04-06T02:38:16,851 copying build/lib/flashinfer/jit/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-06T02:38:16,853 copying build/lib/flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-06T02:38:16,855 copying build/lib/flashinfer/jit/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-06T02:38:16,857 copying build/lib/flashinfer/jit/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,859 copying build/lib/flashinfer/jit/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,861 copying build/lib/flashinfer/jit/gdn.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,863 copying build/lib/flashinfer/jit/fp4_kv_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,865 copying build/lib/flashinfer/jit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,867 copying build/lib/flashinfer/jit/env.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,869 copying build/lib/flashinfer/jit/tinygemm2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,871 copying build/lib/flashinfer/jit/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,873 copying build/lib/flashinfer/jit/fp4_kv_dequantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,875 copying build/lib/flashinfer/jit/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,877 copying build/lib/flashinfer/jit/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,879 copying build/lib/flashinfer/jit/cubin_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,881 copying build/lib/flashinfer/jit/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,884 copying build/lib/flashinfer/jit/comm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,886 copying build/lib/flashinfer/jit/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,888 copying build/lib/flashinfer/jit/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,892 copying build/lib/flashinfer/jit/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,894 copying build/lib/flashinfer/jit/cpp_ext.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,897 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm 2026-04-06T02:38:16,899 copying build/lib/flashinfer/jit/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-06T02:38:16,901 copying build/lib/flashinfer/jit/gemm/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-06T02:38:16,905 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm/cutlass 2026-04-06T02:38:16,906 copying build/lib/flashinfer/jit/gemm/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-06T02:38:16,908 copying build/lib/flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-06T02:38:16,912 copying build/lib/flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-06T02:38:16,916 copying build/lib/flashinfer/jit/gemm/fp8_blockscale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-06T02:38:16,918 copying build/lib/flashinfer/jit/gemm/deepgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-06T02:38:16,921 copying build/lib/flashinfer/jit/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,923 copying build/lib/flashinfer/jit/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,925 copying build/lib/flashinfer/jit/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,927 copying build/lib/flashinfer/jit/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-06T02:38:16,930 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention 2026-04-06T02:38:16,931 copying build/lib/flashinfer/jit/attention/modules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-06T02:38:16,936 copying build/lib/flashinfer/jit/attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-06T02:38:16,938 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:16,940 copying build/lib/flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:16,946 copying build/lib/flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:16,952 copying build/lib/flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:17,001 copying build/lib/flashinfer/jit/attention/fmha_v2/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-06T02:38:17,004 copying build/lib/flashinfer/jit/attention/variants.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-06T02:38:17,006 copying build/lib/flashinfer/jit/attention/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-06T02:38:17,009 copying build/lib/flashinfer/sparse.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,012 copying build/lib/flashinfer/py.typed -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,015 creating build/bdist.linux-armv7l/wheel/flashinfer/dsv3_ops 2026-04-06T02:38:17,016 copying build/lib/flashinfer/dsv3_ops/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/dsv3_ops 2026-04-06T02:38:17,019 copying build/lib/flashinfer/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,021 copying build/lib/flashinfer/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,024 copying build/lib/flashinfer/compilation_context.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,026 copying build/lib/flashinfer/deep_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,029 copying build/lib/flashinfer/api_logging.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,033 copying build/lib/flashinfer/cuda_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,034 copying build/lib/flashinfer/concat_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,037 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm 2026-04-06T02:38:17,038 copying build/lib/flashinfer/gemm/routergemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-06T02:38:17,040 copying build/lib/flashinfer/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-06T02:38:17,043 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm/kernels 2026-04-06T02:38:17,044 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-06T02:38:17,048 copying build/lib/flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-06T02:38:17,052 copying build/lib/flashinfer/gemm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-06T02:38:17,054 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-06T02:38:17,057 copying build/lib/flashinfer/gemm/gemm_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-06T02:38:17,063 copying build/lib/flashinfer/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,065 copying build/lib/flashinfer/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:17,067 creating build/bdist.linux-armv7l/wheel/flashinfer/cudnn 2026-04-06T02:38:17,068 copying build/lib/flashinfer/cudnn/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-06T02:38:17,071 copying build/lib/flashinfer/cudnn/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-06T02:38:17,072 copying build/lib/flashinfer/cudnn/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-06T02:38:17,075 copying build/lib/flashinfer/cudnn/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-06T02:38:17,078 creating build/bdist.linux-armv7l/wheel/flashinfer/data 2026-04-06T02:38:17,079 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog 2026-04-06T02:38:17,081 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/scripts 2026-04-06T02:38:17,082 copying build/lib/flashinfer/data/spdlog/scripts/extract_version.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/scripts 2026-04-06T02:38:17,085 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include 2026-04-06T02:38:17,086 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,088 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,090 copying build/lib/flashinfer/data/spdlog/include/spdlog/stopwatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,092 copying build/lib/flashinfer/data/spdlog/include/spdlog/common-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,094 copying build/lib/flashinfer/data/spdlog/include/spdlog/formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,096 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,098 copying build/lib/flashinfer/data/spdlog/include/spdlog/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,101 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:17,102 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:17,104 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:17,106 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:17,108 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:17,111 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,112 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,115 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,120 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,123 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,125 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,131 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,134 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,135 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,137 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,141 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,143 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,146 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,149 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,151 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,154 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-06T02:38:17,157 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/fmt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:17,158 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:17,160 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ostr.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:17,162 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-06T02:38:17,165 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,166 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,169 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,170 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,173 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,175 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/console_globals.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,177 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,179 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/null_mutex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,181 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,183 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,185 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,187 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/circular_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,189 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,191 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,193 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,195 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/windows_include.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,197 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,199 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,201 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,203 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,205 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,207 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,209 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,211 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,213 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,215 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,216 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,218 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-06T02:38:17,220 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,223 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,225 copying build/lib/flashinfer/data/spdlog/include/spdlog/mdc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,227 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,229 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,231 copying build/lib/flashinfer/data/spdlog/include/spdlog/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,233 copying build/lib/flashinfer/data/spdlog/include/spdlog/async.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,235 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:17,237 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:17,239 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:17,241 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/env.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:17,242 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/argv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-06T02:38:17,244 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,246 copying build/lib/flashinfer/data/spdlog/include/spdlog/tweakme.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,249 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,252 copying build/lib/flashinfer/data/spdlog/include/spdlog/fwd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-06T02:38:17,255 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,256 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,258 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,260 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,262 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,265 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,267 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,268 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,270 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,272 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,274 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,277 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,279 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,280 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,282 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,284 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,287 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,289 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,291 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,293 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,295 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,297 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,300 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,302 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,304 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,305 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,307 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,309 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,312 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,314 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,316 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,318 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,320 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,321 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,324 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-06T02:38:17,329 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc 2026-04-06T02:38:17,330 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,332 copying build/lib/flashinfer/data/csrc/single_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,334 copying build/lib/flashinfer/data/csrc/logging.cc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,336 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,338 copying build/lib/flashinfer/data/csrc/batch_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,340 copying build/lib/flashinfer/data/csrc/runtime_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,343 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,346 copying build/lib/flashinfer/data/csrc/batch_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,349 copying build/lib/flashinfer/data/csrc/single_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,351 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,353 copying build/lib/flashinfer/data/csrc/fmhaReduction.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,356 copying build/lib/flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,357 copying build/lib/flashinfer/data/csrc/fmha_v2_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,359 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,362 copying build/lib/flashinfer/data/csrc/trtllm_alltoall_prepare.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,365 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,367 copying build/lib/flashinfer/data/csrc/cutlass_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,370 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,373 copying build/lib/flashinfer/data/csrc/tgv_gemm.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,376 copying build/lib/flashinfer/data/csrc/single_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,378 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,380 copying build/lib/flashinfer/data/csrc/batch_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,383 copying build/lib/flashinfer/data/csrc/batch_attention.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,386 copying build/lib/flashinfer/data/csrc/page.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,389 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,392 copying build/lib/flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,394 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,398 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,400 copying build/lib/flashinfer/data/csrc/fp4_kv_dequantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,403 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,406 copying build/lib/flashinfer/data/csrc/selective_state_update_kernel_inst.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,408 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,411 copying build/lib/flashinfer/data/csrc/pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,414 copying build/lib/flashinfer/data/csrc/vllm_custom_all_reduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,417 copying build/lib/flashinfer/data/csrc/single_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,419 copying build/lib/flashinfer/data/csrc/group_gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,422 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,424 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,427 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,429 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,432 copying build/lib/flashinfer/data/csrc/gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,434 copying build/lib/flashinfer/data/csrc/flashinfer_mamba_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,436 copying build/lib/flashinfer/data/csrc/batch_pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,439 copying build/lib/flashinfer/data/csrc/norm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,442 copying build/lib/flashinfer/data/csrc/topk.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,445 copying build/lib/flashinfer/data/csrc/single_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,447 copying build/lib/flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,451 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,453 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,456 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,458 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,461 copying build/lib/flashinfer/data/csrc/batch_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,464 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,466 copying build/lib/flashinfer/data/csrc/fp4_kv_quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:17,469 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal 2026-04-06T02:38:17,471 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp 2026-04-06T02:38:17,472 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:17,474 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:17,476 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:17,478 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:17,481 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:17,483 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-06T02:38:17,486 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-06T02:38:17,487 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-06T02:38:17,490 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include 2026-04-06T02:38:17,492 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm 2026-04-06T02:38:17,493 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,495 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,497 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,499 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,501 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,503 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,506 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,508 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,510 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,512 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,514 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,517 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-06T02:38:17,519 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm 2026-04-06T02:38:17,521 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,522 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,526 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,529 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,532 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,535 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,537 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,540 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,542 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,545 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,547 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,550 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-06T02:38:17,553 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:17,554 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:17,556 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:17,559 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:17,561 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:17,564 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:17,566 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:17,568 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-06T02:38:17,570 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,571 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,574 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,576 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,578 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,580 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,582 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,584 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,587 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,589 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-06T02:38:17,592 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:17,593 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:17,595 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:17,599 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-06T02:38:17,600 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-06T02:38:17,603 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-06T02:38:17,606 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-06T02:38:17,607 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-06T02:38:17,610 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-06T02:38:17,613 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-06T02:38:17,614 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-06T02:38:17,616 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-06T02:38:17,618 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:17,620 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:17,622 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:17,626 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-06T02:38:17,627 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:17,629 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:17,631 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:17,633 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:17,637 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-06T02:38:17,638 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-06T02:38:17,640 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-06T02:38:17,643 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:17,647 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-06T02:38:17,649 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-06T02:38:17,652 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,653 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,655 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,657 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,659 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,662 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,664 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,666 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,668 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,670 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,672 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,674 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,675 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,677 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,679 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,681 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,683 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,685 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,687 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,689 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,690 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-06T02:38:17,693 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-06T02:38:17,694 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-06T02:38:17,697 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-06T02:38:17,699 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-06T02:38:17,702 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:17,704 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:17,706 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:17,709 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:17,711 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:17,714 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-06T02:38:17,717 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,718 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,721 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,723 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,726 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,728 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,731 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,734 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,737 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,740 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,742 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,744 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,746 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,749 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,751 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,754 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,756 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,758 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,760 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,763 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,765 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-06T02:38:17,768 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:17,770 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:17,772 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:17,775 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:17,778 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:17,783 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:17,786 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-06T02:38:17,788 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-06T02:38:17,791 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:17,793 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-06T02:38:17,796 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions 2026-04-06T02:38:17,798 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include 2026-04-06T02:38:17,800 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:17,802 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue 2026-04-06T02:38:17,804 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-06T02:38:17,805 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-06T02:38:17,808 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-06T02:38:17,810 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-06T02:38:17,813 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-06T02:38:17,817 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-06T02:38:17,818 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-06T02:38:17,822 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:17,823 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:17,826 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:17,828 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:17,831 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:17,833 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-06T02:38:17,836 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:17,839 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:17,841 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:17,844 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:17,848 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-06T02:38:17,849 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-06T02:38:17,853 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform 2026-04-06T02:38:17,855 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-06T02:38:17,856 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-06T02:38:17,860 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication 2026-04-06T02:38:17,862 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-06T02:38:17,863 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-06T02:38:17,867 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail 2026-04-06T02:38:17,868 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-06T02:38:17,870 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-06T02:38:17,873 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:17,875 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:17,878 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm 2026-04-06T02:38:17,880 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-06T02:38:17,882 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-06T02:38:17,885 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-06T02:38:17,887 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-06T02:38:17,891 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,892 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,895 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,898 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,902 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,904 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,908 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,910 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,914 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,916 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,919 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-06T02:38:17,920 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-06T02:38:17,923 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-06T02:38:17,926 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-06T02:38:17,928 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-06T02:38:17,931 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,933 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,935 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,938 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,940 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,943 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,945 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,948 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,951 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,953 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,956 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,959 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,962 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-06T02:38:17,965 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,967 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,970 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,973 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,975 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,978 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,981 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,983 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,986 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,989 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,992 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,994 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:17,997 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-06T02:38:18,000 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-06T02:38:18,003 copying build/lib/flashinfer/data/csrc/batch_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,005 copying build/lib/flashinfer/data/csrc/trtllm_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,008 copying build/lib/flashinfer/data/csrc/moe_utils_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,011 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,015 copying build/lib/flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,017 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,021 copying build/lib/flashinfer/data/csrc/batch_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,023 copying build/lib/flashinfer/data/csrc/batch_decode_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,025 copying build/lib/flashinfer/data/csrc/flashinfer_cascade_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,027 copying build/lib/flashinfer/data/csrc/group_gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,029 copying build/lib/flashinfer/data/csrc/flashinfer_quantization_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,031 copying build/lib/flashinfer/data/csrc/pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,033 copying build/lib/flashinfer/data/csrc/batch_decode_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,035 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,038 copying build/lib/flashinfer/data/csrc/concat_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,040 copying build/lib/flashinfer/data/csrc/blackwell_fmha_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,042 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,044 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,047 copying build/lib/flashinfer/data/csrc/group_gemm_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,049 copying build/lib/flashinfer/data/csrc/batch_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,051 copying build/lib/flashinfer/data/csrc/selective_state_update_dtype_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,054 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,056 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,058 copying build/lib/flashinfer/data/csrc/flashinfer_sampling_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,061 copying build/lib/flashinfer/data/csrc/batch_pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,063 copying build/lib/flashinfer/data/csrc/flashinfer_topk_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,066 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,068 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,070 copying build/lib/flashinfer/data/csrc/flashinfer_norm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,072 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,075 copying build/lib/flashinfer/data/csrc/batch_pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,078 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,079 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,082 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,086 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,088 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,091 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,094 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,097 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,099 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,102 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,104 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,107 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,110 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,113 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,116 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,118 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,121 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:18,123 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:18,126 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:18,129 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:18,132 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-06T02:38:18,135 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,138 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,141 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-06T02:38:18,145 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,146 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,152 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:18,153 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:18,155 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:18,158 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:18,161 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:18,164 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-06T02:38:18,167 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,169 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,172 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,178 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,179 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,181 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,184 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,187 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,189 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,192 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,197 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,200 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,203 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,206 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,209 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,212 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,215 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,218 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,222 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,225 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,228 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,230 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-06T02:38:18,235 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,238 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,241 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,244 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,248 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/mask.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,252 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,254 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,259 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,261 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,265 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,271 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,275 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,277 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,501 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,504 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-06T02:38:18,509 copying build/lib/flashinfer/data/csrc/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,511 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,513 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,515 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,517 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,519 copying build/lib/flashinfer/data/csrc/batch_pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,522 copying build/lib/flashinfer/data/csrc/tinygemm2.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,524 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,526 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,529 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,531 copying build/lib/flashinfer/data/csrc/trtllm_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,533 copying build/lib/flashinfer/data/csrc/trtllm_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,536 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,539 copying build/lib/flashinfer/data/csrc/batch_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,541 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,543 copying build/lib/flashinfer/data/csrc/trtllm_moe_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,546 copying build/lib/flashinfer/data/csrc/single_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,548 copying build/lib/flashinfer/data/csrc/renorm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,551 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,553 copying build/lib/flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,556 copying build/lib/flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,558 copying build/lib/flashinfer/data/csrc/single_prefill_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,560 copying build/lib/flashinfer/data/csrc/nvshmem_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,563 copying build/lib/flashinfer/data/csrc/batch_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,565 copying build/lib/flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,568 copying build/lib/flashinfer/data/csrc/selective_state_update.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,571 copying build/lib/flashinfer/data/csrc/fmha_v2_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,574 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,576 copying build/lib/flashinfer/data/csrc/batch_decode_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,579 copying build/lib/flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,582 copying build/lib/flashinfer/data/csrc/batch_decode_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,584 copying build/lib/flashinfer/data/csrc/gdn_prefill_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,587 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,590 copying build/lib/flashinfer/data/csrc/trtllm_fmha_v2_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,592 copying build/lib/flashinfer/data/csrc/flashinfer_page_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,594 copying build/lib/flashinfer/data/csrc/trtllm_batched_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,597 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,600 copying build/lib/flashinfer/data/csrc/dsv3_router_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,603 copying build/lib/flashinfer/data/csrc/group_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,605 copying build/lib/flashinfer/data/csrc/bmm_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,608 copying build/lib/flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,611 copying build/lib/flashinfer/data/csrc/single_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,613 copying build/lib/flashinfer/data/csrc/batch_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,615 copying build/lib/flashinfer/data/csrc/batch_attention_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,617 copying build/lib/flashinfer/data/csrc/tgv_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,620 copying build/lib/flashinfer/data/csrc/sampling_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,623 copying build/lib/flashinfer/data/csrc/rope.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,625 copying build/lib/flashinfer/data/csrc/tvm_ffi_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,628 copying build/lib/flashinfer/data/csrc/flashinfer_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,630 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,633 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,635 copying build/lib/flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,637 copying build/lib/flashinfer/data/csrc/selective_state_update_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,640 copying build/lib/flashinfer/data/csrc/pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,642 copying build/lib/flashinfer/data/csrc/pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,645 copying build/lib/flashinfer/data/csrc/flashinfer_rope_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,647 copying build/lib/flashinfer/data/csrc/batch_attention_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,649 copying build/lib/flashinfer/data/csrc/trtllm_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,653 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe 2026-04-06T02:38:18,655 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:18,656 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:18,658 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:18,660 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:18,665 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-06T02:38:18,675 copying build/lib/flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-06T02:38:18,678 copying build/lib/flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-06T02:38:18,681 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:18,683 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_renormalize.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:18,685 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:18,689 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:18,691 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/RoutingRenormalizeCommon.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:18,694 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchInitExpertCounts.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:18,696 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchOffsetsKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:18,698 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchBlockKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:18,701 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchClusterKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:18,703 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramScoresKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:18,706 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-06T02:38:18,708 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:18,710 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/RoutingDeepSeekCommon.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:18,712 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchInitExpertCounts.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:18,715 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchOffsetsKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:18,717 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchMainKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:18,720 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchClusterKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:18,722 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchCoopKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:18,724 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchHistogramKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-06T02:38:18,726 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:18,729 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-06T02:38:18,732 copying build/lib/flashinfer/data/csrc/single_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,734 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,736 copying build/lib/flashinfer/data/csrc/gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,739 copying build/lib/flashinfer/data/csrc/flashinfer_xqa_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,741 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,743 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/xqa 2026-04-06T02:38:18,744 copying build/lib/flashinfer/data/csrc/xqa/mha.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,749 copying build/lib/flashinfer/data/csrc/xqa/tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,751 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,754 copying build/lib/flashinfer/data/csrc/xqa/ldgsts.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,756 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,758 copying build/lib/flashinfer/data/csrc/xqa/mha.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,761 copying build/lib/flashinfer/data/csrc/xqa/hostUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,763 copying build/lib/flashinfer/data/csrc/xqa/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,765 copying build/lib/flashinfer/data/csrc/xqa/gmma_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,773 copying build/lib/flashinfer/data/csrc/xqa/mha_components.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,775 copying build/lib/flashinfer/data/csrc/xqa/mha_stdheaders.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,779 copying build/lib/flashinfer/data/csrc/xqa/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,781 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,783 copying build/lib/flashinfer/data/csrc/xqa/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,786 copying build/lib/flashinfer/data/csrc/xqa/defines.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,788 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,790 copying build/lib/flashinfer/data/csrc/xqa/gmma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,792 copying build/lib/flashinfer/data/csrc/xqa/cuda_hint.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,794 copying build/lib/flashinfer/data/csrc/xqa/mhaUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,796 copying build/lib/flashinfer/data/csrc/xqa/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,799 copying build/lib/flashinfer/data/csrc/xqa/barriers.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,801 copying build/lib/flashinfer/data/csrc/xqa/specDec.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,803 copying build/lib/flashinfer/data/csrc/xqa/xqa_wrapper.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,805 copying build/lib/flashinfer/data/csrc/xqa/mha_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-06T02:38:18,810 copying build/lib/flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,812 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,814 copying build/lib/flashinfer/data/csrc/cascade.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,816 copying build/lib/flashinfer/data/csrc/sampling.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,819 copying build/lib/flashinfer/data/csrc/batch_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-06T02:38:18,822 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass 2026-04-06T02:38:18,824 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples 2026-04-06T02:38:18,825 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-06T02:38:18,827 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-06T02:38:18,829 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-06T02:38:18,832 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python 2026-04-06T02:38:18,833 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL 2026-04-06T02:38:18,835 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental 2026-04-06T02:38:18,837 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-06T02:38:18,838 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-06T02:38:18,841 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:18,842 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:18,845 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:18,849 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:18,851 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:18,855 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-06T02:38:18,858 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-06T02:38:18,859 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-06T02:38:18,860 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-06T02:38:18,864 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:18,865 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:18,869 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:18,872 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:18,875 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-06T02:38:18,878 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:18,879 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:18,881 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:18,886 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:18,889 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:18,892 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:18,896 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:18,900 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-06T02:38:18,902 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-06T02:38:18,903 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-06T02:38:18,908 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,909 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,912 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,914 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,916 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,918 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,921 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,924 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,927 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,929 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,933 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,936 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,938 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,941 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-06T02:38:18,944 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:18,945 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:18,947 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:18,950 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:18,952 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-06T02:38:18,955 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-06T02:38:18,956 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-06T02:38:18,957 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-06T02:38:18,960 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:18,962 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:18,963 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:18,965 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:18,967 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:18,969 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:18,971 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:18,973 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:18,975 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-06T02:38:18,977 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-06T02:38:18,980 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-06T02:38:18,983 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-06T02:38:18,985 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-06T02:38:18,988 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-06T02:38:18,991 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:18,993 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:18,996 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:18,998 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:19,002 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-06T02:38:19,006 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,008 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-06T02:38:19,009 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-06T02:38:19,015 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-06T02:38:19,021 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-06T02:38:19,026 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,033 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,039 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:19,041 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:19,044 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:19,048 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:19,053 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:19,057 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-06T02:38:19,062 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,068 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,072 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:19,074 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:19,077 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:19,079 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:19,082 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:19,085 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-06T02:38:19,088 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,091 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,097 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,102 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,108 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,113 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,117 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,125 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:19,126 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:19,134 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:19,139 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:19,142 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-06T02:38:19,150 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-06T02:38:19,151 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-06T02:38:19,194 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-06T02:38:19,201 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-06T02:38:19,241 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,248 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,251 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,289 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,295 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,303 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,349 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:19,350 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:19,354 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:19,363 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:19,373 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-06T02:38:19,424 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-06T02:38:19,439 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-06T02:38:19,440 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-06T02:38:19,443 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-06T02:38:19,446 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-06T02:38:19,456 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-06T02:38:19,457 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-06T02:38:19,460 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-06T02:38:19,461 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-06T02:38:19,464 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-06T02:38:19,467 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-06T02:38:19,472 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-06T02:38:19,477 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-06T02:38:19,482 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen 2026-04-06T02:38:19,485 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,487 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,490 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,492 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,497 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,503 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,507 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,510 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,512 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,514 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,516 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,519 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,521 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-06T02:38:19,524 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python 2026-04-06T02:38:19,525 copying build/lib/flashinfer/data/cutlass/python/setup_pycute.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-06T02:38:19,546 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:19,550 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-06T02:38:19,551 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-06T02:38:19,553 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-06T02:38:19,555 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-06T02:38:19,558 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-06T02:38:19,559 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-06T02:38:19,561 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-06T02:38:19,563 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-06T02:38:19,566 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:19,568 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:19,571 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:19,572 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:19,575 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:19,577 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:19,579 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:19,584 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-06T02:38:19,587 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:19,590 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,591 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,593 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,596 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,598 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,600 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,602 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,606 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-06T02:38:19,607 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,609 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,612 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,614 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,616 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,618 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,621 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,623 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,625 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,628 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-06T02:38:19,631 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-06T02:38:19,632 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-06T02:38:19,634 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-06T02:38:19,636 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-06T02:38:19,639 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-06T02:38:19,641 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:19,642 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:19,644 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:19,646 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:19,648 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:19,650 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:19,653 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:19,655 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:19,657 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-06T02:38:19,659 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,660 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,662 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,665 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,667 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,669 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,671 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,673 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,676 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,679 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,681 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,683 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,685 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,688 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-06T02:38:19,690 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-06T02:38:19,693 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-06T02:38:19,694 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-06T02:38:19,697 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-06T02:38:19,699 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,703 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,705 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,708 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,711 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,713 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,715 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-06T02:38:19,718 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:19,720 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:19,723 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:19,725 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:19,728 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:19,730 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-06T02:38:19,732 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/shape.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-06T02:38:19,735 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL 2026-04-06T02:38:19,737 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-06T02:38:19,739 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:19,740 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:19,742 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:19,744 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:19,747 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:19,750 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:19,755 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-06T02:38:19,758 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:19,759 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:19,766 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:19,770 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:19,775 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-06T02:38:19,778 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,779 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,784 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:19,785 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:19,790 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:19,793 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:19,797 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:19,804 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-06T02:38:19,806 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,812 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:19,813 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:19,816 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:19,820 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:19,823 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:19,826 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:19,829 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:19,833 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-06T02:38:19,836 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,839 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,843 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,848 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,851 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,853 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,857 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,861 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,865 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:19,866 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:19,871 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:19,874 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:19,876 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:19,878 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:19,881 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-06T02:38:19,885 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,893 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:19,894 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:19,896 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:19,900 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:19,902 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:19,904 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-06T02:38:19,908 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:19,909 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:19,913 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:19,916 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:19,918 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-06T02:38:19,921 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-06T02:38:19,923 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-06T02:38:19,926 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-06T02:38:19,929 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:19,930 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:19,933 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:19,936 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:19,938 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:19,941 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:19,943 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-06T02:38:19,947 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:19,949 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:19,951 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:19,953 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:19,956 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:19,958 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:19,961 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:19,963 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:19,967 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-06T02:38:19,971 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:19,975 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:19,980 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:19,983 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:19,987 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:19,988 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:19,990 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:19,992 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:19,994 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:19,997 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:20,000 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:20,008 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:20,012 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-06T02:38:20,014 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:20,017 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:20,020 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:20,023 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:20,025 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:20,027 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:20,030 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:20,036 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-06T02:38:20,040 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-06T02:38:20,041 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-06T02:38:20,044 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-06T02:38:20,046 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-06T02:38:20,048 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-06T02:38:20,050 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-06T02:38:20,053 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-06T02:38:20,056 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:20,057 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:20,059 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:20,062 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:20,064 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-06T02:38:20,066 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-06T02:38:20,069 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-06T02:38:20,070 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-06T02:38:20,072 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-06T02:38:20,074 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-06T02:38:20,077 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-06T02:38:20,078 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-06T02:38:20,080 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-06T02:38:20,083 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-06T02:38:20,085 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:20,086 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:20,089 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:20,091 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:20,092 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:20,094 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-06T02:38:20,097 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-06T02:38:20,099 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,100 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,103 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,106 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,108 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,111 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,114 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,116 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,118 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,121 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,124 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,126 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,130 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,132 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,135 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,138 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-06T02:38:20,139 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-06T02:38:20,141 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-06T02:38:20,143 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,145 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,148 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-06T02:38:20,151 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL 2026-04-06T02:38:20,153 copying build/lib/flashinfer/data/cutlass/python/setup_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-06T02:38:20,155 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,156 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,160 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,162 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,165 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,167 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,170 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,172 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,175 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,177 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,180 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,182 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,184 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/symm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,187 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,211 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,215 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,218 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/manifest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,221 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,224 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,226 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-06T02:38:20,228 copying build/lib/flashinfer/data/cutlass/python/setup_cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-06T02:38:20,230 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:20,231 copying build/lib/flashinfer/data/cutlass/python/pycute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:20,233 copying build/lib/flashinfer/data/cutlass/python/pycute/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:20,235 copying build/lib/flashinfer/data/cutlass/python/pycute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:20,238 copying build/lib/flashinfer/data/cutlass/python/pycute/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:20,240 copying build/lib/flashinfer/data/cutlass/python/pycute/int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-06T02:38:20,243 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src 2026-04-06T02:38:20,244 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src/source 2026-04-06T02:38:20,245 copying build/lib/flashinfer/data/cutlass/python/docs_src/source/conf.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/docs_src/source 2026-04-06T02:38:20,248 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test 2026-04-06T02:38:20,250 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples 2026-04-06T02:38:20,251 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-06T02:38:20,252 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-06T02:38:20,255 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:20,256 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:20,259 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:20,261 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:20,263 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:20,265 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-06T02:38:20,267 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python 2026-04-06T02:38:20,269 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass 2026-04-06T02:38:20,270 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:20,272 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:20,273 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:20,277 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:20,279 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-06T02:38:20,282 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-06T02:38:20,283 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-06T02:38:20,285 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/installation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass 2026-04-06T02:38:20,288 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:20,289 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:20,291 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:20,294 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:20,296 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-06T02:38:20,299 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:20,300 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:20,302 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:20,304 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:20,306 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:20,309 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-06T02:38:20,310 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-06T02:38:20,313 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:20,315 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-06T02:38:20,318 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,319 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,321 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,323 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,325 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,326 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,329 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,331 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,333 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,335 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,337 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,340 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,342 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,344 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-06T02:38:20,347 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:20,348 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:20,350 copying build/lib/flashinfer/data/cutlass/test/python/pycute/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:20,351 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_composition.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:20,354 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_complement.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:20,356 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:20,358 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_coalesce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:20,360 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:20,361 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-06T02:38:20,363 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/utils 2026-04-06T02:38:20,364 copying build/lib/flashinfer/data/cutlass/test/utils/test_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/utils 2026-04-06T02:38:20,368 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit 2026-04-06T02:38:20,369 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm 2026-04-06T02:38:20,370 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-06T02:38:20,372 copying build/lib/flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/unit/gemm/device 2026-04-06T02:38:20,375 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include 2026-04-06T02:38:20,376 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,378 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,379 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,382 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,385 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/real.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,387 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,389 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,391 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,393 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,395 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/int.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,397 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-06T02:38:20,400 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,403 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,404 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,407 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,410 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,412 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,452 copying build/lib/flashinfer/data/cutlass/include/cute/arch/util.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,454 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,458 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,460 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,464 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,469 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,471 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,473 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,477 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,479 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,487 copying build/lib/flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,488 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,491 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,493 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,495 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,497 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,566 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,620 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,622 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,624 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,643 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,646 copying build/lib/flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,649 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,651 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,653 copying build/lib/flashinfer/data/cutlass/include/cute/arch/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,656 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,659 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,661 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-06T02:38:20,674 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_flagged.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,676 copying build/lib/flashinfer/data/cutlass/include/cute/layout_composed.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,679 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,680 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,682 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,684 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,686 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,689 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,691 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,693 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,696 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/clear.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,698 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,700 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,702 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,704 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,707 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/functional.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-06T02:38:20,709 copying build/lib/flashinfer/data/cutlass/include/cute/stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,712 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,715 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:20,716 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_svg.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:20,718 copying build/lib/flashinfer/data/cutlass/include/cute/util/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:20,720 copying build/lib/flashinfer/data/cutlass/include/cute/util/print.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:20,722 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_latex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:20,725 copying build/lib/flashinfer/data/cutlass/include/cute/util/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:20,727 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-06T02:38:20,729 copying build/lib/flashinfer/data/cutlass/include/cute/tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,732 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,733 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,735 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,738 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,742 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,744 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,747 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,749 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,751 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,754 copying build/lib/flashinfer/data/cutlass/include/cute/atom/partitioner.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,756 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,759 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,761 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,767 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,769 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,781 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,783 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,785 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,787 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,789 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,796 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,801 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,804 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,806 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,809 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,811 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,817 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,828 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,830 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-06T02:38:20,832 copying build/lib/flashinfer/data/cutlass/include/cute/underscore.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,834 copying build/lib/flashinfer/data/cutlass/include/cute/int_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,837 copying build/lib/flashinfer/data/cutlass/include/cute/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,839 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,841 copying build/lib/flashinfer/data/cutlass/include/cute/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,845 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:20,846 copying build/lib/flashinfer/data/cutlass/include/cute/container/alignment.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:20,848 copying build/lib/flashinfer/data/cutlass/include/cute/container/array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:20,850 copying build/lib/flashinfer/data/cutlass/include/cute/container/bit_field.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:20,852 copying build/lib/flashinfer/data/cutlass/include/cute/container/tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:20,855 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_aligned.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:20,857 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:20,859 copying build/lib/flashinfer/data/cutlass/include/cute/container/cuda_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:20,861 copying build/lib/flashinfer/data/cutlass/include/cute/container/type_list.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-06T02:38:20,863 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_impl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,866 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_base.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,868 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_zip.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,871 copying build/lib/flashinfer/data/cutlass/include/cute/pointer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,873 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-06T02:38:20,877 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:20,878 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental 2026-04-06T02:38:20,880 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed 2026-04-06T02:38:20,881 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-06T02:38:20,883 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-06T02:38:20,885 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-06T02:38:20,888 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-06T02:38:20,889 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-06T02:38:20,892 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-06T02:38:20,894 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-06T02:38:20,897 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-06T02:38:20,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-06T02:38:20,900 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-06T02:38:20,902 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-06T02:38:20,905 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/thread 2026-04-06T02:38:20,906 copying build/lib/flashinfer/data/cutlass/include/cutlass/thread/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/thread 2026-04-06T02:38:20,908 copying build/lib/flashinfer/data/cutlass/include/cutlass/constants.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:20,911 copying build/lib/flashinfer/data/cutlass/include/cutlass/fast_math.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:20,913 copying build/lib/flashinfer/data/cutlass/include/cutlass/wmma_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:20,916 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-06T02:38:20,918 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,919 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,921 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,923 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,926 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,928 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,930 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,933 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,935 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,937 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,939 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,941 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,944 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,946 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,948 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,950 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,952 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,955 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,957 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,960 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,962 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,964 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,967 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,969 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,971 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,973 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-06T02:38:20,976 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:20,977 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:20,980 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:20,984 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:20,988 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:20,990 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:20,992 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:20,995 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:20,998 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:21,001 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:21,003 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:21,006 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:21,010 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:21,013 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-06T02:38:21,017 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,018 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,020 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,023 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,025 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,028 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,030 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,032 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,035 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,037 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,039 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,041 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,044 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,046 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,049 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,050 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-06T02:38:21,053 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,054 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,057 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,060 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,063 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,066 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,069 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,072 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,075 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,077 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,080 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,084 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,087 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,090 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,092 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,094 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,097 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,100 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,102 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-06T02:38:21,105 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:21,107 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:21,109 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:21,111 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:21,116 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:21,119 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:21,121 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-06T02:38:21,124 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,125 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,128 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,130 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,133 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,135 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,138 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,141 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,143 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,145 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,147 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,149 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,151 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,154 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:21,156 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:21,158 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:21,161 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:21,164 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:21,167 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-06T02:38:21,170 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,172 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,175 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,178 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,181 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,183 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,186 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,188 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,191 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,194 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,197 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,199 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,202 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,205 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,208 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,210 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,213 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,216 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,218 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,221 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,224 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,227 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,230 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,232 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,234 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,237 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,239 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,242 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,244 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,246 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,249 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,252 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,254 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,258 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,261 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,263 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,266 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-06T02:38:21,269 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-06T02:38:21,272 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,274 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,283 copying build/lib/flashinfer/data/cutlass/include/cutlass/real.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,285 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,289 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-06T02:38:21,290 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-06T02:38:21,293 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-06T02:38:21,296 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-06T02:38:21,300 copying build/lib/flashinfer/data/cutlass/include/cutlass/float_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,303 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,307 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:21,309 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-06T02:38:21,310 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-06T02:38:21,313 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:21,316 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:21,319 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:21,322 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-06T02:38:21,324 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-06T02:38:21,326 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-06T02:38:21,329 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-06T02:38:21,332 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:21,336 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:21,337 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:21,340 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:21,343 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:21,345 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-06T02:38:21,349 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:21,350 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:21,354 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:21,357 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:21,359 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:21,363 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:21,364 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:21,367 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:21,370 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:21,373 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-06T02:38:21,376 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-06T02:38:21,379 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,380 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,383 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,385 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,388 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,392 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,396 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,399 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,402 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,405 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,409 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,413 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,416 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,421 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,424 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,427 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,430 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,434 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,436 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,439 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,443 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,446 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,450 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,453 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,457 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,460 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,464 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,468 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,471 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,475 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-06T02:38:21,480 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,481 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,485 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,488 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,491 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,494 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,497 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,500 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,503 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,506 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,510 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,513 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,515 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,519 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,522 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,525 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,528 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,531 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,535 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,538 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,541 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,544 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,547 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,551 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,554 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,558 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,561 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,564 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,566 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,569 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,572 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,575 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,578 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,580 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,583 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,585 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,588 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,591 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,593 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,596 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,599 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,601 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,604 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,607 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,610 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,614 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,616 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-06T02:38:21,619 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:21,621 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-06T02:38:21,623 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint256.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,625 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,628 copying build/lib/flashinfer/data/cutlass/include/cutlass/half.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,632 copying build/lib/flashinfer/data/cutlass/include/cutlass/bfloat16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,636 copying build/lib/flashinfer/data/cutlass/include/cutlass/integer_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,640 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,641 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,645 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,650 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,653 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,657 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,659 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,661 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,663 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,666 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,669 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,671 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,677 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,680 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,682 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,684 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,686 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,689 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,691 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,693 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,695 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,697 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,699 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,702 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,705 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/arch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,707 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,710 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,712 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,715 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,717 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-06T02:38:21,720 copying build/lib/flashinfer/data/cutlass/include/cutlass/coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,723 copying build/lib/flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,725 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,727 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,732 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,735 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,738 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,739 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,742 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,745 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,748 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/permute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,751 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,754 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,756 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,759 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,762 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-06T02:38:21,764 copying build/lib/flashinfer/data/cutlass/include/cutlass/block_striped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,766 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_launch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,769 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,772 copying build/lib/flashinfer/data/cutlass/include/cutlass/aligned_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,774 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,776 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint128.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,779 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,785 copying build/lib/flashinfer/data/cutlass/include/cutlass/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,787 copying build/lib/flashinfer/data/cutlass/include/cutlass/predicate_vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,790 copying build/lib/flashinfer/data/cutlass/include/cutlass/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,792 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform 2026-04-06T02:38:21,794 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-06T02:38:21,795 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-06T02:38:21,797 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-06T02:38:21,799 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-06T02:38:21,801 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-06T02:38:21,803 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform 2026-04-06T02:38:21,807 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-06T02:38:21,808 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-06T02:38:21,811 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-06T02:38:21,812 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-06T02:38:21,815 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-06T02:38:21,816 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-06T02:38:21,819 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-06T02:38:21,822 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-06T02:38:21,825 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,826 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,829 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,831 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,834 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,837 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,840 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,842 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,845 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,847 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,850 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,852 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,855 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,857 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,860 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,862 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,865 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,868 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,872 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,874 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,876 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,878 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,880 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,883 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,886 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,888 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-06T02:38:21,891 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/platform 2026-04-06T02:38:21,892 copying build/lib/flashinfer/data/cutlass/include/cutlass/platform/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/platform 2026-04-06T02:38:21,895 copying build/lib/flashinfer/data/cutlass/include/cutlass/device_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,900 copying build/lib/flashinfer/data/cutlass/include/cutlass/trace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,901 copying build/lib/flashinfer/data/cutlass/include/cutlass/array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,904 copying build/lib/flashinfer/data/cutlass/include/cutlass/core_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,907 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,909 copying build/lib/flashinfer/data/cutlass/include/cutlass/cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,912 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-06T02:38:21,913 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-06T02:38:21,914 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-06T02:38:21,917 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-06T02:38:21,919 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction 2026-04-06T02:38:21,921 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:21,922 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:21,925 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:21,927 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:21,930 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-06T02:38:21,932 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:21,933 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:21,936 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:21,939 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:21,941 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-06T02:38:21,943 copying build/lib/flashinfer/data/cutlass/include/cutlass/quaternion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,946 copying build/lib/flashinfer/data/cutlass/include/cutlass/tfloat32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,948 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,950 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,952 copying build/lib/flashinfer/data/cutlass/include/cutlass/semaphore.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,954 copying build/lib/flashinfer/data/cutlass/include/cutlass/subbyte_reference.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,957 copying build/lib/flashinfer/data/cutlass/include/cutlass/exmy_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:21,961 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,962 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,964 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,966 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,969 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,971 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,973 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,975 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,977 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,979 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,981 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,983 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,985 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-06T02:38:21,988 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-06T02:38:21,990 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-06T02:38:21,992 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-06T02:38:21,995 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-06T02:38:21,997 copying build/lib/flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,000 copying build/lib/flashinfer/data/cutlass/include/cutlass/float8.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,003 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-06T02:38:22,004 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:22,006 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:22,008 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:22,010 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:22,012 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-06T02:38:22,015 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-06T02:38:22,017 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-06T02:38:22,020 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,021 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,024 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,026 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,030 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,032 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,034 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,037 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,039 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,041 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,044 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,046 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,048 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,050 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,053 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,059 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,061 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,064 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,066 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,070 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,072 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,075 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,077 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,079 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,082 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,084 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,086 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,089 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,093 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,097 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,103 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,107 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,110 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,112 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,115 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,118 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-06T02:38:22,120 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-06T02:38:22,123 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,124 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,127 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,130 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,132 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,134 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,137 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,139 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,142 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,145 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,149 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,152 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,154 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,158 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,162 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,167 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,170 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,173 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,175 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,178 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,180 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,182 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,185 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,187 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,189 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,191 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,193 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,196 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,198 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,200 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,203 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-06T02:38:22,207 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,208 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,210 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,213 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,216 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,219 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,221 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,224 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,227 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,230 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,232 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,235 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,238 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,241 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,244 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,247 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,250 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,254 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,257 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,260 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,264 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,267 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,270 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,273 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,276 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,279 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,282 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,284 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,288 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,291 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,294 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,299 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,304 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,308 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,312 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,316 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,321 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,325 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,330 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,334 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,338 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,341 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,358 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,362 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,367 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,371 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,374 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,378 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,382 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,386 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,390 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,394 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,397 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,401 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-06T02:38:22,405 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,406 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,410 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,413 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,416 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,419 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,422 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,425 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,428 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,431 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,433 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,437 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,440 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,442 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,446 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,449 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,452 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,455 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,458 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,461 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,464 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,467 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,470 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,473 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,476 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,479 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,482 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,488 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,492 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,494 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-06T02:38:22,500 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,501 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,504 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,508 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,511 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,514 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,517 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,520 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,523 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,526 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,530 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,534 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,537 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,540 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,543 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,545 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,549 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,552 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,555 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,558 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,560 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,563 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,566 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,569 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,571 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,574 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,577 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,580 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,582 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,585 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,588 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,590 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,593 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,595 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,598 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,601 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,603 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,607 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,609 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,612 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,615 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,617 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,620 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,623 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,629 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,632 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,638 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,642 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,645 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,648 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,651 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,654 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,657 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,659 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,663 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,666 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,669 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,673 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,676 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,679 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,682 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,685 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,687 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,690 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,692 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,694 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,697 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,699 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,702 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,704 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,706 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,709 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,711 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,714 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,716 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,718 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,721 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,723 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,726 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,728 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,731 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,735 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,738 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,741 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,744 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,747 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,749 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,752 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,754 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,757 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,759 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,763 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,765 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,768 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,773 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,776 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,779 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,782 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,784 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,787 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,789 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,791 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,794 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,796 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,799 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,801 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,804 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,807 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,810 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,813 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,815 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,818 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,820 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,822 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,825 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-06T02:38:22,829 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,830 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,832 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,834 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,837 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,840 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,842 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,845 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,847 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,849 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,852 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,854 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,856 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,859 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,862 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,864 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,866 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,869 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,871 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,874 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,876 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,880 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,883 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,885 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,888 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,891 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,893 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,896 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,899 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,901 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,905 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,908 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,911 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,914 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,917 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,920 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,923 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,926 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,929 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,932 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,934 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,937 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,940 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,943 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,946 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-06T02:38:22,951 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-06T02:38:22,955 copying build/lib/flashinfer/data/cutlass/include/cutlass/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,958 copying build/lib/flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,961 copying build/lib/flashinfer/data/cutlass/include/cutlass/functional.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,964 copying build/lib/flashinfer/data/cutlass/include/cutlass/relatively_equal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,967 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,969 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,971 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_shape.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,974 copying build/lib/flashinfer/data/cutlass/include/cutlass/complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-06T02:38:22,978 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools 2026-04-06T02:38:22,979 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util 2026-04-06T02:38:22,981 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/scripts 2026-04-06T02:38:22,983 copying build/lib/flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/scripts 2026-04-06T02:38:22,986 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include 2026-04-06T02:38:22,988 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass 2026-04-06T02:38:22,991 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:22,993 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:22,996 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference 2026-04-06T02:38:22,998 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,001 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-06T02:38:23,002 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-06T02:38:23,005 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,008 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,011 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,013 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,017 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,020 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,022 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,025 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,027 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,030 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,033 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-06T02:38:23,035 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-06T02:38:23,038 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-06T02:38:23,040 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-06T02:38:23,043 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-06T02:38:23,047 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-06T02:38:23,048 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-06T02:38:23,051 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-06T02:38:23,054 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,055 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,058 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,061 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,063 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,066 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,069 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,071 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,073 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,076 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,078 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,081 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,083 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,086 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,089 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,091 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,094 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,097 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,100 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,102 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,106 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,110 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,113 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,118 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,121 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-06T02:38:23,125 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,129 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,132 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,137 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,141 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,144 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,149 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,153 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,156 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,159 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,162 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,165 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,168 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,171 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,174 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,177 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,179 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,181 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,185 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,188 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,190 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,193 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,196 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,198 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,201 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,204 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,206 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-06T02:38:23,210 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include 2026-04-06T02:38:23,212 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer 2026-04-06T02:38:23,213 copying build/lib/flashinfer/data/include/flashinfer/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,216 copying build/lib/flashinfer/data/include/flashinfer/logging.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,218 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:23,219 copying build/lib/flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:23,221 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:23,224 copying build/lib/flashinfer/data/include/flashinfer/mamba/conversion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:23,227 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:23,229 copying build/lib/flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:23,231 copying build/lib/flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:23,233 copying build/lib/flashinfer/data/include/flashinfer/mamba/common.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-06T02:38:23,235 copying build/lib/flashinfer/data/include/flashinfer/sampling.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,239 copying build/lib/flashinfer/data/include/flashinfer/math.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,241 copying build/lib/flashinfer/data/include/flashinfer/allocator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,243 copying build/lib/flashinfer/data/include/flashinfer/exception.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,244 copying build/lib/flashinfer/data/include/flashinfer/concat_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,247 copying build/lib/flashinfer/data/include/flashinfer/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,249 copying build/lib/flashinfer/data/include/flashinfer/air_top_p.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,252 copying build/lib/flashinfer/data/include/flashinfer/fp16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,254 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:23,255 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:23,259 copying build/lib/flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:23,261 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:23,264 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:23,267 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:23,269 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:23,272 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-06T02:38:23,275 copying build/lib/flashinfer/data/include/flashinfer/attention_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,277 copying build/lib/flashinfer/data/include/flashinfer/topk.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,282 copying build/lib/flashinfer/data/include/flashinfer/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,284 copying build/lib/flashinfer/data/include/flashinfer/cp_async.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,286 copying build/lib/flashinfer/data/include/flashinfer/arch_condition.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,289 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:23,290 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper 2026-04-06T02:38:23,292 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-06T02:38:23,293 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-06T02:38:23,296 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:23,297 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:23,299 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:23,301 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:23,303 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:23,308 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-06T02:38:23,312 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:23,314 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:23,317 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:23,319 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:23,323 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-06T02:38:23,331 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere 2026-04-06T02:38:23,333 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-06T02:38:23,334 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-06T02:38:23,337 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-06T02:38:23,340 copying build/lib/flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:23,342 copying build/lib/flashinfer/data/include/flashinfer/flat/cute_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:23,345 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/prefill 2026-04-06T02:38:23,346 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-06T02:38:23,349 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-06T02:38:23,351 copying build/lib/flashinfer/data/include/flashinfer/flat/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:23,353 copying build/lib/flashinfer/data/include/flashinfer/flat/unused.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:23,355 copying build/lib/flashinfer/data/include/flashinfer/flat/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:23,358 copying build/lib/flashinfer/data/include/flashinfer/flat/common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:23,361 copying build/lib/flashinfer/data/include/flashinfer/flat/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-06T02:38:23,363 copying build/lib/flashinfer/data/include/flashinfer/pos_enc.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,367 copying build/lib/flashinfer/data/include/flashinfer/cutlass_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,369 copying build/lib/flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,371 copying build/lib/flashinfer/data/include/flashinfer/fastdiv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,374 copying build/lib/flashinfer/data/include/flashinfer/activation.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,376 copying build/lib/flashinfer/data/include/flashinfer/profiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,378 copying build/lib/flashinfer/data/include/flashinfer/cubin_loader.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,380 copying build/lib/flashinfer/data/include/flashinfer/permuted_smem.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,383 copying build/lib/flashinfer/data/include/flashinfer/fp4_layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,385 copying build/lib/flashinfer/data/include/flashinfer/page.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,389 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm 2026-04-06T02:38:23,390 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm 2026-04-06T02:38:23,393 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:23,395 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:23,398 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:23,400 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:23,402 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:23,405 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:23,408 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-06T02:38:23,411 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-06T02:38:23,412 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-06T02:38:23,416 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm 2026-04-06T02:38:23,418 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:23,419 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmOptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:23,425 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmInterface.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:23,430 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelTraits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:23,433 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/TmaDescriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:23,436 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/Enums.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:23,439 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:23,442 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParamsDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-06T02:38:23,445 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm 2026-04-06T02:38:23,447 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:23,448 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SfLayoutDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:23,451 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/DtypeDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:23,454 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/MmaDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:23,456 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaKernelLauncher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:23,458 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaArchDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:23,460 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CommonUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:23,463 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SparsityDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-06T02:38:23,465 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:23,467 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:23,470 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:23,472 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:23,475 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:23,477 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:23,480 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:23,483 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-06T02:38:23,486 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,487 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,490 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,492 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,495 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,498 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,499 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,503 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,505 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,508 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-06T02:38:23,511 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,512 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,514 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,518 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,520 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,522 copying build/lib/flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,524 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,527 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,530 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,532 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,535 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,537 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,540 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,542 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,545 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,548 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,551 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,553 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,555 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,558 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,561 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,563 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,565 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,568 copying build/lib/flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,571 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,574 copying build/lib/flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,577 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,579 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,582 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,584 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,587 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,590 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-06T02:38:23,592 copying build/lib/flashinfer/data/include/flashinfer/norm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,595 copying build/lib/flashinfer/data/include/flashinfer/layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,598 copying build/lib/flashinfer/data/include/flashinfer/vec_dtypes.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-06T02:38:23,602 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,603 copying build/lib/flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,606 copying build/lib/flashinfer/data/include/flashinfer/attention/mask.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,608 copying build/lib/flashinfer/data/include/flashinfer/attention/cascade.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,611 copying build/lib/flashinfer/data/include/flashinfer/attention/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,614 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,615 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,618 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,620 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,623 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,625 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,627 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,630 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,633 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:23,635 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:23,637 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:23,640 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:23,643 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:23,646 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:23,649 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-06T02:38:23,652 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,654 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,657 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,659 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,662 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,665 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-06T02:38:23,667 copying build/lib/flashinfer/data/include/flashinfer/attention/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,670 copying build/lib/flashinfer/data/include/flashinfer/attention/prefill.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,674 copying build/lib/flashinfer/data/include/flashinfer/attention/mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,678 copying build/lib/flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,681 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent_template.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,683 copying build/lib/flashinfer/data/include/flashinfer/attention/heap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,685 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,689 copying build/lib/flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,692 copying build/lib/flashinfer/data/include/flashinfer/attention/default_decode_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,694 copying build/lib/flashinfer/data/include/flashinfer/attention/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,699 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,702 copying build/lib/flashinfer/data/include/flashinfer/attention/pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,705 copying build/lib/flashinfer/data/include/flashinfer/attention/decode.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,710 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,713 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,716 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-06T02:38:23,717 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-06T02:38:23,721 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-06T02:38:23,722 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-06T02:38:23,725 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-06T02:38:23,728 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-06T02:38:23,729 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-06T02:38:23,732 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-06T02:38:23,735 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:23,737 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:23,740 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:23,742 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:23,745 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:23,749 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:23,751 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:23,754 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:23,758 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-06T02:38:23,761 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:23,762 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:23,767 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:23,770 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:23,773 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:23,775 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:23,777 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:23,780 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:23,782 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-06T02:38:23,785 copying build/lib/flashinfer/data/include/flashinfer/attention/state.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,788 copying build/lib/flashinfer/data/include/flashinfer/attention/batch_pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-06T02:38:23,791 copying build/lib/flashinfer/data/build_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-06T02:38:23,793 copying build/lib/flashinfer/data/build_backend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-06T02:38:23,797 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe 2026-04-06T02:38:23,799 copying build/lib/flashinfer/fused_moe/fused_routing_dsv3.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-06T02:38:23,802 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:23,804 copying build/lib/flashinfer/fused_moe/cute_dsl/tuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:23,807 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:23,810 copying build/lib/flashinfer/fused_moe/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:23,812 copying build/lib/flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:23,815 copying build/lib/flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:23,818 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-06T02:38:23,822 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:23,823 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:23,826 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:23,831 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:23,834 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:23,839 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-06T02:38:23,842 copying build/lib/flashinfer/fused_moe/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-06T02:38:23,844 copying build/lib/flashinfer/fused_moe/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-06T02:38:23,849 copying build/lib/flashinfer/fused_moe/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-06T02:38:23,852 copying build/lib/flashinfer/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:23,858 copying build/lib/flashinfer/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:23,864 copying build/lib/flashinfer/gdn_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-06T02:38:23,867 creating build/bdist.linux-armv7l/wheel/flashinfer/profiler 2026-04-06T02:38:23,868 copying build/lib/flashinfer/profiler/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/profiler 2026-04-06T02:38:23,871 copying build/lib/build_utils.py -> build/bdist.linux-armv7l/wheel/. 2026-04-06T02:38:23,873 copying build/lib/build_backend.py -> build/bdist.linux-armv7l/wheel/. 2026-04-06T02:38:23,875 running install_egg_info 2026-04-06T02:38:23,887 running egg_info 2026-04-06T02:38:23,893 writing flashinfer_python.egg-info/PKG-INFO 2026-04-06T02:38:23,897 writing dependency_links to flashinfer_python.egg-info/dependency_links.txt 2026-04-06T02:38:23,898 writing entry points to flashinfer_python.egg-info/entry_points.txt 2026-04-06T02:38:23,900 writing requirements to flashinfer_python.egg-info/requires.txt 2026-04-06T02:38:23,902 writing top-level names to flashinfer_python.egg-info/top_level.txt 2026-04-06T02:38:24,703 reading manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-06T02:38:24,820 adding license file 'LICENSE' 2026-04-06T02:38:24,944 writing manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-06T02:38:24,960 Copying flashinfer_python.egg-info to build/bdist.linux-armv7l/wheel/./flashinfer_python-0.6.7.post3-py3.11.egg-info 2026-04-06T02:38:25,019 running install_scripts 2026-04-06T02:38:25,030 creating build/bdist.linux-armv7l/wheel/flashinfer_python-0.6.7.post3.dist-info/WHEEL 2026-04-06T02:38:25,033 creating '/tmp/pip-wheel-sbyrc2da/.tmp-dea5u5yk/flashinfer_python-0.6.7.post3-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-04-06T02:38:25,035 adding 'build_backend.py' 2026-04-06T02:38:25,036 adding 'build_utils.py' 2026-04-06T02:38:25,039 adding 'flashinfer/__init__.py' 2026-04-06T02:38:25,042 adding 'flashinfer/__main__.py' 2026-04-06T02:38:25,043 adding 'flashinfer/_build_meta.py' 2026-04-06T02:38:25,045 adding 'flashinfer/activation.py' 2026-04-06T02:38:25,048 adding 'flashinfer/aot.py' 2026-04-06T02:38:25,055 adding 'flashinfer/api_logging.py' 2026-04-06T02:38:25,057 adding 'flashinfer/artifacts.py' 2026-04-06T02:38:25,059 adding 'flashinfer/attention.py' 2026-04-06T02:38:25,066 adding 'flashinfer/autotuner.py' 2026-04-06T02:38:25,070 adding 'flashinfer/cascade.py' 2026-04-06T02:38:25,071 adding 'flashinfer/compilation_context.py' 2026-04-06T02:38:25,073 adding 'flashinfer/concat_ops.py' 2026-04-06T02:38:25,074 adding 'flashinfer/cuda_utils.py' 2026-04-06T02:38:25,085 adding 'flashinfer/decode.py' 2026-04-06T02:38:25,090 adding 'flashinfer/deep_gemm.py' 2026-04-06T02:38:25,091 adding 'flashinfer/fp4_quantization.py' 2026-04-06T02:38:25,093 adding 'flashinfer/fp8_quantization.py' 2026-04-06T02:38:25,095 adding 'flashinfer/gdn_decode.py' 2026-04-06T02:38:25,097 adding 'flashinfer/gdn_prefill.py' 2026-04-06T02:38:25,099 adding 'flashinfer/green_ctx.py' 2026-04-06T02:38:25,103 adding 'flashinfer/mla.py' 2026-04-06T02:38:25,105 adding 'flashinfer/page.py' 2026-04-06T02:38:25,109 adding 'flashinfer/pod.py' 2026-04-06T02:38:25,126 adding 'flashinfer/prefill.py' 2026-04-06T02:38:25,129 adding 'flashinfer/py.typed' 2026-04-06T02:38:25,133 adding 'flashinfer/rope.py' 2026-04-06T02:38:25,139 adding 'flashinfer/sampling.py' 2026-04-06T02:38:25,144 adding 'flashinfer/sparse.py' 2026-04-06T02:38:25,146 adding 'flashinfer/tllm_enums.py' 2026-04-06T02:38:25,147 adding 'flashinfer/tllm_utils.py' 2026-04-06T02:38:25,149 adding 'flashinfer/topk.py' 2026-04-06T02:38:25,151 adding 'flashinfer/trtllm_low_latency_gemm.py' 2026-04-06T02:38:25,156 adding 'flashinfer/utils.py' 2026-04-06T02:38:25,158 adding 'flashinfer/version.py' 2026-04-06T02:38:25,160 adding 'flashinfer/xqa.py' 2026-04-06T02:38:25,162 adding 'flashinfer/comm/__init__.py' 2026-04-06T02:38:25,165 adding 'flashinfer/comm/allreduce.py' 2026-04-06T02:38:25,167 adding 'flashinfer/comm/cuda_ipc.py' 2026-04-06T02:38:25,169 adding 'flashinfer/comm/dlpack_utils.py' 2026-04-06T02:38:25,171 adding 'flashinfer/comm/mapping.py' 2026-04-06T02:38:25,177 adding 'flashinfer/comm/mnnvl.py' 2026-04-06T02:38:25,179 adding 'flashinfer/comm/nvshmem.py' 2026-04-06T02:38:25,180 adding 'flashinfer/comm/nvshmem_allreduce.py' 2026-04-06T02:38:25,183 adding 'flashinfer/comm/trtllm_alltoall.py' 2026-04-06T02:38:25,187 adding 'flashinfer/comm/trtllm_ar.py' 2026-04-06T02:38:25,190 adding 'flashinfer/comm/trtllm_mnnvl_ar.py' 2026-04-06T02:38:25,194 adding 'flashinfer/comm/trtllm_moe_alltoall.py' 2026-04-06T02:38:25,195 adding 'flashinfer/comm/vllm_ar.py' 2026-04-06T02:38:25,197 adding 'flashinfer/comm/workspace_base.py' 2026-04-06T02:38:25,199 adding 'flashinfer/cudnn/__init__.py' 2026-04-06T02:38:25,201 adding 'flashinfer/cudnn/decode.py' 2026-04-06T02:38:25,204 adding 'flashinfer/cudnn/prefill.py' 2026-04-06T02:38:25,205 adding 'flashinfer/cudnn/utils.py' 2026-04-06T02:38:25,207 adding 'flashinfer/cute_dsl/__init__.py' 2026-04-06T02:38:25,212 adding 'flashinfer/cute_dsl/add_rmsnorm_fp4quant.py' 2026-04-06T02:38:25,214 adding 'flashinfer/cute_dsl/blockscaled_gemm.py' 2026-04-06T02:38:25,218 adding 'flashinfer/cute_dsl/fp4_common.py' 2026-04-06T02:38:25,226 adding 'flashinfer/cute_dsl/gemm_allreduce_two_shot.py' 2026-04-06T02:38:25,231 adding 'flashinfer/cute_dsl/rmsnorm_fp4quant.py' 2026-04-06T02:38:25,233 adding 'flashinfer/cute_dsl/utils.py' 2026-04-06T02:38:25,235 adding 'flashinfer/data/build_backend.py' 2026-04-06T02:38:25,236 adding 'flashinfer/data/build_utils.py' 2026-04-06T02:38:25,241 adding 'flashinfer/data/csrc/batch_attention.cu' 2026-04-06T02:38:25,243 adding 'flashinfer/data/csrc/batch_attention_customize_config.jinja' 2026-04-06T02:38:25,244 adding 'flashinfer/data/csrc/batch_attention_jit_binding.cu' 2026-04-06T02:38:25,245 adding 'flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja' 2026-04-06T02:38:25,247 adding 'flashinfer/data/csrc/batch_decode.cu' 2026-04-06T02:38:25,248 adding 'flashinfer/data/csrc/batch_decode_customize_config.jinja' 2026-04-06T02:38:25,249 adding 'flashinfer/data/csrc/batch_decode_jit_binding.cu' 2026-04-06T02:38:25,251 adding 'flashinfer/data/csrc/batch_decode_kernel_inst.jinja' 2026-04-06T02:38:25,252 adding 'flashinfer/data/csrc/batch_decode_mla_binding.cu' 2026-04-06T02:38:25,253 adding 'flashinfer/data/csrc/batch_decode_mla_config.jinja' 2026-04-06T02:38:25,255 adding 'flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu' 2026-04-06T02:38:25,256 adding 'flashinfer/data/csrc/batch_decode_mla_plan.cu' 2026-04-06T02:38:25,257 adding 'flashinfer/data/csrc/batch_decode_mla_run.cu' 2026-04-06T02:38:25,259 adding 'flashinfer/data/csrc/batch_mla_binding.cu' 2026-04-06T02:38:25,260 adding 'flashinfer/data/csrc/batch_mla_config.jinja' 2026-04-06T02:38:25,261 adding 'flashinfer/data/csrc/batch_mla_plan.cu' 2026-04-06T02:38:25,263 adding 'flashinfer/data/csrc/batch_mla_run.cu' 2026-04-06T02:38:25,264 adding 'flashinfer/data/csrc/batch_mla_sm90_binding.cu' 2026-04-06T02:38:25,265 adding 'flashinfer/data/csrc/batch_mla_sm90_plan.cu' 2026-04-06T02:38:25,267 adding 'flashinfer/data/csrc/batch_mla_sm90_run.cu' 2026-04-06T02:38:25,269 adding 'flashinfer/data/csrc/batch_pod.cu' 2026-04-06T02:38:25,271 adding 'flashinfer/data/csrc/batch_pod_customize_config.jinja' 2026-04-06T02:38:25,272 adding 'flashinfer/data/csrc/batch_pod_jit_binding.cu' 2026-04-06T02:38:25,273 adding 'flashinfer/data/csrc/batch_pod_kernel_inst.jinja' 2026-04-06T02:38:25,275 adding 'flashinfer/data/csrc/batch_prefill.cu' 2026-04-06T02:38:25,277 adding 'flashinfer/data/csrc/batch_prefill_customize_config.jinja' 2026-04-06T02:38:25,278 adding 'flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja' 2026-04-06T02:38:25,279 adding 'flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja' 2026-04-06T02:38:25,281 adding 'flashinfer/data/csrc/batch_prefill_fp8_sm90.cu' 2026-04-06T02:38:25,282 adding 'flashinfer/data/csrc/batch_prefill_jit_binding.cu' 2026-04-06T02:38:25,284 adding 'flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja' 2026-04-06T02:38:25,285 adding 'flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja' 2026-04-06T02:38:25,286 adding 'flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja' 2026-04-06T02:38:25,287 adding 'flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja' 2026-04-06T02:38:25,289 adding 'flashinfer/data/csrc/batch_prefill_sm90.cu' 2026-04-06T02:38:25,291 adding 'flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja' 2026-04-06T02:38:25,292 adding 'flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu' 2026-04-06T02:38:25,294 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.cu' 2026-04-06T02:38:25,295 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.jinja' 2026-04-06T02:38:25,297 adding 'flashinfer/data/csrc/blackwell_fmha_plan.cu' 2026-04-06T02:38:25,298 adding 'flashinfer/data/csrc/bmm_fp8.cu' 2026-04-06T02:38:25,299 adding 'flashinfer/data/csrc/cascade.cu' 2026-04-06T02:38:25,301 adding 'flashinfer/data/csrc/concat_mla.cu' 2026-04-06T02:38:25,306 adding 'flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu' 2026-04-06T02:38:25,308 adding 'flashinfer/data/csrc/cudnn_sdpa_utils.h' 2026-04-06T02:38:25,310 adding 'flashinfer/data/csrc/cutlass_mla.cu' 2026-04-06T02:38:25,311 adding 'flashinfer/data/csrc/dsv3_router_gemm.cu' 2026-04-06T02:38:25,313 adding 'flashinfer/data/csrc/flashinfer_cascade_binding.cu' 2026-04-06T02:38:25,314 adding 'flashinfer/data/csrc/flashinfer_gemm_binding.cu' 2026-04-06T02:38:25,315 adding 'flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu' 2026-04-06T02:38:25,316 adding 'flashinfer/data/csrc/flashinfer_mamba_binding.cu' 2026-04-06T02:38:25,318 adding 'flashinfer/data/csrc/flashinfer_mla_binding.cu' 2026-04-06T02:38:25,319 adding 'flashinfer/data/csrc/flashinfer_norm_binding.cu' 2026-04-06T02:38:25,320 adding 'flashinfer/data/csrc/flashinfer_page_binding.cu' 2026-04-06T02:38:25,321 adding 'flashinfer/data/csrc/flashinfer_quantization_binding.cu' 2026-04-06T02:38:25,323 adding 'flashinfer/data/csrc/flashinfer_rope_binding.cu' 2026-04-06T02:38:25,324 adding 'flashinfer/data/csrc/flashinfer_sampling_binding.cu' 2026-04-06T02:38:25,325 adding 'flashinfer/data/csrc/flashinfer_topk_binding.cu' 2026-04-06T02:38:25,327 adding 'flashinfer/data/csrc/flashinfer_xqa_binding.cu' 2026-04-06T02:38:25,328 adding 'flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc' 2026-04-06T02:38:25,331 adding 'flashinfer/data/csrc/fmhaReduction.cu' 2026-04-06T02:38:25,332 adding 'flashinfer/data/csrc/fmha_cutlass_sm100.cu' 2026-04-06T02:38:25,333 adding 'flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu' 2026-04-06T02:38:25,335 adding 'flashinfer/data/csrc/fmha_v2_jit_binding.cu' 2026-04-06T02:38:25,338 adding 'flashinfer/data/csrc/fmha_v2_run.cu' 2026-04-06T02:38:25,340 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.cu' 2026-04-06T02:38:25,341 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.jinja' 2026-04-06T02:38:25,343 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu' 2026-04-06T02:38:25,344 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja' 2026-04-06T02:38:25,346 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu' 2026-04-06T02:38:25,347 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja' 2026-04-06T02:38:25,349 adding 'flashinfer/data/csrc/fp4_kv_dequantization.cu' 2026-04-06T02:38:25,351 adding 'flashinfer/data/csrc/fp4_kv_quantization.cu' 2026-04-06T02:38:25,353 adding 'flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu' 2026-04-06T02:38:25,354 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.cu' 2026-04-06T02:38:25,355 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.jinja' 2026-04-06T02:38:25,357 adding 'flashinfer/data/csrc/gdn_prefill_launcher.cu' 2026-04-06T02:38:25,358 adding 'flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja' 2026-04-06T02:38:25,360 adding 'flashinfer/data/csrc/gemm_groupwise_sm100.cu' 2026-04-06T02:38:25,361 adding 'flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja' 2026-04-06T02:38:25,363 adding 'flashinfer/data/csrc/gemm_groupwise_sm120.cu' 2026-04-06T02:38:25,364 adding 'flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja' 2026-04-06T02:38:25,365 adding 'flashinfer/data/csrc/gemm_sm100_binding.cu' 2026-04-06T02:38:25,367 adding 'flashinfer/data/csrc/gemm_sm120_binding.cu' 2026-04-06T02:38:25,368 adding 'flashinfer/data/csrc/group_gemm.cu' 2026-04-06T02:38:25,370 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu' 2026-04-06T02:38:25,371 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja' 2026-04-06T02:38:25,373 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu' 2026-04-06T02:38:25,374 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja' 2026-04-06T02:38:25,376 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu' 2026-04-06T02:38:25,377 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja' 2026-04-06T02:38:25,379 adding 'flashinfer/data/csrc/group_gemm_sm100_binding.cu' 2026-04-06T02:38:25,380 adding 'flashinfer/data/csrc/group_gemm_sm120_binding.cu' 2026-04-06T02:38:25,382 adding 'flashinfer/data/csrc/group_gemm_sm90.cu' 2026-04-06T02:38:25,383 adding 'flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja' 2026-04-06T02:38:25,384 adding 'flashinfer/data/csrc/logging.cc' 2026-04-06T02:38:25,386 adding 'flashinfer/data/csrc/moe_utils_binding.cu' 2026-04-06T02:38:25,388 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.cu' 2026-04-06T02:38:25,389 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja' 2026-04-06T02:38:25,391 adding 'flashinfer/data/csrc/norm.cu' 2026-04-06T02:38:25,392 adding 'flashinfer/data/csrc/nvshmem_binding.cu' 2026-04-06T02:38:25,394 adding 'flashinfer/data/csrc/page.cu' 2026-04-06T02:38:25,395 adding 'flashinfer/data/csrc/pod.cu' 2026-04-06T02:38:25,397 adding 'flashinfer/data/csrc/pod_customize_config.jinja' 2026-04-06T02:38:25,398 adding 'flashinfer/data/csrc/pod_jit_binding.cu' 2026-04-06T02:38:25,399 adding 'flashinfer/data/csrc/pod_kernel_inst.jinja' 2026-04-06T02:38:25,400 adding 'flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu' 2026-04-06T02:38:25,402 adding 'flashinfer/data/csrc/quantization.cu' 2026-04-06T02:38:25,403 adding 'flashinfer/data/csrc/renorm.cu' 2026-04-06T02:38:25,406 adding 'flashinfer/data/csrc/rope.cu' 2026-04-06T02:38:25,407 adding 'flashinfer/data/csrc/runtime_utils.h' 2026-04-06T02:38:25,409 adding 'flashinfer/data/csrc/sampling.cu' 2026-04-06T02:38:25,410 adding 'flashinfer/data/csrc/sampling_utils.h' 2026-04-06T02:38:25,413 adding 'flashinfer/data/csrc/selective_state_update.cu' 2026-04-06T02:38:25,414 adding 'flashinfer/data/csrc/selective_state_update_customize_config.jinja' 2026-04-06T02:38:25,416 adding 'flashinfer/data/csrc/selective_state_update_dtype_inst.jinja' 2026-04-06T02:38:25,417 adding 'flashinfer/data/csrc/selective_state_update_kernel_inst.cu' 2026-04-06T02:38:25,418 adding 'flashinfer/data/csrc/seq_chunk_cumsum.cu' 2026-04-06T02:38:25,419 adding 'flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu' 2026-04-06T02:38:25,421 adding 'flashinfer/data/csrc/single_decode.cu' 2026-04-06T02:38:25,422 adding 'flashinfer/data/csrc/single_decode_customize_config.jinja' 2026-04-06T02:38:25,423 adding 'flashinfer/data/csrc/single_decode_jit_binding.cu' 2026-04-06T02:38:25,425 adding 'flashinfer/data/csrc/single_decode_kernel_inst.jinja' 2026-04-06T02:38:25,426 adding 'flashinfer/data/csrc/single_prefill.cu' 2026-04-06T02:38:25,427 adding 'flashinfer/data/csrc/single_prefill_customize_config.jinja' 2026-04-06T02:38:25,429 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90.cu' 2026-04-06T02:38:25,430 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja' 2026-04-06T02:38:25,431 adding 'flashinfer/data/csrc/single_prefill_jit_binding.cu' 2026-04-06T02:38:25,433 adding 'flashinfer/data/csrc/single_prefill_kernel_inst.jinja' 2026-04-06T02:38:25,434 adding 'flashinfer/data/csrc/single_prefill_sm90.cu' 2026-04-06T02:38:25,435 adding 'flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja' 2026-04-06T02:38:25,437 adding 'flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu' 2026-04-06T02:38:25,438 adding 'flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja' 2026-04-06T02:38:25,440 adding 'flashinfer/data/csrc/tgv_gemm.cu' 2026-04-06T02:38:25,441 adding 'flashinfer/data/csrc/tgv_gemm.jinja' 2026-04-06T02:38:25,444 adding 'flashinfer/data/csrc/tinygemm2.cu' 2026-04-06T02:38:25,446 adding 'flashinfer/data/csrc/topk.cu' 2026-04-06T02:38:25,447 adding 'flashinfer/data/csrc/trtllm_allreduce.cu' 2026-04-06T02:38:25,449 adding 'flashinfer/data/csrc/trtllm_allreduce_fusion.cu' 2026-04-06T02:38:25,451 adding 'flashinfer/data/csrc/trtllm_alltoall.cu' 2026-04-06T02:38:25,454 adding 'flashinfer/data/csrc/trtllm_alltoall_prepare.cu' 2026-04-06T02:38:25,456 adding 'flashinfer/data/csrc/trtllm_batched_gemm_runner.cu' 2026-04-06T02:38:25,460 adding 'flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu' 2026-04-06T02:38:25,462 adding 'flashinfer/data/csrc/trtllm_fmha_v2_binding.cu' 2026-04-06T02:38:25,471 adding 'flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu' 2026-04-06T02:38:25,475 adding 'flashinfer/data/csrc/trtllm_fused_moe_runner.cu' 2026-04-06T02:38:25,477 adding 'flashinfer/data/csrc/trtllm_gemm_runner.cu' 2026-04-06T02:38:25,479 adding 'flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu' 2026-04-06T02:38:25,480 adding 'flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu' 2026-04-06T02:38:25,482 adding 'flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu' 2026-04-06T02:38:25,484 adding 'flashinfer/data/csrc/trtllm_moe_alltoall.cu' 2026-04-06T02:38:25,486 adding 'flashinfer/data/csrc/tvm_ffi_utils.h' 2026-04-06T02:38:25,487 adding 'flashinfer/data/csrc/vllm_custom_all_reduce.cu' 2026-04-06T02:38:25,490 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h' 2026-04-06T02:38:25,491 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h' 2026-04-06T02:38:25,493 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h' 2026-04-06T02:38:25,495 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h' 2026-04-06T02:38:25,497 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h' 2026-04-06T02:38:25,499 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h' 2026-04-06T02:38:25,501 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h' 2026-04-06T02:38:25,504 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h' 2026-04-06T02:38:25,506 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h' 2026-04-06T02:38:25,508 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h' 2026-04-06T02:38:25,510 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h' 2026-04-06T02:38:25,515 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h' 2026-04-06T02:38:25,516 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h' 2026-04-06T02:38:25,518 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h' 2026-04-06T02:38:25,520 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h' 2026-04-06T02:38:25,523 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h' 2026-04-06T02:38:25,525 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h' 2026-04-06T02:38:25,528 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h' 2026-04-06T02:38:25,530 adding 'flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h' 2026-04-06T02:38:25,535 adding 'flashinfer/data/csrc/fmha_v2/fmha/fragment.h' 2026-04-06T02:38:25,536 adding 'flashinfer/data/csrc/fmha_v2/fmha/gemm.h' 2026-04-06T02:38:25,538 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h' 2026-04-06T02:38:25,542 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h' 2026-04-06T02:38:25,545 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h' 2026-04-06T02:38:25,547 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h' 2026-04-06T02:38:25,551 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h' 2026-04-06T02:38:25,562 adding 'flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h' 2026-04-06T02:38:25,565 adding 'flashinfer/data/csrc/fmha_v2/fmha/mask.h' 2026-04-06T02:38:25,566 adding 'flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h' 2026-04-06T02:38:25,568 adding 'flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h' 2026-04-06T02:38:25,572 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h' 2026-04-06T02:38:25,577 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h' 2026-04-06T02:38:25,580 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h' 2026-04-06T02:38:25,583 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h' 2026-04-06T02:38:25,594 adding 'flashinfer/data/csrc/fmha_v2/fmha/softmax.h' 2026-04-06T02:38:25,597 adding 'flashinfer/data/csrc/fmha_v2/fmha/traits.h' 2026-04-06T02:38:25,605 adding 'flashinfer/data/csrc/fmha_v2/fmha/utils.h' 2026-04-06T02:38:25,608 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h' 2026-04-06T02:38:25,611 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h' 2026-04-06T02:38:25,613 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h' 2026-04-06T02:38:25,618 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h' 2026-04-06T02:38:25,620 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h' 2026-04-06T02:38:25,622 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h' 2026-04-06T02:38:25,625 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h' 2026-04-06T02:38:25,632 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h' 2026-04-06T02:38:25,634 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h' 2026-04-06T02:38:25,636 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h' 2026-04-06T02:38:25,638 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h' 2026-04-06T02:38:25,640 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h' 2026-04-06T02:38:25,642 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h' 2026-04-06T02:38:25,645 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h' 2026-04-06T02:38:25,647 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h' 2026-04-06T02:38:25,652 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h' 2026-04-06T02:38:25,654 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h' 2026-04-06T02:38:25,656 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h' 2026-04-06T02:38:25,658 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h' 2026-04-06T02:38:25,662 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h' 2026-04-06T02:38:25,666 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h' 2026-04-06T02:38:25,671 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h' 2026-04-06T02:38:25,674 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h' 2026-04-06T02:38:25,677 adding 'flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja' 2026-04-06T02:38:25,679 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel.jinja' 2026-04-06T02:38:25,681 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja' 2026-04-06T02:38:25,684 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja' 2026-04-06T02:38:25,686 adding 'flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh' 2026-04-06T02:38:25,689 adding 'flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu' 2026-04-06T02:38:25,691 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu' 2026-04-06T02:38:25,715 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh' 2026-04-06T02:38:25,718 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu' 2026-04-06T02:38:25,723 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu' 2026-04-06T02:38:25,729 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu' 2026-04-06T02:38:25,731 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu' 2026-04-06T02:38:25,733 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu' 2026-04-06T02:38:25,735 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_renormalize.cu' 2026-04-06T02:38:25,738 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/RoutingDeepSeekCommon.cuh' 2026-04-06T02:38:25,739 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchClusterKernel.cu' 2026-04-06T02:38:25,742 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchCoopKernel.cu' 2026-04-06T02:38:25,743 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchHistogramKernel.cu' 2026-04-06T02:38:25,745 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchInitExpertCounts.cu' 2026-04-06T02:38:25,747 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchMainKernel.cu' 2026-04-06T02:38:25,749 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchOffsetsKernel.cu' 2026-04-06T02:38:25,751 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/RoutingRenormalizeCommon.cuh' 2026-04-06T02:38:25,753 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchBlockKernel.cu' 2026-04-06T02:38:25,755 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchClusterKernel.cu' 2026-04-06T02:38:25,757 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramKernel.cu' 2026-04-06T02:38:25,758 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramScoresKernel.cu' 2026-04-06T02:38:25,760 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchInitExpertCounts.cu' 2026-04-06T02:38:25,762 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchOffsetsKernel.cu' 2026-04-06T02:38:25,765 adding 'flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp' 2026-04-06T02:38:25,767 adding 'flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp' 2026-04-06T02:38:25,771 adding 'flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu' 2026-04-06T02:38:25,773 adding 'flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp' 2026-04-06T02:38:25,774 adding 'flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp' 2026-04-06T02:38:25,778 adding 'flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu' 2026-04-06T02:38:25,781 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h' 2026-04-06T02:38:25,783 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h' 2026-04-06T02:38:25,784 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h' 2026-04-06T02:38:25,786 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h' 2026-04-06T02:38:25,788 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h' 2026-04-06T02:38:25,792 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h' 2026-04-06T02:38:25,794 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h' 2026-04-06T02:38:25,796 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h' 2026-04-06T02:38:25,798 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h' 2026-04-06T02:38:25,800 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h' 2026-04-06T02:38:25,801 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h' 2026-04-06T02:38:25,804 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h' 2026-04-06T02:38:25,806 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh' 2026-04-06T02:38:25,807 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h' 2026-04-06T02:38:25,809 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh' 2026-04-06T02:38:25,811 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h' 2026-04-06T02:38:25,813 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h' 2026-04-06T02:38:25,814 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh' 2026-04-06T02:38:25,817 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh' 2026-04-06T02:38:25,818 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h' 2026-04-06T02:38:25,821 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h' 2026-04-06T02:38:25,823 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h' 2026-04-06T02:38:25,825 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h' 2026-04-06T02:38:25,827 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h' 2026-04-06T02:38:25,828 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h' 2026-04-06T02:38:25,830 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h' 2026-04-06T02:38:25,831 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h' 2026-04-06T02:38:25,833 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp' 2026-04-06T02:38:25,834 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp' 2026-04-06T02:38:25,836 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp' 2026-04-06T02:38:25,837 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h' 2026-04-06T02:38:25,839 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h' 2026-04-06T02:38:25,842 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp' 2026-04-06T02:38:25,846 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp' 2026-04-06T02:38:25,849 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp' 2026-04-06T02:38:25,852 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp' 2026-04-06T02:38:25,856 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp' 2026-04-06T02:38:25,858 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h' 2026-04-06T02:38:25,860 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp' 2026-04-06T02:38:25,861 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp' 2026-04-06T02:38:25,863 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp' 2026-04-06T02:38:25,864 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp' 2026-04-06T02:38:25,865 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp' 2026-04-06T02:38:25,867 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp' 2026-04-06T02:38:25,874 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp' 2026-04-06T02:38:25,877 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp' 2026-04-06T02:38:25,881 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-06T02:38:25,888 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-06T02:38:25,891 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl' 2026-04-06T02:38:25,893 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl' 2026-04-06T02:38:25,894 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl' 2026-04-06T02:38:25,897 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h' 2026-04-06T02:38:25,898 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh' 2026-04-06T02:38:25,901 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh' 2026-04-06T02:38:25,903 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh' 2026-04-06T02:38:25,905 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h' 2026-04-06T02:38:25,906 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp' 2026-04-06T02:38:25,907 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h' 2026-04-06T02:38:25,909 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh' 2026-04-06T02:38:25,912 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h' 2026-04-06T02:38:25,914 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h' 2026-04-06T02:38:25,917 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp' 2026-04-06T02:38:25,920 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp' 2026-04-06T02:38:25,923 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h' 2026-04-06T02:38:25,925 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h' 2026-04-06T02:38:25,926 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h' 2026-04-06T02:38:25,928 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h' 2026-04-06T02:38:25,930 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h' 2026-04-06T02:38:25,932 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h' 2026-04-06T02:38:25,934 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h' 2026-04-06T02:38:25,936 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h' 2026-04-06T02:38:25,939 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h' 2026-04-06T02:38:25,941 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h' 2026-04-06T02:38:25,943 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h' 2026-04-06T02:38:25,945 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h' 2026-04-06T02:38:25,947 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h' 2026-04-06T02:38:25,949 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h' 2026-04-06T02:38:25,951 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h' 2026-04-06T02:38:25,954 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h' 2026-04-06T02:38:25,956 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp' 2026-04-06T02:38:25,959 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh' 2026-04-06T02:38:25,961 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh' 2026-04-06T02:38:25,965 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh' 2026-04-06T02:38:25,966 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh' 2026-04-06T02:38:25,970 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh' 2026-04-06T02:38:25,976 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh' 2026-04-06T02:38:25,978 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh' 2026-04-06T02:38:25,980 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh' 2026-04-06T02:38:25,982 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh' 2026-04-06T02:38:25,984 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh' 2026-04-06T02:38:25,985 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh' 2026-04-06T02:38:25,987 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu' 2026-04-06T02:38:25,989 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h' 2026-04-06T02:38:25,990 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu' 2026-04-06T02:38:25,991 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h' 2026-04-06T02:38:25,994 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh' 2026-04-06T02:38:25,996 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h' 2026-04-06T02:38:25,999 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh' 2026-04-06T02:38:26,004 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu' 2026-04-06T02:38:26,006 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h' 2026-04-06T02:38:26,009 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu' 2026-04-06T02:38:26,010 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h' 2026-04-06T02:38:26,014 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp' 2026-04-06T02:38:26,016 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h' 2026-04-06T02:38:26,017 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h' 2026-04-06T02:38:26,020 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu' 2026-04-06T02:38:26,021 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h' 2026-04-06T02:38:26,028 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh' 2026-04-06T02:38:26,031 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh' 2026-04-06T02:38:26,032 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh' 2026-04-06T02:38:26,035 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh' 2026-04-06T02:38:26,037 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh' 2026-04-06T02:38:26,040 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu' 2026-04-06T02:38:26,041 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu' 2026-04-06T02:38:26,042 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu' 2026-04-06T02:38:26,044 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu' 2026-04-06T02:38:26,045 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu' 2026-04-06T02:38:26,046 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu' 2026-04-06T02:38:26,048 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu' 2026-04-06T02:38:26,049 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu' 2026-04-06T02:38:26,050 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu' 2026-04-06T02:38:26,052 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu' 2026-04-06T02:38:26,053 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu' 2026-04-06T02:38:26,054 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu' 2026-04-06T02:38:26,055 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu' 2026-04-06T02:38:26,057 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu' 2026-04-06T02:38:26,058 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu' 2026-04-06T02:38:26,059 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu' 2026-04-06T02:38:26,061 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu' 2026-04-06T02:38:26,062 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h' 2026-04-06T02:38:26,065 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h' 2026-04-06T02:38:26,067 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h' 2026-04-06T02:38:26,069 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h' 2026-04-06T02:38:26,072 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl' 2026-04-06T02:38:26,074 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h' 2026-04-06T02:38:26,075 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h' 2026-04-06T02:38:26,077 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h' 2026-04-06T02:38:26,083 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h' 2026-04-06T02:38:26,085 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h' 2026-04-06T02:38:26,087 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu' 2026-04-06T02:38:26,088 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu' 2026-04-06T02:38:26,090 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu' 2026-04-06T02:38:26,091 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu' 2026-04-06T02:38:26,092 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu' 2026-04-06T02:38:26,094 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu' 2026-04-06T02:38:26,095 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu' 2026-04-06T02:38:26,096 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu' 2026-04-06T02:38:26,097 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu' 2026-04-06T02:38:26,099 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu' 2026-04-06T02:38:26,100 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu' 2026-04-06T02:38:26,101 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu' 2026-04-06T02:38:26,102 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu' 2026-04-06T02:38:26,104 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu' 2026-04-06T02:38:26,108 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h' 2026-04-06T02:38:26,111 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h' 2026-04-06T02:38:26,113 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h' 2026-04-06T02:38:26,115 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu' 2026-04-06T02:38:26,116 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh' 2026-04-06T02:38:26,118 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h' 2026-04-06T02:38:26,120 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h' 2026-04-06T02:38:26,122 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl' 2026-04-06T02:38:26,123 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h' 2026-04-06T02:38:26,131 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl' 2026-04-06T02:38:26,134 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h' 2026-04-06T02:38:26,136 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl' 2026-04-06T02:38:26,138 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp' 2026-04-06T02:38:26,139 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h' 2026-04-06T02:38:26,142 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp' 2026-04-06T02:38:26,144 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp' 2026-04-06T02:38:26,145 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h' 2026-04-06T02:38:26,147 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp' 2026-04-06T02:38:26,149 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h' 2026-04-06T02:38:26,150 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h' 2026-04-06T02:38:26,151 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h' 2026-04-06T02:38:26,154 adding 'flashinfer/data/csrc/xqa/barriers.cuh' 2026-04-06T02:38:26,156 adding 'flashinfer/data/csrc/xqa/cuda_hint.cuh' 2026-04-06T02:38:26,157 adding 'flashinfer/data/csrc/xqa/defines.h' 2026-04-06T02:38:26,159 adding 'flashinfer/data/csrc/xqa/gmma.cuh' 2026-04-06T02:38:26,169 adding 'flashinfer/data/csrc/xqa/gmma_impl.cuh' 2026-04-06T02:38:26,172 adding 'flashinfer/data/csrc/xqa/hostUtils.h' 2026-04-06T02:38:26,174 adding 'flashinfer/data/csrc/xqa/ldgsts.cuh' 2026-04-06T02:38:26,187 adding 'flashinfer/data/csrc/xqa/mha.cu' 2026-04-06T02:38:26,190 adding 'flashinfer/data/csrc/xqa/mha.h' 2026-04-06T02:38:26,192 adding 'flashinfer/data/csrc/xqa/mhaUtils.cuh' 2026-04-06T02:38:26,194 adding 'flashinfer/data/csrc/xqa/mha_components.cuh' 2026-04-06T02:38:26,208 adding 'flashinfer/data/csrc/xqa/mha_sm90.cu' 2026-04-06T02:38:26,211 adding 'flashinfer/data/csrc/xqa/mha_stdheaders.cuh' 2026-04-06T02:38:26,219 adding 'flashinfer/data/csrc/xqa/mla_sm120.cu' 2026-04-06T02:38:26,221 adding 'flashinfer/data/csrc/xqa/mla_sm120.cuh' 2026-04-06T02:38:26,222 adding 'flashinfer/data/csrc/xqa/mma.cuh' 2026-04-06T02:38:26,224 adding 'flashinfer/data/csrc/xqa/platform.h' 2026-04-06T02:38:26,225 adding 'flashinfer/data/csrc/xqa/specDec.h' 2026-04-06T02:38:26,227 adding 'flashinfer/data/csrc/xqa/tensorMap.cpp' 2026-04-06T02:38:26,228 adding 'flashinfer/data/csrc/xqa/tensorMap.h' 2026-04-06T02:38:26,230 adding 'flashinfer/data/csrc/xqa/tma.h' 2026-04-06T02:38:26,233 adding 'flashinfer/data/csrc/xqa/utils.cuh' 2026-04-06T02:38:26,235 adding 'flashinfer/data/csrc/xqa/utils.h' 2026-04-06T02:38:26,237 adding 'flashinfer/data/csrc/xqa/xqa_wrapper.cu' 2026-04-06T02:38:26,240 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py' 2026-04-06T02:38:26,241 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py' 2026-04-06T02:38:26,243 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py' 2026-04-06T02:38:26,246 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py' 2026-04-06T02:38:26,248 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py' 2026-04-06T02:38:26,250 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py' 2026-04-06T02:38:26,252 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py' 2026-04-06T02:38:26,254 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py' 2026-04-06T02:38:26,256 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py' 2026-04-06T02:38:26,258 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py' 2026-04-06T02:38:26,259 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py' 2026-04-06T02:38:26,261 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py' 2026-04-06T02:38:26,263 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py' 2026-04-06T02:38:26,266 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py' 2026-04-06T02:38:26,268 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py' 2026-04-06T02:38:26,272 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py' 2026-04-06T02:38:26,274 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py' 2026-04-06T02:38:26,276 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py' 2026-04-06T02:38:26,277 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py' 2026-04-06T02:38:26,279 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py' 2026-04-06T02:38:26,282 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py' 2026-04-06T02:38:26,284 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py' 2026-04-06T02:38:26,287 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py' 2026-04-06T02:38:26,289 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py' 2026-04-06T02:38:26,291 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py' 2026-04-06T02:38:26,293 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py' 2026-04-06T02:38:26,296 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py' 2026-04-06T02:38:26,301 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py' 2026-04-06T02:38:26,305 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py' 2026-04-06T02:38:26,307 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py' 2026-04-06T02:38:26,311 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py' 2026-04-06T02:38:26,313 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py' 2026-04-06T02:38:26,318 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py' 2026-04-06T02:38:26,330 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py' 2026-04-06T02:38:26,340 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py' 2026-04-06T02:38:26,350 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-06T02:38:26,358 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py' 2026-04-06T02:38:26,366 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py' 2026-04-06T02:38:26,374 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py' 2026-04-06T02:38:26,382 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py' 2026-04-06T02:38:26,391 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py' 2026-04-06T02:38:26,398 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py' 2026-04-06T02:38:26,410 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py' 2026-04-06T02:38:26,421 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py' 2026-04-06T02:38:26,433 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py' 2026-04-06T02:38:26,443 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py' 2026-04-06T02:38:26,460 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py' 2026-04-06T02:38:26,464 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py' 2026-04-06T02:38:26,467 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py' 2026-04-06T02:38:26,470 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py' 2026-04-06T02:38:26,482 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py' 2026-04-06T02:38:26,492 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py' 2026-04-06T02:38:26,503 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py' 2026-04-06T02:38:26,514 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py' 2026-04-06T02:38:26,518 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py' 2026-04-06T02:38:26,526 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py' 2026-04-06T02:38:26,533 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py' 2026-04-06T02:38:26,536 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py' 2026-04-06T02:38:26,539 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py' 2026-04-06T02:38:26,551 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py' 2026-04-06T02:38:26,554 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py' 2026-04-06T02:38:26,556 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py' 2026-04-06T02:38:26,565 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py' 2026-04-06T02:38:26,572 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py' 2026-04-06T02:38:26,580 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py' 2026-04-06T02:38:26,583 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py' 2026-04-06T02:38:26,593 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py' 2026-04-06T02:38:26,603 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py' 2026-04-06T02:38:26,613 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py' 2026-04-06T02:38:26,616 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py' 2026-04-06T02:38:26,631 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py' 2026-04-06T02:38:26,646 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py' 2026-04-06T02:38:26,649 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py' 2026-04-06T02:38:26,652 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py' 2026-04-06T02:38:26,655 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py' 2026-04-06T02:38:26,658 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py' 2026-04-06T02:38:26,662 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py' 2026-04-06T02:38:26,664 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py' 2026-04-06T02:38:26,670 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py' 2026-04-06T02:38:26,672 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py' 2026-04-06T02:38:26,674 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py' 2026-04-06T02:38:26,675 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py' 2026-04-06T02:38:26,677 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py' 2026-04-06T02:38:26,679 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py' 2026-04-06T02:38:26,682 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py' 2026-04-06T02:38:26,683 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py' 2026-04-06T02:38:26,685 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py' 2026-04-06T02:38:26,686 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py' 2026-04-06T02:38:26,687 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py' 2026-04-06T02:38:26,689 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py' 2026-04-06T02:38:26,690 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py' 2026-04-06T02:38:26,691 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py' 2026-04-06T02:38:26,694 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py' 2026-04-06T02:38:26,697 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py' 2026-04-06T02:38:26,700 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py' 2026-04-06T02:38:26,702 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py' 2026-04-06T02:38:26,712 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py' 2026-04-06T02:38:26,721 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py' 2026-04-06T02:38:26,732 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py' 2026-04-06T02:38:26,735 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py' 2026-04-06T02:38:26,740 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py' 2026-04-06T02:38:26,748 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py' 2026-04-06T02:38:26,750 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py' 2026-04-06T02:38:26,759 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py' 2026-04-06T02:38:26,763 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py' 2026-04-06T02:38:26,767 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py' 2026-04-06T02:38:26,989 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py' 2026-04-06T02:38:26,992 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py' 2026-04-06T02:38:26,998 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py' 2026-04-06T02:38:27,005 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py' 2026-04-06T02:38:27,014 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py' 2026-04-06T02:38:27,016 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py' 2026-04-06T02:38:27,018 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py' 2026-04-06T02:38:27,020 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py' 2026-04-06T02:38:27,022 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py' 2026-04-06T02:38:27,023 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py' 2026-04-06T02:38:27,026 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py' 2026-04-06T02:38:27,029 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py' 2026-04-06T02:38:27,030 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py' 2026-04-06T02:38:27,033 adding 'flashinfer/data/cutlass/include/cute/config.hpp' 2026-04-06T02:38:27,036 adding 'flashinfer/data/cutlass/include/cute/int_tuple.hpp' 2026-04-06T02:38:27,043 adding 'flashinfer/data/cutlass/include/cute/layout.hpp' 2026-04-06T02:38:27,046 adding 'flashinfer/data/cutlass/include/cute/layout_composed.hpp' 2026-04-06T02:38:27,048 adding 'flashinfer/data/cutlass/include/cute/pointer.hpp' 2026-04-06T02:38:27,049 adding 'flashinfer/data/cutlass/include/cute/pointer_base.hpp' 2026-04-06T02:38:27,051 adding 'flashinfer/data/cutlass/include/cute/pointer_flagged.hpp' 2026-04-06T02:38:27,053 adding 'flashinfer/data/cutlass/include/cute/pointer_sparse.hpp' 2026-04-06T02:38:27,054 adding 'flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp' 2026-04-06T02:38:27,057 adding 'flashinfer/data/cutlass/include/cute/stride.hpp' 2026-04-06T02:38:27,059 adding 'flashinfer/data/cutlass/include/cute/swizzle.hpp' 2026-04-06T02:38:27,062 adding 'flashinfer/data/cutlass/include/cute/swizzle_layout.hpp' 2026-04-06T02:38:27,064 adding 'flashinfer/data/cutlass/include/cute/tensor.hpp' 2026-04-06T02:38:27,067 adding 'flashinfer/data/cutlass/include/cute/tensor_impl.hpp' 2026-04-06T02:38:27,069 adding 'flashinfer/data/cutlass/include/cute/tensor_zip.hpp' 2026-04-06T02:38:27,071 adding 'flashinfer/data/cutlass/include/cute/underscore.hpp' 2026-04-06T02:38:27,073 adding 'flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp' 2026-04-06T02:38:27,074 adding 'flashinfer/data/cutlass/include/cute/algorithm/clear.hpp' 2026-04-06T02:38:27,076 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp' 2026-04-06T02:38:27,079 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp' 2026-04-06T02:38:27,081 adding 'flashinfer/data/cutlass/include/cute/algorithm/copy.hpp' 2026-04-06T02:38:27,083 adding 'flashinfer/data/cutlass/include/cute/algorithm/fill.hpp' 2026-04-06T02:38:27,085 adding 'flashinfer/data/cutlass/include/cute/algorithm/functional.hpp' 2026-04-06T02:38:27,087 adding 'flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp' 2026-04-06T02:38:27,088 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp' 2026-04-06T02:38:27,090 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp' 2026-04-06T02:38:27,091 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp' 2026-04-06T02:38:27,093 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp' 2026-04-06T02:38:27,095 adding 'flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp' 2026-04-06T02:38:27,098 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp' 2026-04-06T02:38:27,100 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp' 2026-04-06T02:38:27,102 adding 'flashinfer/data/cutlass/include/cute/arch/config.hpp' 2026-04-06T02:38:27,104 adding 'flashinfer/data/cutlass/include/cute/arch/copy.hpp' 2026-04-06T02:38:27,115 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp' 2026-04-06T02:38:27,119 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp' 2026-04-06T02:38:27,121 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp' 2026-04-06T02:38:27,122 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp' 2026-04-06T02:38:27,124 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp' 2026-04-06T02:38:27,125 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp' 2026-04-06T02:38:27,127 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp' 2026-04-06T02:38:27,131 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp' 2026-04-06T02:38:27,132 adding 'flashinfer/data/cutlass/include/cute/arch/mma.hpp' 2026-04-06T02:38:27,134 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp' 2026-04-06T02:38:27,136 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp' 2026-04-06T02:38:27,140 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp' 2026-04-06T02:38:27,145 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp' 2026-04-06T02:38:27,150 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp' 2026-04-06T02:38:27,152 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp' 2026-04-06T02:38:27,154 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp' 2026-04-06T02:38:27,155 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp' 2026-04-06T02:38:27,159 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp' 2026-04-06T02:38:27,161 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp' 2026-04-06T02:38:27,173 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp' 2026-04-06T02:38:27,177 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp' 2026-04-06T02:38:27,221 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp' 2026-04-06T02:38:27,312 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp' 2026-04-06T02:38:27,368 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp' 2026-04-06T02:38:27,463 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp' 2026-04-06T02:38:27,482 adding 'flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp' 2026-04-06T02:38:27,484 adding 'flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp' 2026-04-06T02:38:27,486 adding 'flashinfer/data/cutlass/include/cute/arch/util.hpp' 2026-04-06T02:38:27,490 adding 'flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp' 2026-04-06T02:38:27,492 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp' 2026-04-06T02:38:27,499 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp' 2026-04-06T02:38:27,502 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp' 2026-04-06T02:38:27,504 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp' 2026-04-06T02:38:27,506 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp' 2026-04-06T02:38:27,507 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp' 2026-04-06T02:38:27,509 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp' 2026-04-06T02:38:27,510 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp' 2026-04-06T02:38:27,514 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp' 2026-04-06T02:38:27,522 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp' 2026-04-06T02:38:27,524 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp' 2026-04-06T02:38:27,526 adding 'flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp' 2026-04-06T02:38:27,528 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp' 2026-04-06T02:38:27,538 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp' 2026-04-06T02:38:27,542 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp' 2026-04-06T02:38:27,544 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp' 2026-04-06T02:38:27,545 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp' 2026-04-06T02:38:27,546 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp' 2026-04-06T02:38:27,548 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp' 2026-04-06T02:38:27,550 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp' 2026-04-06T02:38:27,551 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp' 2026-04-06T02:38:27,553 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp' 2026-04-06T02:38:27,564 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp' 2026-04-06T02:38:27,586 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp' 2026-04-06T02:38:27,598 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp' 2026-04-06T02:38:27,618 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp' 2026-04-06T02:38:27,623 adding 'flashinfer/data/cutlass/include/cute/atom/partitioner.hpp' 2026-04-06T02:38:27,625 adding 'flashinfer/data/cutlass/include/cute/container/alignment.hpp' 2026-04-06T02:38:27,627 adding 'flashinfer/data/cutlass/include/cute/container/array.hpp' 2026-04-06T02:38:27,628 adding 'flashinfer/data/cutlass/include/cute/container/array_aligned.hpp' 2026-04-06T02:38:27,631 adding 'flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp' 2026-04-06T02:38:27,632 adding 'flashinfer/data/cutlass/include/cute/container/bit_field.hpp' 2026-04-06T02:38:27,634 adding 'flashinfer/data/cutlass/include/cute/container/cuda_types.hpp' 2026-04-06T02:38:27,636 adding 'flashinfer/data/cutlass/include/cute/container/tuple.hpp' 2026-04-06T02:38:27,638 adding 'flashinfer/data/cutlass/include/cute/container/type_list.hpp' 2026-04-06T02:38:27,640 adding 'flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp' 2026-04-06T02:38:27,642 adding 'flashinfer/data/cutlass/include/cute/numeric/complex.hpp' 2026-04-06T02:38:27,643 adding 'flashinfer/data/cutlass/include/cute/numeric/int.hpp' 2026-04-06T02:38:27,645 adding 'flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp' 2026-04-06T02:38:27,647 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp' 2026-04-06T02:38:27,649 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp' 2026-04-06T02:38:27,651 adding 'flashinfer/data/cutlass/include/cute/numeric/math.hpp' 2026-04-06T02:38:27,653 adding 'flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp' 2026-04-06T02:38:27,655 adding 'flashinfer/data/cutlass/include/cute/numeric/real.hpp' 2026-04-06T02:38:27,657 adding 'flashinfer/data/cutlass/include/cute/util/debug.hpp' 2026-04-06T02:38:27,659 adding 'flashinfer/data/cutlass/include/cute/util/print.hpp' 2026-04-06T02:38:27,661 adding 'flashinfer/data/cutlass/include/cute/util/print_latex.hpp' 2026-04-06T02:38:27,664 adding 'flashinfer/data/cutlass/include/cute/util/print_svg.hpp' 2026-04-06T02:38:27,666 adding 'flashinfer/data/cutlass/include/cute/util/print_tensor.hpp' 2026-04-06T02:38:27,668 adding 'flashinfer/data/cutlass/include/cute/util/type_traits.hpp' 2026-04-06T02:38:27,673 adding 'flashinfer/data/cutlass/include/cutlass/aligned_buffer.h' 2026-04-06T02:38:27,678 adding 'flashinfer/data/cutlass/include/cutlass/array.h' 2026-04-06T02:38:27,680 adding 'flashinfer/data/cutlass/include/cutlass/array_planar_complex.h' 2026-04-06T02:38:27,682 adding 'flashinfer/data/cutlass/include/cutlass/array_subbyte.h' 2026-04-06T02:38:27,684 adding 'flashinfer/data/cutlass/include/cutlass/barrier.h' 2026-04-06T02:38:27,686 adding 'flashinfer/data/cutlass/include/cutlass/bfloat16.h' 2026-04-06T02:38:27,688 adding 'flashinfer/data/cutlass/include/cutlass/blas3.h' 2026-04-06T02:38:27,689 adding 'flashinfer/data/cutlass/include/cutlass/blas3_types.h' 2026-04-06T02:38:27,691 adding 'flashinfer/data/cutlass/include/cutlass/block_striped.h' 2026-04-06T02:38:27,693 adding 'flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp' 2026-04-06T02:38:27,696 adding 'flashinfer/data/cutlass/include/cutlass/complex.h' 2026-04-06T02:38:27,699 adding 'flashinfer/data/cutlass/include/cutlass/constants.h' 2026-04-06T02:38:27,701 adding 'flashinfer/data/cutlass/include/cutlass/coord.h' 2026-04-06T02:38:27,703 adding 'flashinfer/data/cutlass/include/cutlass/core_io.h' 2026-04-06T02:38:27,705 adding 'flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp' 2026-04-06T02:38:27,707 adding 'flashinfer/data/cutlass/include/cutlass/cutlass.h' 2026-04-06T02:38:27,708 adding 'flashinfer/data/cutlass/include/cutlass/device_kernel.h' 2026-04-06T02:38:27,713 adding 'flashinfer/data/cutlass/include/cutlass/exmy_base.h' 2026-04-06T02:38:27,716 adding 'flashinfer/data/cutlass/include/cutlass/fast_math.h' 2026-04-06T02:38:27,720 adding 'flashinfer/data/cutlass/include/cutlass/float8.h' 2026-04-06T02:38:27,723 adding 'flashinfer/data/cutlass/include/cutlass/float_subbyte.h' 2026-04-06T02:38:27,725 adding 'flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h' 2026-04-06T02:38:27,728 adding 'flashinfer/data/cutlass/include/cutlass/functional.h' 2026-04-06T02:38:27,730 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.h' 2026-04-06T02:38:27,731 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp' 2026-04-06T02:38:27,734 adding 'flashinfer/data/cutlass/include/cutlass/half.h' 2026-04-06T02:38:27,736 adding 'flashinfer/data/cutlass/include/cutlass/integer_subbyte.h' 2026-04-06T02:38:27,737 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h' 2026-04-06T02:38:27,739 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp' 2026-04-06T02:38:27,740 adding 'flashinfer/data/cutlass/include/cutlass/kernel_launch.h' 2026-04-06T02:38:27,761 adding 'flashinfer/data/cutlass/include/cutlass/matrix.h' 2026-04-06T02:38:27,765 adding 'flashinfer/data/cutlass/include/cutlass/matrix_coord.h' 2026-04-06T02:38:27,768 adding 'flashinfer/data/cutlass/include/cutlass/matrix_shape.h' 2026-04-06T02:38:27,822 adding 'flashinfer/data/cutlass/include/cutlass/numeric_conversion.h' 2026-04-06T02:38:27,826 adding 'flashinfer/data/cutlass/include/cutlass/numeric_size.h' 2026-04-06T02:38:27,827 adding 'flashinfer/data/cutlass/include/cutlass/numeric_types.h' 2026-04-06T02:38:27,829 adding 'flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h' 2026-04-06T02:38:27,831 adding 'flashinfer/data/cutlass/include/cutlass/predicate_vector.h' 2026-04-06T02:38:27,833 adding 'flashinfer/data/cutlass/include/cutlass/quaternion.h' 2026-04-06T02:38:27,835 adding 'flashinfer/data/cutlass/include/cutlass/real.h' 2026-04-06T02:38:27,836 adding 'flashinfer/data/cutlass/include/cutlass/relatively_equal.h' 2026-04-06T02:38:27,838 adding 'flashinfer/data/cutlass/include/cutlass/semaphore.h' 2026-04-06T02:38:27,841 adding 'flashinfer/data/cutlass/include/cutlass/subbyte_reference.h' 2026-04-06T02:38:27,843 adding 'flashinfer/data/cutlass/include/cutlass/tensor_coord.h' 2026-04-06T02:38:27,845 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref.h' 2026-04-06T02:38:27,847 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h' 2026-04-06T02:38:27,849 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view.h' 2026-04-06T02:38:27,851 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h' 2026-04-06T02:38:27,853 adding 'flashinfer/data/cutlass/include/cutlass/tfloat32.h' 2026-04-06T02:38:27,854 adding 'flashinfer/data/cutlass/include/cutlass/trace.h' 2026-04-06T02:38:27,856 adding 'flashinfer/data/cutlass/include/cutlass/uint128.h' 2026-04-06T02:38:27,857 adding 'flashinfer/data/cutlass/include/cutlass/uint256.h' 2026-04-06T02:38:27,859 adding 'flashinfer/data/cutlass/include/cutlass/version.h' 2026-04-06T02:38:27,860 adding 'flashinfer/data/cutlass/include/cutlass/wmma_array.h' 2026-04-06T02:38:27,861 adding 'flashinfer/data/cutlass/include/cutlass/workspace.h' 2026-04-06T02:38:27,864 adding 'flashinfer/data/cutlass/include/cutlass/arch/arch.h' 2026-04-06T02:38:27,866 adding 'flashinfer/data/cutlass/include/cutlass/arch/barrier.h' 2026-04-06T02:38:27,868 adding 'flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h' 2026-04-06T02:38:27,870 adding 'flashinfer/data/cutlass/include/cutlass/arch/config.h' 2026-04-06T02:38:27,871 adding 'flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h' 2026-04-06T02:38:27,873 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory.h' 2026-04-06T02:38:27,875 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h' 2026-04-06T02:38:27,876 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h' 2026-04-06T02:38:27,878 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma.h' 2026-04-06T02:38:27,880 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h' 2026-04-06T02:38:27,881 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h' 2026-04-06T02:38:27,883 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h' 2026-04-06T02:38:27,884 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h' 2026-04-06T02:38:27,886 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h' 2026-04-06T02:38:27,888 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h' 2026-04-06T02:38:27,890 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h' 2026-04-06T02:38:27,892 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h' 2026-04-06T02:38:27,894 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h' 2026-04-06T02:38:27,896 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h' 2026-04-06T02:38:27,898 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h' 2026-04-06T02:38:27,900 adding 'flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h' 2026-04-06T02:38:27,901 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd.h' 2026-04-06T02:38:27,902 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h' 2026-04-06T02:38:27,904 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h' 2026-04-06T02:38:27,907 adding 'flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp' 2026-04-06T02:38:27,909 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma.h' 2026-04-06T02:38:27,911 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h' 2026-04-06T02:38:27,912 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h' 2026-04-06T02:38:27,914 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h' 2026-04-06T02:38:27,917 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h' 2026-04-06T02:38:27,919 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h' 2026-04-06T02:38:27,922 adding 'flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp' 2026-04-06T02:38:27,924 adding 'flashinfer/data/cutlass/include/cutlass/conv/convolution.h' 2026-04-06T02:38:27,925 adding 'flashinfer/data/cutlass/include/cutlass/conv/detail.hpp' 2026-04-06T02:38:27,927 adding 'flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp' 2026-04-06T02:38:27,929 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp' 2026-04-06T02:38:27,930 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp' 2026-04-06T02:38:27,932 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp' 2026-04-06T02:38:27,937 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp' 2026-04-06T02:38:27,941 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp' 2026-04-06T02:38:27,943 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl' 2026-04-06T02:38:27,945 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl' 2026-04-06T02:38:27,946 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl' 2026-04-06T02:38:27,948 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl' 2026-04-06T02:38:27,951 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp' 2026-04-06T02:38:27,953 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h' 2026-04-06T02:38:27,955 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h' 2026-04-06T02:38:27,956 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h' 2026-04-06T02:38:27,959 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp' 2026-04-06T02:38:27,960 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h' 2026-04-06T02:38:27,963 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h' 2026-04-06T02:38:27,966 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h' 2026-04-06T02:38:27,968 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h' 2026-04-06T02:38:27,969 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h' 2026-04-06T02:38:27,971 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h' 2026-04-06T02:38:27,972 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h' 2026-04-06T02:38:27,974 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h' 2026-04-06T02:38:27,976 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h' 2026-04-06T02:38:27,978 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h' 2026-04-06T02:38:27,980 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h' 2026-04-06T02:38:27,982 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h' 2026-04-06T02:38:27,984 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h' 2026-04-06T02:38:27,986 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h' 2026-04-06T02:38:27,988 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h' 2026-04-06T02:38:27,990 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h' 2026-04-06T02:38:27,992 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h' 2026-04-06T02:38:27,994 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h' 2026-04-06T02:38:27,996 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h' 2026-04-06T02:38:27,998 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h' 2026-04-06T02:38:28,000 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h' 2026-04-06T02:38:28,003 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h' 2026-04-06T02:38:28,005 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h' 2026-04-06T02:38:28,008 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h' 2026-04-06T02:38:28,011 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h' 2026-04-06T02:38:28,013 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h' 2026-04-06T02:38:28,017 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp' 2026-04-06T02:38:28,019 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp' 2026-04-06T02:38:28,021 adding 'flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h' 2026-04-06T02:38:28,024 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,027 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,029 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,032 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,034 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,036 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h' 2026-04-06T02:38:28,038 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h' 2026-04-06T02:38:28,040 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,042 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,044 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h' 2026-04-06T02:38:28,046 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h' 2026-04-06T02:38:28,048 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,050 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h' 2026-04-06T02:38:28,052 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h' 2026-04-06T02:38:28,054 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,056 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,058 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,060 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,062 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,064 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,066 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,068 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,070 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,073 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,074 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,076 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,078 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h' 2026-04-06T02:38:28,080 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,082 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,084 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-06T02:38:28,085 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-06T02:38:28,087 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h' 2026-04-06T02:38:28,089 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h' 2026-04-06T02:38:28,091 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h' 2026-04-06T02:38:28,093 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h' 2026-04-06T02:38:28,095 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h' 2026-04-06T02:38:28,097 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h' 2026-04-06T02:38:28,099 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h' 2026-04-06T02:38:28,102 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h' 2026-04-06T02:38:28,105 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h' 2026-04-06T02:38:28,108 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h' 2026-04-06T02:38:28,110 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h' 2026-04-06T02:38:28,113 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h' 2026-04-06T02:38:28,115 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-06T02:38:28,117 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-06T02:38:28,119 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h' 2026-04-06T02:38:28,122 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h' 2026-04-06T02:38:28,124 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h' 2026-04-06T02:38:28,126 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h' 2026-04-06T02:38:28,129 adding 'flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp' 2026-04-06T02:38:28,130 adding 'flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp' 2026-04-06T02:38:28,132 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective.hpp' 2026-04-06T02:38:28,133 adding 'flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp' 2026-04-06T02:38:28,135 adding 'flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp' 2026-04-06T02:38:28,137 adding 'flashinfer/data/cutlass/include/cutlass/detail/layout.hpp' 2026-04-06T02:38:28,139 adding 'flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp' 2026-04-06T02:38:28,140 adding 'flashinfer/data/cutlass/include/cutlass/detail/mma.hpp' 2026-04-06T02:38:28,142 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp' 2026-04-06T02:38:28,143 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp' 2026-04-06T02:38:28,145 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp' 2026-04-06T02:38:28,146 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp' 2026-04-06T02:38:28,151 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp' 2026-04-06T02:38:28,153 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp' 2026-04-06T02:38:28,154 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp' 2026-04-06T02:38:28,156 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp' 2026-04-06T02:38:28,158 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp' 2026-04-06T02:38:28,160 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp' 2026-04-06T02:38:28,162 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp' 2026-04-06T02:38:28,164 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp' 2026-04-06T02:38:28,166 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp' 2026-04-06T02:38:28,168 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp' 2026-04-06T02:38:28,172 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp' 2026-04-06T02:38:28,175 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp' 2026-04-06T02:38:28,180 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp' 2026-04-06T02:38:28,188 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp' 2026-04-06T02:38:28,192 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp' 2026-04-06T02:38:28,197 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp' 2026-04-06T02:38:28,203 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp' 2026-04-06T02:38:28,206 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp' 2026-04-06T02:38:28,208 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp' 2026-04-06T02:38:28,214 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp' 2026-04-06T02:38:28,219 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp' 2026-04-06T02:38:28,221 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp' 2026-04-06T02:38:28,228 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl' 2026-04-06T02:38:28,230 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl' 2026-04-06T02:38:28,233 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl' 2026-04-06T02:38:28,234 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl' 2026-04-06T02:38:28,238 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl' 2026-04-06T02:38:28,239 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl' 2026-04-06T02:38:28,241 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp' 2026-04-06T02:38:28,243 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp' 2026-04-06T02:38:28,246 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp' 2026-04-06T02:38:28,250 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp' 2026-04-06T02:38:28,253 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp' 2026-04-06T02:38:28,256 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp' 2026-04-06T02:38:28,260 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp' 2026-04-06T02:38:28,266 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp' 2026-04-06T02:38:28,270 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp' 2026-04-06T02:38:28,274 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp' 2026-04-06T02:38:28,281 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp' 2026-04-06T02:38:28,285 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp' 2026-04-06T02:38:28,288 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp' 2026-04-06T02:38:28,292 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h' 2026-04-06T02:38:28,293 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h' 2026-04-06T02:38:28,295 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp' 2026-04-06T02:38:28,297 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h' 2026-04-06T02:38:28,299 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h' 2026-04-06T02:38:28,302 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h' 2026-04-06T02:38:28,304 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h' 2026-04-06T02:38:28,306 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h' 2026-04-06T02:38:28,308 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h' 2026-04-06T02:38:28,310 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h' 2026-04-06T02:38:28,312 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h' 2026-04-06T02:38:28,313 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h' 2026-04-06T02:38:28,315 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h' 2026-04-06T02:38:28,317 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h' 2026-04-06T02:38:28,318 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h' 2026-04-06T02:38:28,320 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h' 2026-04-06T02:38:28,322 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h' 2026-04-06T02:38:28,324 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h' 2026-04-06T02:38:28,326 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h' 2026-04-06T02:38:28,328 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h' 2026-04-06T02:38:28,329 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h' 2026-04-06T02:38:28,331 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp' 2026-04-06T02:38:28,333 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h' 2026-04-06T02:38:28,334 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h' 2026-04-06T02:38:28,335 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h' 2026-04-06T02:38:28,338 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h' 2026-04-06T02:38:28,340 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h' 2026-04-06T02:38:28,341 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h' 2026-04-06T02:38:28,343 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h' 2026-04-06T02:38:28,345 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h' 2026-04-06T02:38:28,347 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h' 2026-04-06T02:38:28,349 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h' 2026-04-06T02:38:28,350 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h' 2026-04-06T02:38:28,352 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h' 2026-04-06T02:38:28,353 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h' 2026-04-06T02:38:28,355 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h' 2026-04-06T02:38:28,356 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h' 2026-04-06T02:38:28,358 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h' 2026-04-06T02:38:28,359 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h' 2026-04-06T02:38:28,361 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h' 2026-04-06T02:38:28,362 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h' 2026-04-06T02:38:28,364 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h' 2026-04-06T02:38:28,367 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h' 2026-04-06T02:38:28,369 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h' 2026-04-06T02:38:28,370 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h' 2026-04-06T02:38:28,372 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h' 2026-04-06T02:38:28,374 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h' 2026-04-06T02:38:28,376 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h' 2026-04-06T02:38:28,379 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h' 2026-04-06T02:38:28,380 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h' 2026-04-06T02:38:28,382 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h' 2026-04-06T02:38:28,385 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h' 2026-04-06T02:38:28,388 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h' 2026-04-06T02:38:28,393 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h' 2026-04-06T02:38:28,396 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h' 2026-04-06T02:38:28,398 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h' 2026-04-06T02:38:28,400 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h' 2026-04-06T02:38:28,403 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h' 2026-04-06T02:38:28,404 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h' 2026-04-06T02:38:28,406 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h' 2026-04-06T02:38:28,408 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h' 2026-04-06T02:38:28,411 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h' 2026-04-06T02:38:28,414 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h' 2026-04-06T02:38:28,417 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h' 2026-04-06T02:38:28,419 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h' 2026-04-06T02:38:28,421 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h' 2026-04-06T02:38:28,424 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h' 2026-04-06T02:38:28,426 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h' 2026-04-06T02:38:28,428 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h' 2026-04-06T02:38:28,430 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h' 2026-04-06T02:38:28,432 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h' 2026-04-06T02:38:28,434 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h' 2026-04-06T02:38:28,436 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h' 2026-04-06T02:38:28,437 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h' 2026-04-06T02:38:28,440 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp' 2026-04-06T02:38:28,442 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp' 2026-04-06T02:38:28,445 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp' 2026-04-06T02:38:28,448 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp' 2026-04-06T02:38:28,449 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp' 2026-04-06T02:38:28,452 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h' 2026-04-06T02:38:28,453 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h' 2026-04-06T02:38:28,455 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h' 2026-04-06T02:38:28,457 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h' 2026-04-06T02:38:28,459 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h' 2026-04-06T02:38:28,460 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h' 2026-04-06T02:38:28,462 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h' 2026-04-06T02:38:28,463 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h' 2026-04-06T02:38:28,466 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h' 2026-04-06T02:38:28,468 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h' 2026-04-06T02:38:28,471 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h' 2026-04-06T02:38:28,473 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h' 2026-04-06T02:38:28,474 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h' 2026-04-06T02:38:28,476 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h' 2026-04-06T02:38:28,477 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h' 2026-04-06T02:38:28,480 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp' 2026-04-06T02:38:28,484 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp' 2026-04-06T02:38:28,486 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp' 2026-04-06T02:38:28,488 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp' 2026-04-06T02:38:28,490 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp' 2026-04-06T02:38:28,491 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp' 2026-04-06T02:38:28,494 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp' 2026-04-06T02:38:28,496 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp' 2026-04-06T02:38:28,502 adding 'flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp' 2026-04-06T02:38:28,503 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm.h' 2026-04-06T02:38:28,505 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h' 2026-04-06T02:38:28,506 adding 'flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp' 2026-04-06T02:38:28,509 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp' 2026-04-06T02:38:28,511 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp' 2026-04-06T02:38:28,512 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp' 2026-04-06T02:38:28,514 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp' 2026-04-06T02:38:28,515 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp' 2026-04-06T02:38:28,522 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp' 2026-04-06T02:38:28,530 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp' 2026-04-06T02:38:28,536 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-06T02:38:28,541 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp' 2026-04-06T02:38:28,548 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp' 2026-04-06T02:38:28,553 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp' 2026-04-06T02:38:28,560 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp' 2026-04-06T02:38:28,566 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp' 2026-04-06T02:38:28,573 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-06T02:38:28,578 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-06T02:38:28,583 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp' 2026-04-06T02:38:28,588 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp' 2026-04-06T02:38:28,592 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp' 2026-04-06T02:38:28,595 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-06T02:38:28,599 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp' 2026-04-06T02:38:28,605 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp' 2026-04-06T02:38:28,611 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp' 2026-04-06T02:38:28,617 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-06T02:38:28,622 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-06T02:38:28,628 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp' 2026-04-06T02:38:28,632 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp' 2026-04-06T02:38:28,637 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp' 2026-04-06T02:38:28,646 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp' 2026-04-06T02:38:28,653 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp' 2026-04-06T02:38:28,658 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp' 2026-04-06T02:38:28,663 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp' 2026-04-06T02:38:28,670 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp' 2026-04-06T02:38:28,675 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp' 2026-04-06T02:38:28,678 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp' 2026-04-06T02:38:28,682 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp' 2026-04-06T02:38:28,687 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp' 2026-04-06T02:38:28,690 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp' 2026-04-06T02:38:28,692 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp' 2026-04-06T02:38:28,695 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp' 2026-04-06T02:38:28,702 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-06T02:38:28,706 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp' 2026-04-06T02:38:28,711 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-06T02:38:28,717 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-06T02:38:28,721 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp' 2026-04-06T02:38:28,724 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp' 2026-04-06T02:38:28,727 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp' 2026-04-06T02:38:28,733 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-06T02:38:28,736 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp' 2026-04-06T02:38:28,742 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-06T02:38:28,745 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-06T02:38:28,750 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-06T02:38:28,754 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-06T02:38:28,758 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-06T02:38:28,761 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl' 2026-04-06T02:38:28,763 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl' 2026-04-06T02:38:28,765 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl' 2026-04-06T02:38:28,768 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl' 2026-04-06T02:38:28,770 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl' 2026-04-06T02:38:28,773 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl' 2026-04-06T02:38:28,776 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl' 2026-04-06T02:38:28,778 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl' 2026-04-06T02:38:28,780 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl' 2026-04-06T02:38:28,782 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl' 2026-04-06T02:38:28,784 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl' 2026-04-06T02:38:28,786 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl' 2026-04-06T02:38:28,787 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl' 2026-04-06T02:38:28,789 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl' 2026-04-06T02:38:28,792 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl' 2026-04-06T02:38:28,794 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl' 2026-04-06T02:38:28,797 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl' 2026-04-06T02:38:28,799 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl' 2026-04-06T02:38:28,802 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl' 2026-04-06T02:38:28,804 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl' 2026-04-06T02:38:28,806 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl' 2026-04-06T02:38:28,808 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl' 2026-04-06T02:38:28,810 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl' 2026-04-06T02:38:28,813 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl' 2026-04-06T02:38:28,816 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl' 2026-04-06T02:38:28,818 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl' 2026-04-06T02:38:28,822 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl' 2026-04-06T02:38:28,824 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl' 2026-04-06T02:38:28,826 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl' 2026-04-06T02:38:28,829 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h' 2026-04-06T02:38:28,831 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h' 2026-04-06T02:38:28,834 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h' 2026-04-06T02:38:28,837 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h' 2026-04-06T02:38:28,840 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h' 2026-04-06T02:38:28,842 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h' 2026-04-06T02:38:28,845 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h' 2026-04-06T02:38:28,848 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h' 2026-04-06T02:38:28,849 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h' 2026-04-06T02:38:28,851 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h' 2026-04-06T02:38:28,854 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h' 2026-04-06T02:38:28,855 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h' 2026-04-06T02:38:28,857 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h' 2026-04-06T02:38:28,859 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h' 2026-04-06T02:38:28,861 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h' 2026-04-06T02:38:28,863 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h' 2026-04-06T02:38:28,865 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h' 2026-04-06T02:38:28,869 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h' 2026-04-06T02:38:28,871 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h' 2026-04-06T02:38:28,873 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h' 2026-04-06T02:38:28,875 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h' 2026-04-06T02:38:28,878 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h' 2026-04-06T02:38:28,880 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h' 2026-04-06T02:38:28,882 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h' 2026-04-06T02:38:28,883 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h' 2026-04-06T02:38:28,885 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h' 2026-04-06T02:38:28,887 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h' 2026-04-06T02:38:28,889 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h' 2026-04-06T02:38:28,892 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h' 2026-04-06T02:38:28,895 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h' 2026-04-06T02:38:28,900 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h' 2026-04-06T02:38:28,903 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h' 2026-04-06T02:38:28,905 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h' 2026-04-06T02:38:28,906 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h' 2026-04-06T02:38:28,908 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h' 2026-04-06T02:38:28,910 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h' 2026-04-06T02:38:28,911 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h' 2026-04-06T02:38:28,913 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h' 2026-04-06T02:38:28,915 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h' 2026-04-06T02:38:28,916 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h' 2026-04-06T02:38:28,918 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h' 2026-04-06T02:38:28,919 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h' 2026-04-06T02:38:28,921 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h' 2026-04-06T02:38:28,922 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h' 2026-04-06T02:38:28,924 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h' 2026-04-06T02:38:28,926 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h' 2026-04-06T02:38:28,927 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h' 2026-04-06T02:38:28,929 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h' 2026-04-06T02:38:28,930 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h' 2026-04-06T02:38:28,932 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h' 2026-04-06T02:38:28,933 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h' 2026-04-06T02:38:28,935 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h' 2026-04-06T02:38:28,937 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h' 2026-04-06T02:38:28,939 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h' 2026-04-06T02:38:28,941 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h' 2026-04-06T02:38:28,943 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h' 2026-04-06T02:38:28,945 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h' 2026-04-06T02:38:28,946 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h' 2026-04-06T02:38:28,948 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h' 2026-04-06T02:38:28,950 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h' 2026-04-06T02:38:28,952 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h' 2026-04-06T02:38:28,954 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h' 2026-04-06T02:38:28,955 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h' 2026-04-06T02:38:28,957 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h' 2026-04-06T02:38:28,959 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h' 2026-04-06T02:38:28,962 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h' 2026-04-06T02:38:28,964 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h' 2026-04-06T02:38:28,965 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h' 2026-04-06T02:38:28,967 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h' 2026-04-06T02:38:28,969 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h' 2026-04-06T02:38:28,971 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h' 2026-04-06T02:38:28,973 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h' 2026-04-06T02:38:28,974 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h' 2026-04-06T02:38:28,976 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h' 2026-04-06T02:38:28,979 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h' 2026-04-06T02:38:28,981 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h' 2026-04-06T02:38:28,982 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h' 2026-04-06T02:38:28,985 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h' 2026-04-06T02:38:28,988 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h' 2026-04-06T02:38:28,991 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h' 2026-04-06T02:38:28,993 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h' 2026-04-06T02:38:28,995 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h' 2026-04-06T02:38:29,004 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h' 2026-04-06T02:38:29,006 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h' 2026-04-06T02:38:29,008 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h' 2026-04-06T02:38:29,010 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp' 2026-04-06T02:38:29,012 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h' 2026-04-06T02:38:29,013 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h' 2026-04-06T02:38:29,018 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h' 2026-04-06T02:38:29,020 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h' 2026-04-06T02:38:29,023 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h' 2026-04-06T02:38:29,026 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h' 2026-04-06T02:38:29,030 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h' 2026-04-06T02:38:29,033 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h' 2026-04-06T02:38:29,035 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h' 2026-04-06T02:38:29,037 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h' 2026-04-06T02:38:29,041 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h' 2026-04-06T02:38:29,043 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h' 2026-04-06T02:38:29,045 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h' 2026-04-06T02:38:29,047 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h' 2026-04-06T02:38:29,049 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h' 2026-04-06T02:38:29,052 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h' 2026-04-06T02:38:29,054 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h' 2026-04-06T02:38:29,056 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h' 2026-04-06T02:38:29,059 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h' 2026-04-06T02:38:29,066 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp' 2026-04-06T02:38:29,072 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp' 2026-04-06T02:38:29,078 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp' 2026-04-06T02:38:29,082 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp' 2026-04-06T02:38:29,086 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-06T02:38:29,091 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp' 2026-04-06T02:38:29,096 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp' 2026-04-06T02:38:29,102 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp' 2026-04-06T02:38:29,107 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp' 2026-04-06T02:38:29,111 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp' 2026-04-06T02:38:29,113 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp' 2026-04-06T02:38:29,117 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp' 2026-04-06T02:38:29,119 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp' 2026-04-06T02:38:29,123 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp' 2026-04-06T02:38:29,129 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp' 2026-04-06T02:38:29,135 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp' 2026-04-06T02:38:29,139 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp' 2026-04-06T02:38:29,142 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp' 2026-04-06T02:38:29,144 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp' 2026-04-06T02:38:29,149 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp' 2026-04-06T02:38:29,154 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp' 2026-04-06T02:38:29,157 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp' 2026-04-06T02:38:29,159 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp' 2026-04-06T02:38:29,164 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp' 2026-04-06T02:38:29,169 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp' 2026-04-06T02:38:29,171 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp' 2026-04-06T02:38:29,174 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp' 2026-04-06T02:38:29,177 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp' 2026-04-06T02:38:29,179 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp' 2026-04-06T02:38:29,182 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp' 2026-04-06T02:38:29,188 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp' 2026-04-06T02:38:29,190 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h' 2026-04-06T02:38:29,192 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h' 2026-04-06T02:38:29,194 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h' 2026-04-06T02:38:29,197 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp' 2026-04-06T02:38:29,199 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h' 2026-04-06T02:38:29,201 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp' 2026-04-06T02:38:29,202 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp' 2026-04-06T02:38:29,211 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h' 2026-04-06T02:38:29,214 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h' 2026-04-06T02:38:29,216 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h' 2026-04-06T02:38:29,218 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h' 2026-04-06T02:38:29,221 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h' 2026-04-06T02:38:29,222 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h' 2026-04-06T02:38:29,226 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h' 2026-04-06T02:38:29,227 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h' 2026-04-06T02:38:29,230 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h' 2026-04-06T02:38:29,232 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h' 2026-04-06T02:38:29,234 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h' 2026-04-06T02:38:29,237 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h' 2026-04-06T02:38:29,240 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h' 2026-04-06T02:38:29,245 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h' 2026-04-06T02:38:29,248 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h' 2026-04-06T02:38:29,250 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h' 2026-04-06T02:38:29,252 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h' 2026-04-06T02:38:29,254 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h' 2026-04-06T02:38:29,255 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h' 2026-04-06T02:38:29,257 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h' 2026-04-06T02:38:29,259 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h' 2026-04-06T02:38:29,260 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h' 2026-04-06T02:38:29,262 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h' 2026-04-06T02:38:29,263 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h' 2026-04-06T02:38:29,265 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h' 2026-04-06T02:38:29,267 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h' 2026-04-06T02:38:29,270 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h' 2026-04-06T02:38:29,272 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h' 2026-04-06T02:38:29,274 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h' 2026-04-06T02:38:29,276 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h' 2026-04-06T02:38:29,279 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h' 2026-04-06T02:38:29,281 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h' 2026-04-06T02:38:29,282 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h' 2026-04-06T02:38:29,284 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h' 2026-04-06T02:38:29,285 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h' 2026-04-06T02:38:29,288 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h' 2026-04-06T02:38:29,291 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h' 2026-04-06T02:38:29,295 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h' 2026-04-06T02:38:29,297 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h' 2026-04-06T02:38:29,299 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h' 2026-04-06T02:38:29,301 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h' 2026-04-06T02:38:29,304 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h' 2026-04-06T02:38:29,306 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h' 2026-04-06T02:38:29,308 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h' 2026-04-06T02:38:29,311 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h' 2026-04-06T02:38:29,313 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h' 2026-04-06T02:38:29,315 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h' 2026-04-06T02:38:29,318 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h' 2026-04-06T02:38:29,320 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h' 2026-04-06T02:38:29,323 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h' 2026-04-06T02:38:29,326 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h' 2026-04-06T02:38:29,328 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h' 2026-04-06T02:38:29,330 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h' 2026-04-06T02:38:29,331 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h' 2026-04-06T02:38:29,333 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h' 2026-04-06T02:38:29,335 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h' 2026-04-06T02:38:29,336 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h' 2026-04-06T02:38:29,338 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h' 2026-04-06T02:38:29,341 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h' 2026-04-06T02:38:29,344 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h' 2026-04-06T02:38:29,349 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h' 2026-04-06T02:38:29,352 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h' 2026-04-06T02:38:29,354 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h' 2026-04-06T02:38:29,356 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h' 2026-04-06T02:38:29,358 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h' 2026-04-06T02:38:29,360 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h' 2026-04-06T02:38:29,361 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h' 2026-04-06T02:38:29,364 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h' 2026-04-06T02:38:29,367 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h' 2026-04-06T02:38:29,369 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h' 2026-04-06T02:38:29,371 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h' 2026-04-06T02:38:29,373 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h' 2026-04-06T02:38:29,375 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h' 2026-04-06T02:38:29,376 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h' 2026-04-06T02:38:29,378 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h' 2026-04-06T02:38:29,387 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h' 2026-04-06T02:38:29,394 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h' 2026-04-06T02:38:29,399 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h' 2026-04-06T02:38:29,402 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h' 2026-04-06T02:38:29,404 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h' 2026-04-06T02:38:29,406 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h' 2026-04-06T02:38:29,408 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h' 2026-04-06T02:38:29,411 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h' 2026-04-06T02:38:29,412 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h' 2026-04-06T02:38:29,414 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h' 2026-04-06T02:38:29,416 adding 'flashinfer/data/cutlass/include/cutlass/layout/layout.h' 2026-04-06T02:38:29,419 adding 'flashinfer/data/cutlass/include/cutlass/layout/matrix.h' 2026-04-06T02:38:29,421 adding 'flashinfer/data/cutlass/include/cutlass/layout/permute.h' 2026-04-06T02:38:29,423 adding 'flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h' 2026-04-06T02:38:29,425 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor.h' 2026-04-06T02:38:29,427 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h' 2026-04-06T02:38:29,430 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h' 2026-04-06T02:38:29,432 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h' 2026-04-06T02:38:29,434 adding 'flashinfer/data/cutlass/include/cutlass/layout/vector.h' 2026-04-06T02:38:29,436 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp' 2026-04-06T02:38:29,440 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp' 2026-04-06T02:38:29,444 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp' 2026-04-06T02:38:29,448 adding 'flashinfer/data/cutlass/include/cutlass/platform/platform.h' 2026-04-06T02:38:29,450 adding 'flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h' 2026-04-06T02:38:29,452 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h' 2026-04-06T02:38:29,454 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h' 2026-04-06T02:38:29,456 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h' 2026-04-06T02:38:29,458 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h' 2026-04-06T02:38:29,460 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h' 2026-04-06T02:38:29,462 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h' 2026-04-06T02:38:29,464 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h' 2026-04-06T02:38:29,467 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h' 2026-04-06T02:38:29,469 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h' 2026-04-06T02:38:29,470 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h' 2026-04-06T02:38:29,472 adding 'flashinfer/data/cutlass/include/cutlass/thread/matrix.h' 2026-04-06T02:38:29,476 adding 'flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h' 2026-04-06T02:38:29,479 adding 'flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp' 2026-04-06T02:38:29,482 adding 'flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp' 2026-04-06T02:38:29,484 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp' 2026-04-06T02:38:29,487 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp' 2026-04-06T02:38:29,489 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp' 2026-04-06T02:38:29,491 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h' 2026-04-06T02:38:29,493 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h' 2026-04-06T02:38:29,495 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h' 2026-04-06T02:38:29,499 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h' 2026-04-06T02:38:29,502 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h' 2026-04-06T02:38:29,504 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-06T02:38:29,506 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-06T02:38:29,511 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h' 2026-04-06T02:38:29,514 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h' 2026-04-06T02:38:29,516 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h' 2026-04-06T02:38:29,519 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h' 2026-04-06T02:38:29,522 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h' 2026-04-06T02:38:29,525 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h' 2026-04-06T02:38:29,528 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h' 2026-04-06T02:38:29,530 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h' 2026-04-06T02:38:29,532 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h' 2026-04-06T02:38:29,533 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h' 2026-04-06T02:38:29,535 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h' 2026-04-06T02:38:29,538 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h' 2026-04-06T02:38:29,540 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h' 2026-04-06T02:38:29,543 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h' 2026-04-06T02:38:29,544 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h' 2026-04-06T02:38:29,546 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h' 2026-04-06T02:38:29,548 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h' 2026-04-06T02:38:29,551 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h' 2026-04-06T02:38:29,554 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h' 2026-04-06T02:38:29,555 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h' 2026-04-06T02:38:29,558 adding 'flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h' 2026-04-06T02:38:29,560 adding 'flashinfer/data/cutlass/python/setup_cutlass.py' 2026-04-06T02:38:29,561 adding 'flashinfer/data/cutlass/python/setup_library.py' 2026-04-06T02:38:29,562 adding 'flashinfer/data/cutlass/python/setup_pycute.py' 2026-04-06T02:38:29,565 adding 'flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py' 2026-04-06T02:38:29,567 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py' 2026-04-06T02:38:29,568 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py' 2026-04-06T02:38:29,570 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py' 2026-04-06T02:38:29,572 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py' 2026-04-06T02:38:29,574 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py' 2026-04-06T02:38:29,577 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py' 2026-04-06T02:38:29,588 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py' 2026-04-06T02:38:29,591 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py' 2026-04-06T02:38:29,593 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py' 2026-04-06T02:38:29,596 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py' 2026-04-06T02:38:29,605 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py' 2026-04-06T02:38:29,607 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py' 2026-04-06T02:38:29,613 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py' 2026-04-06T02:38:29,621 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py' 2026-04-06T02:38:29,622 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py' 2026-04-06T02:38:29,624 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py' 2026-04-06T02:38:29,627 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py' 2026-04-06T02:38:29,629 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py' 2026-04-06T02:38:29,630 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py' 2026-04-06T02:38:29,631 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py' 2026-04-06T02:38:29,633 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py' 2026-04-06T02:38:29,635 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py' 2026-04-06T02:38:29,638 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py' 2026-04-06T02:38:29,639 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py' 2026-04-06T02:38:29,641 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py' 2026-04-06T02:38:29,644 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py' 2026-04-06T02:38:29,646 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py' 2026-04-06T02:38:29,647 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py' 2026-04-06T02:38:29,649 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py' 2026-04-06T02:38:29,650 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py' 2026-04-06T02:38:29,652 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py' 2026-04-06T02:38:29,654 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py' 2026-04-06T02:38:29,656 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py' 2026-04-06T02:38:29,659 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py' 2026-04-06T02:38:29,661 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py' 2026-04-06T02:38:29,669 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py' 2026-04-06T02:38:29,671 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py' 2026-04-06T02:38:29,672 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py' 2026-04-06T02:38:29,674 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py' 2026-04-06T02:38:29,677 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py' 2026-04-06T02:38:29,678 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py' 2026-04-06T02:38:29,681 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py' 2026-04-06T02:38:29,684 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py' 2026-04-06T02:38:29,687 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py' 2026-04-06T02:38:29,690 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py' 2026-04-06T02:38:29,695 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py' 2026-04-06T02:38:29,715 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py' 2026-04-06T02:38:29,718 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py' 2026-04-06T02:38:29,719 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py' 2026-04-06T02:38:29,724 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py' 2026-04-06T02:38:29,734 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py' 2026-04-06T02:38:29,740 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py' 2026-04-06T02:38:29,743 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py' 2026-04-06T02:38:29,745 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py' 2026-04-06T02:38:29,747 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py' 2026-04-06T02:38:29,749 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py' 2026-04-06T02:38:29,750 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py' 2026-04-06T02:38:29,752 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py' 2026-04-06T02:38:29,754 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py' 2026-04-06T02:38:29,761 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py' 2026-04-06T02:38:29,764 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py' 2026-04-06T02:38:29,765 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py' 2026-04-06T02:38:29,767 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py' 2026-04-06T02:38:29,768 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py' 2026-04-06T02:38:29,770 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py' 2026-04-06T02:38:29,771 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py' 2026-04-06T02:38:29,773 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py' 2026-04-06T02:38:29,775 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py' 2026-04-06T02:38:29,777 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py' 2026-04-06T02:38:29,779 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py' 2026-04-06T02:38:29,780 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py' 2026-04-06T02:38:29,782 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py' 2026-04-06T02:38:29,784 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py' 2026-04-06T02:38:29,785 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py' 2026-04-06T02:38:29,787 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py' 2026-04-06T02:38:29,788 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py' 2026-04-06T02:38:29,790 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py' 2026-04-06T02:38:29,792 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py' 2026-04-06T02:38:29,795 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py' 2026-04-06T02:38:29,798 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py' 2026-04-06T02:38:29,800 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py' 2026-04-06T02:38:29,802 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py' 2026-04-06T02:38:29,804 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py' 2026-04-06T02:38:29,807 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py' 2026-04-06T02:38:29,809 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py' 2026-04-06T02:38:29,811 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py' 2026-04-06T02:38:29,813 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py' 2026-04-06T02:38:29,815 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py' 2026-04-06T02:38:29,816 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py' 2026-04-06T02:38:29,818 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py' 2026-04-06T02:38:29,820 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py' 2026-04-06T02:38:29,822 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py' 2026-04-06T02:38:29,824 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py' 2026-04-06T02:38:29,833 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py' 2026-04-06T02:38:29,836 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py' 2026-04-06T02:38:29,839 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py' 2026-04-06T02:38:29,842 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py' 2026-04-06T02:38:29,843 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py' 2026-04-06T02:38:29,845 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py' 2026-04-06T02:38:29,847 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py' 2026-04-06T02:38:29,848 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py' 2026-04-06T02:38:29,851 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py' 2026-04-06T02:38:29,853 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py' 2026-04-06T02:38:29,855 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py' 2026-04-06T02:38:29,859 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py' 2026-04-06T02:38:29,863 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py' 2026-04-06T02:38:29,866 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py' 2026-04-06T02:38:29,870 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py' 2026-04-06T02:38:29,872 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py' 2026-04-06T02:38:29,874 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py' 2026-04-06T02:38:29,875 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py' 2026-04-06T02:38:29,879 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py' 2026-04-06T02:38:29,882 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py' 2026-04-06T02:38:29,884 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py' 2026-04-06T02:38:29,886 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py' 2026-04-06T02:38:29,887 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py' 2026-04-06T02:38:29,891 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py' 2026-04-06T02:38:29,893 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py' 2026-04-06T02:38:29,895 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py' 2026-04-06T02:38:29,899 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py' 2026-04-06T02:38:29,900 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py' 2026-04-06T02:38:29,902 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py' 2026-04-06T02:38:29,904 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py' 2026-04-06T02:38:29,906 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py' 2026-04-06T02:38:29,909 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py' 2026-04-06T02:38:29,912 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py' 2026-04-06T02:38:29,915 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py' 2026-04-06T02:38:29,917 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/shape.py' 2026-04-06T02:38:29,918 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py' 2026-04-06T02:38:29,920 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py' 2026-04-06T02:38:29,922 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py' 2026-04-06T02:38:29,925 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py' 2026-04-06T02:38:29,927 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py' 2026-04-06T02:38:29,931 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py' 2026-04-06T02:38:29,933 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py' 2026-04-06T02:38:29,934 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py' 2026-04-06T02:38:29,942 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py' 2026-04-06T02:38:29,945 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py' 2026-04-06T02:38:29,947 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py' 2026-04-06T02:38:29,949 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py' 2026-04-06T02:38:29,951 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py' 2026-04-06T02:38:29,953 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py' 2026-04-06T02:38:29,955 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py' 2026-04-06T02:38:29,956 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py' 2026-04-06T02:38:29,958 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py' 2026-04-06T02:38:29,960 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py' 2026-04-06T02:38:29,961 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py' 2026-04-06T02:38:29,963 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py' 2026-04-06T02:38:29,964 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py' 2026-04-06T02:38:29,966 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py' 2026-04-06T02:38:29,967 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py' 2026-04-06T02:38:29,969 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py' 2026-04-06T02:38:29,971 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py' 2026-04-06T02:38:29,973 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py' 2026-04-06T02:38:29,975 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py' 2026-04-06T02:38:29,977 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py' 2026-04-06T02:38:29,979 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py' 2026-04-06T02:38:29,980 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py' 2026-04-06T02:38:29,982 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py' 2026-04-06T02:38:29,984 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py' 2026-04-06T02:38:29,986 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py' 2026-04-06T02:38:29,988 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py' 2026-04-06T02:38:29,990 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py' 2026-04-06T02:38:29,991 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py' 2026-04-06T02:38:29,994 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py' 2026-04-06T02:38:29,995 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py' 2026-04-06T02:38:29,997 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py' 2026-04-06T02:38:29,998 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py' 2026-04-06T02:38:30,000 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py' 2026-04-06T02:38:30,001 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py' 2026-04-06T02:38:30,003 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py' 2026-04-06T02:38:30,004 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py' 2026-04-06T02:38:30,006 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py' 2026-04-06T02:38:30,007 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py' 2026-04-06T02:38:30,008 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py' 2026-04-06T02:38:30,010 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py' 2026-04-06T02:38:30,012 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py' 2026-04-06T02:38:30,013 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py' 2026-04-06T02:38:30,015 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py' 2026-04-06T02:38:30,017 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py' 2026-04-06T02:38:30,018 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py' 2026-04-06T02:38:30,022 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py' 2026-04-06T02:38:30,024 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py' 2026-04-06T02:38:30,025 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py' 2026-04-06T02:38:30,027 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py' 2026-04-06T02:38:30,029 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py' 2026-04-06T02:38:30,033 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py' 2026-04-06T02:38:30,037 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py' 2026-04-06T02:38:30,040 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py' 2026-04-06T02:38:30,042 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py' 2026-04-06T02:38:30,044 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py' 2026-04-06T02:38:30,046 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py' 2026-04-06T02:38:30,048 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py' 2026-04-06T02:38:30,049 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py' 2026-04-06T02:38:30,051 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py' 2026-04-06T02:38:30,053 adding 'flashinfer/data/cutlass/python/cutlass_library/__init__.py' 2026-04-06T02:38:30,056 adding 'flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py' 2026-04-06T02:38:30,058 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py' 2026-04-06T02:38:30,060 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py' 2026-04-06T02:38:30,065 adding 'flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py' 2026-04-06T02:38:30,071 adding 'flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py' 2026-04-06T02:38:30,102 adding 'flashinfer/data/cutlass/python/cutlass_library/generator.py' 2026-04-06T02:38:30,108 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics.py' 2026-04-06T02:38:30,110 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py' 2026-04-06T02:38:30,115 adding 'flashinfer/data/cutlass/python/cutlass_library/library.py' 2026-04-06T02:38:30,120 adding 'flashinfer/data/cutlass/python/cutlass_library/manifest.py' 2026-04-06T02:38:30,122 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py' 2026-04-06T02:38:30,124 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py' 2026-04-06T02:38:30,126 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py' 2026-04-06T02:38:30,128 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py' 2026-04-06T02:38:30,130 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py' 2026-04-06T02:38:30,133 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py' 2026-04-06T02:38:30,135 adding 'flashinfer/data/cutlass/python/cutlass_library/symm_operation.py' 2026-04-06T02:38:30,138 adding 'flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py' 2026-04-06T02:38:30,140 adding 'flashinfer/data/cutlass/python/docs_src/source/conf.py' 2026-04-06T02:38:30,142 adding 'flashinfer/data/cutlass/python/pycute/__init__.py' 2026-04-06T02:38:30,144 adding 'flashinfer/data/cutlass/python/pycute/int_tuple.py' 2026-04-06T02:38:30,146 adding 'flashinfer/data/cutlass/python/pycute/layout.py' 2026-04-06T02:38:30,147 adding 'flashinfer/data/cutlass/python/pycute/swizzle.py' 2026-04-06T02:38:30,149 adding 'flashinfer/data/cutlass/python/pycute/typing.py' 2026-04-06T02:38:30,151 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py' 2026-04-06T02:38:30,153 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py' 2026-04-06T02:38:30,155 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-06T02:38:30,156 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py' 2026-04-06T02:38:30,158 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py' 2026-04-06T02:38:30,159 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py' 2026-04-06T02:38:30,161 adding 'flashinfer/data/cutlass/test/python/cutlass/installation.py' 2026-04-06T02:38:30,164 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py' 2026-04-06T02:38:30,165 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py' 2026-04-06T02:38:30,167 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py' 2026-04-06T02:38:30,169 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py' 2026-04-06T02:38:30,171 adding 'flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py' 2026-04-06T02:38:30,173 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py' 2026-04-06T02:38:30,175 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py' 2026-04-06T02:38:30,176 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py' 2026-04-06T02:38:30,178 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py' 2026-04-06T02:38:30,180 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py' 2026-04-06T02:38:30,181 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py' 2026-04-06T02:38:30,183 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py' 2026-04-06T02:38:30,185 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py' 2026-04-06T02:38:30,187 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py' 2026-04-06T02:38:30,188 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py' 2026-04-06T02:38:30,190 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py' 2026-04-06T02:38:30,191 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py' 2026-04-06T02:38:30,193 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py' 2026-04-06T02:38:30,194 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py' 2026-04-06T02:38:30,196 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py' 2026-04-06T02:38:30,197 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py' 2026-04-06T02:38:30,199 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py' 2026-04-06T02:38:30,201 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py' 2026-04-06T02:38:30,202 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py' 2026-04-06T02:38:30,204 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py' 2026-04-06T02:38:30,207 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py' 2026-04-06T02:38:30,209 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py' 2026-04-06T02:38:30,211 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py' 2026-04-06T02:38:30,212 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/utils.py' 2026-04-06T02:38:30,214 adding 'flashinfer/data/cutlass/test/python/pycute/run_all_tests.py' 2026-04-06T02:38:30,216 adding 'flashinfer/data/cutlass/test/python/pycute/test_coalesce.py' 2026-04-06T02:38:30,217 adding 'flashinfer/data/cutlass/test/python/pycute/test_complement.py' 2026-04-06T02:38:30,218 adding 'flashinfer/data/cutlass/test/python/pycute/test_composition.py' 2026-04-06T02:38:30,219 adding 'flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py' 2026-04-06T02:38:30,221 adding 'flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py' 2026-04-06T02:38:30,222 adding 'flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py' 2026-04-06T02:38:30,223 adding 'flashinfer/data/cutlass/test/python/pycute/test_typing.py' 2026-04-06T02:38:30,227 adding 'flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py' 2026-04-06T02:38:30,230 adding 'flashinfer/data/cutlass/test/utils/test_sharding.py' 2026-04-06T02:38:30,233 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp' 2026-04-06T02:38:30,235 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h' 2026-04-06T02:38:30,237 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp' 2026-04-06T02:38:30,239 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h' 2026-04-06T02:38:30,240 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h' 2026-04-06T02:38:30,242 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h' 2026-04-06T02:38:30,244 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h' 2026-04-06T02:38:30,246 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h' 2026-04-06T02:38:30,248 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h' 2026-04-06T02:38:30,249 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h' 2026-04-06T02:38:30,251 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h' 2026-04-06T02:38:30,253 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h' 2026-04-06T02:38:30,254 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h' 2026-04-06T02:38:30,256 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h' 2026-04-06T02:38:30,257 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h' 2026-04-06T02:38:30,259 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h' 2026-04-06T02:38:30,261 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp' 2026-04-06T02:38:30,262 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp' 2026-04-06T02:38:30,264 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h' 2026-04-06T02:38:30,266 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h' 2026-04-06T02:38:30,269 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h' 2026-04-06T02:38:30,270 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h' 2026-04-06T02:38:30,271 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h' 2026-04-06T02:38:30,274 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp' 2026-04-06T02:38:30,277 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp' 2026-04-06T02:38:30,279 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp' 2026-04-06T02:38:30,281 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h' 2026-04-06T02:38:30,282 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h' 2026-04-06T02:38:30,284 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h' 2026-04-06T02:38:30,286 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h' 2026-04-06T02:38:30,290 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h' 2026-04-06T02:38:30,292 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h' 2026-04-06T02:38:30,293 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h' 2026-04-06T02:38:30,295 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h' 2026-04-06T02:38:30,297 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp' 2026-04-06T02:38:30,298 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h' 2026-04-06T02:38:30,300 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h' 2026-04-06T02:38:30,304 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h' 2026-04-06T02:38:30,306 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h' 2026-04-06T02:38:30,308 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h' 2026-04-06T02:38:30,309 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h' 2026-04-06T02:38:30,311 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h' 2026-04-06T02:38:30,313 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h' 2026-04-06T02:38:30,314 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h' 2026-04-06T02:38:30,316 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h' 2026-04-06T02:38:30,320 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp' 2026-04-06T02:38:30,323 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h' 2026-04-06T02:38:30,324 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h' 2026-04-06T02:38:30,326 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h' 2026-04-06T02:38:30,328 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h' 2026-04-06T02:38:30,329 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h' 2026-04-06T02:38:30,333 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp' 2026-04-06T02:38:30,335 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h' 2026-04-06T02:38:30,337 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h' 2026-04-06T02:38:30,339 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h' 2026-04-06T02:38:30,340 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h' 2026-04-06T02:38:30,342 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h' 2026-04-06T02:38:30,344 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h' 2026-04-06T02:38:30,346 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp' 2026-04-06T02:38:30,347 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h' 2026-04-06T02:38:30,349 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h' 2026-04-06T02:38:30,353 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h' 2026-04-06T02:38:30,355 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp' 2026-04-06T02:38:30,356 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h' 2026-04-06T02:38:30,358 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h' 2026-04-06T02:38:30,359 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h' 2026-04-06T02:38:30,361 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp' 2026-04-06T02:38:30,362 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h' 2026-04-06T02:38:30,364 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h' 2026-04-06T02:38:30,367 adding 'flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py' 2026-04-06T02:38:30,369 adding 'flashinfer/data/include/flashinfer/activation.cuh' 2026-04-06T02:38:30,372 adding 'flashinfer/data/include/flashinfer/air_top_p.cuh' 2026-04-06T02:38:30,373 adding 'flashinfer/data/include/flashinfer/allocator.h' 2026-04-06T02:38:30,375 adding 'flashinfer/data/include/flashinfer/arch_condition.h' 2026-04-06T02:38:30,376 adding 'flashinfer/data/include/flashinfer/attention_impl.cuh' 2026-04-06T02:38:30,377 adding 'flashinfer/data/include/flashinfer/concat_mla.cuh' 2026-04-06T02:38:30,379 adding 'flashinfer/data/include/flashinfer/cp_async.cuh' 2026-04-06T02:38:30,380 adding 'flashinfer/data/include/flashinfer/cubin_loader.h' 2026-04-06T02:38:30,382 adding 'flashinfer/data/include/flashinfer/cutlass_utils.cuh' 2026-04-06T02:38:30,383 adding 'flashinfer/data/include/flashinfer/exception.h' 2026-04-06T02:38:30,385 adding 'flashinfer/data/include/flashinfer/fastdiv.cuh' 2026-04-06T02:38:30,386 adding 'flashinfer/data/include/flashinfer/fp16.h' 2026-04-06T02:38:30,388 adding 'flashinfer/data/include/flashinfer/fp4_layout.cuh' 2026-04-06T02:38:30,389 adding 'flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh' 2026-04-06T02:38:30,390 adding 'flashinfer/data/include/flashinfer/layout.cuh' 2026-04-06T02:38:30,392 adding 'flashinfer/data/include/flashinfer/logging.h' 2026-04-06T02:38:30,393 adding 'flashinfer/data/include/flashinfer/math.cuh' 2026-04-06T02:38:30,395 adding 'flashinfer/data/include/flashinfer/mma.cuh' 2026-04-06T02:38:30,399 adding 'flashinfer/data/include/flashinfer/norm.cuh' 2026-04-06T02:38:30,401 adding 'flashinfer/data/include/flashinfer/page.cuh' 2026-04-06T02:38:30,403 adding 'flashinfer/data/include/flashinfer/permuted_smem.cuh' 2026-04-06T02:38:30,409 adding 'flashinfer/data/include/flashinfer/pos_enc.cuh' 2026-04-06T02:38:30,411 adding 'flashinfer/data/include/flashinfer/profiler.cuh' 2026-04-06T02:38:30,412 adding 'flashinfer/data/include/flashinfer/quantization.cuh' 2026-04-06T02:38:30,418 adding 'flashinfer/data/include/flashinfer/sampling.cuh' 2026-04-06T02:38:30,428 adding 'flashinfer/data/include/flashinfer/topk.cuh' 2026-04-06T02:38:30,431 adding 'flashinfer/data/include/flashinfer/utils.cuh' 2026-04-06T02:38:30,436 adding 'flashinfer/data/include/flashinfer/vec_dtypes.cuh' 2026-04-06T02:38:30,439 adding 'flashinfer/data/include/flashinfer/attention/batch_pod.cuh' 2026-04-06T02:38:30,443 adding 'flashinfer/data/include/flashinfer/attention/cascade.cuh' 2026-04-06T02:38:30,444 adding 'flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh' 2026-04-06T02:38:30,449 adding 'flashinfer/data/include/flashinfer/attention/decode.cuh' 2026-04-06T02:38:30,452 adding 'flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh' 2026-04-06T02:38:30,454 adding 'flashinfer/data/include/flashinfer/attention/default_decode_params.cuh' 2026-04-06T02:38:30,456 adding 'flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh' 2026-04-06T02:38:30,457 adding 'flashinfer/data/include/flashinfer/attention/heap.h' 2026-04-06T02:38:30,459 adding 'flashinfer/data/include/flashinfer/attention/hopper.cuh' 2026-04-06T02:38:30,460 adding 'flashinfer/data/include/flashinfer/attention/mask.cuh' 2026-04-06T02:38:30,464 adding 'flashinfer/data/include/flashinfer/attention/mla.cuh' 2026-04-06T02:38:30,468 adding 'flashinfer/data/include/flashinfer/attention/mla_hopper.cuh' 2026-04-06T02:38:30,470 adding 'flashinfer/data/include/flashinfer/attention/mla_params.cuh' 2026-04-06T02:38:30,473 adding 'flashinfer/data/include/flashinfer/attention/persistent.cuh' 2026-04-06T02:38:30,475 adding 'flashinfer/data/include/flashinfer/attention/persistent_template.cuh' 2026-04-06T02:38:30,477 adding 'flashinfer/data/include/flashinfer/attention/pod.cuh' 2026-04-06T02:38:30,486 adding 'flashinfer/data/include/flashinfer/attention/prefill.cuh' 2026-04-06T02:38:30,494 adding 'flashinfer/data/include/flashinfer/attention/scheduler.cuh' 2026-04-06T02:38:30,496 adding 'flashinfer/data/include/flashinfer/attention/state.cuh' 2026-04-06T02:38:30,497 adding 'flashinfer/data/include/flashinfer/attention/variant_helper.cuh' 2026-04-06T02:38:30,499 adding 'flashinfer/data/include/flashinfer/attention/variants.cuh' 2026-04-06T02:38:30,501 adding 'flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh' 2026-04-06T02:38:30,503 adding 'flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh' 2026-04-06T02:38:30,505 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp' 2026-04-06T02:38:30,506 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp' 2026-04-06T02:38:30,508 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp' 2026-04-06T02:38:30,512 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp' 2026-04-06T02:38:30,514 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp' 2026-04-06T02:38:30,518 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp' 2026-04-06T02:38:30,521 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp' 2026-04-06T02:38:30,522 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp' 2026-04-06T02:38:30,524 adding 'flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp' 2026-04-06T02:38:30,527 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp' 2026-04-06T02:38:30,529 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp' 2026-04-06T02:38:30,531 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp' 2026-04-06T02:38:30,532 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp' 2026-04-06T02:38:30,534 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp' 2026-04-06T02:38:30,536 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp' 2026-04-06T02:38:30,539 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp' 2026-04-06T02:38:30,541 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp' 2026-04-06T02:38:30,548 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp' 2026-04-06T02:38:30,550 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp' 2026-04-06T02:38:30,553 adding 'flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh' 2026-04-06T02:38:30,555 adding 'flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh' 2026-04-06T02:38:30,556 adding 'flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh' 2026-04-06T02:38:30,558 adding 'flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh' 2026-04-06T02:38:30,560 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh' 2026-04-06T02:38:30,562 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh' 2026-04-06T02:38:30,563 adding 'flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh' 2026-04-06T02:38:30,566 adding 'flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh' 2026-04-06T02:38:30,568 adding 'flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh' 2026-04-06T02:38:30,570 adding 'flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh' 2026-04-06T02:38:30,572 adding 'flashinfer/data/include/flashinfer/attention/hopper/utils.cuh' 2026-04-06T02:38:30,573 adding 'flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh' 2026-04-06T02:38:30,575 adding 'flashinfer/data/include/flashinfer/attention/hopper/variants.cuh' 2026-04-06T02:38:30,577 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh' 2026-04-06T02:38:30,579 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh' 2026-04-06T02:38:30,581 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh' 2026-04-06T02:38:30,583 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh' 2026-04-06T02:38:30,586 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh' 2026-04-06T02:38:30,588 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh' 2026-04-06T02:38:30,595 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh' 2026-04-06T02:38:30,601 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh' 2026-04-06T02:38:30,605 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh' 2026-04-06T02:38:30,607 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh' 2026-04-06T02:38:30,612 adding 'flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh' 2026-04-06T02:38:30,617 adding 'flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh' 2026-04-06T02:38:30,620 adding 'flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh' 2026-04-06T02:38:30,622 adding 'flashinfer/data/include/flashinfer/flat/common.hpp' 2026-04-06T02:38:30,624 adding 'flashinfer/data/include/flashinfer/flat/cute_ext.hpp' 2026-04-06T02:38:30,625 adding 'flashinfer/data/include/flashinfer/flat/debug.hpp' 2026-04-06T02:38:30,626 adding 'flashinfer/data/include/flashinfer/flat/math.hpp' 2026-04-06T02:38:30,628 adding 'flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp' 2026-04-06T02:38:30,629 adding 'flashinfer/data/include/flashinfer/flat/type_traits.hpp' 2026-04-06T02:38:30,630 adding 'flashinfer/data/include/flashinfer/flat/unused.hpp' 2026-04-06T02:38:30,634 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp' 2026-04-06T02:38:30,635 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp' 2026-04-06T02:38:30,638 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp' 2026-04-06T02:38:30,640 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp' 2026-04-06T02:38:30,646 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp' 2026-04-06T02:38:30,648 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp' 2026-04-06T02:38:30,649 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp' 2026-04-06T02:38:30,651 adding 'flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp' 2026-04-06T02:38:30,653 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp' 2026-04-06T02:38:30,655 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp' 2026-04-06T02:38:30,656 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp' 2026-04-06T02:38:30,658 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp' 2026-04-06T02:38:30,660 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp' 2026-04-06T02:38:30,661 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh' 2026-04-06T02:38:30,663 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h' 2026-04-06T02:38:30,665 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h' 2026-04-06T02:38:30,666 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h' 2026-04-06T02:38:30,668 adding 'flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh' 2026-04-06T02:38:30,670 adding 'flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h' 2026-04-06T02:38:30,672 adding 'flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh' 2026-04-06T02:38:30,673 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h' 2026-04-06T02:38:30,675 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h' 2026-04-06T02:38:30,677 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h' 2026-04-06T02:38:30,679 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h' 2026-04-06T02:38:30,681 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h' 2026-04-06T02:38:30,684 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h' 2026-04-06T02:38:30,686 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h' 2026-04-06T02:38:30,687 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h' 2026-04-06T02:38:30,689 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h' 2026-04-06T02:38:30,691 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h' 2026-04-06T02:38:30,693 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh' 2026-04-06T02:38:30,694 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh' 2026-04-06T02:38:30,696 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm.cuh' 2026-04-06T02:38:30,698 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh' 2026-04-06T02:38:30,700 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh' 2026-04-06T02:38:30,704 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh' 2026-04-06T02:38:30,706 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh' 2026-04-06T02:38:30,708 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh' 2026-04-06T02:38:30,709 adding 'flashinfer/data/include/flashinfer/gemm/group_gemv.cuh' 2026-04-06T02:38:30,711 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h' 2026-04-06T02:38:30,713 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h' 2026-04-06T02:38:30,716 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h' 2026-04-06T02:38:30,725 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh' 2026-04-06T02:38:30,727 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h' 2026-04-06T02:38:30,728 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h' 2026-04-06T02:38:30,730 adding 'flashinfer/data/include/flashinfer/mamba/common.cuh' 2026-04-06T02:38:30,732 adding 'flashinfer/data/include/flashinfer/mamba/conversion.cuh' 2026-04-06T02:38:30,734 adding 'flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh' 2026-04-06T02:38:30,737 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp.cuh' 2026-04-06T02:38:30,742 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh' 2026-04-06T02:38:30,744 adding 'flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh' 2026-04-06T02:38:30,745 adding 'flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh' 2026-04-06T02:38:30,748 adding 'flashinfer/data/include/flashinfer/trtllm/common.h' 2026-04-06T02:38:30,750 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h' 2026-04-06T02:38:30,752 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh' 2026-04-06T02:38:30,753 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h' 2026-04-06T02:38:30,754 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h' 2026-04-06T02:38:30,756 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh' 2026-04-06T02:38:30,758 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h' 2026-04-06T02:38:30,760 adding 'flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh' 2026-04-06T02:38:30,762 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h' 2026-04-06T02:38:30,763 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h' 2026-04-06T02:38:30,768 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh' 2026-04-06T02:38:30,770 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h' 2026-04-06T02:38:30,771 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh' 2026-04-06T02:38:30,773 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h' 2026-04-06T02:38:30,778 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h' 2026-04-06T02:38:30,780 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h' 2026-04-06T02:38:30,781 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh' 2026-04-06T02:38:30,784 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h' 2026-04-06T02:38:30,786 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h' 2026-04-06T02:38:30,789 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh' 2026-04-06T02:38:30,790 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h' 2026-04-06T02:38:30,792 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh' 2026-04-06T02:38:30,793 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h' 2026-04-06T02:38:30,796 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h' 2026-04-06T02:38:30,798 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/Enums.h' 2026-04-06T02:38:30,802 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmInterface.h' 2026-04-06T02:38:30,810 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmOptions.h' 2026-04-06T02:38:30,813 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParams.h' 2026-04-06T02:38:30,815 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParamsDecl.h' 2026-04-06T02:38:30,819 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelTraits.h' 2026-04-06T02:38:30,820 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/TmaDescriptor.h' 2026-04-06T02:38:30,823 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CommonUtils.h' 2026-04-06T02:38:30,824 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaArchDecl.h' 2026-04-06T02:38:30,825 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaKernelLauncher.h' 2026-04-06T02:38:30,827 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/DtypeDecl.h' 2026-04-06T02:38:30,829 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/MmaDecl.h' 2026-04-06T02:38:30,830 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SfLayoutDecl.h' 2026-04-06T02:38:30,831 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SparsityDecl.h' 2026-04-06T02:38:30,834 adding 'flashinfer/data/spdlog/include/spdlog/async.h' 2026-04-06T02:38:30,835 adding 'flashinfer/data/spdlog/include/spdlog/async_logger-inl.h' 2026-04-06T02:38:30,837 adding 'flashinfer/data/spdlog/include/spdlog/async_logger.h' 2026-04-06T02:38:30,838 adding 'flashinfer/data/spdlog/include/spdlog/common-inl.h' 2026-04-06T02:38:30,840 adding 'flashinfer/data/spdlog/include/spdlog/common.h' 2026-04-06T02:38:30,841 adding 'flashinfer/data/spdlog/include/spdlog/formatter.h' 2026-04-06T02:38:30,842 adding 'flashinfer/data/spdlog/include/spdlog/fwd.h' 2026-04-06T02:38:30,844 adding 'flashinfer/data/spdlog/include/spdlog/logger-inl.h' 2026-04-06T02:38:30,846 adding 'flashinfer/data/spdlog/include/spdlog/logger.h' 2026-04-06T02:38:30,847 adding 'flashinfer/data/spdlog/include/spdlog/mdc.h' 2026-04-06T02:38:30,851 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h' 2026-04-06T02:38:30,853 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter.h' 2026-04-06T02:38:30,854 adding 'flashinfer/data/spdlog/include/spdlog/spdlog-inl.h' 2026-04-06T02:38:30,856 adding 'flashinfer/data/spdlog/include/spdlog/spdlog.h' 2026-04-06T02:38:30,857 adding 'flashinfer/data/spdlog/include/spdlog/stopwatch.h' 2026-04-06T02:38:30,859 adding 'flashinfer/data/spdlog/include/spdlog/tweakme.h' 2026-04-06T02:38:30,860 adding 'flashinfer/data/spdlog/include/spdlog/version.h' 2026-04-06T02:38:30,862 adding 'flashinfer/data/spdlog/include/spdlog/cfg/argv.h' 2026-04-06T02:38:30,863 adding 'flashinfer/data/spdlog/include/spdlog/cfg/env.h' 2026-04-06T02:38:30,864 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h' 2026-04-06T02:38:30,865 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers.h' 2026-04-06T02:38:30,868 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h' 2026-04-06T02:38:30,869 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer.h' 2026-04-06T02:38:30,870 adding 'flashinfer/data/spdlog/include/spdlog/details/circular_q.h' 2026-04-06T02:38:30,872 adding 'flashinfer/data/spdlog/include/spdlog/details/console_globals.h' 2026-04-06T02:38:30,873 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h' 2026-04-06T02:38:30,874 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper.h' 2026-04-06T02:38:30,876 adding 'flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h' 2026-04-06T02:38:30,877 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h' 2026-04-06T02:38:30,878 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg.h' 2026-04-06T02:38:30,880 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h' 2026-04-06T02:38:30,881 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h' 2026-04-06T02:38:30,882 adding 'flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h' 2026-04-06T02:38:30,884 adding 'flashinfer/data/spdlog/include/spdlog/details/null_mutex.h' 2026-04-06T02:38:30,886 adding 'flashinfer/data/spdlog/include/spdlog/details/os-inl.h' 2026-04-06T02:38:30,888 adding 'flashinfer/data/spdlog/include/spdlog/details/os.h' 2026-04-06T02:38:30,889 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h' 2026-04-06T02:38:30,891 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h' 2026-04-06T02:38:30,892 adding 'flashinfer/data/spdlog/include/spdlog/details/registry-inl.h' 2026-04-06T02:38:30,894 adding 'flashinfer/data/spdlog/include/spdlog/details/registry.h' 2026-04-06T02:38:30,895 adding 'flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h' 2026-04-06T02:38:30,896 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h' 2026-04-06T02:38:30,898 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client.h' 2026-04-06T02:38:30,899 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h' 2026-04-06T02:38:30,900 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool.h' 2026-04-06T02:38:30,902 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h' 2026-04-06T02:38:30,903 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client.h' 2026-04-06T02:38:30,904 adding 'flashinfer/data/spdlog/include/spdlog/details/windows_include.h' 2026-04-06T02:38:30,906 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h' 2026-04-06T02:38:30,907 adding 'flashinfer/data/spdlog/include/spdlog/fmt/chrono.h' 2026-04-06T02:38:30,909 adding 'flashinfer/data/spdlog/include/spdlog/fmt/compile.h' 2026-04-06T02:38:30,910 adding 'flashinfer/data/spdlog/include/spdlog/fmt/fmt.h' 2026-04-06T02:38:30,911 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ostr.h' 2026-04-06T02:38:30,913 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ranges.h' 2026-04-06T02:38:30,914 adding 'flashinfer/data/spdlog/include/spdlog/fmt/std.h' 2026-04-06T02:38:30,915 adding 'flashinfer/data/spdlog/include/spdlog/fmt/xchar.h' 2026-04-06T02:38:30,917 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h' 2026-04-06T02:38:30,925 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h' 2026-04-06T02:38:30,928 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h' 2026-04-06T02:38:30,931 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h' 2026-04-06T02:38:30,944 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h' 2026-04-06T02:38:30,945 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst' 2026-04-06T02:38:30,955 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h' 2026-04-06T02:38:30,976 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h' 2026-04-06T02:38:30,978 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h' 2026-04-06T02:38:30,980 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h' 2026-04-06T02:38:30,982 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h' 2026-04-06T02:38:30,985 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h' 2026-04-06T02:38:30,988 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h' 2026-04-06T02:38:30,990 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h' 2026-04-06T02:38:30,992 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h' 2026-04-06T02:38:30,994 adding 'flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h' 2026-04-06T02:38:30,996 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h' 2026-04-06T02:38:30,997 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h' 2026-04-06T02:38:30,998 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h' 2026-04-06T02:38:31,000 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h' 2026-04-06T02:38:31,001 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h' 2026-04-06T02:38:31,002 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h' 2026-04-06T02:38:31,004 adding 'flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h' 2026-04-06T02:38:31,005 adding 'flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h' 2026-04-06T02:38:31,007 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h' 2026-04-06T02:38:31,008 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h' 2026-04-06T02:38:31,010 adding 'flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h' 2026-04-06T02:38:31,011 adding 'flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h' 2026-04-06T02:38:31,012 adding 'flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h' 2026-04-06T02:38:31,013 adding 'flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h' 2026-04-06T02:38:31,015 adding 'flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h' 2026-04-06T02:38:31,016 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h' 2026-04-06T02:38:31,017 adding 'flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h' 2026-04-06T02:38:31,019 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h' 2026-04-06T02:38:31,020 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h' 2026-04-06T02:38:31,021 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h' 2026-04-06T02:38:31,022 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h' 2026-04-06T02:38:31,024 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink.h' 2026-04-06T02:38:31,025 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h' 2026-04-06T02:38:31,026 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h' 2026-04-06T02:38:31,028 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h' 2026-04-06T02:38:31,029 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h' 2026-04-06T02:38:31,030 adding 'flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h' 2026-04-06T02:38:31,032 adding 'flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h' 2026-04-06T02:38:31,033 adding 'flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h' 2026-04-06T02:38:31,034 adding 'flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h' 2026-04-06T02:38:31,036 adding 'flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h' 2026-04-06T02:38:31,038 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h' 2026-04-06T02:38:31,039 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h' 2026-04-06T02:38:31,041 adding 'flashinfer/data/spdlog/scripts/extract_version.py' 2026-04-06T02:38:31,042 adding 'flashinfer/dsv3_ops/__init__.py' 2026-04-06T02:38:31,044 adding 'flashinfer/fused_moe/__init__.py' 2026-04-06T02:38:31,052 adding 'flashinfer/fused_moe/core.py' 2026-04-06T02:38:31,054 adding 'flashinfer/fused_moe/fused_routing_dsv3.py' 2026-04-06T02:38:31,056 adding 'flashinfer/fused_moe/utils.py' 2026-04-06T02:38:31,058 adding 'flashinfer/fused_moe/cute_dsl/__init__.py' 2026-04-06T02:38:31,061 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-06T02:38:31,064 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-06T02:38:31,067 adding 'flashinfer/fused_moe/cute_dsl/fused_moe.py' 2026-04-06T02:38:31,070 adding 'flashinfer/fused_moe/cute_dsl/moe_utils.py' 2026-04-06T02:38:31,073 adding 'flashinfer/fused_moe/cute_dsl/tuner.py' 2026-04-06T02:38:31,075 adding 'flashinfer/fused_moe/cute_dsl/blackwell/__init__.py' 2026-04-06T02:38:31,088 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-06T02:38:31,099 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-06T02:38:31,101 adding 'flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py' 2026-04-06T02:38:31,103 adding 'flashinfer/fused_moe/cute_dsl/blackwell/utils.py' 2026-04-06T02:38:31,105 adding 'flashinfer/gdn_kernels/__init__.py' 2026-04-06T02:38:31,110 adding 'flashinfer/gdn_kernels/gdn_decode_bf16_state.py' 2026-04-06T02:38:31,118 adding 'flashinfer/gdn_kernels/gdn_decode_mtp.py' 2026-04-06T02:38:31,121 adding 'flashinfer/gdn_kernels/gdn_decode_nontranspose.py' 2026-04-06T02:38:31,125 adding 'flashinfer/gdn_kernels/gdn_decode_pretranspose.py' 2026-04-06T02:38:31,127 adding 'flashinfer/gdn_kernels/blackwell_prefill/__init__.py' 2026-04-06T02:38:31,139 adding 'flashinfer/gdn_kernels/blackwell_prefill/gdn.py' 2026-04-06T02:38:31,141 adding 'flashinfer/gdn_kernels/blackwell_prefill/gdn_helpers.py' 2026-04-06T02:38:31,142 adding 'flashinfer/gdn_kernels/blackwell_prefill/gdn_tile_scheduler.py' 2026-04-06T02:38:31,144 adding 'flashinfer/gemm/__init__.py' 2026-04-06T02:38:31,165 adding 'flashinfer/gemm/gemm_base.py' 2026-04-06T02:38:31,168 adding 'flashinfer/gemm/routergemm.py' 2026-04-06T02:38:31,170 adding 'flashinfer/gemm/kernels/__init__.py' 2026-04-06T02:38:31,178 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py' 2026-04-06T02:38:31,187 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py' 2026-04-06T02:38:31,198 adding 'flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py' 2026-04-06T02:38:31,201 adding 'flashinfer/jit/__init__.py' 2026-04-06T02:38:31,203 adding 'flashinfer/jit/activation.py' 2026-04-06T02:38:31,204 adding 'flashinfer/jit/cascade.py' 2026-04-06T02:38:31,205 adding 'flashinfer/jit/comm.py' 2026-04-06T02:38:31,208 adding 'flashinfer/jit/core.py' 2026-04-06T02:38:31,210 adding 'flashinfer/jit/cpp_ext.py' 2026-04-06T02:38:31,212 adding 'flashinfer/jit/cubin_loader.py' 2026-04-06T02:38:31,213 adding 'flashinfer/jit/dsv3_optimizations.py' 2026-04-06T02:38:31,214 adding 'flashinfer/jit/env.py' 2026-04-06T02:38:31,216 adding 'flashinfer/jit/fp4_kv_dequantization.py' 2026-04-06T02:38:31,217 adding 'flashinfer/jit/fp4_kv_quantization.py' 2026-04-06T02:38:31,218 adding 'flashinfer/jit/fp4_quantization.py' 2026-04-06T02:38:31,219 adding 'flashinfer/jit/fp8_quantization.py' 2026-04-06T02:38:31,221 adding 'flashinfer/jit/fused_moe.py' 2026-04-06T02:38:31,222 adding 'flashinfer/jit/gdn.py' 2026-04-06T02:38:31,224 adding 'flashinfer/jit/mla.py' 2026-04-06T02:38:31,225 adding 'flashinfer/jit/moe_utils.py' 2026-04-06T02:38:31,226 adding 'flashinfer/jit/norm.py' 2026-04-06T02:38:31,228 adding 'flashinfer/jit/page.py' 2026-04-06T02:38:31,229 adding 'flashinfer/jit/quantization.py' 2026-04-06T02:38:31,230 adding 'flashinfer/jit/rope.py' 2026-04-06T02:38:31,232 adding 'flashinfer/jit/sampling.py' 2026-04-06T02:38:31,233 adding 'flashinfer/jit/spdlog.py' 2026-04-06T02:38:31,234 adding 'flashinfer/jit/tinygemm2.py' 2026-04-06T02:38:31,235 adding 'flashinfer/jit/tllm_utils.py' 2026-04-06T02:38:31,237 adding 'flashinfer/jit/topk.py' 2026-04-06T02:38:31,238 adding 'flashinfer/jit/utils.py' 2026-04-06T02:38:31,240 adding 'flashinfer/jit/xqa.py' 2026-04-06T02:38:31,241 adding 'flashinfer/jit/attention/__init__.py' 2026-04-06T02:38:31,246 adding 'flashinfer/jit/attention/modules.py' 2026-04-06T02:38:31,248 adding 'flashinfer/jit/attention/utils.py' 2026-04-06T02:38:31,249 adding 'flashinfer/jit/attention/variants.py' 2026-04-06T02:38:31,255 adding 'flashinfer/jit/attention/fmha_v2/fmha_library.py' 2026-04-06T02:38:31,256 adding 'flashinfer/jit/attention/fmha_v2/generate_kernels.py' 2026-04-06T02:38:31,272 adding 'flashinfer/jit/attention/fmha_v2/generator_utils.py' 2026-04-06T02:38:31,277 adding 'flashinfer/jit/attention/fmha_v2/utils.py' 2026-04-06T02:38:31,279 adding 'flashinfer/jit/gemm/__init__.py' 2026-04-06T02:38:31,281 adding 'flashinfer/jit/gemm/core.py' 2026-04-06T02:38:31,283 adding 'flashinfer/jit/gemm/deepgemm.py' 2026-04-06T02:38:31,284 adding 'flashinfer/jit/gemm/fp8_blockscale.py' 2026-04-06T02:38:31,286 adding 'flashinfer/jit/gemm/cutlass/__init__.py' 2026-04-06T02:38:31,290 adding 'flashinfer/jit/gemm/cutlass/cutlass_library.py' 2026-04-06T02:38:31,294 adding 'flashinfer/jit/gemm/cutlass/generate_kernels.py' 2026-04-06T02:38:31,297 adding 'flashinfer/jit/mamba/__init__.py' 2026-04-06T02:38:31,298 adding 'flashinfer/jit/mamba/selective_state_update.py' 2026-04-06T02:38:31,300 adding 'flashinfer/jit/mamba/seq_chunk_cumsum.py' 2026-04-06T02:38:31,301 adding 'flashinfer/logits_processor/__init__.py' 2026-04-06T02:38:31,303 adding 'flashinfer/logits_processor/compiler.py' 2026-04-06T02:38:31,305 adding 'flashinfer/logits_processor/fusion_rules.py' 2026-04-06T02:38:31,306 adding 'flashinfer/logits_processor/legalization.py' 2026-04-06T02:38:31,307 adding 'flashinfer/logits_processor/op.py' 2026-04-06T02:38:31,309 adding 'flashinfer/logits_processor/operators.py' 2026-04-06T02:38:31,311 adding 'flashinfer/logits_processor/pipeline.py' 2026-04-06T02:38:31,312 adding 'flashinfer/logits_processor/processors.py' 2026-04-06T02:38:31,314 adding 'flashinfer/logits_processor/types.py' 2026-04-06T02:38:31,315 adding 'flashinfer/logits_processor/validators.py' 2026-04-06T02:38:31,317 adding 'flashinfer/mamba/__init__.py' 2026-04-06T02:38:31,319 adding 'flashinfer/mamba/selective_state_update.py' 2026-04-06T02:38:31,322 adding 'flashinfer/mamba/ssd_combined.py' 2026-04-06T02:38:31,336 adding 'flashinfer/mamba/ssd_kernel.py' 2026-04-06T02:38:31,338 adding 'flashinfer/mamba/ssd_tile_scheduler.py' 2026-04-06T02:38:31,341 adding 'flashinfer/norm/__init__.py' 2026-04-06T02:38:31,343 adding 'flashinfer/norm/utils.py' 2026-04-06T02:38:31,345 adding 'flashinfer/norm/kernels/__init__.py' 2026-04-06T02:38:31,349 adding 'flashinfer/norm/kernels/fused_add_rmsnorm.py' 2026-04-06T02:38:31,351 adding 'flashinfer/norm/kernels/layernorm.py' 2026-04-06T02:38:31,355 adding 'flashinfer/norm/kernels/rmsnorm.py' 2026-04-06T02:38:31,357 adding 'flashinfer/profiler/__init__.py' 2026-04-06T02:38:31,358 adding 'flashinfer/quantization/__init__.py' 2026-04-06T02:38:31,363 adding 'flashinfer/quantization/fp4_quantization.py' 2026-04-06T02:38:31,365 adding 'flashinfer/quantization/fp8_quantization.py' 2026-04-06T02:38:31,367 adding 'flashinfer/quantization/packbits.py' 2026-04-06T02:38:31,370 adding 'flashinfer/quantization/quantization_cute_dsl_utils.py' 2026-04-06T02:38:31,371 adding 'flashinfer/quantization/kernels/__init__.py' 2026-04-06T02:38:31,374 adding 'flashinfer/quantization/kernels/mxfp4_quantize.py' 2026-04-06T02:38:31,377 adding 'flashinfer/quantization/kernels/mxfp8_quantize.py' 2026-04-06T02:38:31,379 adding 'flashinfer/testing/__init__.py' 2026-04-06T02:38:31,385 adding 'flashinfer/testing/utils.py' 2026-04-06T02:38:31,387 adding 'flashinfer/triton/__init__.py' 2026-04-06T02:38:31,389 adding 'flashinfer/triton/activation.py' 2026-04-06T02:38:31,390 adding 'flashinfer/triton/cascade.py' 2026-04-06T02:38:31,391 adding 'flashinfer/triton/gemm.py' 2026-04-06T02:38:31,393 adding 'flashinfer/triton/norm.py' 2026-04-06T02:38:31,394 adding 'flashinfer/triton/page.py' 2026-04-06T02:38:31,395 adding 'flashinfer/triton/sm_constraint_gemm.py' 2026-04-06T02:38:31,397 adding 'flashinfer/triton/utils.py' 2026-04-06T02:38:31,399 adding 'flashinfer/triton/kernels/__init__.py' 2026-04-06T02:38:31,400 adding 'flashinfer/triton/kernels/activation.py' 2026-04-06T02:38:31,401 adding 'flashinfer/triton/kernels/cascade.py' 2026-04-06T02:38:31,403 adding 'flashinfer/triton/kernels/norm.py' 2026-04-06T02:38:31,404 adding 'flashinfer/triton/kernels/quant.py' 2026-04-06T02:38:31,406 adding 'flashinfer/triton/kernels/sm_constraint_gemm.py' 2026-04-06T02:38:31,407 adding 'flashinfer/triton/kernels/ssd_chunk_state.py' 2026-04-06T02:38:31,409 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py' 2026-04-06T02:38:31,411 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py' 2026-04-06T02:38:31,414 adding 'flashinfer_python-0.6.7.post3.dist-info/licenses/LICENSE' 2026-04-06T02:38:31,416 adding 'flashinfer_python-0.6.7.post3.dist-info/METADATA' 2026-04-06T02:38:31,417 adding 'flashinfer_python-0.6.7.post3.dist-info/WHEEL' 2026-04-06T02:38:31,418 adding 'flashinfer_python-0.6.7.post3.dist-info/entry_points.txt' 2026-04-06T02:38:31,419 adding 'flashinfer_python-0.6.7.post3.dist-info/top_level.txt' 2026-04-06T02:38:31,461 adding 'flashinfer_python-0.6.7.post3.dist-info/RECORD' 2026-04-06T02:38:31,602 removing build/bdist.linux-armv7l/wheel 2026-04-06T02:38:32,316 Building wheel for flashinfer-python (pyproject.toml): finished with status 'done' 2026-04-06T02:38:32,470 Created wheel for flashinfer-python: filename=flashinfer_python-0.6.7.post3-py3-none-any.whl size=9187530 sha256=7a81720af5bdc04efcb67207f3867adb1b068f961d2e048e55baf32fb8e2cfc5 2026-04-06T02:38:32,471 Stored in directory: /tmp/pip-ephem-wheel-cache-6go0d428/wheels/8d/46/c0/2b972ef11ae388949bbb54d7e293729862686b79edfff7e3b7 2026-04-06T02:38:33,056 Successfully built flashinfer-python 2026-04-06T02:38:33,374 Removed build tracker: '/tmp/pip-build-tracker-lo4oc_78'