2026-04-03T10:32:48,404 Created temporary directory: /tmp/pip-ephem-wheel-cache-xdunoq92 2026-04-03T10:32:48,405 Created temporary directory: /tmp/pip-build-tracker-ni7c15lb 2026-04-03T10:32:48,406 Initialized build tracking at /tmp/pip-build-tracker-ni7c15lb 2026-04-03T10:32:48,406 Created build tracker: /tmp/pip-build-tracker-ni7c15lb 2026-04-03T10:32:48,407 Entered build tracker: /tmp/pip-build-tracker-ni7c15lb 2026-04-03T10:32:48,408 Created temporary directory: /tmp/pip-wheel-ubjctduw 2026-04-03T10:32:48,410 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-03T10:32:48,413 Created temporary directory: /tmp/pip-ephem-wheel-cache-v9jb6c71 2026-04-03T10:32:48,434 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-03T10:32:48,438 2 location(s) to search for versions of flashinfer-python: 2026-04-03T10:32:48,438 * https://pypi.org/simple/flashinfer-python/ 2026-04-03T10:32:48,438 * https://www.piwheels.org/simple/flashinfer-python/ 2026-04-03T10:32:48,439 Fetching project page and analyzing links: https://pypi.org/simple/flashinfer-python/ 2026-04-03T10:32:48,440 Getting page https://pypi.org/simple/flashinfer-python/ 2026-04-03T10:32:48,441 Found index url https://pypi.org/simple 2026-04-03T10:32:48,590 Fetched page https://pypi.org/simple/flashinfer-python/ as application/vnd.pypi.simple.v1+json 2026-04-03T10:32:48,605 Found link https://files.pythonhosted.org/packages/6c/e9/5d6adcf888922a17c6fc52a0e5bed78785239af1219f41e1073b063a07ff/flashinfer_python-0.2.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post1 2026-04-03T10:32:48,606 Found link https://files.pythonhosted.org/packages/c8/39/bac839234a3beaab4292e489b4d8941cc97ba4f76474aff0407d7b05a84f/flashinfer_python-0.2.0.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post2 2026-04-03T10:32:48,607 Found link https://files.pythonhosted.org/packages/94/74/4dda2a7a7aa08bcfb8039faf2202bf0fea6b378d0d4968864737400fc329/flashinfer_python-0.2.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1 2026-04-03T10:32:48,609 Found link https://files.pythonhosted.org/packages/7f/3d/aab500609825108d3f6a4b440a7eeb6436d578d3e781e97ea015fd49a530/flashinfer_python-0.2.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post1 2026-04-03T10:32:48,610 Found link https://files.pythonhosted.org/packages/30/ac/afd1d2c472857be8f83389eb506e1413a2ac3a603889bea3cf24d5ab5be5/flashinfer_python-0.2.1.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post2 2026-04-03T10:32:48,612 Found link https://files.pythonhosted.org/packages/90/00/833dd50745bc15bb7a7451b77589d444ce963d48c0cb730b4760bfebffad/flashinfer_python-0.2.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2 2026-04-03T10:32:48,613 Found link https://files.pythonhosted.org/packages/02/cc/db9635c56653d3fa5a28f14ac858e0801de621aa33d3b528e4781aee906f/flashinfer_python-0.2.2.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2.post1 2026-04-03T10:32:48,614 Found link https://files.pythonhosted.org/packages/b6/10/2a63f1d09c5b337705236005dc9ccce513dcc08b7fd037cb40426f1695b1/flashinfer_python-0.2.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.3 2026-04-03T10:32:48,616 Found link https://files.pythonhosted.org/packages/a4/e5/8d193ccf65b92c009c4be50fdffa88fa0edc8fd6e6169bacaca6bab84d89/flashinfer_python-0.2.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.4 2026-04-03T10:32:48,617 Found link https://files.pythonhosted.org/packages/b2/c4/9ec0f79e2480fc5c93307c4a1ac903e5cf33c551c0eaeb648196234b55af/flashinfer_python-0.2.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.5 2026-04-03T10:32:48,619 Found link https://files.pythonhosted.org/packages/95/4a/a3109d57463d25a153b16c0d0f06495e4d18b727c81f8e08e42e97faaf45/flashinfer_python-0.2.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6 2026-04-03T10:32:48,621 Found link https://files.pythonhosted.org/packages/34/26/3c6f12ffaefbfa0c453030d6e15941269b3a4ffcd267daec32d1a10dda96/flashinfer_python-0.2.6.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6.post1 2026-04-03T10:32:48,622 Found link https://files.pythonhosted.org/packages/f9/a0/5e700751f2393a504bc5eb2879e77d783a5b70778a254289711323126abc/flashinfer_python-0.2.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7 2026-04-03T10:32:48,623 Found link https://files.pythonhosted.org/packages/c0/10/43cf1ea7a03ca8e75a185190708e48286e1583d781e93d1de130e5d450ca/flashinfer_python-0.2.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7.post1 2026-04-03T10:32:48,625 Found link https://files.pythonhosted.org/packages/f1/80/8dfae62d04af4597d7615b892f346ace68bcb07dfbef2a9e614219d96a8a/flashinfer_python-0.2.8rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8rc1 2026-04-03T10:32:48,626 Found link https://files.pythonhosted.org/packages/72/0e/827624993516e80f62ba88dd368ad5e180c41324f063c00d27fa638a430e/flashinfer_python-0.2.8.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8 2026-04-03T10:32:48,628 Found link https://files.pythonhosted.org/packages/17/50/42afc9a81031939140fcbfd93e5a3652dc4995e338b4e6d007b0dda04f93/flashinfer_python-0.2.9rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc1 2026-04-03T10:32:48,629 Found link https://files.pythonhosted.org/packages/ed/1a/9f30eda3178ed2f5f7e311ae0011d02c4542d087f84c9247e4b30668b767/flashinfer_python-0.2.9rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc2 2026-04-03T10:32:48,631 Found link https://files.pythonhosted.org/packages/45/fc/4deff13f1420cc6e5871b7505a6c0d9031eb49cd09571ae576aec59bed61/flashinfer_python-0.2.9.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9 2026-04-03T10:32:48,632 Found link https://files.pythonhosted.org/packages/74/e4/2c6d6a19d13ed13d4863f6900febe72b502334e43292d5fe9a1ac2f6c5be/flashinfer_python-0.2.10.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.10 2026-04-03T10:32:48,633 Found link https://files.pythonhosted.org/packages/72/8b/f315dda5993d1c018ca5ecfef0775c6a3c7a8f59ac426fabb7f3f6b93482/flashinfer_python-0.2.11.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11 2026-04-03T10:32:48,634 Found link https://files.pythonhosted.org/packages/37/e3/2e8e31f7f7ee26f39968264e4fcf74f9810d90e940859016d974106ed5c6/flashinfer_python-0.2.11.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post1 2026-04-03T10:32:48,635 Found link https://files.pythonhosted.org/packages/b6/01/fa069f076cfe5bed34ddc3b7f772aa09c70e03e572dd9d3569ff887f33b1/flashinfer_python-0.2.11.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post2 2026-04-03T10:32:48,636 Found link https://files.pythonhosted.org/packages/a3/09/5d89ef0bc2d19d3ebcf3b9fa621c945909f681818c9d55aa3181921db874/flashinfer_python-0.2.11.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post3 2026-04-03T10:32:48,637 Found link https://files.pythonhosted.org/packages/b9/5a/7a839afb07af313549b9d9f1057b02aaf067f020267d5a9d128e50596bf4/flashinfer_python-0.2.12.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.12 2026-04-03T10:32:48,639 Found link https://files.pythonhosted.org/packages/f2/20/e79142a9f26aab61b17e2c906a49e9a3d3c656d97608c8773785c3b13140/flashinfer_python-0.2.13.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.13 2026-04-03T10:32:48,640 Found link https://files.pythonhosted.org/packages/ed/26/d1eac56b37d225cb3f84495bd897829dece21f62463487f3c1d9cafe78a0/flashinfer_python-0.2.14.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14 2026-04-03T10:32:48,641 Found link https://files.pythonhosted.org/packages/94/d4/4a2bf3d49f84b2d975925c1c024790b4e4768bdefbc5e27529d68368355a/flashinfer_python-0.2.14.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14.post1 2026-04-03T10:32:48,642 Found link https://files.pythonhosted.org/packages/56/e3/7c0a4df2640a97ecfed45fe9110ecc6a67d4967278723abf8e6531b6bc1f/flashinfer_python-0.3.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0rc1 2026-04-03T10:32:48,643 Found link https://files.pythonhosted.org/packages/1f/b4/5c4cbb0f3cbc5e8d4c19b3f163c048eed959a0ac0c603cfb3939a3079c52/flashinfer_python-0.3.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0 2026-04-03T10:32:48,644 Found link https://files.pythonhosted.org/packages/59/1b/83a9c58432b4a5d6ff04b97d4873bedfb5e35d38972ca8946b3acdbffeb4/flashinfer_python-0.3.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0.post1 2026-04-03T10:32:48,645 Found link https://files.pythonhosted.org/packages/ba/71/dd3001b8be8174d90561764a5f3be4ca219517bde2841189ea6973a3873f/flashinfer_python-0.3.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1 2026-04-03T10:32:48,646 Found link https://files.pythonhosted.org/packages/49/a7/f5bd3878f94fc47e25ecc0828f910233022366f7e832dfa02f3617fad41f/flashinfer_python-0.3.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1.post1 2026-04-03T10:32:48,647 Found link https://files.pythonhosted.org/packages/df/b4/f113bb950e5244d1c72c3d73c03fac0db939f085670e3a45a41fe92ffde0/flashinfer_python-0.4.0rc0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc0 2026-04-03T10:32:48,648 Found link https://files.pythonhosted.org/packages/2e/a8/adceccda3aae01b7bdb5f99c68a2b401c58600f34a6386d9489ff736cdbc/flashinfer_python-0.4.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc1 2026-04-03T10:32:48,649 Found link https://files.pythonhosted.org/packages/15/c0/5fb88fc273fed23dbf3b0ef0bffa7db26e2df24e016202df1b4e98b95879/flashinfer_python-0.4.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc2 2026-04-03T10:32:48,650 Found link https://files.pythonhosted.org/packages/65/91/cf9e3a0a2626711bfab18ea4a4c739e0eb823e9513addc0e9e1b8f929538/flashinfer_python-0.4.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc3 2026-04-03T10:32:48,651 Found link https://files.pythonhosted.org/packages/94/ec/bdcc0ec502994d544cbe69763d999458ae2deda67e58c1cb2d85867677c4/flashinfer_python-0.4.0rc4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc4 2026-04-03T10:32:48,652 Found link https://files.pythonhosted.org/packages/08/29/f5609be182174e8c97124baeb90bb955fe05e2e1353776f48e226c153214/flashinfer_python-0.4.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0 2026-04-03T10:32:48,654 Found link https://files.pythonhosted.org/packages/64/cf/f82142abd7c819fb84a53f18fe1ac9e7cf1af8790b93c06dbf430001473b/flashinfer_python-0.4.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.1 2026-04-03T10:32:48,655 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/c7/92/126dacc3476fab07478bdfc9944abd22aafa1000088d93bf86fb9ec78a29/flashinfer_python-0.5.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,656 Found link https://files.pythonhosted.org/packages/53/47/a759f1ae9ef4ceb4e12895665b65dfacea2085494626e764627dd3548fa8/flashinfer_python-0.5.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc1 2026-04-03T10:32:48,656 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/fb/aa/7b5d28c2aec11acfce18f2655d0b4614c7e34547fab218b4f2fd0d57bdce/flashinfer_python-0.5.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,657 Found link https://files.pythonhosted.org/packages/3d/5a/58a7b60f79a1ac9c652b4055b06e88b5f57e8ef4c7dd4830ef48fa4cc265/flashinfer_python-0.5.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc2 2026-04-03T10:32:48,658 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/5f/8f/7077cf0a44056a65045a793d6d55845d95818fb6455bfebb44ddea7e1f12/flashinfer_python-0.5.0rc3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,659 Found link https://files.pythonhosted.org/packages/60/d1/8c90d6dfc95ab609028e9d541a6cdb3483f5c1475b07d97465ff3f0db14c/flashinfer_python-0.5.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc3 2026-04-03T10:32:48,660 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/eb/8a/425b75b44ce5eeefe01dd61d4ee260b8e5f9dcf1a500d5f08d6cd4095d3a/flashinfer_python-0.5.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,661 Found link https://files.pythonhosted.org/packages/e3/1d/b82cd2606f4f0033e2fb28194dc3b04fd8101643e4ceb1d13fb1466cfd28/flashinfer_python-0.5.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0 2026-04-03T10:32:48,661 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f4/f1/33dedad087a2bc3d66244126bd5d1c79721ea22d1f2124299f9e5bdaf3b1/flashinfer_python-0.5.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,662 Found link https://files.pythonhosted.org/packages/6c/bb/897c3b9d683dcf6490f70e468efb585eebcd673970b13a04ed947b491982/flashinfer_python-0.5.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.1 2026-04-03T10:32:48,663 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/8d/0c/4a8ffbbc0d85e314f534cf5c32711f2af5d5e6e49225a5a414400a67b684/flashinfer_python-0.5.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,664 Found link https://files.pythonhosted.org/packages/d8/04/e357eaa50238e12c49e66fcf47f83e066e741ef19a117c136782b32eafbb/flashinfer_python-0.5.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.2 2026-04-03T10:32:48,665 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/78/6dc7e7da8cb87c9965644ea0d2439457a1bc9256c45ceda0044595be4143/flashinfer_python-0.5.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,666 Found link https://files.pythonhosted.org/packages/b4/91/cca69baeff24bb3efd12c7479a026432c8717ee47193694010494c528b22/flashinfer_python-0.5.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.5.3 2026-04-03T10:32:48,667 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/b2/0c/cb2d60eb86f0171451d676f17b90484ab66baf73c54cefe15c9a7c800739/flashinfer_python-0.6.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,667 Found link https://files.pythonhosted.org/packages/53/2a/e855be4851ad6bfcebed929807fb541715f9a3a7d7b239b696e635b49d0e/flashinfer_python-0.6.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc1 2026-04-03T10:32:48,668 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/05/22/9193f1da2468acec8ba99c4bee8aeacbda489777acf00b5871a73209acf7/flashinfer_python-0.6.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,669 Found link https://files.pythonhosted.org/packages/1b/71/dd1bb86ea531e5c1a34f8ad851901bf2e2ce500618b5a4da19bd69f7de11/flashinfer_python-0.6.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc2 2026-04-03T10:32:48,670 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/90/5834597488f5ea62b1cc874338125c79ce21c11d777ac6f7b47f12cf2bb3/flashinfer_python-0.6.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,671 Found link https://files.pythonhosted.org/packages/ad/8d/c7330f27f09b9110af2f6c44c6f68d7b536f525f8ac539210073bfcdb965/flashinfer_python-0.6.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0 2026-04-03T10:32:48,671 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/d5/bca632bb5781689415186421bbee2ad39ae8a39b0996d579c76901e5c66f/flashinfer_python-0.6.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,672 Found link https://files.pythonhosted.org/packages/68/81/5a84e14df7358d2c2903b18c6f2779bd4b4a6739076d01a847d4c18fb102/flashinfer_python-0.6.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.1 2026-04-03T10:32:48,673 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/aa/c0/ee819d16f6b40e287727bb3db471f4eaa9e0372e233bf2f7343faaa3009f/flashinfer_python-0.6.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,674 Found link https://files.pythonhosted.org/packages/89/86/b25115177606ae3b6cec373d290798c28e185d033b66f6b80a89589e7786/flashinfer_python-0.6.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.2 2026-04-03T10:32:48,675 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/13/2d95248101d8cb978db9000a4dceafb5b122484a694b53e84df1ac2a7b3d/flashinfer_python-0.6.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,676 Found link https://files.pythonhosted.org/packages/d6/aa/c564313b42dee7573da4ed0e441844f0c2bd827aecc9f29ea02c3838ffae/flashinfer_python-0.6.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.3 2026-04-03T10:32:48,676 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/17/9a/d2bab76d2bb15062c6a2329614653e4f8bec9c78eec9069856ef0c7c0a79/flashinfer_python-0.6.4-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,677 Found link https://files.pythonhosted.org/packages/77/45/15645d2a4ee81d08206f3e132a77323e48312f510462415d7cd1122eba43/flashinfer_python-0.6.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.4 2026-04-03T10:32:48,678 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/4f/83/eea2a74700b5fcae36ee2b748db9c3554a83a3f9e2dc4f3816369c5cb653/flashinfer_python-0.6.5-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,679 Found link https://files.pythonhosted.org/packages/e2/2f/5c52276af3cc40ac1f6eaf823ccd8e257f77e2fe5d465fa641ad3dba4d1b/flashinfer_python-0.6.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.5 2026-04-03T10:32:48,680 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/e0/61/385d06755f3ab66333018285657adf0daf8a90a129448231fd09e315bd2e/flashinfer_python-0.6.6-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,681 Found link https://files.pythonhosted.org/packages/03/70/c5a235297351021f5d3d3233523a85f5a6468495587489ad2f257e8eafe2/flashinfer_python-0.6.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.6 2026-04-03T10:32:48,681 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f1/e8/91361a5f07667f36181cfd08e2d7d28be4cae2aa5a24016339174b308c38/flashinfer_python-0.6.7-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,682 Found link https://files.pythonhosted.org/packages/d9/2d/aa36fa1fee744c46fef99436baea5cda4a34244846c1df0fea97eaa9a856/flashinfer_python-0.6.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7 2026-04-03T10:32:48,683 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/16/92/516c79e5d8d1f0b41793e499c37a9299115ac8bc05171661b30d4a94beb8/flashinfer_python-0.6.7.post1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,684 Found link https://files.pythonhosted.org/packages/60/6c/4b1a3d380c04306bde63412043e679d5a52d3da7feed91f1e9ba8ce8bc3f/flashinfer_python-0.6.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post1 2026-04-03T10:32:48,685 Fetching project page and analyzing links: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-03T10:32:48,685 Getting page https://www.piwheels.org/simple/flashinfer-python/ 2026-04-03T10:32:48,687 Found index url https://www.piwheels.org/simple 2026-04-03T10:32:48,858 Fetched page https://www.piwheels.org/simple/flashinfer-python/ as text/html 2026-04-03T10:32:48,867 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7-py3-none-any.whl#sha256=9b349825a2d26c3e4653c594d7a1d7b2126a43b29a4a70a6d48f3aaac23b96f3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,868 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.6-py3-none-any.whl#sha256=94791e01c31510c057b4decabff24cbc62466682667867e84214c62c45d9b343 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,869 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.5-py3-none-any.whl#sha256=4b0a6c246959ca2dbc232fa1fe2f17ff857fd258de5dfacfa45033f21b6b7b93 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,869 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.4-py3-none-any.whl#sha256=22ee7972266bb31ce1583330769efc0ecd001fb70371531ce4c77f2d6eda0d59 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,870 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.3-py3-none-any.whl#sha256=ed3282188580afd663819924a772b2b531ac5bb88438bbe89d0baf67fe8c9fa5 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,870 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.1-py3-none-any.whl#sha256=9e0e308062a81d4e4c462313bfe33edce7712309e8c89aed722065249e644833 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,871 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0-py3-none-any.whl#sha256=7ebc0582df714a933fc4c58ed4d12f4e61b4ad30b22b9155f290e96ee3eee3a0 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,871 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc2-py3-none-any.whl#sha256=63057b7ee43a4f6764c6ed8fe4c4c6de5a94da058fe0975bf279db0567c26204 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,872 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc1-py3-none-any.whl#sha256=e30a125bf89f8155f83aca80e5fb88a3d81224225485ce70f0f4c4c3a27da92c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,872 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.3-py3-none-any.whl#sha256=1de562233dfbd8de835c2eb757275a7759eda034460093c1eb9ff3c7d5c0845d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-03T10:32:48,873 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.2-py3-none-any.whl#sha256=bd3d206d1243bee523cf6cda27e0219e8fdf9026ade2e32045c8d9d4b7f7bf7a (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,873 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.1-py3-none-any.whl#sha256=8d73e4b66b7eb7fc4500f7f7e61aa194efebc769e7da1635a86506c97bf6fa0d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,874 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0-py3-none-any.whl#sha256=ac991d1911cff4a7453f02d88922803e7ca794a0af1dceaa920e33b81c78f5c8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,875 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc3-py3-none-any.whl#sha256=8799f4a93afc14042ac6f521f6fb682e4d62d738dc18a1e8798b7a2ba5b2e4ec (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,875 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc2-py3-none-any.whl#sha256=4ee4d438c8c7fdc242a917c3f97076562f3c44411dcaceb4f7d29082c41c0f8c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,876 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc1-py3-none-any.whl#sha256=a9d675075f3cb79ac1b5cba9e8430496d3983127609dc780a117b2b44bdb025d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,876 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.1-py3-none-any.whl#sha256=8fc8fc3233781e384689c5f202124ae7d266cb8dee14055cbb3c90fca530bf7f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,877 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.0-py3-none-any.whl#sha256=da0141b2163f9703e49972728eeb502d45eda60c25529a460d0d0d61963eedb2 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-03T10:32:48,877 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.5-py3-none-any.whl#sha256=cb2a17c3ea5f47f8129f6410e2892f30051e15665f2ae54db540c8677c187d31 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,878 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.4-py3-none-any.whl#sha256=4a85bd6ac785f106f0ad9fe213abf42f96ab84ccd04aec3ab9acf76d47d2aa3f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,878 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.3-py3-none-any.whl#sha256=b8ead688a4857a2b360c992fb46ae2930fc4c43b50a092b7e42a13b40ee195da (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,879 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2.post1-py3-none-any.whl#sha256=0097a08376ae147084ea6bd0848fc2ea1764f524c510a48755aa8c63259b4466 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,879 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2-py3-none-any.whl#sha256=c109a340b7e60cb57d8c9ccec2c10e303a36b82a56ba8dcaaa0efbee2a48b97f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,880 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post2-py3-none-any.whl#sha256=dc91f387ba09e4df899238705ec37bbe3648395d828240b77db84378d1b91e9e (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,881 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post1-py3-none-any.whl#sha256=a44b9d872cf2ba6812d3c0750d98ad01b73e9ccbede933c7eade01b6c27b6232 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,881 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1-py3-none-any.whl#sha256=e07427d9eff1b8d091b5837c3ffc4fe7885dbf01d271d7225f7a89a2e3925f27 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,882 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post2-py3-none-any.whl#sha256=52c20b84ef1e848dd49c726ffc27801df8acccb4038aea61a2d73fa685bf75f8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,882 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post1-py3-none-any.whl#sha256=783c1039e0a7db0478a579d5cc54894def70ae601b1e5b90a3c3de2209334bf3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-03T10:32:48,883 Skipping link: not a file: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-03T10:32:48,884 Skipping link: not a file: https://pypi.org/simple/flashinfer-python/ 2026-04-03T10:32:48,909 Given no hashes to check 1 links for project 'flashinfer-python': discarding no candidates 2026-04-03T10:32:48,927 Collecting flashinfer-python==0.6.7.post1 2026-04-03T10:32:48,929 Created temporary directory: /tmp/pip-unpack-z_lu3lyn 2026-04-03T10:32:49,171 Downloading flashinfer_python-0.6.7.post1.tar.gz (6.5 MB) 2026-04-03T10:32:56,043 Added flashinfer-python==0.6.7.post1 from https://files.pythonhosted.org/packages/60/6c/4b1a3d380c04306bde63412043e679d5a52d3da7feed91f1e9ba8ce8bc3f/flashinfer_python-0.6.7.post1.tar.gz to build tracker '/tmp/pip-build-tracker-ni7c15lb' 2026-04-03T10:32:56,052 Created temporary directory: /tmp/pip-build-env-jhndp_za 2026-04-03T10:32:56,056 Installing build dependencies: started 2026-04-03T10:32:56,057 Running command pip subprocess to install build dependencies 2026-04-03T10:32:57,229 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-04-03T10:32:57,672 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-03T10:32:57,696 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-03T10:32:59,461 Collecting setuptools>=77 2026-04-03T10:32:59,559 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-04-03T10:32:59,823 Collecting packaging>=24 2026-04-03T10:32:59,840 Using cached https://www.piwheels.org/simple/packaging/packaging-26.0-py3-none-any.whl (74 kB) 2026-04-03T10:33:00,474 Collecting apache-tvm-ffi!=0.1.8,!=0.1.8.post0,<0.2,>=0.1.6 2026-04-03T10:33:00,488 Downloading https://www.piwheels.org/simple/apache-tvm-ffi/apache_tvm_ffi-0.1.9-cp311-cp311-linux_armv7l.whl (2.2 MB) 2026-04-03T10:33:00,666 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.2/2.2 MB 13.2 MB/s eta 0:00:00 2026-04-03T10:33:00,881 Collecting typing-extensions>=4.5 2026-04-03T10:33:00,896 Using cached https://www.piwheels.org/simple/typing-extensions/typing_extensions-4.15.0-py3-none-any.whl (44 kB) 2026-04-03T10:33:03,885 Installing collected packages: typing-extensions, setuptools, packaging, apache-tvm-ffi 2026-04-03T10:33:08,126 Creating /tmp/pip-build-env-jhndp_za/overlay/local/bin 2026-04-03T10:33:08,129 changing mode of /tmp/pip-build-env-jhndp_za/overlay/local/bin/tvm-ffi-config to 755 2026-04-03T10:33:08,131 changing mode of /tmp/pip-build-env-jhndp_za/overlay/local/bin/tvm-ffi-stubgen to 755 2026-04-03T10:33:08,164 Successfully installed apache-tvm-ffi-0.1.9 packaging-26.0 setuptools-82.0.1 typing-extensions-4.15.0 2026-04-03T10:33:08,466 Installing build dependencies: finished with status 'done' 2026-04-03T10:33:08,473 Getting requirements to build wheel: started 2026-04-03T10:33:08,474 Running command Getting requirements to build wheel 2026-04-03T10:33:14,150 Build metadata file already exists (not in git repo), keeping it 2026-04-03T10:33:14,218 Getting requirements to build wheel: finished with status 'done' 2026-04-03T10:33:14,221 Created temporary directory: /tmp/pip-modern-metadata-ui_nwunz 2026-04-03T10:33:14,224 Preparing metadata (pyproject.toml): started 2026-04-03T10:33:14,225 Running command Preparing metadata (pyproject.toml) 2026-04-03T10:33:21,247 /tmp/pip-build-env-jhndp_za/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-03T10:33:21,247 !! 2026-04-03T10:33:21,248 ******************************************************************************** 2026-04-03T10:33:21,249 Pattern 'LICENSE*.txt' did not match any files. 2026-04-03T10:33:21,250 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-03T10:33:21,250 or your builds will no longer be supported. 2026-04-03T10:33:21,251 ******************************************************************************** 2026-04-03T10:33:21,252 !! 2026-04-03T10:33:21,252 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-03T10:33:21,256 Build metadata file already exists (not in git repo), keeping it 2026-04-03T10:33:21,256 running dist_info 2026-04-03T10:33:21,269 creating /tmp/pip-modern-metadata-ui_nwunz/flashinfer_python.egg-info 2026-04-03T10:33:21,270 writing /tmp/pip-modern-metadata-ui_nwunz/flashinfer_python.egg-info/PKG-INFO 2026-04-03T10:33:21,274 writing dependency_links to /tmp/pip-modern-metadata-ui_nwunz/flashinfer_python.egg-info/dependency_links.txt 2026-04-03T10:33:21,276 writing entry points to /tmp/pip-modern-metadata-ui_nwunz/flashinfer_python.egg-info/entry_points.txt 2026-04-03T10:33:21,278 writing requirements to /tmp/pip-modern-metadata-ui_nwunz/flashinfer_python.egg-info/requires.txt 2026-04-03T10:33:21,279 writing top-level names to /tmp/pip-modern-metadata-ui_nwunz/flashinfer_python.egg-info/top_level.txt 2026-04-03T10:33:21,280 writing manifest file '/tmp/pip-modern-metadata-ui_nwunz/flashinfer_python.egg-info/SOURCES.txt' 2026-04-03T10:33:22,123 reading manifest file '/tmp/pip-modern-metadata-ui_nwunz/flashinfer_python.egg-info/SOURCES.txt' 2026-04-03T10:33:22,125 adding license file 'LICENSE' 2026-04-03T10:33:22,201 writing manifest file '/tmp/pip-modern-metadata-ui_nwunz/flashinfer_python.egg-info/SOURCES.txt' 2026-04-03T10:33:22,205 creating '/tmp/pip-modern-metadata-ui_nwunz/flashinfer_python-0.6.7.post1.dist-info' 2026-04-03T10:33:22,335 Preparing metadata (pyproject.toml): finished with status 'done' 2026-04-03T10:33:22,341 Source in /tmp/pip-wheel-ubjctduw/flashinfer-python_b194e0587f4a47829f713906a09be464 has version 0.6.7.post1, which satisfies requirement flashinfer-python==0.6.7.post1 from https://files.pythonhosted.org/packages/60/6c/4b1a3d380c04306bde63412043e679d5a52d3da7feed91f1e9ba8ce8bc3f/flashinfer_python-0.6.7.post1.tar.gz 2026-04-03T10:33:22,342 Removed flashinfer-python==0.6.7.post1 from https://files.pythonhosted.org/packages/60/6c/4b1a3d380c04306bde63412043e679d5a52d3da7feed91f1e9ba8ce8bc3f/flashinfer_python-0.6.7.post1.tar.gz from build tracker '/tmp/pip-build-tracker-ni7c15lb' 2026-04-03T10:33:22,349 Created temporary directory: /tmp/pip-unpack-01rlhy_m 2026-04-03T10:33:22,349 Building wheels for collected packages: flashinfer-python 2026-04-03T10:33:22,354 Created temporary directory: /tmp/pip-wheel-6s_9v3jz 2026-04-03T10:33:22,354 Destination directory: /tmp/pip-wheel-6s_9v3jz 2026-04-03T10:33:22,357 Building wheel for flashinfer-python (pyproject.toml): started 2026-04-03T10:33:22,358 Running command Building wheel for flashinfer-python (pyproject.toml) 2026-04-03T10:33:28,532 /tmp/pip-build-env-jhndp_za/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-03T10:33:28,532 !! 2026-04-03T10:33:28,534 ******************************************************************************** 2026-04-03T10:33:28,535 Pattern 'LICENSE*.txt' did not match any files. 2026-04-03T10:33:28,537 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-03T10:33:28,537 or your builds will no longer be supported. 2026-04-03T10:33:28,538 ******************************************************************************** 2026-04-03T10:33:28,539 !! 2026-04-03T10:33:28,539 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-03T10:33:28,540 Build metadata file already exists (not in git repo), keeping it 2026-04-03T10:33:28,540 running bdist_wheel 2026-04-03T10:33:28,559 running build 2026-04-03T10:33:28,560 running build_py 2026-04-03T10:33:28,566 creating build/lib 2026-04-03T10:33:28,569 copying build_backend.py -> build/lib 2026-04-03T10:33:28,572 copying build_utils.py -> build/lib 2026-04-03T10:33:28,576 creating build/lib/flashinfer 2026-04-03T10:33:28,577 copying flashinfer/pod.py -> build/lib/flashinfer 2026-04-03T10:33:28,580 copying flashinfer/sampling.py -> build/lib/flashinfer 2026-04-03T10:33:28,584 copying flashinfer/concat_ops.py -> build/lib/flashinfer 2026-04-03T10:33:28,586 copying flashinfer/tllm_utils.py -> build/lib/flashinfer 2026-04-03T10:33:28,588 copying flashinfer/green_ctx.py -> build/lib/flashinfer 2026-04-03T10:33:28,591 copying flashinfer/autotuner.py -> build/lib/flashinfer 2026-04-03T10:33:28,594 copying flashinfer/gdn_prefill.py -> build/lib/flashinfer 2026-04-03T10:33:28,597 copying flashinfer/page.py -> build/lib/flashinfer 2026-04-03T10:33:28,600 copying flashinfer/cuda_utils.py -> build/lib/flashinfer 2026-04-03T10:33:28,602 copying flashinfer/activation.py -> build/lib/flashinfer 2026-04-03T10:33:28,604 copying flashinfer/attention.py -> build/lib/flashinfer 2026-04-03T10:33:28,607 copying flashinfer/api_logging.py -> build/lib/flashinfer 2026-04-03T10:33:28,611 copying flashinfer/decode.py -> build/lib/flashinfer 2026-04-03T10:33:28,616 copying flashinfer/cascade.py -> build/lib/flashinfer 2026-04-03T10:33:28,619 copying flashinfer/__init__.py -> build/lib/flashinfer 2026-04-03T10:33:28,621 copying flashinfer/fp8_quantization.py -> build/lib/flashinfer 2026-04-03T10:33:28,624 copying flashinfer/mla.py -> build/lib/flashinfer 2026-04-03T10:33:28,626 copying flashinfer/utils.py -> build/lib/flashinfer 2026-04-03T10:33:28,630 copying flashinfer/rope.py -> build/lib/flashinfer 2026-04-03T10:33:28,633 copying flashinfer/gdn_decode.py -> build/lib/flashinfer 2026-04-03T10:33:28,636 copying flashinfer/aot.py -> build/lib/flashinfer 2026-04-03T10:33:28,639 copying flashinfer/deep_gemm.py -> build/lib/flashinfer 2026-04-03T10:33:28,642 copying flashinfer/_build_meta.py -> build/lib/flashinfer 2026-04-03T10:33:28,644 copying flashinfer/trtllm_low_latency_gemm.py -> build/lib/flashinfer 2026-04-03T10:33:28,647 copying flashinfer/tllm_enums.py -> build/lib/flashinfer 2026-04-03T10:33:28,649 copying flashinfer/xqa.py -> build/lib/flashinfer 2026-04-03T10:33:28,652 copying flashinfer/compilation_context.py -> build/lib/flashinfer 2026-04-03T10:33:28,654 copying flashinfer/version.py -> build/lib/flashinfer 2026-04-03T10:33:28,656 copying flashinfer/prefill.py -> build/lib/flashinfer 2026-04-03T10:33:28,661 copying flashinfer/topk.py -> build/lib/flashinfer 2026-04-03T10:33:28,664 copying flashinfer/sparse.py -> build/lib/flashinfer 2026-04-03T10:33:28,667 copying flashinfer/__main__.py -> build/lib/flashinfer 2026-04-03T10:33:28,670 copying flashinfer/fp4_quantization.py -> build/lib/flashinfer 2026-04-03T10:33:28,672 copying flashinfer/artifacts.py -> build/lib/flashinfer 2026-04-03T10:33:28,675 creating build/lib/flashinfer/mamba 2026-04-03T10:33:28,677 copying flashinfer/mamba/ssd_tile_scheduler.py -> build/lib/flashinfer/mamba 2026-04-03T10:33:28,680 copying flashinfer/mamba/ssd_combined.py -> build/lib/flashinfer/mamba 2026-04-03T10:33:28,683 copying flashinfer/mamba/selective_state_update.py -> build/lib/flashinfer/mamba 2026-04-03T10:33:28,685 copying flashinfer/mamba/__init__.py -> build/lib/flashinfer/mamba 2026-04-03T10:33:28,688 copying flashinfer/mamba/ssd_kernel.py -> build/lib/flashinfer/mamba 2026-04-03T10:33:28,693 creating build/lib/flashinfer/quantization 2026-04-03T10:33:28,695 copying flashinfer/quantization/packbits.py -> build/lib/flashinfer/quantization 2026-04-03T10:33:28,697 copying flashinfer/quantization/quantization_cute_dsl_utils.py -> build/lib/flashinfer/quantization 2026-04-03T10:33:28,700 copying flashinfer/quantization/__init__.py -> build/lib/flashinfer/quantization 2026-04-03T10:33:28,703 copying flashinfer/quantization/fp8_quantization.py -> build/lib/flashinfer/quantization 2026-04-03T10:33:28,705 copying flashinfer/quantization/fp4_quantization.py -> build/lib/flashinfer/quantization 2026-04-03T10:33:28,709 creating build/lib/flashinfer/gemm 2026-04-03T10:33:28,711 copying flashinfer/gemm/routergemm.py -> build/lib/flashinfer/gemm 2026-04-03T10:33:28,714 copying flashinfer/gemm/gemm_base.py -> build/lib/flashinfer/gemm 2026-04-03T10:33:28,720 copying flashinfer/gemm/__init__.py -> build/lib/flashinfer/gemm 2026-04-03T10:33:28,722 creating build/lib/flashinfer/tuning_configs 2026-04-03T10:33:28,723 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/lib/flashinfer/tuning_configs 2026-04-03T10:33:28,726 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/lib/flashinfer/tuning_configs 2026-04-03T10:33:28,729 creating build/lib/flashinfer/cudnn 2026-04-03T10:33:28,731 copying flashinfer/cudnn/decode.py -> build/lib/flashinfer/cudnn 2026-04-03T10:33:28,734 copying flashinfer/cudnn/__init__.py -> build/lib/flashinfer/cudnn 2026-04-03T10:33:28,736 copying flashinfer/cudnn/utils.py -> build/lib/flashinfer/cudnn 2026-04-03T10:33:28,738 copying flashinfer/cudnn/prefill.py -> build/lib/flashinfer/cudnn 2026-04-03T10:33:28,742 creating build/lib/flashinfer/comm 2026-04-03T10:33:28,743 copying flashinfer/comm/trtllm_ar.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,747 copying flashinfer/comm/trtllm_mnnvl_ar.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,750 copying flashinfer/comm/trtllm_alltoall.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,752 copying flashinfer/comm/mapping.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,754 copying flashinfer/comm/vllm_ar.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,757 copying flashinfer/comm/trtllm_moe_alltoall.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,760 copying flashinfer/comm/__init__.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,762 copying flashinfer/comm/allreduce.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,764 copying flashinfer/comm/workspace_base.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,767 copying flashinfer/comm/mnnvl.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,770 copying flashinfer/comm/nvshmem_allreduce.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,772 copying flashinfer/comm/nvshmem.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,774 copying flashinfer/comm/dlpack_utils.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,776 copying flashinfer/comm/cuda_ipc.py -> build/lib/flashinfer/comm 2026-04-03T10:33:28,779 creating build/lib/flashinfer/profiler 2026-04-03T10:33:28,780 copying flashinfer/profiler/__init__.py -> build/lib/flashinfer/profiler 2026-04-03T10:33:28,783 creating build/lib/flashinfer/norm 2026-04-03T10:33:28,784 copying flashinfer/norm/__init__.py -> build/lib/flashinfer/norm 2026-04-03T10:33:28,786 copying flashinfer/norm/utils.py -> build/lib/flashinfer/norm 2026-04-03T10:33:28,789 creating build/lib/flashinfer/gdn_kernels 2026-04-03T10:33:28,791 copying flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/lib/flashinfer/gdn_kernels 2026-04-03T10:33:28,794 copying flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-03T10:33:28,797 copying flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-03T10:33:28,799 copying flashinfer/gdn_kernels/__init__.py -> build/lib/flashinfer/gdn_kernels 2026-04-03T10:33:28,801 copying flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/lib/flashinfer/gdn_kernels 2026-04-03T10:33:28,805 creating build/lib/flashinfer/dsv3_ops 2026-04-03T10:33:28,806 copying flashinfer/dsv3_ops/__init__.py -> build/lib/flashinfer/dsv3_ops 2026-04-03T10:33:28,809 creating build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,810 copying flashinfer/logits_processor/fusion_rules.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,812 copying flashinfer/logits_processor/compiler.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,815 copying flashinfer/logits_processor/op.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,816 copying flashinfer/logits_processor/types.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,818 copying flashinfer/logits_processor/__init__.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,820 copying flashinfer/logits_processor/operators.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,823 copying flashinfer/logits_processor/validators.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,825 copying flashinfer/logits_processor/legalization.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,827 copying flashinfer/logits_processor/pipeline.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,829 copying flashinfer/logits_processor/processors.py -> build/lib/flashinfer/logits_processor 2026-04-03T10:33:28,833 creating build/lib/flashinfer/jit 2026-04-03T10:33:28,834 copying flashinfer/jit/spdlog.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,836 copying flashinfer/jit/sampling.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,838 copying flashinfer/jit/comm.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,840 copying flashinfer/jit/quantization.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,842 copying flashinfer/jit/tllm_utils.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,844 copying flashinfer/jit/fused_moe.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,846 copying flashinfer/jit/dsv3_optimizations.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,848 copying flashinfer/jit/page.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,850 copying flashinfer/jit/env.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,853 copying flashinfer/jit/gdn.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,855 copying flashinfer/jit/activation.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,857 copying flashinfer/jit/norm.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,859 copying flashinfer/jit/cascade.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,860 copying flashinfer/jit/__init__.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,863 copying flashinfer/jit/fp8_quantization.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,865 copying flashinfer/jit/moe_utils.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,867 copying flashinfer/jit/fp4_kv_quantization.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,869 copying flashinfer/jit/mla.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,871 copying flashinfer/jit/utils.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,873 copying flashinfer/jit/rope.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,875 copying flashinfer/jit/fp4_kv_dequantization.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,877 copying flashinfer/jit/xqa.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,879 copying flashinfer/jit/topk.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,881 copying flashinfer/jit/cubin_loader.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,884 copying flashinfer/jit/tinygemm2.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,886 copying flashinfer/jit/core.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,889 copying flashinfer/jit/cpp_ext.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,891 copying flashinfer/jit/fp4_quantization.py -> build/lib/flashinfer/jit 2026-04-03T10:33:28,893 creating build/lib/flashinfer/triton 2026-04-03T10:33:28,895 copying flashinfer/triton/page.py -> build/lib/flashinfer/triton 2026-04-03T10:33:28,897 copying flashinfer/triton/sm_constraint_gemm.py -> build/lib/flashinfer/triton 2026-04-03T10:33:28,899 copying flashinfer/triton/activation.py -> build/lib/flashinfer/triton 2026-04-03T10:33:28,901 copying flashinfer/triton/norm.py -> build/lib/flashinfer/triton 2026-04-03T10:33:28,903 copying flashinfer/triton/cascade.py -> build/lib/flashinfer/triton 2026-04-03T10:33:28,904 copying flashinfer/triton/__init__.py -> build/lib/flashinfer/triton 2026-04-03T10:33:28,906 copying flashinfer/triton/utils.py -> build/lib/flashinfer/triton 2026-04-03T10:33:28,908 copying flashinfer/triton/gemm.py -> build/lib/flashinfer/triton 2026-04-03T10:33:28,911 creating build/lib/flashinfer/cute_dsl 2026-04-03T10:33:28,912 copying flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/lib/flashinfer/cute_dsl 2026-04-03T10:33:28,915 copying flashinfer/cute_dsl/__init__.py -> build/lib/flashinfer/cute_dsl 2026-04-03T10:33:28,917 copying flashinfer/cute_dsl/utils.py -> build/lib/flashinfer/cute_dsl 2026-04-03T10:33:28,920 copying flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-03T10:33:28,923 copying flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-03T10:33:28,926 copying flashinfer/cute_dsl/fp4_common.py -> build/lib/flashinfer/cute_dsl 2026-04-03T10:33:28,929 copying flashinfer/cute_dsl/blockscaled_gemm.py -> build/lib/flashinfer/cute_dsl 2026-04-03T10:33:28,932 creating build/lib/flashinfer/data 2026-04-03T10:33:28,932 copying ./build_utils.py -> build/lib/flashinfer/data 2026-04-03T10:33:28,934 copying ./build_backend.py -> build/lib/flashinfer/data 2026-04-03T10:33:28,937 creating build/lib/flashinfer/fused_moe 2026-04-03T10:33:28,938 copying flashinfer/fused_moe/fused_routing_dsv3.py -> build/lib/flashinfer/fused_moe 2026-04-03T10:33:28,940 copying flashinfer/fused_moe/__init__.py -> build/lib/flashinfer/fused_moe 2026-04-03T10:33:28,942 copying flashinfer/fused_moe/utils.py -> build/lib/flashinfer/fused_moe 2026-04-03T10:33:28,945 copying flashinfer/fused_moe/core.py -> build/lib/flashinfer/fused_moe 2026-04-03T10:33:28,950 creating build/lib/flashinfer/testing 2026-04-03T10:33:28,950 copying flashinfer/testing/__init__.py -> build/lib/flashinfer/testing 2026-04-03T10:33:28,954 copying flashinfer/testing/utils.py -> build/lib/flashinfer/testing 2026-04-03T10:33:28,958 creating build/lib/flashinfer/quantization/kernels 2026-04-03T10:33:28,959 copying flashinfer/quantization/kernels/mxfp4_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-03T10:33:28,962 copying flashinfer/quantization/kernels/mxfp8_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-03T10:33:28,964 copying flashinfer/quantization/kernels/__init__.py -> build/lib/flashinfer/quantization/kernels 2026-04-03T10:33:28,966 creating build/lib/flashinfer/gemm/kernels 2026-04-03T10:33:28,968 copying flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/lib/flashinfer/gemm/kernels 2026-04-03T10:33:28,972 copying flashinfer/gemm/kernels/__init__.py -> build/lib/flashinfer/gemm/kernels 2026-04-03T10:33:28,973 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/lib/flashinfer/gemm/kernels 2026-04-03T10:33:28,977 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/lib/flashinfer/gemm/kernels 2026-04-03T10:33:28,982 creating build/lib/flashinfer/norm/kernels 2026-04-03T10:33:28,983 copying flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-03T10:33:28,987 copying flashinfer/norm/kernels/rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-03T10:33:28,990 copying flashinfer/norm/kernels/__init__.py -> build/lib/flashinfer/norm/kernels 2026-04-03T10:33:28,992 copying flashinfer/norm/kernels/layernorm.py -> build/lib/flashinfer/norm/kernels 2026-04-03T10:33:28,995 creating build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:28,996 copying flashinfer/gdn_kernels/blackwell_prefill/gdn_tile_scheduler.py -> build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:28,999 copying flashinfer/gdn_kernels/blackwell_prefill/gdn.py -> build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:29,003 copying flashinfer/gdn_kernels/blackwell_prefill/__init__.py -> build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:29,005 copying flashinfer/gdn_kernels/blackwell_prefill/gdn_helpers.py -> build/lib/flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:29,008 creating build/lib/flashinfer/jit/attention 2026-04-03T10:33:29,009 copying flashinfer/jit/attention/variants.py -> build/lib/flashinfer/jit/attention 2026-04-03T10:33:29,011 copying flashinfer/jit/attention/modules.py -> build/lib/flashinfer/jit/attention 2026-04-03T10:33:29,014 copying flashinfer/jit/attention/__init__.py -> build/lib/flashinfer/jit/attention 2026-04-03T10:33:29,016 copying flashinfer/jit/attention/utils.py -> build/lib/flashinfer/jit/attention 2026-04-03T10:33:29,019 creating build/lib/flashinfer/jit/mamba 2026-04-03T10:33:29,020 copying flashinfer/jit/mamba/selective_state_update.py -> build/lib/flashinfer/jit/mamba 2026-04-03T10:33:29,023 copying flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/lib/flashinfer/jit/mamba 2026-04-03T10:33:29,024 copying flashinfer/jit/mamba/__init__.py -> build/lib/flashinfer/jit/mamba 2026-04-03T10:33:29,027 creating build/lib/flashinfer/jit/gemm 2026-04-03T10:33:29,028 copying flashinfer/jit/gemm/fp8_blockscale.py -> build/lib/flashinfer/jit/gemm 2026-04-03T10:33:29,030 copying flashinfer/jit/gemm/__init__.py -> build/lib/flashinfer/jit/gemm 2026-04-03T10:33:29,031 copying flashinfer/jit/gemm/deepgemm.py -> build/lib/flashinfer/jit/gemm 2026-04-03T10:33:29,033 copying flashinfer/jit/gemm/core.py -> build/lib/flashinfer/jit/gemm 2026-04-03T10:33:29,036 creating build/lib/flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:29,037 copying flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:29,044 copying flashinfer/jit/attention/fmha_v2/utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:29,047 copying flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:29,050 copying flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:29,053 creating build/lib/flashinfer/jit/gemm/cutlass 2026-04-03T10:33:29,054 copying flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-03T10:33:29,057 copying flashinfer/jit/gemm/cutlass/__init__.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-03T10:33:29,059 copying flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-03T10:33:29,062 creating build/lib/flashinfer/triton/kernels 2026-04-03T10:33:29,063 copying flashinfer/triton/kernels/ssd_chunk_state.py -> build/lib/flashinfer/triton/kernels 2026-04-03T10:33:29,066 copying flashinfer/triton/kernels/sm_constraint_gemm.py -> build/lib/flashinfer/triton/kernels 2026-04-03T10:33:29,068 copying flashinfer/triton/kernels/activation.py -> build/lib/flashinfer/triton/kernels 2026-04-03T10:33:29,070 copying flashinfer/triton/kernels/norm.py -> build/lib/flashinfer/triton/kernels 2026-04-03T10:33:29,072 copying flashinfer/triton/kernels/cascade.py -> build/lib/flashinfer/triton/kernels 2026-04-03T10:33:29,074 copying flashinfer/triton/kernels/__init__.py -> build/lib/flashinfer/triton/kernels 2026-04-03T10:33:29,076 copying flashinfer/triton/kernels/quant.py -> build/lib/flashinfer/triton/kernels 2026-04-03T10:33:29,080 creating build/lib/flashinfer/data/cutlass/python 2026-04-03T10:33:29,082 copying 3rdparty/cutlass/python/setup_cutlass.py -> build/lib/flashinfer/data/cutlass/python 2026-04-03T10:33:29,084 copying 3rdparty/cutlass/python/setup_library.py -> build/lib/flashinfer/data/cutlass/python 2026-04-03T10:33:29,085 copying 3rdparty/cutlass/python/setup_pycute.py -> build/lib/flashinfer/data/cutlass/python 2026-04-03T10:33:29,089 creating build/lib/flashinfer/data/cutlass/test/utils 2026-04-03T10:33:29,090 copying 3rdparty/cutlass/test/utils/test_sharding.py -> build/lib/flashinfer/data/cutlass/test/utils 2026-04-03T10:33:29,094 creating build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-03T10:33:29,095 copying 3rdparty/cutlass/test/python/cutlass/installation.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-03T10:33:29,098 creating build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:29,099 copying 3rdparty/cutlass/test/python/pycute/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:29,101 copying 3rdparty/cutlass/test/python/pycute/test_coalesce.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:29,102 copying 3rdparty/cutlass/test/python/pycute/test_complement.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:29,104 copying 3rdparty/cutlass/test/python/pycute/test_typing.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:29,105 copying 3rdparty/cutlass/test/python/pycute/test_right_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:29,107 copying 3rdparty/cutlass/test/python/pycute/test_composition.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:29,109 copying 3rdparty/cutlass/test/python/pycute/test_int_tuple.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:29,111 copying 3rdparty/cutlass/test/python/pycute/test_left_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:29,113 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,114 copying 3rdparty/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,116 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,118 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,120 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,122 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,124 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,126 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,128 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,130 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,132 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,134 copying 3rdparty/cutlass/test/python/cutlass/gemm/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,136 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,138 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:29,141 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:29,142 copying 3rdparty/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:29,144 copying 3rdparty/cutlass/test/python/cutlass/interface/evt_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:29,147 copying 3rdparty/cutlass/test/python/cutlass/interface/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:29,148 copying 3rdparty/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:29,151 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:29,152 copying 3rdparty/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:29,154 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:29,156 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:29,158 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:29,161 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:29,162 copying 3rdparty/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:29,164 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:29,166 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:29,168 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:29,170 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:29,172 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:29,175 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-03T10:33:29,176 copying 3rdparty/cutlass/test/python/cutlass/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-03T10:33:29,179 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-03T10:33:29,180 copying 3rdparty/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-03T10:33:29,183 creating build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-03T10:33:29,185 copying 3rdparty/cutlass/test/unit/gemm/device/simt_sm50.py -> build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-03T10:33:29,188 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-03T10:33:29,190 copying 3rdparty/cutlass/test/examples/CuTeDSL/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-03T10:33:29,193 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:29,194 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:29,196 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:29,199 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:29,201 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:29,203 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:29,206 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:29,206 copying 3rdparty/cutlass/python/cutlass_cppgen/shape.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:29,209 copying 3rdparty/cutlass/python/cutlass_cppgen/library_defaults.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:29,211 copying 3rdparty/cutlass/python/cutlass_cppgen/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:29,213 copying 3rdparty/cutlass/python/cutlass_cppgen/swizzle.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:29,216 creating build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,217 copying 3rdparty/cutlass/python/cutlass_library/sm90_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,220 copying 3rdparty/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,222 copying 3rdparty/cutlass/python/cutlass_library/sm100_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,224 copying 3rdparty/cutlass/python/cutlass_library/manifest.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,227 copying 3rdparty/cutlass/python/cutlass_library/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,229 copying 3rdparty/cutlass/python/cutlass_library/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,231 copying 3rdparty/cutlass/python/cutlass_library/sm100_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,234 copying 3rdparty/cutlass/python/cutlass_library/rank_k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,236 copying 3rdparty/cutlass/python/cutlass_library/rank_2k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,238 copying 3rdparty/cutlass/python/cutlass_library/heuristics.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,241 copying 3rdparty/cutlass/python/cutlass_library/generator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,251 copying 3rdparty/cutlass/python/cutlass_library/conv3d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,253 copying 3rdparty/cutlass/python/cutlass_library/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,257 copying 3rdparty/cutlass/python/cutlass_library/conv3x_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,259 copying 3rdparty/cutlass/python/cutlass_library/sm90_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,262 copying 3rdparty/cutlass/python/cutlass_library/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,266 copying 3rdparty/cutlass/python/cutlass_library/trmm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,268 copying 3rdparty/cutlass/python/cutlass_library/heuristics_provider.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,271 copying 3rdparty/cutlass/python/cutlass_library/symm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:29,274 creating build/lib/flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:29,275 copying 3rdparty/cutlass/python/pycute/__init__.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:29,277 copying 3rdparty/cutlass/python/pycute/typing.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:29,279 copying 3rdparty/cutlass/python/pycute/int_tuple.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:29,281 copying 3rdparty/cutlass/python/pycute/layout.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:29,283 copying 3rdparty/cutlass/python/pycute/swizzle.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:29,285 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-03T10:33:29,286 copying 3rdparty/cutlass/python/CuTeDSL/prep_editable_install.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-03T10:33:29,289 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:29,290 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/check.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:29,292 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:29,294 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:29,296 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:29,298 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:29,300 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:29,301 copying 3rdparty/cutlass/python/cutlass_cppgen/op/op.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:29,304 copying 3rdparty/cutlass/python/cutlass_cppgen/op/conv.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:29,307 copying 3rdparty/cutlass/python/cutlass_cppgen/op/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:29,309 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:29,311 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:29,314 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-03T10:33:29,315 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-03T10:33:29,317 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-03T10:33:29,319 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-03T10:33:29,322 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,323 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,325 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,327 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,329 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,331 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,333 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,336 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,338 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,340 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,342 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,344 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,348 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,350 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:29,353 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-03T10:33:29,354 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/common.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-03T10:33:29,356 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-03T10:33:29,358 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-03T10:33:29,361 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-03T10:33:29,362 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-03T10:33:29,364 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-03T10:33:29,366 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-03T10:33:29,368 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-03T10:33:29,370 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-03T10:33:29,372 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,373 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,375 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,378 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,380 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,382 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,384 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,386 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,388 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,391 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:29,393 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-03T10:33:29,394 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-03T10:33:29,396 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-03T10:33:29,398 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-03T10:33:29,401 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:29,402 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:29,404 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:29,406 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:29,408 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:29,410 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:29,412 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:29,413 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:29,415 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:29,418 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,419 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,421 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,423 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,425 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,427 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,429 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,432 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,434 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,435 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,437 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,439 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,441 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,443 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:29,445 creating build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-03T10:33:29,447 copying 3rdparty/cutlass/python/docs_src/source/conf.py -> build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-03T10:33:29,449 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-03T10:33:29,450 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/torch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-03T10:33:29,453 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-03T10:33:29,454 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-03T10:33:29,457 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:29,458 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:29,461 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:29,464 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:29,466 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:29,468 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:29,470 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:29,473 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:29,474 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:29,476 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:29,478 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:29,480 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:29,482 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:29,484 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:29,487 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,488 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,491 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,493 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,496 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,498 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,501 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,502 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,505 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,508 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,510 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,513 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,515 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,517 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,519 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,521 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,524 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,526 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:29,529 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,530 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,535 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,538 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,540 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,542 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,545 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,548 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,550 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,552 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,555 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,557 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,562 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:29,565 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:29,566 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:29,568 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:29,571 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:29,572 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:29,576 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,577 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,579 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,581 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,583 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,587 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,589 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,592 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,595 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,599 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,601 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,604 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,606 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:29,609 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-03T10:33:29,610 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-03T10:33:29,612 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-03T10:33:29,615 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:29,616 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:29,618 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:29,620 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:29,622 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:29,624 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:29,626 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:29,628 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:29,631 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-03T10:33:29,632 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-03T10:33:29,635 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-03T10:33:29,637 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-03T10:33:29,639 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:29,640 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:29,642 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:29,644 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:29,650 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:29,652 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:29,654 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:29,656 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:29,657 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:29,660 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:29,661 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:29,663 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:29,665 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:29,667 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:29,669 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:29,671 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:29,672 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:29,674 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:29,677 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:29,679 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:29,683 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-03T10:33:29,684 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-03T10:33:29,686 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-03T10:33:29,689 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-03T10:33:29,691 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-03T10:33:29,693 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-03T10:33:29,695 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-03T10:33:29,697 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-03T10:33:29,699 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-03T10:33:29,700 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-03T10:33:29,703 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-03T10:33:29,704 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-03T10:33:29,708 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:29,709 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:29,711 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:29,713 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:29,715 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:29,717 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:29,719 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:29,722 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:29,723 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:29,726 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:29,728 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:29,732 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:29,733 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:29,736 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:29,737 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:29,739 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:29,742 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:29,743 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:29,745 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:29,747 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:29,748 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:29,751 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:29,753 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:29,755 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:29,757 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:29,759 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:29,761 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:29,763 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:29,764 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:29,767 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:29,769 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:29,771 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:29,775 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-03T10:33:29,776 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-03T10:33:29,778 copying 3rdparty/cutlass/examples/40_cutlass_py/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-03T10:33:29,781 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-03T10:33:29,783 creating build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-03T10:33:29,784 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-03T10:33:29,786 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-03T10:33:29,789 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:29,791 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:29,793 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:29,795 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:29,797 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:29,800 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:29,801 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:29,803 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:29,805 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:29,806 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:29,810 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:29,811 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:29,813 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:29,817 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:29,819 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:29,823 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:29,825 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:29,828 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:29,832 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,833 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,837 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,840 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,844 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,848 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,851 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,854 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,857 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,861 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,865 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,870 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,877 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,880 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,882 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,885 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,888 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,891 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,894 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:29,898 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-03T10:33:29,899 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-03T10:33:29,901 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-03T10:33:29,904 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-03T10:33:29,905 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-03T10:33:29,909 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,910 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,913 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,915 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,918 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,920 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,922 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,924 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,927 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,929 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,932 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,935 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,937 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,939 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:29,942 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:29,943 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:29,946 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:29,949 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:29,952 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:29,955 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-03T10:33:29,956 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-03T10:33:29,958 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-03T10:33:29,961 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:29,963 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:29,966 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:29,969 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:29,971 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:29,974 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:29,978 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-03T10:33:29,979 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-03T10:33:29,981 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:29,982 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:29,985 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:29,987 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:29,991 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:29,994 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:29,997 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:29,998 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:30,001 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:30,003 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:30,006 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:30,008 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:30,011 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-03T10:33:30,012 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-03T10:33:30,017 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-03T10:33:30,021 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-03T10:33:30,025 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:30,027 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:30,030 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:30,033 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:30,037 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:30,040 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-03T10:33:30,041 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-03T10:33:30,047 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-03T10:33:30,053 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-03T10:33:30,055 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:30,056 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:30,060 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:30,064 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:30,066 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:30,071 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-03T10:33:30,072 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-03T10:33:30,076 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-03T10:33:30,078 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-03T10:33:30,081 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:30,082 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:30,084 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:30,086 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:30,088 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:30,090 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:30,092 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:30,094 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:30,095 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:30,098 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-03T10:33:30,099 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-03T10:33:30,101 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-03T10:33:30,102 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-03T10:33:30,104 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-03T10:33:30,107 creating build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,109 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,111 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,113 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,115 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,117 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,119 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,122 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,124 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,126 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,129 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,131 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,133 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:30,136 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-03T10:33:30,137 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-03T10:33:30,139 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-03T10:33:30,142 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-03T10:33:30,145 creating build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-03T10:33:30,147 copying 3rdparty/cutlass/tools/util/scripts/split_test_cmake.py -> build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-03T10:33:30,175 creating build/lib/flashinfer/data/spdlog/scripts 2026-04-03T10:33:30,176 copying 3rdparty/spdlog/scripts/extract_version.py -> build/lib/flashinfer/data/spdlog/scripts 2026-04-03T10:33:30,211 creating build/lib/flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:30,212 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:30,215 copying flashinfer/fused_moe/cute_dsl/tuner.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:30,217 copying flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:30,220 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:30,223 copying flashinfer/fused_moe/cute_dsl/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:30,225 copying flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:30,228 creating build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:30,229 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:30,235 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:30,239 copying flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:30,241 copying flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:30,243 copying flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:30,789 copying flashinfer/py.typed -> build/lib/flashinfer 2026-04-03T10:33:30,792 creating build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,793 copying ./csrc/single_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,795 copying ./csrc/trtllm_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,798 copying ./csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,800 copying ./csrc/mxfp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,802 copying ./csrc/trtllm_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,805 copying ./csrc/selective_state_update_dtype_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,807 copying ./csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,809 copying ./csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,811 copying ./csrc/flashinfer_rope_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,814 copying ./csrc/batch_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,816 copying ./csrc/fmha_v2_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,818 copying ./csrc/cudnn_sdpa_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,822 copying ./csrc/fmha_v2_run.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,825 copying ./csrc/page.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,827 copying ./csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,829 copying ./csrc/batch_prefill_ragged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,831 copying ./csrc/fp4_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,835 copying ./csrc/batch_decode_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,837 copying ./csrc/batch_decode_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,839 copying ./csrc/mxfp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,842 copying ./csrc/pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,844 copying ./csrc/flashinfer_page_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,846 copying ./csrc/fp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,848 copying ./csrc/gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,850 copying ./csrc/single_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,852 copying ./csrc/gemm_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,855 copying ./csrc/trtllm_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:30,857 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:30,859 copying ./csrc/nv_internal/cpp/common/logger.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:30,862 copying ./csrc/nv_internal/cpp/common/tllmException.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:30,864 copying ./csrc/nv_internal/cpp/common/memoryUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:30,867 copying ./csrc/nv_internal/cpp/common/stringUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:30,870 copying ./csrc/nv_internal/cpp/common/envUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:30,872 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-03T10:33:30,873 copying ./csrc/nv_internal/cpp/kernels/quantization.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-03T10:33:30,876 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,878 copying ./csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,881 copying ./csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,884 copying ./csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,886 copying ./csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,889 copying ./csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,891 copying ./csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,894 copying ./csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,896 copying ./csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,899 copying ./csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:30,901 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,902 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,905 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,908 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,911 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,917 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,921 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,924 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,926 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,929 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,931 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,933 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:30,935 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:30,936 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:30,938 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:30,940 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:30,942 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:30,944 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-03T10:33:30,945 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-03T10:33:30,949 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-03T10:33:30,951 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-03T10:33:30,952 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-03T10:33:30,955 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-03T10:33:30,957 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:30,959 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,961 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,963 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,966 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,968 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,970 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,972 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,974 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,976 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,978 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:30,980 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:30,981 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:30,983 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:30,988 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:30,991 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:30,994 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:30,996 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:30,998 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,001 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,003 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,005 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,007 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,009 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,011 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,014 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,016 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,018 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,021 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:31,024 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-03T10:33:31,026 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-03T10:33:31,027 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:31,028 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:31,031 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-03T10:33:31,032 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-03T10:33:31,035 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-03T10:33:31,038 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:31,041 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:31,043 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:31,047 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:31,049 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-03T10:33:31,052 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,053 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,055 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,057 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,059 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,061 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,063 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,065 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,067 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,069 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,071 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,072 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,074 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,077 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,079 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-03T10:33:31,080 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-03T10:33:31,084 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-03T10:33:31,086 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,087 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,089 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,092 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,094 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,095 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,098 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:31,100 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:31,101 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:31,103 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:31,106 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:31,109 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:31,111 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:31,112 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:31,115 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-03T10:33:31,116 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-03T10:33:31,118 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-03T10:33:31,120 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:31,124 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:31,125 copying ./csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:31,127 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:31,129 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:31,131 copying ./csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:31,133 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:31,136 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:31,139 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:31,140 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-03T10:33:31,143 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-03T10:33:31,145 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-03T10:33:31,146 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-03T10:33:31,149 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-03T10:33:31,151 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-03T10:33:31,153 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,154 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,156 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,159 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,161 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,163 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,166 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,169 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,171 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,173 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,176 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,178 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,181 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:31,183 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,184 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,187 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,191 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,193 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,195 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,198 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,200 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,202 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,205 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,207 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,210 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,212 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:31,215 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-03T10:33:31,216 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-03T10:33:31,219 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-03T10:33:31,221 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-03T10:33:31,224 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,227 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,230 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,232 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,234 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,236 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,239 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,244 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,246 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,248 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:31,250 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:31,252 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-03T10:33:31,254 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-03T10:33:31,257 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-03T10:33:31,258 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-03T10:33:31,260 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-03T10:33:31,261 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-03T10:33:31,264 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-03T10:33:31,267 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:31,270 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-03T10:33:31,272 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-03T10:33:31,275 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:31,277 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-03T10:33:31,279 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-03T10:33:31,282 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:31,284 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-03T10:33:31,286 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-03T10:33:31,289 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:31,291 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:31,292 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:31,294 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:31,297 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:31,299 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:31,302 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:31,304 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:31,306 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:31,309 creating build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,311 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,314 copying ./csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,316 copying ./csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,318 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,320 copying ./csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,322 copying ./csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,324 copying ./csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,326 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,329 copying ./csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,331 copying ./csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,334 copying ./csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:31,336 copying ./csrc/flashinfer_cascade_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,338 copying ./csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,340 copying ./csrc/flashinfer_mamba_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,342 copying ./csrc/batch_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,344 copying ./csrc/batch_pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,346 copying ./csrc/batch_prefill_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,348 copying ./csrc/batch_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,350 copying ./csrc/batch_attention_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,351 copying ./csrc/batch_decode_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,353 copying ./csrc/trtllm_alltoall_prepare.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,356 copying ./csrc/norm.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,359 copying ./csrc/selective_state_update.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,361 copying ./csrc/quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,363 copying ./csrc/batch_attention_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,365 copying ./csrc/batch_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,368 copying ./csrc/single_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,369 copying ./csrc/batch_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,372 copying ./csrc/group_gemm_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,374 copying ./csrc/flashinfer_quantization_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,376 copying ./csrc/bmm_fp8.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,378 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,379 copying ./csrc/fmha_v2/fmha/paged_kv_cache.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,381 copying ./csrc/fmha_v2/fmha/softmax.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,386 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,389 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,391 copying ./csrc/fmha_v2/fmha/utils.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,394 copying ./csrc/fmha_v2/fmha/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,398 copying ./csrc/fmha_v2/fmha/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,401 copying ./csrc/fmha_v2/fmha/alibi_params.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,403 copying ./csrc/fmha_v2/fmha/smem_tile_v.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,406 copying ./csrc/fmha_v2/fmha/gemm.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,408 copying ./csrc/fmha_v2/fmha/mask.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,411 copying ./csrc/fmha_v2/fmha/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,414 copying ./csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,417 copying ./csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,420 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:31,421 copying ./csrc/fmha_v2/fmha/warpspec/compute.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:31,424 copying ./csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:31,427 copying ./csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:31,429 copying ./csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:31,432 copying ./csrc/fmha_v2/fmha/warpspec/dma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:31,435 copying ./csrc/fmha_v2/fmha/traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,438 copying ./csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,441 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,442 copying ./csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,444 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,447 copying ./csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,449 copying ./csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,452 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,455 copying ./csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,457 copying ./csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,460 copying ./csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,462 copying ./csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,466 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,469 copying ./csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,472 copying ./csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,474 copying ./csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,479 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,482 copying ./csrc/fmha_v2/fmha/hopper/tma_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,484 copying ./csrc/fmha_v2/fmha/hopper/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,486 copying ./csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,489 copying ./csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:31,491 copying ./csrc/fmha_v2/fmha/numeric_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,493 copying ./csrc/fmha_v2/fmha/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,496 copying ./csrc/fmha_v2/fmha/gmem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:31,499 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,502 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,504 creating build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:31,505 copying ./csrc/fmha_v2/templates/kernel_hopper.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:31,508 copying ./csrc/fmha_v2/templates/fa_kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:31,511 copying ./csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:31,513 copying ./csrc/fmha_v2/templates/kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:31,516 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,518 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,520 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,523 copying ./csrc/fmha_v2/fused_multihead_cross_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,525 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,527 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,530 copying ./csrc/fmha_v2/fused_multihead_attention_utils.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,533 copying ./csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,535 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,538 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,540 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,543 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,545 copying ./csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,548 copying ./csrc/fmha_v2/fused_multihead_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,550 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,553 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:31,555 copying ./csrc/fp4_kv_quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,558 copying ./csrc/flashinfer_topk_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,560 copying ./csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,562 copying ./csrc/single_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,564 copying ./csrc/trtllm_batched_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,567 copying ./csrc/blackwell_fmha_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,569 copying ./csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,571 copying ./csrc/batch_pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,573 copying ./csrc/batch_mla_sm90_run.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,576 copying ./csrc/batch_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,578 copying ./csrc/single_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,580 copying ./csrc/logging.cc -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,582 copying ./csrc/trtllm_low_latency_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,585 copying ./csrc/flashinfer_xqa_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,587 copying ./csrc/fp4_gemm_cutlass_sm120.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,589 copying ./csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,591 copying ./csrc/cutlass_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,593 copying ./csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,595 copying ./csrc/batch_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,597 copying ./csrc/single_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,599 copying ./csrc/group_gemm_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,601 copying ./csrc/batch_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,603 copying ./csrc/trtllm_moe_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,605 copying ./csrc/seq_chunk_cumsum.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,607 copying ./csrc/tvm_ffi_utils.h -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,609 copying ./csrc/renorm.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,611 copying ./csrc/trtllm_fused_moe_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,614 copying ./csrc/group_gemm_fp8_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,616 copying ./csrc/batch_mla_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,619 copying ./csrc/sampling_utils.h -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,621 copying ./csrc/batch_decode_mla_cute_sm80.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,623 copying ./csrc/nvshmem_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,625 copying ./csrc/runtime_utils.h -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,627 copying ./csrc/seq_chunk_cumsum_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,629 copying ./csrc/single_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,631 copying ./csrc/batch_pod.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,634 copying ./csrc/batch_mla_sm90_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,636 copying ./csrc/cascade.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,638 copying ./csrc/cudnn_sdpa_utils.h -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,640 copying ./csrc/fmhaReduction.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,643 copying ./csrc/moe_utils_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,646 copying ./csrc/fp4_kv_dequantization.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,648 copying ./csrc/group_gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,650 copying ./csrc/batch_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,652 copying ./csrc/batch_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,654 copying ./csrc/fp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,656 copying ./csrc/batch_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,659 copying ./csrc/selective_state_update_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,661 copying ./csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,663 copying ./csrc/bf16_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,665 copying ./csrc/trtllm_moe_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,667 creating build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,668 copying ./csrc/xqa/tma.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,671 copying ./csrc/xqa/barriers.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,674 copying ./csrc/xqa/mha.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,677 copying ./csrc/xqa/utils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,681 copying ./csrc/xqa/mla_sm120.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,686 copying ./csrc/xqa/tensorMap.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,689 copying ./csrc/xqa/specDec.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,691 copying ./csrc/xqa/mha.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,697 copying ./csrc/xqa/platform.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,700 copying ./csrc/xqa/tensorMap.cpp -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,703 copying ./csrc/xqa/gmma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,706 copying ./csrc/xqa/gmma_impl.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,717 copying ./csrc/xqa/mhaUtils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,720 copying ./csrc/xqa/hostUtils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,723 copying ./csrc/xqa/mha_components.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,726 copying ./csrc/xqa/ldgsts.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,728 copying ./csrc/xqa/mha_sm90.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,734 copying ./csrc/xqa/mma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,736 copying ./csrc/xqa/utils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,740 copying ./csrc/xqa/mha_stdheaders.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,744 copying ./csrc/xqa/mla_sm120.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,746 copying ./csrc/xqa/defines.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,750 copying ./csrc/xqa/cuda_hint.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,752 copying ./csrc/xqa/xqa_wrapper.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-03T10:33:31,755 copying ./csrc/bf16_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,758 copying ./csrc/fp4_gemm_cutlass_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,761 copying ./csrc/vllm_custom_all_reduce.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,764 copying ./csrc/batch_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,767 copying ./csrc/single_prefill_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,770 copying ./csrc/group_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,773 copying ./csrc/group_gemm_fp8_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,776 copying ./csrc/single_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,779 copying ./csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,781 copying ./csrc/trtllm_fmha_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,785 copying ./csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,788 copying ./csrc/flashinfer_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,790 copying ./csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,793 copying ./csrc/batch_pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,796 copying ./csrc/batch_attention.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,799 copying ./csrc/batch_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,802 copying ./csrc/trtllm_fused_moe_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,808 copying ./csrc/group_gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,810 copying ./csrc/flashinfer_gemm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,814 copying ./csrc/sampling.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,817 copying ./csrc/single_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,819 copying ./csrc/gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,822 copying ./csrc/batch_attention_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,824 copying ./csrc/trtllm_fmha_v2_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,828 copying ./csrc/selective_state_update_kernel_inst.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,830 copying ./csrc/flashinfer_sampling_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,833 copying ./csrc/rope.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,837 copying ./csrc/pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,839 copying ./csrc/prefill_kernel_delta_rule_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,842 creating build/lib/flashinfer/data/csrc/fused_moe 2026-04-03T10:33:31,843 copying ./csrc/fused_moe/noAuxTcKernels.cu -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-03T10:33:31,846 copying ./csrc/fused_moe/moeTopKFuncs.cuh -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-03T10:33:31,850 creating build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:31,851 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:31,859 copying ./csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:31,863 copying ./csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:31,868 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:31,871 creating build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:31,873 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/RoutingRenormalizeCommon.cuh -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:31,877 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramScoresKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:31,879 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchInitExpertCounts.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:31,882 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchClusterKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:31,884 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:31,887 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchBlockKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:31,890 copying ./csrc/fused_moe/trtllm_backend/routingRenormalize/launchOffsetsKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:31,892 creating build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:31,894 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchMainKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:31,897 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/RoutingDeepSeekCommon.cuh -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:31,900 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchCoopKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:31,903 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchInitExpertCounts.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:31,905 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchClusterKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:31,908 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchHistogramKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:31,910 copying ./csrc/fused_moe/trtllm_backend/routingDeepSeek/launchOffsetsKernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:31,913 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_renormalize.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-03T10:33:31,916 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-03T10:33:31,918 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-03T10:33:31,921 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-03T10:33:31,925 copying ./csrc/fp4_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,927 copying ./csrc/trtllm_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,930 copying ./csrc/fp4_gemm_cutlass_sm103.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,933 copying ./csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,935 copying ./csrc/single_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,938 copying ./csrc/concat_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,940 copying ./csrc/fmha_cutlass_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,942 copying ./csrc/flashinfer_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,945 copying ./csrc/flashinfer_norm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,947 copying ./csrc/single_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,949 copying ./csrc/fmha_cutlass_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,952 copying ./csrc/tgv_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,955 copying ./csrc/batch_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,957 copying ./csrc/gemm_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,960 copying ./csrc/pod.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,963 copying ./csrc/pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,965 copying ./csrc/tgv_gemm.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,968 copying ./csrc/fp4_gemm_cutlass_sm103.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,970 copying ./csrc/gdn_prefill_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,972 copying ./csrc/tinygemm2.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,975 copying ./csrc/single_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,978 copying ./csrc/trtllm_mnnvl_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,980 copying ./csrc/topk.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,983 copying ./csrc/batch_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,985 copying ./csrc/dsv3_router_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,987 copying ./csrc/batch_decode_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-03T10:33:31,990 creating build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:31,992 copying ./include/flashinfer/attention/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:31,994 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-03T10:33:31,996 copying ./include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-03T10:33:31,998 copying ./include/flashinfer/attention/blackwell/plan.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-03T10:33:32,001 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:32,002 copying ./include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:32,004 copying ./include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:32,006 copying ./include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:32,008 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:32,011 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:32,014 copying ./include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:32,016 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:32,019 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:32,022 copying ./include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-03T10:33:32,024 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-03T10:33:32,025 copying ./include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-03T10:33:32,028 copying ./include/flashinfer/attention/blackwell/device/fmha.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-03T10:33:32,030 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:32,031 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:32,034 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:32,036 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:32,038 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:32,041 copying ./include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:32,044 copying ./include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:32,046 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:32,049 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:32,052 copying ./include/flashinfer/attention/persistent.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,055 copying ./include/flashinfer/attention/mask.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,057 copying ./include/flashinfer/attention/default_prefill_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,060 copying ./include/flashinfer/attention/heap.h -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,062 copying ./include/flashinfer/attention/pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,065 copying ./include/flashinfer/attention/cutlass_mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,067 copying ./include/flashinfer/attention/default_decode_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,070 copying ./include/flashinfer/attention/decode.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,076 copying ./include/flashinfer/attention/persistent_template.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,080 copying ./include/flashinfer/attention/batch_pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,083 copying ./include/flashinfer/attention/prefill.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,088 copying ./include/flashinfer/attention/cascade.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,091 copying ./include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,093 copying ./include/flashinfer/attention/scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,097 copying ./include/flashinfer/attention/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,099 copying ./include/flashinfer/attention/mla_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,102 copying ./include/flashinfer/attention/mla_hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,107 copying ./include/flashinfer/attention/state.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,111 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,113 copying ./include/flashinfer/attention/hopper/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,117 copying ./include/flashinfer/attention/hopper/named_barrier.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,122 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:32,124 copying ./include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:32,131 copying ./include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:32,350 copying ./include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:32,353 copying ./include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:32,355 copying ./include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:32,358 copying ./include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:32,361 copying ./include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,364 copying ./include/flashinfer/attention/hopper/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,366 copying ./include/flashinfer/attention/hopper/default_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,368 copying ./include/flashinfer/attention/hopper/attention_updater.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,371 copying ./include/flashinfer/attention/hopper/mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,373 copying ./include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,376 copying ./include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,379 copying ./include/flashinfer/attention/hopper/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,381 copying ./include/flashinfer/attention/hopper/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,383 copying ./include/flashinfer/attention/hopper/utils.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,386 copying ./include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:32,389 copying ./include/flashinfer/attention/hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,391 copying ./include/flashinfer/attention/mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:32,395 creating build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:32,396 copying ./include/flashinfer/mamba/common.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:32,398 copying ./include/flashinfer/mamba/conversion.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:32,401 copying ./include/flashinfer/mamba/selective_state_update.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:32,403 copying ./include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:32,406 copying ./include/flashinfer/mamba/create_tensor_map.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:32,408 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:32,411 copying ./include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:32,415 copying ./include/flashinfer/fp4_layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,417 creating build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,418 copying ./include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,421 copying ./include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,424 copying ./include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,426 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,429 copying ./include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,432 copying ./include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,435 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,437 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,440 copying ./include/flashinfer/gemm/cutlass_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,443 copying ./include/flashinfer/gemm/group_gemv.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,445 copying ./include/flashinfer/gemm/tgv_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,449 copying ./include/flashinfer/gemm/group_gemm_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,452 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,454 copying ./include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,457 copying ./include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,459 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,462 copying ./include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,465 copying ./include/flashinfer/gemm/tgv_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,467 copying ./include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,470 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,473 copying ./include/flashinfer/gemm/bmm_fp8.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,475 copying ./include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,478 copying ./include/flashinfer/gemm/group_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,481 copying ./include/flashinfer/gemm/group_gemm_lora.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,483 copying ./include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,485 copying ./include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,487 copying ./include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,489 copying ./include/flashinfer/gemm/tgv_gemm_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,492 copying ./include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,494 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,497 copying ./include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:32,500 copying ./include/flashinfer/fp16.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,503 copying ./include/flashinfer/page.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,506 copying ./include/flashinfer/allocator.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,508 creating build/lib/flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:32,509 copying ./include/flashinfer/flat/type_traits.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:32,511 copying ./include/flashinfer/flat/debug.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:32,513 copying ./include/flashinfer/flat/cute_ext.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:32,515 copying ./include/flashinfer/flat/unused.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:32,517 creating build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-03T10:33:32,519 copying ./include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-03T10:33:32,521 copying ./include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-03T10:33:32,524 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:32,525 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:32,528 copying ./include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:32,530 copying ./include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:32,532 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:32,535 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-03T10:33:32,535 copying ./include/flashinfer/flat/hopper/device/device_universal.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-03T10:33:32,538 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:32,539 copying ./include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:32,542 copying ./include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:32,544 copying ./include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:32,547 copying ./include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:32,549 copying ./include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:32,551 copying ./include/flashinfer/flat/math_order_barrier.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:32,553 creating build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-03T10:33:32,554 copying ./include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-03T10:33:32,557 copying ./include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-03T10:33:32,560 copying ./include/flashinfer/flat/math.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:32,562 copying ./include/flashinfer/flat/common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:32,564 copying ./include/flashinfer/layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,566 copying ./include/flashinfer/topk.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,570 copying ./include/flashinfer/permuted_smem.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,573 copying ./include/flashinfer/pos_enc.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,576 copying ./include/flashinfer/math.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,579 copying ./include/flashinfer/profiler.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,582 creating build/lib/flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:32,583 copying ./include/flashinfer/comm/trtllm_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:32,590 copying ./include/flashinfer/comm/trtllm_alltoall.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:32,594 copying ./include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:32,598 copying ./include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:32,601 copying ./include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:32,605 copying ./include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:32,610 copying ./include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:32,615 copying ./include/flashinfer/cubin_loader.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,618 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,620 copying ./include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,624 copying ./include/flashinfer/trtllm/fmha/lse.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,627 copying ./include/flashinfer/trtllm/fmha/kernelUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,631 copying ./include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,635 copying ./include/flashinfer/trtllm/fmha/decoder_params.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,638 copying ./include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,641 copying ./include/flashinfer/trtllm/fmha/kernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,646 copying ./include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,650 copying ./include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:32,655 creating build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:32,658 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:32,663 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmOptions.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:32,670 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParamsDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:32,674 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/Enums.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:32,678 creating build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:32,681 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/DtypeDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:32,684 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/MmaDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:32,687 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SparsityDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:32,690 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CommonUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:32,692 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaArchDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:32,695 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaKernelLauncher.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:32,697 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SfLayoutDecl.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:32,700 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmInterface.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:32,703 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/TmaDescriptor.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:32,706 copying ./include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelTraits.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:32,710 copying ./include/flashinfer/trtllm/common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm 2026-04-03T10:33:32,713 creating build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:32,714 copying ./include/flashinfer/trtllm/common/cudaUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:32,717 copying ./include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:32,720 copying ./include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:32,723 copying ./include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:32,725 copying ./include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:32,728 copying ./include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:32,731 creating build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-03T10:33:32,733 copying ./include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-03T10:33:32,736 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:32,737 copying ./include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:32,741 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:32,744 copying ./include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:32,747 copying ./include/flashinfer/trtllm/fused_moe/runner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:32,750 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:32,753 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:32,756 copying ./include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:32,759 copying ./include/flashinfer/fastdiv.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,761 copying ./include/flashinfer/exception.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,764 copying ./include/flashinfer/attention_impl.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,766 copying ./include/flashinfer/concat_mla.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,769 copying ./include/flashinfer/cutlass_utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,772 copying ./include/flashinfer/frag_layout_swizzle.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,775 copying ./include/flashinfer/activation.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,778 copying ./include/flashinfer/mma.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,781 copying ./include/flashinfer/cp_async.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,784 copying ./include/flashinfer/air_top_p.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,787 copying ./include/flashinfer/norm.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,791 copying ./include/flashinfer/logging.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,793 copying ./include/flashinfer/utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,797 copying ./include/flashinfer/arch_condition.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,800 copying ./include/flashinfer/sampling.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,803 copying ./include/flashinfer/quantization.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,806 copying ./include/flashinfer/vec_dtypes.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-03T10:33:32,810 creating build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:32,812 copying 3rdparty/cutlass/include/cute/util/print_tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:32,815 copying 3rdparty/cutlass/include/cute/util/type_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:32,818 copying 3rdparty/cutlass/include/cute/util/print_svg.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:32,821 copying 3rdparty/cutlass/include/cute/util/debug.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:32,824 copying 3rdparty/cutlass/include/cute/util/print_latex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:32,827 copying 3rdparty/cutlass/include/cute/util/print.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:32,830 copying 3rdparty/cutlass/include/cute/swizzle_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,833 copying 3rdparty/cutlass/include/cute/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,836 copying 3rdparty/cutlass/include/cute/pointer_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,839 copying 3rdparty/cutlass/include/cute/pointer_base.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,842 copying 3rdparty/cutlass/include/cute/pointer_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,845 copying 3rdparty/cutlass/include/cute/swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,848 copying 3rdparty/cutlass/include/cute/pointer_flagged.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,851 copying 3rdparty/cutlass/include/cute/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,855 copying 3rdparty/cutlass/include/cute/tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,857 copying 3rdparty/cutlass/include/cute/underscore.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:32,860 creating build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:32,861 copying 3rdparty/cutlass/include/cute/container/tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:32,865 copying 3rdparty/cutlass/include/cute/container/bit_field.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:32,867 copying 3rdparty/cutlass/include/cute/container/alignment.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:32,869 copying 3rdparty/cutlass/include/cute/container/array_aligned.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:32,872 copying 3rdparty/cutlass/include/cute/container/type_list.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:32,874 copying 3rdparty/cutlass/include/cute/container/array.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:32,877 copying 3rdparty/cutlass/include/cute/container/array_subbyte.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:32,880 copying 3rdparty/cutlass/include/cute/container/cuda_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:32,883 creating build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,884 copying 3rdparty/cutlass/include/cute/atom/partitioner.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,886 copying 3rdparty/cutlass/include/cute/atom/mma_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,889 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,893 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,896 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,898 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,902 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,905 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,914 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,917 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,934 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,939 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,942 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,945 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,948 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,950 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,964 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,975 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,978 copying 3rdparty/cutlass/include/cute/atom/mma_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,981 copying 3rdparty/cutlass/include/cute/atom/copy_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,983 copying 3rdparty/cutlass/include/cute/atom/copy_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,987 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,989 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,991 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,994 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:32,996 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:33,004 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:33,006 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:33,009 creating build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,010 copying 3rdparty/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,012 copying 3rdparty/cutlass/include/cute/arch/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,015 copying 3rdparty/cutlass/include/cute/arch/mma_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,017 copying 3rdparty/cutlass/include/cute/arch/copy_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,024 copying 3rdparty/cutlass/include/cute/arch/copy_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,026 copying 3rdparty/cutlass/include/cute/arch/mma_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,029 copying 3rdparty/cutlass/include/cute/arch/mma_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,032 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,035 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,053 copying 3rdparty/cutlass/include/cute/arch/mma_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,057 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,103 copying 3rdparty/cutlass/include/cute/arch/copy_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,105 copying 3rdparty/cutlass/include/cute/arch/cluster_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,108 copying 3rdparty/cutlass/include/cute/arch/mma_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,110 copying 3rdparty/cutlass/include/cute/arch/mma_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,120 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,123 copying 3rdparty/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,128 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,131 copying 3rdparty/cutlass/include/cute/arch/mma_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,133 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,151 copying 3rdparty/cutlass/include/cute/arch/cluster_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,153 copying 3rdparty/cutlass/include/cute/arch/simd_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,156 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,158 copying 3rdparty/cutlass/include/cute/arch/copy_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,161 copying 3rdparty/cutlass/include/cute/arch/copy_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,163 copying 3rdparty/cutlass/include/cute/arch/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,165 copying 3rdparty/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,168 copying 3rdparty/cutlass/include/cute/arch/util.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,171 copying 3rdparty/cutlass/include/cute/arch/mma_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,172 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,218 copying 3rdparty/cutlass/include/cute/arch/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,220 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:33,223 copying 3rdparty/cutlass/include/cute/tensor_impl.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:33,226 copying 3rdparty/cutlass/include/cute/int_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:33,229 copying 3rdparty/cutlass/include/cute/stride.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:33,232 copying 3rdparty/cutlass/include/cute/tensor_zip.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:33,234 creating build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,235 copying 3rdparty/cutlass/include/cute/algorithm/gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,238 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,241 copying 3rdparty/cutlass/include/cute/algorithm/clear.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,243 copying 3rdparty/cutlass/include/cute/algorithm/prefetch.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,245 copying 3rdparty/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,248 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,251 copying 3rdparty/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,254 copying 3rdparty/cutlass/include/cute/algorithm/fill.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,256 copying 3rdparty/cutlass/include/cute/algorithm/prefer.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,258 copying 3rdparty/cutlass/include/cute/algorithm/axpby.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,261 copying 3rdparty/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,263 copying 3rdparty/cutlass/include/cute/algorithm/functional.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,266 copying 3rdparty/cutlass/include/cute/algorithm/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:33,269 copying 3rdparty/cutlass/include/cute/pointer.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:33,271 creating build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,272 copying 3rdparty/cutlass/include/cute/numeric/complex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,275 copying 3rdparty/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,277 copying 3rdparty/cutlass/include/cute/numeric/integral_constant.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,280 copying 3rdparty/cutlass/include/cute/numeric/integer_sequence.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,282 copying 3rdparty/cutlass/include/cute/numeric/real.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,284 copying 3rdparty/cutlass/include/cute/numeric/numeric_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,287 copying 3rdparty/cutlass/include/cute/numeric/integral_ratio.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,289 copying 3rdparty/cutlass/include/cute/numeric/int.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,292 copying 3rdparty/cutlass/include/cute/numeric/math.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:33,294 copying 3rdparty/cutlass/include/cute/layout_composed.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:33,297 creating build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:33,298 copying 3rdparty/cutlass/include/cutlass/predicate_vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:33,301 copying 3rdparty/cutlass/include/cutlass/fast_math.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:33,304 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-03T10:33:33,306 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-03T10:33:33,309 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-03T10:33:33,312 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-03T10:33:33,313 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-03T10:33:33,315 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-03T10:33:33,317 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-03T10:33:33,320 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-03T10:33:33,321 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-03T10:33:33,323 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-03T10:33:33,326 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-03T10:33:33,329 copying 3rdparty/cutlass/include/cutlass/tensor_ref.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:33,331 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:33,334 copying 3rdparty/cutlass/include/cutlass/bfloat16.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:33,336 copying 3rdparty/cutlass/include/cutlass/gemm_coord.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:33,338 copying 3rdparty/cutlass/include/cutlass/blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:33,341 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,343 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,345 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,347 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,349 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,352 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,356 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,358 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,362 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,365 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,368 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,370 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,374 copying 3rdparty/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,377 copying 3rdparty/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,380 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,383 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,386 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,389 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,392 copying 3rdparty/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,395 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,398 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,402 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,406 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,409 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,413 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,416 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,421 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,428 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,431 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,434 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,437 copying 3rdparty/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,440 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,443 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,446 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,450 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,452 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:33,455 copying 3rdparty/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-03T10:33:33,458 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,460 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,462 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,465 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,468 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,472 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,474 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,478 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,481 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,483 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,487 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,489 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,492 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,495 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,497 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,500 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,503 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,505 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,507 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,510 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,512 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,515 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,517 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,520 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,523 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,526 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,529 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,532 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,534 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,537 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,539 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,542 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,544 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,547 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,549 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,553 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,556 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,558 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,561 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,563 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,566 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,568 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,572 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,574 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,576 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:33,579 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,580 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,583 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,585 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,588 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,591 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,594 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,596 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,599 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,602 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,604 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,607 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,610 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,612 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,615 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,618 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,621 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,624 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,627 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,630 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,633 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,636 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,639 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,642 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,644 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,649 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,652 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,655 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,658 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,660 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,663 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,666 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,669 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,671 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,674 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,677 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,680 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,682 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,685 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,688 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,692 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,695 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,697 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,700 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,703 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,705 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,709 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,711 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,714 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,717 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,719 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,722 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,725 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,728 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,731 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,733 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,735 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,738 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,741 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,744 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,746 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,749 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,752 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,754 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,757 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,759 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,762 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,765 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,768 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,770 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,773 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,776 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,779 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,781 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,784 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,787 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,790 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,794 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,796 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,799 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,802 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,805 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,808 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,811 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,814 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,816 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,818 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,821 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,824 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,826 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,829 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,832 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,835 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,838 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,840 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,843 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,845 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,848 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,850 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,853 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,856 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,858 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,861 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,864 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,866 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,869 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,871 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,873 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,876 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,879 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,882 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,886 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,888 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,892 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,894 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,897 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:33,899 copying 3rdparty/cutlass/include/cutlass/gemm/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-03T10:33:33,902 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,903 copying 3rdparty/cutlass/include/cutlass/gemm/device/trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,905 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,908 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,910 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,913 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,915 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,917 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,920 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,923 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,925 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,928 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,931 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,933 copying 3rdparty/cutlass/include/cutlass/gemm/device/symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,936 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,938 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,941 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,943 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,945 copying 3rdparty/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,948 copying 3rdparty/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,951 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,953 copying 3rdparty/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,956 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,958 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,961 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,963 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,966 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,969 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,971 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,974 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,976 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:33,979 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:33,979 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:33,983 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:33,985 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:33,988 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:33,992 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:33,995 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:33,995 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:33,998 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,001 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,003 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,008 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,010 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,013 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,015 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,017 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,020 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,023 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,025 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,028 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,031 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,034 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,037 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,039 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,042 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,044 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,047 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,050 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,052 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,056 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,058 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,061 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,063 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,066 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,069 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,071 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:34,074 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,077 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,080 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,084 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,088 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,091 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,095 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,098 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,102 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,106 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,109 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,113 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,117 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,121 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,123 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,125 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,129 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,132 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,135 copying 3rdparty/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,137 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,140 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,143 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,147 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,150 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,153 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,157 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,160 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,163 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,166 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,170 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,174 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,177 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,180 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,183 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,186 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,189 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,192 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,195 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,199 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,203 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,207 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,210 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,214 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,218 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,220 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,224 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,228 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,232 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:34,235 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:34,236 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:34,239 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:34,242 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:34,245 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:34,248 copying 3rdparty/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-03T10:33:34,253 copying 3rdparty/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-03T10:33:34,255 copying 3rdparty/cutlass/include/cutlass/device_kernel.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,258 copying 3rdparty/cutlass/include/cutlass/numeric_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,260 copying 3rdparty/cutlass/include/cutlass/quaternion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,263 copying 3rdparty/cutlass/include/cutlass/relatively_equal.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,266 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,268 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,271 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,275 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,278 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,280 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,283 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,286 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,289 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,291 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,294 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,297 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,300 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,304 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,307 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,310 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:34,313 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,314 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,317 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,320 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,324 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,327 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,330 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,333 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,336 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,339 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,342 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,345 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,348 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,352 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,355 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,359 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,362 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,365 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,369 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,372 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,375 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,379 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,382 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,385 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,388 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,391 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,394 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,396 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,399 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,402 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,405 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,407 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,409 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,412 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,415 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,417 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,421 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,424 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,426 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:34,427 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:34,430 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:34,433 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:34,436 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:34,439 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:34,442 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,445 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,448 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,450 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,454 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,456 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,459 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,461 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,465 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,468 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,470 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,473 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:34,476 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,477 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,481 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,483 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:34,484 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:34,487 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:34,490 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:34,493 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:34,496 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:34,499 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:34,502 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,505 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,509 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,512 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,515 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,517 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,520 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,523 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,526 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,529 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,531 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,533 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,536 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,539 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,542 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,544 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:34,547 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,548 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,551 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,555 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,557 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,559 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,562 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,564 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,566 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,568 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,571 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,573 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,575 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,578 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,580 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,582 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,584 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,587 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,590 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,593 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,595 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,598 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,601 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,603 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,606 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,608 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/activation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:34,612 copying 3rdparty/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-03T10:33:34,615 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,616 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,619 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,622 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,626 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,629 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,632 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,636 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,640 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,644 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,648 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,651 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,654 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,657 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:34,661 copying 3rdparty/cutlass/include/cutlass/array_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,664 copying 3rdparty/cutlass/include/cutlass/exmy_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,667 copying 3rdparty/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,670 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,671 copying 3rdparty/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,673 copying 3rdparty/cutlass/include/cutlass/detail/helper_macros.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,676 copying 3rdparty/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,679 copying 3rdparty/cutlass/include/cutlass/detail/dependent_false.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,681 copying 3rdparty/cutlass/include/cutlass/detail/collective.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,683 copying 3rdparty/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,686 copying 3rdparty/cutlass/include/cutlass/detail/cluster.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,688 copying 3rdparty/cutlass/include/cutlass/detail/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,691 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-03T10:33:34,692 copying 3rdparty/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-03T10:33:34,695 copying 3rdparty/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-03T10:33:34,698 copying 3rdparty/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-03T10:33:34,701 copying 3rdparty/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,703 copying 3rdparty/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,705 copying 3rdparty/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,708 copying 3rdparty/cutlass/include/cutlass/detail/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:34,710 copying 3rdparty/cutlass/include/cutlass/floating_point_nvrtc.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,712 copying 3rdparty/cutlass/include/cutlass/array_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,714 creating build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-03T10:33:34,715 copying 3rdparty/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-03T10:33:34,718 copying 3rdparty/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-03T10:33:34,722 copying 3rdparty/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-03T10:33:34,724 copying 3rdparty/cutlass/include/cutlass/numeric_conversion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,730 copying 3rdparty/cutlass/include/cutlass/real.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,732 copying 3rdparty/cutlass/include/cutlass/float8.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,735 copying 3rdparty/cutlass/include/cutlass/coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,738 copying 3rdparty/cutlass/include/cutlass/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,746 copying 3rdparty/cutlass/include/cutlass/uint128.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,749 copying 3rdparty/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,751 copying 3rdparty/cutlass/include/cutlass/functional.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,754 copying 3rdparty/cutlass/include/cutlass/block_striped.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,757 copying 3rdparty/cutlass/include/cutlass/trace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,759 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-03T10:33:34,760 copying 3rdparty/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-03T10:33:34,763 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,764 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,767 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,769 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,772 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,775 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,777 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,780 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,782 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,785 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,787 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,790 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,793 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,797 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,799 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,802 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,804 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,808 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,810 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,813 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,816 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,819 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,822 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,824 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,828 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,831 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:34,833 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-03T10:33:34,834 copying 3rdparty/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-03T10:33:34,836 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-03T10:33:34,839 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-03T10:33:34,841 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-03T10:33:34,842 copying 3rdparty/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-03T10:33:34,845 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-03T10:33:34,846 copying 3rdparty/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-03T10:33:34,849 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-03T10:33:34,850 copying 3rdparty/cutlass/include/cutlass/transform/thread/unary_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-03T10:33:34,852 copying 3rdparty/cutlass/include/cutlass/transform/thread/transpose.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-03T10:33:34,854 copying 3rdparty/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform 2026-04-03T10:33:34,857 copying 3rdparty/cutlass/include/cutlass/blas3_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,859 copying 3rdparty/cutlass/include/cutlass/matrix_shape.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,861 copying 3rdparty/cutlass/include/cutlass/kernel_launch.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,863 creating build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-03T10:33:34,864 copying 3rdparty/cutlass/include/cutlass/thread/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-03T10:33:34,866 copying 3rdparty/cutlass/include/cutlass/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,869 copying 3rdparty/cutlass/include/cutlass/aligned_buffer.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,871 copying 3rdparty/cutlass/include/cutlass/half.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,874 copying 3rdparty/cutlass/include/cutlass/cluster_launch.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,876 creating build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,877 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm100.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,879 copying 3rdparty/cutlass/include/cutlass/arch/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,882 copying 3rdparty/cutlass/include/cutlass/arch/cache_operation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,884 copying 3rdparty/cutlass/include/cutlass/arch/memory.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,886 copying 3rdparty/cutlass/include/cutlass/arch/config.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,889 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,891 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,894 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,897 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,899 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,901 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,904 copying 3rdparty/cutlass/include/cutlass/arch/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,907 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,910 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,912 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,915 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm72.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,917 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,920 copying 3rdparty/cutlass/include/cutlass/arch/reg_reconfig.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,922 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,924 copying 3rdparty/cutlass/include/cutlass/arch/arch.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,926 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm90.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,929 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,931 copying 3rdparty/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,934 copying 3rdparty/cutlass/include/cutlass/arch/wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,936 copying 3rdparty/cutlass/include/cutlass/arch/simd.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,938 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,941 copying 3rdparty/cutlass/include/cutlass/arch/synclog.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,945 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,947 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:34,950 copying 3rdparty/cutlass/include/cutlass/uint256.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,952 copying 3rdparty/cutlass/include/cutlass/integer_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,955 copying 3rdparty/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,957 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:34,959 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:34,962 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:34,965 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:34,967 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:34,970 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:34,971 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:34,974 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:34,976 copying 3rdparty/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:34,979 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:34,981 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-03T10:33:34,982 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-03T10:33:34,985 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-03T10:33:34,987 copying 3rdparty/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-03T10:33:34,989 copying 3rdparty/cutlass/include/cutlass/subbyte_reference.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,992 copying 3rdparty/cutlass/include/cutlass/core_io.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,995 copying 3rdparty/cutlass/include/cutlass/float_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,997 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:34,999 copying 3rdparty/cutlass/include/cutlass/tensor_view.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,002 copying 3rdparty/cutlass/include/cutlass/semaphore.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,004 copying 3rdparty/cutlass/include/cutlass/constants.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,007 copying 3rdparty/cutlass/include/cutlass/wmma_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,009 creating build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,010 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,013 copying 3rdparty/cutlass/include/cutlass/layout/layout.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,015 copying 3rdparty/cutlass/include/cutlass/layout/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,017 copying 3rdparty/cutlass/include/cutlass/layout/permute.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,020 copying 3rdparty/cutlass/include/cutlass/layout/vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,022 copying 3rdparty/cutlass/include/cutlass/layout/pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,024 copying 3rdparty/cutlass/include/cutlass/layout/tensor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,027 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,029 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:35,032 copying 3rdparty/cutlass/include/cutlass/numeric_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,035 copying 3rdparty/cutlass/include/cutlass/workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,037 copying 3rdparty/cutlass/include/cutlass/matrix_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,039 copying 3rdparty/cutlass/include/cutlass/array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,042 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-03T10:33:35,043 copying 3rdparty/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-03T10:33:35,046 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-03T10:33:35,048 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-03T10:33:35,051 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,052 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,054 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,056 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,059 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,061 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,064 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,066 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,069 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,071 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,073 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,076 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,078 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,081 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,083 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,086 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,089 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,091 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,094 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,096 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,100 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,103 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,107 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,110 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,113 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,116 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,120 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,123 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,127 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,130 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,133 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,136 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,140 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,143 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,147 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,150 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,154 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,157 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,160 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,163 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,167 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,170 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,173 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,177 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,181 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,184 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,187 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:35,191 copying 3rdparty/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:35,194 copying 3rdparty/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:35,197 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,198 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,201 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,204 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,207 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,210 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,213 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,216 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,219 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,221 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,224 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,227 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,229 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,233 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,235 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,238 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,240 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,243 copying 3rdparty/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,246 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,249 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,252 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,255 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,257 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,260 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,262 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,265 copying 3rdparty/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,267 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,270 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,273 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,275 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:35,277 copying 3rdparty/cutlass/include/cutlass/conv/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:35,280 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:35,281 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:35,283 copying 3rdparty/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:35,285 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:35,288 copying 3rdparty/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:35,290 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:35,291 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:35,294 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:35,295 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:35,297 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:35,300 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:35,302 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:35,304 copying 3rdparty/cutlass/include/cutlass/conv/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:35,307 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:35,310 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:35,312 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:35,313 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-03T10:33:35,314 copying 3rdparty/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-03T10:33:35,317 copying 3rdparty/cutlass/include/cutlass/conv/convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:35,319 copying 3rdparty/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:35,322 copying 3rdparty/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:35,324 copying 3rdparty/cutlass/include/cutlass/tensor_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,326 copying 3rdparty/cutlass/include/cutlass/gemm_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,329 copying 3rdparty/cutlass/include/cutlass/version.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,331 copying 3rdparty/cutlass/include/cutlass/tfloat32.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,333 creating build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-03T10:33:35,334 copying 3rdparty/cutlass/include/cutlass/platform/platform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-03T10:33:35,337 copying 3rdparty/cutlass/include/cutlass/pitch_linear_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,340 copying 3rdparty/cutlass/include/cutlass/complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,342 copying 3rdparty/cutlass/include/cutlass/cutlass.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:35,344 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,346 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,349 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,351 copying 3rdparty/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,354 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,356 copying 3rdparty/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,359 copying 3rdparty/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,361 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,363 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,366 copying 3rdparty/cutlass/tools/util/include/cutlass/util/command_line.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,368 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,370 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-03T10:33:35,372 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-03T10:33:35,374 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-03T10:33:35,376 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,377 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,380 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,382 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,384 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,387 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,389 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,392 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,395 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,397 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,400 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,402 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,405 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,407 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,410 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,413 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,416 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,419 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,421 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,423 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,425 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,428 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,430 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,433 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,435 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:35,438 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,439 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,441 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,444 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,447 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,449 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,452 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-03T10:33:35,453 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-03T10:33:35,455 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-03T10:33:35,458 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-03T10:33:35,460 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,463 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,465 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-03T10:33:35,466 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-03T10:33:35,469 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,472 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,475 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,477 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:35,479 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,482 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,485 copying 3rdparty/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,487 copying 3rdparty/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,490 copying 3rdparty/cutlass/tools/util/include/cutlass/util/debug.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,492 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,495 copying 3rdparty/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,498 copying 3rdparty/cutlass/tools/util/include/cutlass/util/distribution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,500 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,502 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,505 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,508 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,511 copying 3rdparty/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,513 copying 3rdparty/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,516 copying 3rdparty/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,518 copying 3rdparty/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,520 copying 3rdparty/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,523 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:35,526 creating build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,527 copying 3rdparty/spdlog/include/spdlog/common-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,530 copying 3rdparty/spdlog/include/spdlog/spdlog.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,532 copying 3rdparty/spdlog/include/spdlog/common.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,535 copying 3rdparty/spdlog/include/spdlog/stopwatch.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,537 creating build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,538 copying 3rdparty/spdlog/include/spdlog/details/tcp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,540 copying 3rdparty/spdlog/include/spdlog/details/thread_pool.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,542 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,544 copying 3rdparty/spdlog/include/spdlog/details/file_helper-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,547 copying 3rdparty/spdlog/include/spdlog/details/file_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,549 copying 3rdparty/spdlog/include/spdlog/details/udp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,551 copying 3rdparty/spdlog/include/spdlog/details/log_msg-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,553 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,555 copying 3rdparty/spdlog/include/spdlog/details/os-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,559 copying 3rdparty/spdlog/include/spdlog/details/fmt_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,562 copying 3rdparty/spdlog/include/spdlog/details/circular_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,564 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,566 copying 3rdparty/spdlog/include/spdlog/details/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,569 copying 3rdparty/spdlog/include/spdlog/details/log_msg.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,570 copying 3rdparty/spdlog/include/spdlog/details/windows_include.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,572 copying 3rdparty/spdlog/include/spdlog/details/registry.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,575 copying 3rdparty/spdlog/include/spdlog/details/synchronous_factory.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,577 copying 3rdparty/spdlog/include/spdlog/details/backtracer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,579 copying 3rdparty/spdlog/include/spdlog/details/console_globals.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,580 copying 3rdparty/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,583 copying 3rdparty/spdlog/include/spdlog/details/backtracer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,585 copying 3rdparty/spdlog/include/spdlog/details/udp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,587 copying 3rdparty/spdlog/include/spdlog/details/registry-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,589 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,591 copying 3rdparty/spdlog/include/spdlog/details/tcp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,594 copying 3rdparty/spdlog/include/spdlog/details/null_mutex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,596 copying 3rdparty/spdlog/include/spdlog/details/thread_pool-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:35,598 copying 3rdparty/spdlog/include/spdlog/tweakme.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,601 copying 3rdparty/spdlog/include/spdlog/async.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,603 copying 3rdparty/spdlog/include/spdlog/async_logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,605 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:35,606 copying 3rdparty/spdlog/include/spdlog/fmt/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:35,608 copying 3rdparty/spdlog/include/spdlog/fmt/ostr.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:35,610 copying 3rdparty/spdlog/include/spdlog/fmt/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:35,612 copying 3rdparty/spdlog/include/spdlog/fmt/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:35,615 copying 3rdparty/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:35,617 copying 3rdparty/spdlog/include/spdlog/fmt/fmt.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:35,620 copying 3rdparty/spdlog/include/spdlog/fmt/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:35,622 copying 3rdparty/spdlog/include/spdlog/fmt/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:35,624 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,626 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/args.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,629 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,632 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/printf.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,635 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,638 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,641 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/locale.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,643 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,646 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,650 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/core.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,654 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,657 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/color.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,660 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,665 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,667 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,671 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:35,674 creating build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:35,675 copying 3rdparty/spdlog/include/spdlog/cfg/helpers.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:35,677 copying 3rdparty/spdlog/include/spdlog/cfg/helpers-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:35,679 copying 3rdparty/spdlog/include/spdlog/cfg/env.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:35,681 copying 3rdparty/spdlog/include/spdlog/cfg/argv.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:35,683 copying 3rdparty/spdlog/include/spdlog/pattern_formatter-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,687 copying 3rdparty/spdlog/include/spdlog/spdlog-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,690 copying 3rdparty/spdlog/include/spdlog/mdc.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,692 copying 3rdparty/spdlog/include/spdlog/logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,694 copying 3rdparty/spdlog/include/spdlog/logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,697 creating build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,697 copying 3rdparty/spdlog/include/spdlog/sinks/callback_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,700 copying 3rdparty/spdlog/include/spdlog/sinks/dist_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,702 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,704 copying 3rdparty/spdlog/include/spdlog/sinks/systemd_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,706 copying 3rdparty/spdlog/include/spdlog/sinks/kafka_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,709 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,711 copying 3rdparty/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,713 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,715 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,717 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,718 copying 3rdparty/spdlog/include/spdlog/sinks/udp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,720 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,723 copying 3rdparty/spdlog/include/spdlog/sinks/msvc_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,724 copying 3rdparty/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,727 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,729 copying 3rdparty/spdlog/include/spdlog/sinks/qt_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,732 copying 3rdparty/spdlog/include/spdlog/sinks/sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,734 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,736 copying 3rdparty/spdlog/include/spdlog/sinks/null_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,738 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,740 copying 3rdparty/spdlog/include/spdlog/sinks/android_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,742 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,744 copying 3rdparty/spdlog/include/spdlog/sinks/syslog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,746 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,749 copying 3rdparty/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,751 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,754 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,756 copying 3rdparty/spdlog/include/spdlog/sinks/sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,759 copying 3rdparty/spdlog/include/spdlog/sinks/ostream_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,760 copying 3rdparty/spdlog/include/spdlog/sinks/mongo_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,763 copying 3rdparty/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,766 copying 3rdparty/spdlog/include/spdlog/sinks/tcp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,768 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,771 copying 3rdparty/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:35,773 copying 3rdparty/spdlog/include/spdlog/async_logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,775 copying 3rdparty/spdlog/include/spdlog/pattern_formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,777 copying 3rdparty/spdlog/include/spdlog/version.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,779 copying 3rdparty/spdlog/include/spdlog/fwd.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,781 copying 3rdparty/spdlog/include/spdlog/formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:35,914 installing to build/bdist.linux-armv7l/wheel 2026-04-03T10:33:35,915 running install 2026-04-03T10:33:35,954 running install_lib 2026-04-03T10:33:35,963 creating build/bdist.linux-armv7l/wheel 2026-04-03T10:33:35,965 copying build/lib/build_utils.py -> build/bdist.linux-armv7l/wheel/. 2026-04-03T10:33:35,968 copying build/lib/build_backend.py -> build/bdist.linux-armv7l/wheel/. 2026-04-03T10:33:35,972 creating build/bdist.linux-armv7l/wheel/flashinfer 2026-04-03T10:33:35,974 creating build/bdist.linux-armv7l/wheel/flashinfer/mamba 2026-04-03T10:33:35,976 copying build/lib/flashinfer/mamba/ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-03T10:33:35,979 copying build/lib/flashinfer/mamba/ssd_combined.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-03T10:33:35,982 copying build/lib/flashinfer/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-03T10:33:35,985 copying build/lib/flashinfer/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-03T10:33:35,987 copying build/lib/flashinfer/mamba/ssd_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-03T10:33:35,995 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization 2026-04-03T10:33:35,997 copying build/lib/flashinfer/quantization/packbits.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-03T10:33:36,000 copying build/lib/flashinfer/quantization/quantization_cute_dsl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-03T10:33:36,003 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization/kernels 2026-04-03T10:33:36,004 copying build/lib/flashinfer/quantization/kernels/mxfp4_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-03T10:33:36,007 copying build/lib/flashinfer/quantization/kernels/mxfp8_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-03T10:33:36,010 copying build/lib/flashinfer/quantization/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-03T10:33:36,012 copying build/lib/flashinfer/quantization/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-03T10:33:36,014 copying build/lib/flashinfer/quantization/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-03T10:33:36,016 copying build/lib/flashinfer/quantization/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-03T10:33:36,020 copying build/lib/flashinfer/pod.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,024 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm 2026-04-03T10:33:36,025 copying build/lib/flashinfer/gemm/routergemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-03T10:33:36,029 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm/kernels 2026-04-03T10:33:36,030 copying build/lib/flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-03T10:33:36,035 copying build/lib/flashinfer/gemm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-03T10:33:36,037 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-03T10:33:36,041 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-03T10:33:36,045 copying build/lib/flashinfer/gemm/gemm_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-03T10:33:36,053 copying build/lib/flashinfer/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-03T10:33:36,056 creating build/bdist.linux-armv7l/wheel/flashinfer/tuning_configs 2026-04-03T10:33:36,058 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-03T10:33:36,060 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-03T10:33:36,063 copying build/lib/flashinfer/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,067 copying build/lib/flashinfer/concat_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,070 creating build/bdist.linux-armv7l/wheel/flashinfer/cudnn 2026-04-03T10:33:36,071 copying build/lib/flashinfer/cudnn/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-03T10:33:36,074 copying build/lib/flashinfer/cudnn/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-03T10:33:36,076 copying build/lib/flashinfer/cudnn/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-03T10:33:36,078 copying build/lib/flashinfer/cudnn/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-03T10:33:36,081 copying build/lib/flashinfer/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,083 copying build/lib/flashinfer/green_ctx.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,085 copying build/lib/flashinfer/autotuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,088 copying build/lib/flashinfer/gdn_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,091 copying build/lib/flashinfer/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,093 copying build/lib/flashinfer/cuda_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,096 creating build/bdist.linux-armv7l/wheel/flashinfer/comm 2026-04-03T10:33:36,097 copying build/lib/flashinfer/comm/trtllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,100 copying build/lib/flashinfer/comm/trtllm_mnnvl_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,104 copying build/lib/flashinfer/comm/trtllm_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,107 copying build/lib/flashinfer/comm/mapping.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,109 copying build/lib/flashinfer/comm/vllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,112 copying build/lib/flashinfer/comm/trtllm_moe_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,115 copying build/lib/flashinfer/comm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,117 copying build/lib/flashinfer/comm/allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,120 copying build/lib/flashinfer/comm/workspace_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,122 copying build/lib/flashinfer/comm/mnnvl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,126 copying build/lib/flashinfer/comm/nvshmem_allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,128 copying build/lib/flashinfer/comm/nvshmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,130 copying build/lib/flashinfer/comm/dlpack_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,133 copying build/lib/flashinfer/comm/cuda_ipc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-03T10:33:36,135 creating build/bdist.linux-armv7l/wheel/flashinfer/profiler 2026-04-03T10:33:36,136 copying build/lib/flashinfer/profiler/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/profiler 2026-04-03T10:33:36,139 creating build/bdist.linux-armv7l/wheel/flashinfer/norm 2026-04-03T10:33:36,141 creating build/bdist.linux-armv7l/wheel/flashinfer/norm/kernels 2026-04-03T10:33:36,142 copying build/lib/flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-03T10:33:36,145 copying build/lib/flashinfer/norm/kernels/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-03T10:33:36,148 copying build/lib/flashinfer/norm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-03T10:33:36,150 copying build/lib/flashinfer/norm/kernels/layernorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-03T10:33:36,153 copying build/lib/flashinfer/norm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-03T10:33:36,155 copying build/lib/flashinfer/norm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-03T10:33:36,158 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels 2026-04-03T10:33:36,159 copying build/lib/flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-03T10:33:36,163 copying build/lib/flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-03T10:33:36,166 copying build/lib/flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-03T10:33:36,170 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:36,171 copying build/lib/flashinfer/gdn_kernels/blackwell_prefill/gdn_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:36,173 copying build/lib/flashinfer/gdn_kernels/blackwell_prefill/gdn.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:36,186 copying build/lib/flashinfer/gdn_kernels/blackwell_prefill/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:36,188 copying build/lib/flashinfer/gdn_kernels/blackwell_prefill/gdn_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell_prefill 2026-04-03T10:33:36,190 copying build/lib/flashinfer/gdn_kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-03T10:33:36,192 copying build/lib/flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-03T10:33:36,197 creating build/bdist.linux-armv7l/wheel/flashinfer/dsv3_ops 2026-04-03T10:33:36,198 copying build/lib/flashinfer/dsv3_ops/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/dsv3_ops 2026-04-03T10:33:36,200 copying build/lib/flashinfer/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,202 copying build/lib/flashinfer/attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,205 creating build/bdist.linux-armv7l/wheel/flashinfer/logits_processor 2026-04-03T10:33:36,207 copying build/lib/flashinfer/logits_processor/fusion_rules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,209 copying build/lib/flashinfer/logits_processor/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,211 copying build/lib/flashinfer/logits_processor/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,213 copying build/lib/flashinfer/logits_processor/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,215 copying build/lib/flashinfer/logits_processor/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,217 copying build/lib/flashinfer/logits_processor/operators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,220 copying build/lib/flashinfer/logits_processor/validators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,222 copying build/lib/flashinfer/logits_processor/legalization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,224 copying build/lib/flashinfer/logits_processor/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,226 copying build/lib/flashinfer/logits_processor/processors.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-03T10:33:36,229 copying build/lib/flashinfer/api_logging.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,233 creating build/bdist.linux-armv7l/wheel/flashinfer/jit 2026-04-03T10:33:36,235 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention 2026-04-03T10:33:36,236 copying build/lib/flashinfer/jit/attention/variants.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-03T10:33:36,239 copying build/lib/flashinfer/jit/attention/modules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-03T10:33:36,243 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:36,244 copying build/lib/flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:36,252 copying build/lib/flashinfer/jit/attention/fmha_v2/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:36,255 copying build/lib/flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:36,258 copying build/lib/flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-03T10:33:36,261 copying build/lib/flashinfer/jit/attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-03T10:33:36,263 copying build/lib/flashinfer/jit/attention/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-03T10:33:36,266 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/mamba 2026-04-03T10:33:36,267 copying build/lib/flashinfer/jit/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-03T10:33:36,269 copying build/lib/flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-03T10:33:36,271 copying build/lib/flashinfer/jit/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-03T10:33:36,273 copying build/lib/flashinfer/jit/spdlog.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,276 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm 2026-04-03T10:33:36,278 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm/cutlass 2026-04-03T10:33:36,279 copying build/lib/flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-03T10:33:36,282 copying build/lib/flashinfer/jit/gemm/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-03T10:33:36,284 copying build/lib/flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-03T10:33:36,287 copying build/lib/flashinfer/jit/gemm/fp8_blockscale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-03T10:33:36,289 copying build/lib/flashinfer/jit/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-03T10:33:36,291 copying build/lib/flashinfer/jit/gemm/deepgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-03T10:33:36,293 copying build/lib/flashinfer/jit/gemm/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-03T10:33:36,296 copying build/lib/flashinfer/jit/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,297 copying build/lib/flashinfer/jit/comm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,299 copying build/lib/flashinfer/jit/quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,301 copying build/lib/flashinfer/jit/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,303 copying build/lib/flashinfer/jit/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,306 copying build/lib/flashinfer/jit/dsv3_optimizations.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,307 copying build/lib/flashinfer/jit/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,309 copying build/lib/flashinfer/jit/env.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,312 copying build/lib/flashinfer/jit/gdn.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,314 copying build/lib/flashinfer/jit/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,315 copying build/lib/flashinfer/jit/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,317 copying build/lib/flashinfer/jit/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,319 copying build/lib/flashinfer/jit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,322 copying build/lib/flashinfer/jit/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,324 copying build/lib/flashinfer/jit/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,326 copying build/lib/flashinfer/jit/fp4_kv_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,328 copying build/lib/flashinfer/jit/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,330 copying build/lib/flashinfer/jit/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,332 copying build/lib/flashinfer/jit/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,333 copying build/lib/flashinfer/jit/fp4_kv_dequantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,335 copying build/lib/flashinfer/jit/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,338 copying build/lib/flashinfer/jit/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,340 copying build/lib/flashinfer/jit/cubin_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,342 copying build/lib/flashinfer/jit/tinygemm2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,344 copying build/lib/flashinfer/jit/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,346 copying build/lib/flashinfer/jit/cpp_ext.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,349 copying build/lib/flashinfer/jit/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-03T10:33:36,351 copying build/lib/flashinfer/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,355 copying build/lib/flashinfer/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,358 copying build/lib/flashinfer/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,360 copying build/lib/flashinfer/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,362 copying build/lib/flashinfer/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,365 copying build/lib/flashinfer/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,368 copying build/lib/flashinfer/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,372 creating build/bdist.linux-armv7l/wheel/flashinfer/triton 2026-04-03T10:33:36,373 copying build/lib/flashinfer/triton/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-03T10:33:36,375 copying build/lib/flashinfer/triton/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-03T10:33:36,378 creating build/bdist.linux-armv7l/wheel/flashinfer/triton/kernels 2026-04-03T10:33:36,379 copying build/lib/flashinfer/triton/kernels/ssd_chunk_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-03T10:33:36,381 copying build/lib/flashinfer/triton/kernels/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-03T10:33:36,383 copying build/lib/flashinfer/triton/kernels/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-03T10:33:36,385 copying build/lib/flashinfer/triton/kernels/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-03T10:33:36,387 copying build/lib/flashinfer/triton/kernels/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-03T10:33:36,389 copying build/lib/flashinfer/triton/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-03T10:33:36,391 copying build/lib/flashinfer/triton/kernels/quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-03T10:33:36,392 copying build/lib/flashinfer/triton/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-03T10:33:36,394 copying build/lib/flashinfer/triton/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-03T10:33:36,396 copying build/lib/flashinfer/triton/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-03T10:33:36,398 copying build/lib/flashinfer/triton/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-03T10:33:36,400 copying build/lib/flashinfer/triton/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-03T10:33:36,402 copying build/lib/flashinfer/triton/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-03T10:33:36,403 copying build/lib/flashinfer/gdn_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,406 copying build/lib/flashinfer/aot.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,410 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl 2026-04-03T10:33:36,411 copying build/lib/flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-03T10:33:36,414 copying build/lib/flashinfer/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-03T10:33:36,416 copying build/lib/flashinfer/cute_dsl/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-03T10:33:36,419 copying build/lib/flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-03T10:33:36,422 copying build/lib/flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-03T10:33:36,425 copying build/lib/flashinfer/cute_dsl/fp4_common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-03T10:33:36,427 copying build/lib/flashinfer/cute_dsl/blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-03T10:33:36,429 copying build/lib/flashinfer/deep_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:36,433 creating build/bdist.linux-armv7l/wheel/flashinfer/data 2026-04-03T10:33:36,434 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass 2026-04-03T10:33:36,436 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test 2026-04-03T10:33:36,438 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/utils 2026-04-03T10:33:36,439 copying build/lib/flashinfer/data/cutlass/test/utils/test_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/utils 2026-04-03T10:33:36,443 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python 2026-04-03T10:33:36,444 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass 2026-04-03T10:33:36,446 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/installation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass 2026-04-03T10:33:36,449 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,450 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,452 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,454 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,456 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,458 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,460 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,462 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,464 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,467 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,469 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,471 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,473 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,476 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-03T10:33:36,478 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:36,479 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:36,482 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:36,484 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:36,486 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-03T10:33:36,489 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:36,490 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:36,492 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:36,494 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:36,496 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-03T10:33:36,500 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:36,501 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:36,503 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-03T10:33:36,504 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-03T10:33:36,507 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:36,509 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:36,512 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:36,514 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:36,517 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-03T10:33:36,520 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-03T10:33:36,521 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-03T10:33:36,524 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:36,525 copying build/lib/flashinfer/data/cutlass/test/python/pycute/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:36,528 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_coalesce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:36,530 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_complement.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:36,532 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:36,534 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:36,536 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_composition.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:36,539 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:36,541 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-03T10:33:36,543 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit 2026-04-03T10:33:36,545 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm 2026-04-03T10:33:36,547 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-03T10:33:36,548 copying build/lib/flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/unit/gemm/device 2026-04-03T10:33:36,551 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples 2026-04-03T10:33:36,553 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-03T10:33:36,555 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:36,556 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:36,558 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:36,561 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:36,563 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:36,565 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-03T10:33:36,567 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-03T10:33:36,570 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python 2026-04-03T10:33:36,572 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:36,574 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:36,575 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:36,577 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:36,580 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:36,582 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:36,584 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-03T10:33:36,587 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:36,588 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:36,591 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:36,594 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:36,596 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:36,599 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-03T10:33:36,601 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/shape.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:36,604 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-03T10:33:36,605 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-03T10:33:36,607 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-03T10:33:36,609 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-03T10:33:36,611 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:36,614 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:36,617 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,618 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-03T10:33:36,620 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-03T10:33:36,622 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-03T10:33:36,624 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,626 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,628 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,630 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,633 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,635 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,638 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,641 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,644 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,646 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,648 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-03T10:33:36,649 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-03T10:33:36,652 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,653 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,655 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,657 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,660 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,662 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,664 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,667 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,669 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,671 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-03T10:33:36,674 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-03T10:33:36,676 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-03T10:33:36,677 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-03T10:33:36,679 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-03T10:33:36,682 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-03T10:33:36,685 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:36,686 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:36,688 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:36,690 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:36,693 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:36,694 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:36,696 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:36,698 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:36,700 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-03T10:33:36,703 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,704 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,706 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,709 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,711 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,713 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,715 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,718 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,720 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,722 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,724 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,727 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,729 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,731 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-03T10:33:36,733 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,736 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,739 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-03T10:33:36,742 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-03T10:33:36,744 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-03T10:33:36,745 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-03T10:33:36,748 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-03T10:33:36,750 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-03T10:33:36,753 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src 2026-04-03T10:33:36,755 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src/source 2026-04-03T10:33:36,756 copying build/lib/flashinfer/data/cutlass/python/docs_src/source/conf.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/docs_src/source 2026-04-03T10:33:36,759 copying build/lib/flashinfer/data/cutlass/python/setup_cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-03T10:33:36,762 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,763 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,766 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,770 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,772 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/manifest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,775 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,778 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,780 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,782 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,785 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,788 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,791 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,805 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,807 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,811 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,813 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,816 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,819 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,821 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,823 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/symm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-03T10:33:36,827 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:36,828 copying build/lib/flashinfer/data/cutlass/python/pycute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:36,830 copying build/lib/flashinfer/data/cutlass/python/pycute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:36,832 copying build/lib/flashinfer/data/cutlass/python/pycute/int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:36,834 copying build/lib/flashinfer/data/cutlass/python/pycute/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:36,836 copying build/lib/flashinfer/data/cutlass/python/pycute/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-03T10:33:36,839 copying build/lib/flashinfer/data/cutlass/python/setup_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-03T10:33:36,841 copying build/lib/flashinfer/data/cutlass/python/setup_pycute.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-03T10:33:36,843 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL 2026-04-03T10:33:36,845 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-03T10:33:36,847 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:36,848 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:36,852 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:36,855 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:36,856 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:36,859 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:36,861 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-03T10:33:36,864 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:36,865 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:36,868 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:36,870 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:36,872 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:36,874 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:36,876 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-03T10:33:36,879 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,880 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,883 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,886 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-03T10:33:36,887 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-03T10:33:36,889 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-03T10:33:36,891 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,895 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,898 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,901 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,902 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,906 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,909 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,911 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,914 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,917 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,919 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,921 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,923 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,927 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,930 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-03T10:33:36,933 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,934 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,937 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:36,938 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:36,940 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:36,942 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:36,944 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:36,946 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:36,949 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:36,951 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-03T10:33:36,954 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,956 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,958 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,961 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,964 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,967 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,970 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,972 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:36,975 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-03T10:33:36,976 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-03T10:33:36,979 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:36,980 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:36,982 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:36,985 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:36,987 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-03T10:33:36,991 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-03T10:33:36,992 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-03T10:33:36,994 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-03T10:33:36,996 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-03T10:33:36,999 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-03T10:33:37,000 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-03T10:33:37,002 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-03T10:33:37,005 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-03T10:33:37,007 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-03T10:33:37,009 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-03T10:33:37,011 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-03T10:33:37,012 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-03T10:33:37,015 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-03T10:33:37,017 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-03T10:33:37,020 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:37,021 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:37,024 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:37,026 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:37,030 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:37,032 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:37,034 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:37,036 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:37,038 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-03T10:33:37,040 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:37,042 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:37,049 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-03T10:33:37,052 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:37,053 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:37,055 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:37,057 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:37,059 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:37,061 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-03T10:33:37,064 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:37,065 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:37,068 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:37,072 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:37,074 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-03T10:33:37,077 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-03T10:33:37,079 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-03T10:33:37,081 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,083 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:37,084 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:37,086 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:37,089 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:37,091 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:37,093 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:37,095 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-03T10:33:37,097 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,098 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,100 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,103 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,107 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,109 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,112 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,115 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,119 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:37,120 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:37,123 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:37,125 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:37,131 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:37,132 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-03T10:33:37,135 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,137 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,140 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:37,142 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:37,144 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:37,146 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:37,148 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:37,150 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-03T10:33:37,152 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:37,153 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:37,156 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:37,158 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:37,160 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:37,162 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:37,164 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:37,166 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-03T10:33:37,168 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,170 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-03T10:33:37,173 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:37,174 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:37,177 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:37,179 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:37,181 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-03T10:33:37,183 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-03T10:33:37,185 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL 2026-04-03T10:33:37,188 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples 2026-04-03T10:33:37,189 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python 2026-04-03T10:33:37,191 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL 2026-04-03T10:33:37,192 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:37,193 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:37,196 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:37,198 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:37,201 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-03T10:33:37,203 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:37,204 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:37,207 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:37,209 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:37,210 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-03T10:33:37,213 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental 2026-04-03T10:33:37,215 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:37,216 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:37,219 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:37,221 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:37,225 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:37,228 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-03T10:33:37,232 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-03T10:33:37,233 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-03T10:33:37,236 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:37,237 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:37,241 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:37,245 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:37,248 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:37,252 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:37,255 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:37,257 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-03T10:33:37,261 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,262 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,266 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,270 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,276 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:37,277 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:37,280 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:37,283 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:37,286 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:37,291 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-03T10:33:37,295 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:37,296 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:37,298 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:37,301 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:37,304 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:37,306 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-03T10:33:37,309 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-03T10:33:37,310 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-03T10:33:37,315 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-03T10:33:37,320 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-03T10:33:37,324 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,332 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,336 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,340 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,343 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,349 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:37,350 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:37,356 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:37,360 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:37,365 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-03T10:33:37,368 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,381 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-03T10:33:37,382 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-03T10:33:37,397 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-03T10:33:37,406 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-03T10:33:37,408 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,419 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,432 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,439 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:37,441 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:37,447 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:37,452 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:37,455 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-03T10:33:37,462 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-03T10:33:37,463 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-03T10:33:37,469 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-03T10:33:37,473 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-03T10:33:37,476 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,479 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,484 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,487 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,491 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,496 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-03T10:33:37,503 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-03T10:33:37,504 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-03T10:33:37,508 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:37,509 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:37,512 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:37,515 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:37,517 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:37,520 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:37,523 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:37,525 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:37,528 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-03T10:33:37,531 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-03T10:33:37,533 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-03T10:33:37,536 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-03T10:33:37,539 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-03T10:33:37,541 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-03T10:33:37,543 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-03T10:33:37,547 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-03T10:33:37,548 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-03T10:33:37,553 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,555 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,558 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,562 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,565 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,568 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,571 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,574 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,578 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,582 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,585 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,589 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,593 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,596 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-03T10:33:37,600 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:37,602 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:37,607 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:37,612 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:37,618 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-03T10:33:37,623 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-03T10:33:37,625 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-03T10:33:37,628 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-03T10:33:37,635 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen 2026-04-03T10:33:37,638 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,641 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,646 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,650 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,653 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,656 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,660 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,665 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,668 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,671 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,675 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,678 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,682 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-03T10:33:37,686 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-03T10:33:37,688 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-03T10:33:37,692 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-03T10:33:37,694 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-03T10:33:37,697 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-03T10:33:37,701 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-03T10:33:37,704 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-03T10:33:37,707 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-03T10:33:37,710 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-03T10:33:37,711 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-03T10:33:37,714 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-03T10:33:37,717 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools 2026-04-03T10:33:37,719 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util 2026-04-03T10:33:37,720 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/scripts 2026-04-03T10:33:37,722 copying build/lib/flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/scripts 2026-04-03T10:33:37,725 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include 2026-04-03T10:33:37,726 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass 2026-04-03T10:33:37,728 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,730 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,733 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,735 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,739 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,741 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,744 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,746 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,748 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,751 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,753 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,757 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference 2026-04-03T10:33:37,758 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-03T10:33:37,759 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-03T10:33:37,762 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-03T10:33:37,764 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,765 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,768 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,770 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,773 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,775 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,777 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,781 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,784 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,786 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,789 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,791 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,794 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,796 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,798 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,801 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,804 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,807 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,809 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,811 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,813 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,816 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,818 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,820 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,823 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-03T10:33:37,826 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,827 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,830 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,832 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,835 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,837 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,840 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-03T10:33:37,842 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-03T10:33:37,844 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-03T10:33:37,846 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-03T10:33:37,848 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,851 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,854 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-03T10:33:37,855 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-03T10:33:37,857 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,860 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,862 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,864 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-03T10:33:37,867 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,869 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,871 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,873 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,876 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,878 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,880 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,882 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,884 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,886 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,889 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,891 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,894 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,896 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,899 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,901 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,903 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,906 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-03T10:33:37,910 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include 2026-04-03T10:33:37,912 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,914 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:37,915 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:37,918 copying build/lib/flashinfer/data/cutlass/include/cute/util/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:37,920 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_svg.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:37,923 copying build/lib/flashinfer/data/cutlass/include/cute/util/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:37,925 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_latex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:37,929 copying build/lib/flashinfer/data/cutlass/include/cute/util/print.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-03T10:33:37,931 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,934 copying build/lib/flashinfer/data/cutlass/include/cute/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,936 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,939 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_base.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,941 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,943 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,946 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_flagged.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,949 copying build/lib/flashinfer/data/cutlass/include/cute/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,952 copying build/lib/flashinfer/data/cutlass/include/cute/tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,955 copying build/lib/flashinfer/data/cutlass/include/cute/underscore.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:37,958 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:37,959 copying build/lib/flashinfer/data/cutlass/include/cute/container/tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:37,963 copying build/lib/flashinfer/data/cutlass/include/cute/container/bit_field.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:37,965 copying build/lib/flashinfer/data/cutlass/include/cute/container/alignment.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:37,967 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_aligned.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:37,970 copying build/lib/flashinfer/data/cutlass/include/cute/container/type_list.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:37,972 copying build/lib/flashinfer/data/cutlass/include/cute/container/array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:37,975 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:37,978 copying build/lib/flashinfer/data/cutlass/include/cute/container/cuda_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-03T10:33:37,982 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:37,983 copying build/lib/flashinfer/data/cutlass/include/cute/atom/partitioner.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:37,986 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:37,988 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:37,992 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:37,994 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:37,997 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,000 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,003 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,009 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,012 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,024 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,028 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,030 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,032 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,035 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,037 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,043 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,054 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,056 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,059 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,062 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,065 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,067 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,069 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,072 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,075 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,082 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,085 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-03T10:33:38,088 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,090 copying build/lib/flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,092 copying build/lib/flashinfer/data/cutlass/include/cute/arch/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,095 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,097 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,108 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,110 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,114 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,116 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,121 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,145 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,150 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,192 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,194 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,196 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,199 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,207 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,210 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,215 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,217 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,219 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,248 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,250 copying build/lib/flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,252 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,255 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,257 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,260 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,262 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,264 copying build/lib/flashinfer/data/cutlass/include/cute/arch/util.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,267 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,269 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,311 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,313 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-03T10:33:38,315 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_impl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:38,318 copying build/lib/flashinfer/data/cutlass/include/cute/int_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:38,321 copying build/lib/flashinfer/data/cutlass/include/cute/stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:38,324 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_zip.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:38,327 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,329 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,331 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,334 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/clear.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,336 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,338 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,340 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,343 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,346 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,348 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,350 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,351 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,354 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/functional.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,356 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-03T10:33:38,359 copying build/lib/flashinfer/data/cutlass/include/cute/pointer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:38,362 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,363 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,365 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,368 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,371 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,373 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/real.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,375 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,378 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,380 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/int.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,382 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-03T10:33:38,384 copying build/lib/flashinfer/data/cutlass/include/cute/layout_composed.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-03T10:33:38,388 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:38,390 copying build/lib/flashinfer/data/cutlass/include/cutlass/predicate_vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:38,392 copying build/lib/flashinfer/data/cutlass/include/cutlass/fast_math.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:38,396 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental 2026-04-03T10:33:38,397 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed 2026-04-03T10:33:38,399 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-03T10:33:38,400 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-03T10:33:38,403 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-03T10:33:38,406 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-03T10:33:38,407 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-03T10:33:38,410 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-03T10:33:38,412 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-03T10:33:38,414 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-03T10:33:38,415 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-03T10:33:38,418 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-03T10:33:38,421 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-03T10:33:38,424 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:38,427 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:38,429 copying build/lib/flashinfer/data/cutlass/include/cutlass/bfloat16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:38,432 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:38,435 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:38,438 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-03T10:33:38,441 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,442 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,444 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,447 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,449 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,452 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,456 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,459 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,463 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,466 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,469 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,471 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,476 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,478 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,481 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,484 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,487 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,490 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,492 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,495 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,498 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,501 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,505 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,507 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,510 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,513 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,518 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,524 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,527 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,530 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,533 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,536 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,538 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,541 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,544 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,546 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-03T10:33:38,549 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-03T10:33:38,553 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,555 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,557 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,560 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,563 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,566 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,569 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,572 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,574 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,577 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,580 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,583 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,587 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,590 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,594 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,597 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,600 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,603 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,605 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,608 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,611 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,614 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,616 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,619 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,622 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,624 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,629 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,631 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,634 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,636 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,638 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,640 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,643 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,646 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,650 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,652 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,654 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,657 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,659 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,661 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,664 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,667 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,671 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,673 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-03T10:33:38,678 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,679 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,682 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,685 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,687 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,690 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,692 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,695 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,697 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,701 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,703 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,706 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,709 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,711 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,714 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,716 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,719 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,722 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,724 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,727 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,732 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,735 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,738 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,740 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,744 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,746 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,749 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,752 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,754 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,756 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,759 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,761 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,764 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,766 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,769 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,772 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,774 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,776 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,779 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,783 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,786 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,788 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,790 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,793 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,796 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,799 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,801 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,803 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,807 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,809 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,812 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,815 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,818 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,820 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,822 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,825 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,827 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,830 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,833 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,835 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,837 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,840 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,842 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,845 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,847 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,850 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,853 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,856 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,858 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,861 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,863 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,867 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,869 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,872 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,875 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,878 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,883 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,885 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,889 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,891 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,893 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,896 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,901 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,903 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,905 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,907 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,910 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,913 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,916 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,919 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,921 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,924 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,926 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,929 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,931 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,934 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,936 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,938 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,941 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,943 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,946 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,948 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,951 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,953 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,956 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,958 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,961 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,964 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,967 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,971 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,974 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,977 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,979 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,982 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-03T10:33:38,985 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-03T10:33:38,988 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:38,989 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:38,992 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:38,995 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:38,997 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,000 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,002 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,004 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,009 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,012 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,015 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,018 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,020 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,023 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,026 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,029 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,033 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,035 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,037 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,041 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,044 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,047 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,050 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,053 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,056 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,059 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,062 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,065 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,068 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,072 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,074 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-03T10:33:39,079 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,080 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,084 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,087 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,090 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,094 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,098 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,099 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,102 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,105 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,108 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,113 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,116 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,118 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,121 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,124 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,127 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,131 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,134 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,137 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,141 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,144 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,147 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,150 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,153 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,156 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,159 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,162 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,165 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,169 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,172 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,175 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,178 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,181 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,184 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,187 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-03T10:33:39,190 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,193 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,197 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,200 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,204 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,207 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,211 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,215 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,218 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,221 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,225 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,228 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,233 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,236 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,238 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,241 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,245 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,249 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,253 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,256 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,258 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,262 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,266 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,269 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,272 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,276 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,278 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,281 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,284 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,287 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,290 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,293 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,295 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,298 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,301 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,305 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,307 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,310 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,313 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,316 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,318 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,321 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,324 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,329 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,331 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,335 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,338 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,342 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-03T10:33:39,346 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:39,347 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:39,350 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:39,353 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:39,356 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-03T10:33:39,359 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-03T10:33:39,364 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-03T10:33:39,366 copying build/lib/flashinfer/data/cutlass/include/cutlass/device_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,369 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,371 copying build/lib/flashinfer/data/cutlass/include/cutlass/quaternion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,374 copying build/lib/flashinfer/data/cutlass/include/cutlass/relatively_equal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,378 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-03T10:33:39,380 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,382 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,385 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,388 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,391 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,394 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,397 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,400 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,402 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,405 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,407 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,409 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,412 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,416 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,419 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,422 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-03T10:33:39,426 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,427 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,430 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,432 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,435 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,437 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,440 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,443 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,445 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,448 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,450 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,452 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,455 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,458 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,460 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,463 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,465 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,468 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,471 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,473 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,476 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,478 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,480 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,482 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,485 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,487 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,490 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,492 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,494 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,497 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,499 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,502 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,504 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,507 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,509 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,512 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,515 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,517 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,521 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:39,522 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:39,524 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:39,526 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:39,528 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:39,531 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-03T10:33:39,534 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,535 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,538 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,541 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,543 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,546 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,548 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,550 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,553 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,555 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,558 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,560 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-03T10:33:39,564 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,565 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,568 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,571 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:39,572 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:39,574 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:39,577 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:39,580 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:39,582 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:39,586 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-03T10:33:39,588 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,592 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,597 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,600 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,602 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,605 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,607 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,610 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,613 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,616 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,618 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,620 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,623 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,629 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,631 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-03T10:33:39,635 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,636 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,639 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,642 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,644 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,646 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,648 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,650 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,653 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,655 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,657 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,659 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,661 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,664 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,667 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,669 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,671 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,674 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,676 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,679 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,682 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,684 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,686 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,689 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,691 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,693 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-03T10:33:39,696 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-03T10:33:39,700 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,700 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,703 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,706 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,709 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,711 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,714 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,718 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,721 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,724 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,728 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,734 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,737 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-03T10:33:39,740 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,743 copying build/lib/flashinfer/data/cutlass/include/cutlass/exmy_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,746 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,749 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,750 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,752 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,755 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,757 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,759 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,761 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,763 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,766 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,769 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-03T10:33:39,770 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-03T10:33:39,772 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-03T10:33:39,775 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-03T10:33:39,778 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,780 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,782 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,784 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-03T10:33:39,786 copying build/lib/flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,788 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,791 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-03T10:33:39,792 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-03T10:33:39,795 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-03T10:33:39,799 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-03T10:33:39,801 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,812 copying build/lib/flashinfer/data/cutlass/include/cutlass/real.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,814 copying build/lib/flashinfer/data/cutlass/include/cutlass/float8.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,817 copying build/lib/flashinfer/data/cutlass/include/cutlass/coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,820 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,828 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint128.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,831 copying build/lib/flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,834 copying build/lib/flashinfer/data/cutlass/include/cutlass/functional.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,837 copying build/lib/flashinfer/data/cutlass/include/cutlass/block_striped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,839 copying build/lib/flashinfer/data/cutlass/include/cutlass/trace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,842 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform 2026-04-03T10:33:39,844 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-03T10:33:39,846 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-03T10:33:39,850 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,851 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,854 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,857 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,859 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,862 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,865 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,867 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,870 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,873 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,876 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,879 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,882 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,886 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,889 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,892 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,894 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,901 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,903 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,907 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,909 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,913 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,916 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,920 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,923 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-03T10:33:39,926 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-03T10:33:39,927 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-03T10:33:39,930 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-03T10:33:39,932 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-03T10:33:39,935 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-03T10:33:39,936 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-03T10:33:39,939 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-03T10:33:39,940 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-03T10:33:39,943 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-03T10:33:39,944 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-03T10:33:39,946 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-03T10:33:39,948 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform 2026-04-03T10:33:39,951 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,953 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_shape.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,955 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_launch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,957 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/thread 2026-04-03T10:33:39,958 copying build/lib/flashinfer/data/cutlass/include/cutlass/thread/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/thread 2026-04-03T10:33:39,961 copying build/lib/flashinfer/data/cutlass/include/cutlass/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,963 copying build/lib/flashinfer/data/cutlass/include/cutlass/aligned_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,965 copying build/lib/flashinfer/data/cutlass/include/cutlass/half.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,968 copying build/lib/flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:39,971 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,972 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,974 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,976 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,978 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,980 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,982 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,985 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,987 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,989 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,992 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,994 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,996 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:39,999 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,002 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,004 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,006 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,009 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,011 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,013 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,015 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/arch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,017 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,019 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,021 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,024 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,026 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,028 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,030 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,033 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,035 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-03T10:33:40,037 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint256.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,039 copying build/lib/flashinfer/data/cutlass/include/cutlass/integer_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,041 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,044 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-03T10:33:40,046 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:40,047 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:40,049 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:40,051 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:40,054 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-03T10:33:40,057 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:40,058 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:40,061 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:40,063 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:40,066 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-03T10:33:40,069 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-03T10:33:40,070 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-03T10:33:40,073 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-03T10:33:40,075 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction 2026-04-03T10:33:40,077 copying build/lib/flashinfer/data/cutlass/include/cutlass/subbyte_reference.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,080 copying build/lib/flashinfer/data/cutlass/include/cutlass/core_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,082 copying build/lib/flashinfer/data/cutlass/include/cutlass/float_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,085 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,087 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,089 copying build/lib/flashinfer/data/cutlass/include/cutlass/semaphore.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,091 copying build/lib/flashinfer/data/cutlass/include/cutlass/constants.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,094 copying build/lib/flashinfer/data/cutlass/include/cutlass/wmma_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,097 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,098 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,101 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,103 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,106 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/permute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,109 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,111 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,113 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,116 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,118 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-03T10:33:40,121 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,123 copying build/lib/flashinfer/data/cutlass/include/cutlass/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,126 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,128 copying build/lib/flashinfer/data/cutlass/include/cutlass/array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,133 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:40,136 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-03T10:33:40,137 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-03T10:33:40,140 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-03T10:33:40,143 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-03T10:33:40,149 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,151 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,154 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,157 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,161 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,164 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,168 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,171 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,174 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,177 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,181 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,184 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,187 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,191 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,194 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,198 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,201 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,204 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,207 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,211 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,214 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,217 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,221 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,224 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,228 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,231 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,233 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,236 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,239 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,241 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,243 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,246 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,249 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,252 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,254 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,257 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,259 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,262 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,264 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,267 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,270 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,273 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,275 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,278 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,281 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,283 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,286 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-03T10:33:40,288 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:40,291 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:40,295 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,296 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,299 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,302 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,304 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,307 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,309 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,312 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,315 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,318 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,321 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,323 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,326 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,329 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,331 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,334 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,336 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,339 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,342 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,344 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,347 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,350 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,352 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,355 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,357 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,359 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,361 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,364 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,366 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,369 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-03T10:33:40,371 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:40,373 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:40,375 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:40,377 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:40,380 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:40,382 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-03T10:33:40,386 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:40,387 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:40,391 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:40,392 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:40,394 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:40,397 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:40,400 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-03T10:33:40,402 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:40,405 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:40,408 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:40,410 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-03T10:33:40,413 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-03T10:33:40,414 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-03T10:33:40,416 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:40,419 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:40,421 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-03T10:33:40,424 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,427 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,429 copying build/lib/flashinfer/data/cutlass/include/cutlass/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,431 copying build/lib/flashinfer/data/cutlass/include/cutlass/tfloat32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,435 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/platform 2026-04-03T10:33:40,436 copying build/lib/flashinfer/data/cutlass/include/cutlass/platform/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/platform 2026-04-03T10:33:40,439 copying build/lib/flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,441 copying build/lib/flashinfer/data/cutlass/include/cutlass/complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,444 copying build/lib/flashinfer/data/cutlass/include/cutlass/cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-03T10:33:40,446 copying build/lib/flashinfer/data/build_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-03T10:33:40,448 copying build/lib/flashinfer/data/build_backend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-03T10:33:40,450 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog 2026-04-03T10:33:40,452 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/scripts 2026-04-03T10:33:40,453 copying build/lib/flashinfer/data/spdlog/scripts/extract_version.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/scripts 2026-04-03T10:33:40,455 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include 2026-04-03T10:33:40,457 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,458 copying build/lib/flashinfer/data/spdlog/include/spdlog/common-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,460 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,462 copying build/lib/flashinfer/data/spdlog/include/spdlog/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,465 copying build/lib/flashinfer/data/spdlog/include/spdlog/stopwatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,468 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,469 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,471 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,473 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,475 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,477 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,479 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,481 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,483 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,484 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,487 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,489 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/circular_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,491 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,493 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,495 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,496 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/windows_include.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,498 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,500 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,502 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,504 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/console_globals.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,506 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,509 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,511 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,513 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,515 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,518 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,520 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/null_mutex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,522 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-03T10:33:40,524 copying build/lib/flashinfer/data/spdlog/include/spdlog/tweakme.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,527 copying build/lib/flashinfer/data/spdlog/include/spdlog/async.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,529 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,532 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:40,533 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:40,536 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ostr.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:40,538 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:40,540 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:40,542 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:40,544 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/fmt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:40,546 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:40,549 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-03T10:33:40,551 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,553 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,555 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,558 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,561 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,564 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,566 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,569 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,571 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,575 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,579 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,582 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,585 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,590 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,592 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,596 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-03T10:33:40,600 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:40,601 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:40,604 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:40,606 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/env.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:40,608 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/argv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-03T10:33:40,611 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,614 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,616 copying build/lib/flashinfer/data/spdlog/include/spdlog/mdc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,619 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,621 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,625 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,626 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,629 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,631 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,633 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,636 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,639 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,641 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,643 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,646 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,648 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,650 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,653 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,655 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,657 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,660 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,662 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,665 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,667 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,670 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,672 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,674 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,677 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,679 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,682 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,684 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,687 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,690 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,692 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,694 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,696 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,698 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,701 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,703 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,705 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-03T10:33:40,707 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,710 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,712 copying build/lib/flashinfer/data/spdlog/include/spdlog/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,714 copying build/lib/flashinfer/data/spdlog/include/spdlog/fwd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,716 copying build/lib/flashinfer/data/spdlog/include/spdlog/formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-03T10:33:40,718 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include 2026-04-03T10:33:40,720 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer 2026-04-03T10:33:40,722 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,723 copying build/lib/flashinfer/data/include/flashinfer/attention/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,726 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-03T10:33:40,728 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-03T10:33:40,729 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-03T10:33:40,731 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-03T10:33:40,734 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:40,735 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:40,737 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:40,740 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:40,742 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:40,745 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:40,748 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:40,751 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:40,754 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-03T10:33:40,756 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-03T10:33:40,760 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-03T10:33:40,761 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-03T10:33:40,764 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-03T10:33:40,767 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:40,768 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:40,771 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:40,773 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:40,775 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:40,778 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:40,781 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:40,783 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:40,786 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-03T10:33:40,789 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,792 copying build/lib/flashinfer/data/include/flashinfer/attention/mask.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,794 copying build/lib/flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,797 copying build/lib/flashinfer/data/include/flashinfer/attention/heap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,799 copying build/lib/flashinfer/data/include/flashinfer/attention/pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,801 copying build/lib/flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,804 copying build/lib/flashinfer/data/include/flashinfer/attention/default_decode_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,807 copying build/lib/flashinfer/data/include/flashinfer/attention/decode.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,810 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent_template.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,812 copying build/lib/flashinfer/data/include/flashinfer/attention/batch_pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,815 copying build/lib/flashinfer/data/include/flashinfer/attention/prefill.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,819 copying build/lib/flashinfer/data/include/flashinfer/attention/cascade.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,822 copying build/lib/flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,825 copying build/lib/flashinfer/data/include/flashinfer/attention/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,828 copying build/lib/flashinfer/data/include/flashinfer/attention/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,831 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,833 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,836 copying build/lib/flashinfer/data/include/flashinfer/attention/state.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,839 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,840 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,842 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,845 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:40,846 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:40,849 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:40,851 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:40,854 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:40,856 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:40,859 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-03T10:33:40,862 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,865 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,867 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,870 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,872 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,875 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,878 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,880 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,882 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,885 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,887 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-03T10:33:40,890 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,893 copying build/lib/flashinfer/data/include/flashinfer/attention/mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-03T10:33:40,897 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:40,898 copying build/lib/flashinfer/data/include/flashinfer/mamba/common.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:40,901 copying build/lib/flashinfer/data/include/flashinfer/mamba/conversion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:40,903 copying build/lib/flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:40,905 copying build/lib/flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:40,908 copying build/lib/flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:40,910 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:40,913 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-03T10:33:40,917 copying build/lib/flashinfer/data/include/flashinfer/fp4_layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:40,920 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,921 copying build/lib/flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,924 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,926 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,929 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,931 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,933 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,936 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,939 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,942 copying build/lib/flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,945 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,948 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,952 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,956 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,959 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,963 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,966 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,970 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,973 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,977 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,980 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,984 copying build/lib/flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,987 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,992 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,995 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:40,998 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:41,001 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:41,004 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:41,007 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:41,010 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:41,013 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:41,016 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-03T10:33:41,020 copying build/lib/flashinfer/data/include/flashinfer/fp16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,023 copying build/lib/flashinfer/data/include/flashinfer/page.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,027 copying build/lib/flashinfer/data/include/flashinfer/allocator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,030 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:41,032 copying build/lib/flashinfer/data/include/flashinfer/flat/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:41,035 copying build/lib/flashinfer/data/include/flashinfer/flat/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:41,037 copying build/lib/flashinfer/data/include/flashinfer/flat/cute_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:41,039 copying build/lib/flashinfer/data/include/flashinfer/flat/unused.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:41,042 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere 2026-04-03T10:33:41,043 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-03T10:33:41,045 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-03T10:33:41,047 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-03T10:33:41,051 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper 2026-04-03T10:33:41,053 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:41,054 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:41,056 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:41,059 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:41,061 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-03T10:33:41,065 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-03T10:33:41,066 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-03T10:33:41,069 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:41,071 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:41,073 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:41,076 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:41,080 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:41,082 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-03T10:33:41,084 copying build/lib/flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:41,087 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/prefill 2026-04-03T10:33:41,088 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-03T10:33:41,090 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-03T10:33:41,093 copying build/lib/flashinfer/data/include/flashinfer/flat/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:41,095 copying build/lib/flashinfer/data/include/flashinfer/flat/common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-03T10:33:41,097 copying build/lib/flashinfer/data/include/flashinfer/layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,099 copying build/lib/flashinfer/data/include/flashinfer/topk.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,104 copying build/lib/flashinfer/data/include/flashinfer/permuted_smem.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,106 copying build/lib/flashinfer/data/include/flashinfer/pos_enc.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,109 copying build/lib/flashinfer/data/include/flashinfer/math.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,111 copying build/lib/flashinfer/data/include/flashinfer/profiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,114 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:41,116 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:41,119 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:41,122 copying build/lib/flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:41,125 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:41,128 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:41,130 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:41,134 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-03T10:33:41,137 copying build/lib/flashinfer/data/include/flashinfer/cubin_loader.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,139 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm 2026-04-03T10:33:41,141 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,142 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,144 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,146 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,148 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,151 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,152 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,154 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,157 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,159 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-03T10:33:41,162 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm 2026-04-03T10:33:41,164 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:41,165 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:41,167 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmOptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:41,171 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParamsDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:41,173 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/Enums.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:41,176 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm 2026-04-03T10:33:41,177 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:41,179 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/DtypeDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:41,181 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/MmaDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:41,183 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SparsityDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:41,185 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CommonUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:41,187 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaArchDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:41,188 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaKernelLauncher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:41,190 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SfLayoutDecl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen 2026-04-03T10:33:41,192 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmInterface.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:41,195 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/TmaDescriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:41,197 copying build/lib/flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelTraits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export 2026-04-03T10:33:41,200 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm 2026-04-03T10:33:41,203 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:41,204 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:41,206 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:41,208 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:41,210 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:41,212 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:41,215 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-03T10:33:41,217 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-03T10:33:41,218 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-03T10:33:41,221 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:41,222 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:41,225 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:41,228 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:41,230 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:41,233 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:41,235 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:41,238 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-03T10:33:41,240 copying build/lib/flashinfer/data/include/flashinfer/fastdiv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,242 copying build/lib/flashinfer/data/include/flashinfer/exception.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,244 copying build/lib/flashinfer/data/include/flashinfer/attention_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,247 copying build/lib/flashinfer/data/include/flashinfer/concat_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,249 copying build/lib/flashinfer/data/include/flashinfer/cutlass_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,251 copying build/lib/flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,253 copying build/lib/flashinfer/data/include/flashinfer/activation.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,256 copying build/lib/flashinfer/data/include/flashinfer/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,259 copying build/lib/flashinfer/data/include/flashinfer/cp_async.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,261 copying build/lib/flashinfer/data/include/flashinfer/air_top_p.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,264 copying build/lib/flashinfer/data/include/flashinfer/norm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,267 copying build/lib/flashinfer/data/include/flashinfer/logging.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,269 copying build/lib/flashinfer/data/include/flashinfer/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,272 copying build/lib/flashinfer/data/include/flashinfer/arch_condition.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,274 copying build/lib/flashinfer/data/include/flashinfer/sampling.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,277 copying build/lib/flashinfer/data/include/flashinfer/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,279 copying build/lib/flashinfer/data/include/flashinfer/vec_dtypes.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-03T10:33:41,286 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc 2026-04-03T10:33:41,288 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,290 copying build/lib/flashinfer/data/csrc/trtllm_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,292 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,294 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,296 copying build/lib/flashinfer/data/csrc/trtllm_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,298 copying build/lib/flashinfer/data/csrc/selective_state_update_dtype_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,300 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,302 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,304 copying build/lib/flashinfer/data/csrc/flashinfer_rope_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,306 copying build/lib/flashinfer/data/csrc/batch_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,308 copying build/lib/flashinfer/data/csrc/fmha_v2_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,310 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,314 copying build/lib/flashinfer/data/csrc/fmha_v2_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,317 copying build/lib/flashinfer/data/csrc/page.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,319 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,321 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,323 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,326 copying build/lib/flashinfer/data/csrc/batch_decode_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,328 copying build/lib/flashinfer/data/csrc/batch_decode_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,330 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,333 copying build/lib/flashinfer/data/csrc/pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,335 copying build/lib/flashinfer/data/csrc/flashinfer_page_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,338 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,340 copying build/lib/flashinfer/data/csrc/gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,342 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,343 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,346 copying build/lib/flashinfer/data/csrc/trtllm_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,349 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal 2026-04-03T10:33:41,351 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp 2026-04-03T10:33:41,353 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:41,354 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:41,356 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:41,358 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:41,361 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:41,363 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-03T10:33:41,366 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-03T10:33:41,367 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-03T10:33:41,370 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm 2026-04-03T10:33:41,372 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,373 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,375 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,377 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,379 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,381 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,383 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,386 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,388 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,390 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-03T10:33:41,393 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,394 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,396 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,398 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,401 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,404 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,407 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,410 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,412 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,415 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,417 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,419 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-03T10:33:41,421 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:41,422 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:41,424 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:41,426 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:41,428 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:41,430 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-03T10:33:41,431 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-03T10:33:41,434 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-03T10:33:41,437 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-03T10:33:41,438 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-03T10:33:41,440 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-03T10:33:41,442 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:41,445 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-03T10:33:41,446 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,447 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,449 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,452 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,454 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,456 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,458 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,459 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,461 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,463 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,465 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:41,466 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:41,468 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:41,473 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:41,475 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:41,477 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:41,479 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-03T10:33:41,481 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,483 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,485 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,486 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,488 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,490 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,492 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,495 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,496 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,499 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,501 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-03T10:33:41,503 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-03T10:33:41,505 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-03T10:33:41,507 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:41,508 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:41,511 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-03T10:33:41,512 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-03T10:33:41,514 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-03T10:33:41,517 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:41,520 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:41,522 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:41,525 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-03T10:33:41,527 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-03T10:33:41,531 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,532 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,534 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,535 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,537 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,539 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,541 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,542 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,544 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,546 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,547 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,549 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,551 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,553 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,556 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-03T10:33:41,557 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-03T10:33:41,560 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-03T10:33:41,562 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,564 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,566 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,568 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,570 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,572 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,575 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-03T10:33:41,577 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:41,579 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:41,581 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:41,584 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:41,586 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:41,589 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-03T10:33:41,591 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:41,594 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-03T10:33:41,595 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-03T10:33:41,597 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-03T10:33:41,599 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-03T10:33:41,603 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:41,604 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:41,606 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:41,607 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:41,610 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:41,612 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:41,614 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:41,616 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-03T10:33:41,619 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions 2026-04-03T10:33:41,620 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include 2026-04-03T10:33:41,622 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:41,624 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-03T10:33:41,625 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-03T10:33:41,628 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm 2026-04-03T10:33:41,630 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-03T10:33:41,631 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-03T10:33:41,633 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-03T10:33:41,635 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-03T10:33:41,639 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,640 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,642 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,644 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,647 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,649 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,652 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,654 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,657 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,659 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,662 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,664 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,667 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-03T10:33:41,671 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,672 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,674 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,678 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,680 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,682 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,684 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,687 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,689 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,692 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,694 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,697 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,699 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-03T10:33:41,703 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,704 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-03T10:33:41,706 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-03T10:33:41,709 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-03T10:33:41,712 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-03T10:33:41,714 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,717 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,721 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,723 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,725 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,727 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,730 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,734 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,736 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,738 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-03T10:33:41,740 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:41,743 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue 2026-04-03T10:33:41,745 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-03T10:33:41,746 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-03T10:33:41,750 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-03T10:33:41,751 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-03T10:33:41,754 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-03T10:33:41,755 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-03T10:33:41,758 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-03T10:33:41,761 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:41,765 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail 2026-04-03T10:33:41,766 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-03T10:33:41,767 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-03T10:33:41,771 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:41,773 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication 2026-04-03T10:33:41,775 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-03T10:33:41,776 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-03T10:33:41,779 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:41,781 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform 2026-04-03T10:33:41,783 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-03T10:33:41,784 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-03T10:33:41,786 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:41,789 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:41,790 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:41,792 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:41,795 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:41,797 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:41,799 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-03T10:33:41,801 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:41,804 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-03T10:33:41,807 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include 2026-04-03T10:33:41,809 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm 2026-04-03T10:33:41,811 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,812 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,815 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,817 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,819 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,821 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,823 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,825 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,828 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,830 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,832 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,835 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-03T10:33:41,837 copying build/lib/flashinfer/data/csrc/flashinfer_cascade_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,839 copying build/lib/flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,841 copying build/lib/flashinfer/data/csrc/flashinfer_mamba_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,843 copying build/lib/flashinfer/data/csrc/batch_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,845 copying build/lib/flashinfer/data/csrc/batch_pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,847 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,849 copying build/lib/flashinfer/data/csrc/batch_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,851 copying build/lib/flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,853 copying build/lib/flashinfer/data/csrc/batch_decode_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,855 copying build/lib/flashinfer/data/csrc/trtllm_alltoall_prepare.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,858 copying build/lib/flashinfer/data/csrc/norm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,861 copying build/lib/flashinfer/data/csrc/selective_state_update.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,863 copying build/lib/flashinfer/data/csrc/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,865 copying build/lib/flashinfer/data/csrc/batch_attention_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,867 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,870 copying build/lib/flashinfer/data/csrc/single_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,872 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,874 copying build/lib/flashinfer/data/csrc/group_gemm_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,877 copying build/lib/flashinfer/data/csrc/flashinfer_quantization_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,879 copying build/lib/flashinfer/data/csrc/bmm_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:41,881 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:41,883 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,885 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,887 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,892 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,895 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,898 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,902 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,905 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,908 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,910 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,914 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,916 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/mask.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,919 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,924 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,927 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,931 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:41,932 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:41,935 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:41,938 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:41,941 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:41,943 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-03T10:33:41,946 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,949 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:41,953 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,954 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,956 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,959 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,961 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,963 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,967 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,970 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,973 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,975 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,980 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,983 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,986 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,988 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,993 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,996 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:41,998 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:42,001 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:42,003 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-03T10:33:42,005 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:42,007 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:42,011 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-03T10:33:42,014 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,017 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,020 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:42,021 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:42,024 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:42,026 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:42,029 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-03T10:33:42,032 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,034 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,037 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,040 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,042 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,045 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,047 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,050 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,053 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,056 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,058 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,061 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,064 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,066 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,069 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,071 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-03T10:33:42,073 copying build/lib/flashinfer/data/csrc/fp4_kv_quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,076 copying build/lib/flashinfer/data/csrc/flashinfer_topk_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,078 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,080 copying build/lib/flashinfer/data/csrc/single_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,082 copying build/lib/flashinfer/data/csrc/trtllm_batched_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,084 copying build/lib/flashinfer/data/csrc/blackwell_fmha_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,086 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,088 copying build/lib/flashinfer/data/csrc/batch_pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,090 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,092 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,094 copying build/lib/flashinfer/data/csrc/single_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,096 copying build/lib/flashinfer/data/csrc/logging.cc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,098 copying build/lib/flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,100 copying build/lib/flashinfer/data/csrc/flashinfer_xqa_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,102 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,104 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,106 copying build/lib/flashinfer/data/csrc/cutlass_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,108 copying build/lib/flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,110 copying build/lib/flashinfer/data/csrc/batch_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,112 copying build/lib/flashinfer/data/csrc/single_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,114 copying build/lib/flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,116 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,118 copying build/lib/flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,120 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,122 copying build/lib/flashinfer/data/csrc/tvm_ffi_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,126 copying build/lib/flashinfer/data/csrc/renorm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,128 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,131 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,133 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,135 copying build/lib/flashinfer/data/csrc/sampling_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,137 copying build/lib/flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,140 copying build/lib/flashinfer/data/csrc/nvshmem_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,142 copying build/lib/flashinfer/data/csrc/runtime_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,143 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,145 copying build/lib/flashinfer/data/csrc/single_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,147 copying build/lib/flashinfer/data/csrc/batch_pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,150 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,152 copying build/lib/flashinfer/data/csrc/cascade.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,154 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,157 copying build/lib/flashinfer/data/csrc/fmhaReduction.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,159 copying build/lib/flashinfer/data/csrc/moe_utils_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,162 copying build/lib/flashinfer/data/csrc/fp4_kv_dequantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,164 copying build/lib/flashinfer/data/csrc/group_gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,166 copying build/lib/flashinfer/data/csrc/batch_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,168 copying build/lib/flashinfer/data/csrc/batch_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,170 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,173 copying build/lib/flashinfer/data/csrc/batch_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,175 copying build/lib/flashinfer/data/csrc/selective_state_update_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,177 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,179 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,181 copying build/lib/flashinfer/data/csrc/trtllm_moe_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,185 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/xqa 2026-04-03T10:33:42,186 copying build/lib/flashinfer/data/csrc/xqa/tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,189 copying build/lib/flashinfer/data/csrc/xqa/barriers.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,191 copying build/lib/flashinfer/data/csrc/xqa/mha.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,194 copying build/lib/flashinfer/data/csrc/xqa/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,196 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,199 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,201 copying build/lib/flashinfer/data/csrc/xqa/specDec.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,203 copying build/lib/flashinfer/data/csrc/xqa/mha.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,207 copying build/lib/flashinfer/data/csrc/xqa/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,209 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,211 copying build/lib/flashinfer/data/csrc/xqa/gmma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,214 copying build/lib/flashinfer/data/csrc/xqa/gmma_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,223 copying build/lib/flashinfer/data/csrc/xqa/mhaUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,226 copying build/lib/flashinfer/data/csrc/xqa/hostUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,227 copying build/lib/flashinfer/data/csrc/xqa/mha_components.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,230 copying build/lib/flashinfer/data/csrc/xqa/ldgsts.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,232 copying build/lib/flashinfer/data/csrc/xqa/mha_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,236 copying build/lib/flashinfer/data/csrc/xqa/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,238 copying build/lib/flashinfer/data/csrc/xqa/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,241 copying build/lib/flashinfer/data/csrc/xqa/mha_stdheaders.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,244 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,246 copying build/lib/flashinfer/data/csrc/xqa/defines.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,248 copying build/lib/flashinfer/data/csrc/xqa/cuda_hint.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,250 copying build/lib/flashinfer/data/csrc/xqa/xqa_wrapper.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-03T10:33:42,253 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,255 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,258 copying build/lib/flashinfer/data/csrc/vllm_custom_all_reduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,260 copying build/lib/flashinfer/data/csrc/batch_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,263 copying build/lib/flashinfer/data/csrc/single_prefill_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,265 copying build/lib/flashinfer/data/csrc/group_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,266 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,269 copying build/lib/flashinfer/data/csrc/single_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,271 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,272 copying build/lib/flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,275 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,278 copying build/lib/flashinfer/data/csrc/flashinfer_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,279 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,281 copying build/lib/flashinfer/data/csrc/batch_pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,283 copying build/lib/flashinfer/data/csrc/batch_attention.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,285 copying build/lib/flashinfer/data/csrc/batch_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,287 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,291 copying build/lib/flashinfer/data/csrc/group_gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,293 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,295 copying build/lib/flashinfer/data/csrc/sampling.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,297 copying build/lib/flashinfer/data/csrc/single_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,299 copying build/lib/flashinfer/data/csrc/gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,301 copying build/lib/flashinfer/data/csrc/batch_attention_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,303 copying build/lib/flashinfer/data/csrc/trtllm_fmha_v2_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,305 copying build/lib/flashinfer/data/csrc/selective_state_update_kernel_inst.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,307 copying build/lib/flashinfer/data/csrc/flashinfer_sampling_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,309 copying build/lib/flashinfer/data/csrc/rope.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,312 copying build/lib/flashinfer/data/csrc/pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,314 copying build/lib/flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,317 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe 2026-04-03T10:33:42,318 copying build/lib/flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-03T10:33:42,321 copying build/lib/flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-03T10:33:42,324 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:42,325 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:42,331 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:42,333 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:42,336 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-03T10:33:42,339 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-03T10:33:42,340 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:42,342 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/RoutingRenormalizeCommon.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:42,344 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramScoresKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:42,346 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchInitExpertCounts.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:42,348 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchClusterKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:42,351 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:42,353 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchBlockKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:42,355 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchOffsetsKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize 2026-04-03T10:33:42,357 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:42,358 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchMainKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:42,361 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/RoutingDeepSeekCommon.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:42,363 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchCoopKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:42,366 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchInitExpertCounts.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:42,368 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchClusterKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:42,370 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchHistogramKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:42,372 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchOffsetsKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek 2026-04-03T10:33:42,374 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_renormalize.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-03T10:33:42,376 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-03T10:33:42,378 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-03T10:33:42,381 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-03T10:33:42,383 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,385 copying build/lib/flashinfer/data/csrc/trtllm_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,388 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,390 copying build/lib/flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,393 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,394 copying build/lib/flashinfer/data/csrc/concat_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,396 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,398 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,400 copying build/lib/flashinfer/data/csrc/flashinfer_norm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,402 copying build/lib/flashinfer/data/csrc/single_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,404 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,406 copying build/lib/flashinfer/data/csrc/tgv_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,409 copying build/lib/flashinfer/data/csrc/batch_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,410 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,413 copying build/lib/flashinfer/data/csrc/pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,415 copying build/lib/flashinfer/data/csrc/pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,417 copying build/lib/flashinfer/data/csrc/tgv_gemm.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,419 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,421 copying build/lib/flashinfer/data/csrc/gdn_prefill_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,424 copying build/lib/flashinfer/data/csrc/tinygemm2.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,426 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,428 copying build/lib/flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,431 copying build/lib/flashinfer/data/csrc/topk.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,433 copying build/lib/flashinfer/data/csrc/batch_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,434 copying build/lib/flashinfer/data/csrc/dsv3_router_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,437 copying build/lib/flashinfer/data/csrc/batch_decode_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-03T10:33:42,439 copying build/lib/flashinfer/_build_meta.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,441 copying build/lib/flashinfer/trtllm_low_latency_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,443 copying build/lib/flashinfer/tllm_enums.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,446 copying build/lib/flashinfer/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,449 copying build/lib/flashinfer/compilation_context.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,452 copying build/lib/flashinfer/version.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,454 copying build/lib/flashinfer/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,465 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe 2026-04-03T10:33:42,466 copying build/lib/flashinfer/fused_moe/fused_routing_dsv3.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-03T10:33:42,469 copying build/lib/flashinfer/fused_moe/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-03T10:33:42,472 copying build/lib/flashinfer/fused_moe/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-03T10:33:42,475 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:42,478 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:42,479 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:42,487 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:42,494 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:42,497 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:42,501 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-03T10:33:42,505 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:42,509 copying build/lib/flashinfer/fused_moe/cute_dsl/tuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:42,513 copying build/lib/flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:42,517 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:42,522 copying build/lib/flashinfer/fused_moe/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:42,525 copying build/lib/flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-03T10:33:42,529 copying build/lib/flashinfer/fused_moe/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-03T10:33:42,537 copying build/lib/flashinfer/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,540 copying build/lib/flashinfer/sparse.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,546 copying build/lib/flashinfer/__main__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,549 copying build/lib/flashinfer/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,551 copying build/lib/flashinfer/py.typed -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,553 copying build/lib/flashinfer/artifacts.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-03T10:33:42,556 creating build/bdist.linux-armv7l/wheel/flashinfer/testing 2026-04-03T10:33:42,557 copying build/lib/flashinfer/testing/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-03T10:33:42,560 copying build/lib/flashinfer/testing/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-03T10:33:42,564 running install_egg_info 2026-04-03T10:33:42,577 running egg_info 2026-04-03T10:33:42,583 writing flashinfer_python.egg-info/PKG-INFO 2026-04-03T10:33:42,587 writing dependency_links to flashinfer_python.egg-info/dependency_links.txt 2026-04-03T10:33:42,590 writing entry points to flashinfer_python.egg-info/entry_points.txt 2026-04-03T10:33:42,592 writing requirements to flashinfer_python.egg-info/requires.txt 2026-04-03T10:33:42,594 writing top-level names to flashinfer_python.egg-info/top_level.txt 2026-04-03T10:33:43,390 reading manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-03T10:33:43,508 adding license file 'LICENSE' 2026-04-03T10:33:43,630 writing manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-03T10:33:43,637 Copying flashinfer_python.egg-info to build/bdist.linux-armv7l/wheel/./flashinfer_python-0.6.7.post1-py3.11.egg-info 2026-04-03T10:33:43,656 running install_scripts 2026-04-03T10:33:43,667 creating build/bdist.linux-armv7l/wheel/flashinfer_python-0.6.7.post1.dist-info/WHEEL 2026-04-03T10:33:43,670 creating '/tmp/pip-wheel-6s_9v3jz/.tmp-dv5axhl2/flashinfer_python-0.6.7.post1-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-04-03T10:33:43,672 adding 'build_backend.py' 2026-04-03T10:33:43,673 adding 'build_utils.py' 2026-04-03T10:33:43,676 adding 'flashinfer/__init__.py' 2026-04-03T10:33:43,679 adding 'flashinfer/__main__.py' 2026-04-03T10:33:43,680 adding 'flashinfer/_build_meta.py' 2026-04-03T10:33:43,682 adding 'flashinfer/activation.py' 2026-04-03T10:33:43,685 adding 'flashinfer/aot.py' 2026-04-03T10:33:43,692 adding 'flashinfer/api_logging.py' 2026-04-03T10:33:43,694 adding 'flashinfer/artifacts.py' 2026-04-03T10:33:43,696 adding 'flashinfer/attention.py' 2026-04-03T10:33:43,702 adding 'flashinfer/autotuner.py' 2026-04-03T10:33:43,706 adding 'flashinfer/cascade.py' 2026-04-03T10:33:43,708 adding 'flashinfer/compilation_context.py' 2026-04-03T10:33:43,709 adding 'flashinfer/concat_ops.py' 2026-04-03T10:33:43,711 adding 'flashinfer/cuda_utils.py' 2026-04-03T10:33:43,721 adding 'flashinfer/decode.py' 2026-04-03T10:33:43,727 adding 'flashinfer/deep_gemm.py' 2026-04-03T10:33:43,728 adding 'flashinfer/fp4_quantization.py' 2026-04-03T10:33:43,729 adding 'flashinfer/fp8_quantization.py' 2026-04-03T10:33:43,732 adding 'flashinfer/gdn_decode.py' 2026-04-03T10:33:43,734 adding 'flashinfer/gdn_prefill.py' 2026-04-03T10:33:43,736 adding 'flashinfer/green_ctx.py' 2026-04-03T10:33:43,740 adding 'flashinfer/mla.py' 2026-04-03T10:33:43,742 adding 'flashinfer/page.py' 2026-04-03T10:33:43,745 adding 'flashinfer/pod.py' 2026-04-03T10:33:43,762 adding 'flashinfer/prefill.py' 2026-04-03T10:33:43,764 adding 'flashinfer/py.typed' 2026-04-03T10:33:43,769 adding 'flashinfer/rope.py' 2026-04-03T10:33:43,775 adding 'flashinfer/sampling.py' 2026-04-03T10:33:43,779 adding 'flashinfer/sparse.py' 2026-04-03T10:33:43,781 adding 'flashinfer/tllm_enums.py' 2026-04-03T10:33:43,782 adding 'flashinfer/tllm_utils.py' 2026-04-03T10:33:43,784 adding 'flashinfer/topk.py' 2026-04-03T10:33:43,785 adding 'flashinfer/trtllm_low_latency_gemm.py' 2026-04-03T10:33:43,790 adding 'flashinfer/utils.py' 2026-04-03T10:33:43,792 adding 'flashinfer/version.py' 2026-04-03T10:33:43,794 adding 'flashinfer/xqa.py' 2026-04-03T10:33:43,796 adding 'flashinfer/comm/__init__.py' 2026-04-03T10:33:43,799 adding 'flashinfer/comm/allreduce.py' 2026-04-03T10:33:43,801 adding 'flashinfer/comm/cuda_ipc.py' 2026-04-03T10:33:43,803 adding 'flashinfer/comm/dlpack_utils.py' 2026-04-03T10:33:43,805 adding 'flashinfer/comm/mapping.py' 2026-04-03T10:33:43,811 adding 'flashinfer/comm/mnnvl.py' 2026-04-03T10:33:43,812 adding 'flashinfer/comm/nvshmem.py' 2026-04-03T10:33:43,814 adding 'flashinfer/comm/nvshmem_allreduce.py' 2026-04-03T10:33:43,816 adding 'flashinfer/comm/trtllm_alltoall.py' 2026-04-03T10:33:43,820 adding 'flashinfer/comm/trtllm_ar.py' 2026-04-03T10:33:43,824 adding 'flashinfer/comm/trtllm_mnnvl_ar.py' 2026-04-03T10:33:43,827 adding 'flashinfer/comm/trtllm_moe_alltoall.py' 2026-04-03T10:33:43,828 adding 'flashinfer/comm/vllm_ar.py' 2026-04-03T10:33:43,830 adding 'flashinfer/comm/workspace_base.py' 2026-04-03T10:33:43,832 adding 'flashinfer/cudnn/__init__.py' 2026-04-03T10:33:43,834 adding 'flashinfer/cudnn/decode.py' 2026-04-03T10:33:43,837 adding 'flashinfer/cudnn/prefill.py' 2026-04-03T10:33:43,838 adding 'flashinfer/cudnn/utils.py' 2026-04-03T10:33:43,840 adding 'flashinfer/cute_dsl/__init__.py' 2026-04-03T10:33:43,845 adding 'flashinfer/cute_dsl/add_rmsnorm_fp4quant.py' 2026-04-03T10:33:43,847 adding 'flashinfer/cute_dsl/blockscaled_gemm.py' 2026-04-03T10:33:43,850 adding 'flashinfer/cute_dsl/fp4_common.py' 2026-04-03T10:33:43,858 adding 'flashinfer/cute_dsl/gemm_allreduce_two_shot.py' 2026-04-03T10:33:43,862 adding 'flashinfer/cute_dsl/rmsnorm_fp4quant.py' 2026-04-03T10:33:43,864 adding 'flashinfer/cute_dsl/utils.py' 2026-04-03T10:33:43,867 adding 'flashinfer/data/build_backend.py' 2026-04-03T10:33:43,868 adding 'flashinfer/data/build_utils.py' 2026-04-03T10:33:43,873 adding 'flashinfer/data/csrc/batch_attention.cu' 2026-04-03T10:33:43,874 adding 'flashinfer/data/csrc/batch_attention_customize_config.jinja' 2026-04-03T10:33:43,876 adding 'flashinfer/data/csrc/batch_attention_jit_binding.cu' 2026-04-03T10:33:43,877 adding 'flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja' 2026-04-03T10:33:43,879 adding 'flashinfer/data/csrc/batch_decode.cu' 2026-04-03T10:33:43,880 adding 'flashinfer/data/csrc/batch_decode_customize_config.jinja' 2026-04-03T10:33:43,882 adding 'flashinfer/data/csrc/batch_decode_jit_binding.cu' 2026-04-03T10:33:43,883 adding 'flashinfer/data/csrc/batch_decode_kernel_inst.jinja' 2026-04-03T10:33:43,884 adding 'flashinfer/data/csrc/batch_decode_mla_binding.cu' 2026-04-03T10:33:43,885 adding 'flashinfer/data/csrc/batch_decode_mla_config.jinja' 2026-04-03T10:33:43,887 adding 'flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu' 2026-04-03T10:33:43,888 adding 'flashinfer/data/csrc/batch_decode_mla_plan.cu' 2026-04-03T10:33:43,890 adding 'flashinfer/data/csrc/batch_decode_mla_run.cu' 2026-04-03T10:33:43,891 adding 'flashinfer/data/csrc/batch_mla_binding.cu' 2026-04-03T10:33:43,892 adding 'flashinfer/data/csrc/batch_mla_config.jinja' 2026-04-03T10:33:43,894 adding 'flashinfer/data/csrc/batch_mla_plan.cu' 2026-04-03T10:33:43,895 adding 'flashinfer/data/csrc/batch_mla_run.cu' 2026-04-03T10:33:43,897 adding 'flashinfer/data/csrc/batch_mla_sm90_binding.cu' 2026-04-03T10:33:43,898 adding 'flashinfer/data/csrc/batch_mla_sm90_plan.cu' 2026-04-03T10:33:43,900 adding 'flashinfer/data/csrc/batch_mla_sm90_run.cu' 2026-04-03T10:33:43,902 adding 'flashinfer/data/csrc/batch_pod.cu' 2026-04-03T10:33:43,903 adding 'flashinfer/data/csrc/batch_pod_customize_config.jinja' 2026-04-03T10:33:43,904 adding 'flashinfer/data/csrc/batch_pod_jit_binding.cu' 2026-04-03T10:33:43,906 adding 'flashinfer/data/csrc/batch_pod_kernel_inst.jinja' 2026-04-03T10:33:43,908 adding 'flashinfer/data/csrc/batch_prefill.cu' 2026-04-03T10:33:43,909 adding 'flashinfer/data/csrc/batch_prefill_customize_config.jinja' 2026-04-03T10:33:43,910 adding 'flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja' 2026-04-03T10:33:43,912 adding 'flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja' 2026-04-03T10:33:43,913 adding 'flashinfer/data/csrc/batch_prefill_fp8_sm90.cu' 2026-04-03T10:33:43,915 adding 'flashinfer/data/csrc/batch_prefill_jit_binding.cu' 2026-04-03T10:33:43,916 adding 'flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja' 2026-04-03T10:33:43,918 adding 'flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja' 2026-04-03T10:33:43,919 adding 'flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja' 2026-04-03T10:33:43,920 adding 'flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja' 2026-04-03T10:33:43,922 adding 'flashinfer/data/csrc/batch_prefill_sm90.cu' 2026-04-03T10:33:43,923 adding 'flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja' 2026-04-03T10:33:43,924 adding 'flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu' 2026-04-03T10:33:43,926 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.cu' 2026-04-03T10:33:43,927 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.jinja' 2026-04-03T10:33:43,928 adding 'flashinfer/data/csrc/blackwell_fmha_plan.cu' 2026-04-03T10:33:43,930 adding 'flashinfer/data/csrc/bmm_fp8.cu' 2026-04-03T10:33:43,931 adding 'flashinfer/data/csrc/cascade.cu' 2026-04-03T10:33:43,933 adding 'flashinfer/data/csrc/concat_mla.cu' 2026-04-03T10:33:43,937 adding 'flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu' 2026-04-03T10:33:43,940 adding 'flashinfer/data/csrc/cudnn_sdpa_utils.h' 2026-04-03T10:33:43,941 adding 'flashinfer/data/csrc/cutlass_mla.cu' 2026-04-03T10:33:43,943 adding 'flashinfer/data/csrc/dsv3_router_gemm.cu' 2026-04-03T10:33:43,944 adding 'flashinfer/data/csrc/flashinfer_cascade_binding.cu' 2026-04-03T10:33:43,945 adding 'flashinfer/data/csrc/flashinfer_gemm_binding.cu' 2026-04-03T10:33:43,947 adding 'flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu' 2026-04-03T10:33:43,948 adding 'flashinfer/data/csrc/flashinfer_mamba_binding.cu' 2026-04-03T10:33:43,949 adding 'flashinfer/data/csrc/flashinfer_mla_binding.cu' 2026-04-03T10:33:43,951 adding 'flashinfer/data/csrc/flashinfer_norm_binding.cu' 2026-04-03T10:33:43,952 adding 'flashinfer/data/csrc/flashinfer_page_binding.cu' 2026-04-03T10:33:43,953 adding 'flashinfer/data/csrc/flashinfer_quantization_binding.cu' 2026-04-03T10:33:43,955 adding 'flashinfer/data/csrc/flashinfer_rope_binding.cu' 2026-04-03T10:33:43,956 adding 'flashinfer/data/csrc/flashinfer_sampling_binding.cu' 2026-04-03T10:33:43,958 adding 'flashinfer/data/csrc/flashinfer_topk_binding.cu' 2026-04-03T10:33:43,959 adding 'flashinfer/data/csrc/flashinfer_xqa_binding.cu' 2026-04-03T10:33:43,960 adding 'flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc' 2026-04-03T10:33:43,963 adding 'flashinfer/data/csrc/fmhaReduction.cu' 2026-04-03T10:33:43,965 adding 'flashinfer/data/csrc/fmha_cutlass_sm100.cu' 2026-04-03T10:33:43,966 adding 'flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu' 2026-04-03T10:33:43,968 adding 'flashinfer/data/csrc/fmha_v2_jit_binding.cu' 2026-04-03T10:33:43,971 adding 'flashinfer/data/csrc/fmha_v2_run.cu' 2026-04-03T10:33:43,973 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.cu' 2026-04-03T10:33:43,974 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.jinja' 2026-04-03T10:33:43,976 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu' 2026-04-03T10:33:43,977 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja' 2026-04-03T10:33:43,979 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu' 2026-04-03T10:33:43,980 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja' 2026-04-03T10:33:43,981 adding 'flashinfer/data/csrc/fp4_kv_dequantization.cu' 2026-04-03T10:33:43,983 adding 'flashinfer/data/csrc/fp4_kv_quantization.cu' 2026-04-03T10:33:43,985 adding 'flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu' 2026-04-03T10:33:43,987 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.cu' 2026-04-03T10:33:43,988 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.jinja' 2026-04-03T10:33:43,989 adding 'flashinfer/data/csrc/gdn_prefill_launcher.cu' 2026-04-03T10:33:43,991 adding 'flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja' 2026-04-03T10:33:43,992 adding 'flashinfer/data/csrc/gemm_groupwise_sm100.cu' 2026-04-03T10:33:43,994 adding 'flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja' 2026-04-03T10:33:43,995 adding 'flashinfer/data/csrc/gemm_groupwise_sm120.cu' 2026-04-03T10:33:43,997 adding 'flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja' 2026-04-03T10:33:43,998 adding 'flashinfer/data/csrc/gemm_sm100_binding.cu' 2026-04-03T10:33:43,999 adding 'flashinfer/data/csrc/gemm_sm120_binding.cu' 2026-04-03T10:33:44,000 adding 'flashinfer/data/csrc/group_gemm.cu' 2026-04-03T10:33:44,002 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu' 2026-04-03T10:33:44,003 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja' 2026-04-03T10:33:44,005 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu' 2026-04-03T10:33:44,006 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja' 2026-04-03T10:33:44,008 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu' 2026-04-03T10:33:44,009 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja' 2026-04-03T10:33:44,010 adding 'flashinfer/data/csrc/group_gemm_sm100_binding.cu' 2026-04-03T10:33:44,012 adding 'flashinfer/data/csrc/group_gemm_sm120_binding.cu' 2026-04-03T10:33:44,013 adding 'flashinfer/data/csrc/group_gemm_sm90.cu' 2026-04-03T10:33:44,014 adding 'flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja' 2026-04-03T10:33:44,016 adding 'flashinfer/data/csrc/logging.cc' 2026-04-03T10:33:44,018 adding 'flashinfer/data/csrc/moe_utils_binding.cu' 2026-04-03T10:33:44,020 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.cu' 2026-04-03T10:33:44,021 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja' 2026-04-03T10:33:44,022 adding 'flashinfer/data/csrc/norm.cu' 2026-04-03T10:33:44,024 adding 'flashinfer/data/csrc/nvshmem_binding.cu' 2026-04-03T10:33:44,026 adding 'flashinfer/data/csrc/page.cu' 2026-04-03T10:33:44,028 adding 'flashinfer/data/csrc/pod.cu' 2026-04-03T10:33:44,029 adding 'flashinfer/data/csrc/pod_customize_config.jinja' 2026-04-03T10:33:44,030 adding 'flashinfer/data/csrc/pod_jit_binding.cu' 2026-04-03T10:33:44,031 adding 'flashinfer/data/csrc/pod_kernel_inst.jinja' 2026-04-03T10:33:44,033 adding 'flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu' 2026-04-03T10:33:44,034 adding 'flashinfer/data/csrc/quantization.cu' 2026-04-03T10:33:44,036 adding 'flashinfer/data/csrc/renorm.cu' 2026-04-03T10:33:44,039 adding 'flashinfer/data/csrc/rope.cu' 2026-04-03T10:33:44,040 adding 'flashinfer/data/csrc/runtime_utils.h' 2026-04-03T10:33:44,042 adding 'flashinfer/data/csrc/sampling.cu' 2026-04-03T10:33:44,043 adding 'flashinfer/data/csrc/sampling_utils.h' 2026-04-03T10:33:44,046 adding 'flashinfer/data/csrc/selective_state_update.cu' 2026-04-03T10:33:44,048 adding 'flashinfer/data/csrc/selective_state_update_customize_config.jinja' 2026-04-03T10:33:44,049 adding 'flashinfer/data/csrc/selective_state_update_dtype_inst.jinja' 2026-04-03T10:33:44,050 adding 'flashinfer/data/csrc/selective_state_update_kernel_inst.cu' 2026-04-03T10:33:44,052 adding 'flashinfer/data/csrc/seq_chunk_cumsum.cu' 2026-04-03T10:33:44,053 adding 'flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu' 2026-04-03T10:33:44,054 adding 'flashinfer/data/csrc/single_decode.cu' 2026-04-03T10:33:44,055 adding 'flashinfer/data/csrc/single_decode_customize_config.jinja' 2026-04-03T10:33:44,056 adding 'flashinfer/data/csrc/single_decode_jit_binding.cu' 2026-04-03T10:33:44,057 adding 'flashinfer/data/csrc/single_decode_kernel_inst.jinja' 2026-04-03T10:33:44,059 adding 'flashinfer/data/csrc/single_prefill.cu' 2026-04-03T10:33:44,060 adding 'flashinfer/data/csrc/single_prefill_customize_config.jinja' 2026-04-03T10:33:44,061 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90.cu' 2026-04-03T10:33:44,063 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja' 2026-04-03T10:33:44,064 adding 'flashinfer/data/csrc/single_prefill_jit_binding.cu' 2026-04-03T10:33:44,065 adding 'flashinfer/data/csrc/single_prefill_kernel_inst.jinja' 2026-04-03T10:33:44,066 adding 'flashinfer/data/csrc/single_prefill_sm90.cu' 2026-04-03T10:33:44,068 adding 'flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja' 2026-04-03T10:33:44,069 adding 'flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu' 2026-04-03T10:33:44,070 adding 'flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja' 2026-04-03T10:33:44,072 adding 'flashinfer/data/csrc/tgv_gemm.cu' 2026-04-03T10:33:44,073 adding 'flashinfer/data/csrc/tgv_gemm.jinja' 2026-04-03T10:33:44,076 adding 'flashinfer/data/csrc/tinygemm2.cu' 2026-04-03T10:33:44,077 adding 'flashinfer/data/csrc/topk.cu' 2026-04-03T10:33:44,079 adding 'flashinfer/data/csrc/trtllm_allreduce.cu' 2026-04-03T10:33:44,080 adding 'flashinfer/data/csrc/trtllm_allreduce_fusion.cu' 2026-04-03T10:33:44,082 adding 'flashinfer/data/csrc/trtllm_alltoall.cu' 2026-04-03T10:33:44,085 adding 'flashinfer/data/csrc/trtllm_alltoall_prepare.cu' 2026-04-03T10:33:44,088 adding 'flashinfer/data/csrc/trtllm_batched_gemm_runner.cu' 2026-04-03T10:33:44,091 adding 'flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu' 2026-04-03T10:33:44,093 adding 'flashinfer/data/csrc/trtllm_fmha_v2_binding.cu' 2026-04-03T10:33:44,102 adding 'flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu' 2026-04-03T10:33:44,106 adding 'flashinfer/data/csrc/trtllm_fused_moe_runner.cu' 2026-04-03T10:33:44,108 adding 'flashinfer/data/csrc/trtllm_gemm_runner.cu' 2026-04-03T10:33:44,110 adding 'flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu' 2026-04-03T10:33:44,112 adding 'flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu' 2026-04-03T10:33:44,113 adding 'flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu' 2026-04-03T10:33:44,115 adding 'flashinfer/data/csrc/trtllm_moe_alltoall.cu' 2026-04-03T10:33:44,117 adding 'flashinfer/data/csrc/tvm_ffi_utils.h' 2026-04-03T10:33:44,119 adding 'flashinfer/data/csrc/vllm_custom_all_reduce.cu' 2026-04-03T10:33:44,122 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h' 2026-04-03T10:33:44,123 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h' 2026-04-03T10:33:44,125 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h' 2026-04-03T10:33:44,127 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h' 2026-04-03T10:33:44,129 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h' 2026-04-03T10:33:44,131 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h' 2026-04-03T10:33:44,133 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h' 2026-04-03T10:33:44,136 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h' 2026-04-03T10:33:44,139 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h' 2026-04-03T10:33:44,141 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h' 2026-04-03T10:33:44,143 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h' 2026-04-03T10:33:44,148 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h' 2026-04-03T10:33:44,150 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h' 2026-04-03T10:33:44,152 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h' 2026-04-03T10:33:44,154 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h' 2026-04-03T10:33:44,156 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h' 2026-04-03T10:33:44,159 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h' 2026-04-03T10:33:44,162 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h' 2026-04-03T10:33:44,164 adding 'flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h' 2026-04-03T10:33:44,168 adding 'flashinfer/data/csrc/fmha_v2/fmha/fragment.h' 2026-04-03T10:33:44,170 adding 'flashinfer/data/csrc/fmha_v2/fmha/gemm.h' 2026-04-03T10:33:44,172 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h' 2026-04-03T10:33:44,176 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h' 2026-04-03T10:33:44,179 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h' 2026-04-03T10:33:44,180 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h' 2026-04-03T10:33:44,184 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h' 2026-04-03T10:33:44,187 adding 'flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h' 2026-04-03T10:33:44,190 adding 'flashinfer/data/csrc/fmha_v2/fmha/mask.h' 2026-04-03T10:33:44,191 adding 'flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h' 2026-04-03T10:33:44,192 adding 'flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h' 2026-04-03T10:33:44,197 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h' 2026-04-03T10:33:44,202 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h' 2026-04-03T10:33:44,205 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h' 2026-04-03T10:33:44,207 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h' 2026-04-03T10:33:44,218 adding 'flashinfer/data/csrc/fmha_v2/fmha/softmax.h' 2026-04-03T10:33:44,222 adding 'flashinfer/data/csrc/fmha_v2/fmha/traits.h' 2026-04-03T10:33:44,228 adding 'flashinfer/data/csrc/fmha_v2/fmha/utils.h' 2026-04-03T10:33:44,231 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h' 2026-04-03T10:33:44,233 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h' 2026-04-03T10:33:44,235 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h' 2026-04-03T10:33:44,239 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h' 2026-04-03T10:33:44,241 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h' 2026-04-03T10:33:44,243 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h' 2026-04-03T10:33:44,246 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h' 2026-04-03T10:33:44,252 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h' 2026-04-03T10:33:44,254 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h' 2026-04-03T10:33:44,256 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h' 2026-04-03T10:33:44,258 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h' 2026-04-03T10:33:44,259 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h' 2026-04-03T10:33:44,262 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h' 2026-04-03T10:33:44,264 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h' 2026-04-03T10:33:44,266 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h' 2026-04-03T10:33:44,270 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h' 2026-04-03T10:33:44,273 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h' 2026-04-03T10:33:44,274 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h' 2026-04-03T10:33:44,276 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h' 2026-04-03T10:33:44,279 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h' 2026-04-03T10:33:44,283 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h' 2026-04-03T10:33:44,287 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h' 2026-04-03T10:33:44,290 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h' 2026-04-03T10:33:44,292 adding 'flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja' 2026-04-03T10:33:44,294 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel.jinja' 2026-04-03T10:33:44,296 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja' 2026-04-03T10:33:44,298 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja' 2026-04-03T10:33:44,300 adding 'flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh' 2026-04-03T10:33:44,302 adding 'flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu' 2026-04-03T10:33:44,304 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu' 2026-04-03T10:33:44,327 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh' 2026-04-03T10:33:44,330 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu' 2026-04-03T10:33:44,335 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu' 2026-04-03T10:33:44,340 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu' 2026-04-03T10:33:44,342 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu' 2026-04-03T10:33:44,344 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu' 2026-04-03T10:33:44,346 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_renormalize.cu' 2026-04-03T10:33:44,348 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/RoutingDeepSeekCommon.cuh' 2026-04-03T10:33:44,349 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchClusterKernel.cu' 2026-04-03T10:33:44,351 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchCoopKernel.cu' 2026-04-03T10:33:44,352 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchHistogramKernel.cu' 2026-04-03T10:33:44,354 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchInitExpertCounts.cu' 2026-04-03T10:33:44,356 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchMainKernel.cu' 2026-04-03T10:33:44,357 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingDeepSeek/launchOffsetsKernel.cu' 2026-04-03T10:33:44,359 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/RoutingRenormalizeCommon.cuh' 2026-04-03T10:33:44,361 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchBlockKernel.cu' 2026-04-03T10:33:44,362 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchClusterKernel.cu' 2026-04-03T10:33:44,364 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramKernel.cu' 2026-04-03T10:33:44,365 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchHistogramScoresKernel.cu' 2026-04-03T10:33:44,367 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchInitExpertCounts.cu' 2026-04-03T10:33:44,368 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/routingRenormalize/launchOffsetsKernel.cu' 2026-04-03T10:33:44,373 adding 'flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp' 2026-04-03T10:33:44,374 adding 'flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp' 2026-04-03T10:33:44,378 adding 'flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu' 2026-04-03T10:33:44,380 adding 'flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp' 2026-04-03T10:33:44,382 adding 'flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp' 2026-04-03T10:33:44,386 adding 'flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu' 2026-04-03T10:33:44,390 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h' 2026-04-03T10:33:44,391 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h' 2026-04-03T10:33:44,392 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h' 2026-04-03T10:33:44,394 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h' 2026-04-03T10:33:44,395 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h' 2026-04-03T10:33:44,399 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h' 2026-04-03T10:33:44,401 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h' 2026-04-03T10:33:44,402 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h' 2026-04-03T10:33:44,404 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h' 2026-04-03T10:33:44,405 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h' 2026-04-03T10:33:44,407 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h' 2026-04-03T10:33:44,409 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h' 2026-04-03T10:33:44,411 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh' 2026-04-03T10:33:44,412 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h' 2026-04-03T10:33:44,414 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh' 2026-04-03T10:33:44,416 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h' 2026-04-03T10:33:44,418 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h' 2026-04-03T10:33:44,419 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh' 2026-04-03T10:33:44,421 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh' 2026-04-03T10:33:44,423 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h' 2026-04-03T10:33:44,426 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h' 2026-04-03T10:33:44,427 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h' 2026-04-03T10:33:44,430 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h' 2026-04-03T10:33:44,432 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h' 2026-04-03T10:33:44,433 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h' 2026-04-03T10:33:44,435 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h' 2026-04-03T10:33:44,436 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h' 2026-04-03T10:33:44,438 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp' 2026-04-03T10:33:44,440 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp' 2026-04-03T10:33:44,441 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp' 2026-04-03T10:33:44,443 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h' 2026-04-03T10:33:44,444 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h' 2026-04-03T10:33:44,447 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp' 2026-04-03T10:33:44,451 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp' 2026-04-03T10:33:44,455 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp' 2026-04-03T10:33:44,458 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp' 2026-04-03T10:33:44,461 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp' 2026-04-03T10:33:44,463 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h' 2026-04-03T10:33:44,465 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp' 2026-04-03T10:33:44,467 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp' 2026-04-03T10:33:44,468 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp' 2026-04-03T10:33:44,470 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp' 2026-04-03T10:33:44,471 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp' 2026-04-03T10:33:44,472 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp' 2026-04-03T10:33:44,479 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp' 2026-04-03T10:33:44,483 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp' 2026-04-03T10:33:44,486 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-03T10:33:44,494 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-03T10:33:44,496 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl' 2026-04-03T10:33:44,498 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl' 2026-04-03T10:33:44,500 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl' 2026-04-03T10:33:44,503 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h' 2026-04-03T10:33:44,505 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh' 2026-04-03T10:33:44,508 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh' 2026-04-03T10:33:44,510 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh' 2026-04-03T10:33:44,511 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h' 2026-04-03T10:33:44,513 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp' 2026-04-03T10:33:44,515 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h' 2026-04-03T10:33:44,516 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh' 2026-04-03T10:33:44,519 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h' 2026-04-03T10:33:44,521 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h' 2026-04-03T10:33:44,524 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp' 2026-04-03T10:33:44,528 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp' 2026-04-03T10:33:44,530 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h' 2026-04-03T10:33:44,532 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h' 2026-04-03T10:33:44,534 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h' 2026-04-03T10:33:44,536 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h' 2026-04-03T10:33:44,538 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h' 2026-04-03T10:33:44,540 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h' 2026-04-03T10:33:44,541 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h' 2026-04-03T10:33:44,544 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h' 2026-04-03T10:33:44,547 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h' 2026-04-03T10:33:44,549 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h' 2026-04-03T10:33:44,551 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h' 2026-04-03T10:33:44,554 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h' 2026-04-03T10:33:44,556 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h' 2026-04-03T10:33:44,558 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h' 2026-04-03T10:33:44,560 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h' 2026-04-03T10:33:44,563 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h' 2026-04-03T10:33:44,565 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp' 2026-04-03T10:33:44,568 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh' 2026-04-03T10:33:44,570 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh' 2026-04-03T10:33:44,574 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh' 2026-04-03T10:33:44,576 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh' 2026-04-03T10:33:44,579 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh' 2026-04-03T10:33:44,586 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh' 2026-04-03T10:33:44,588 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh' 2026-04-03T10:33:44,590 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh' 2026-04-03T10:33:44,592 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh' 2026-04-03T10:33:44,594 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh' 2026-04-03T10:33:44,595 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh' 2026-04-03T10:33:44,597 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu' 2026-04-03T10:33:44,598 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h' 2026-04-03T10:33:44,600 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu' 2026-04-03T10:33:44,601 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h' 2026-04-03T10:33:44,604 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh' 2026-04-03T10:33:44,606 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h' 2026-04-03T10:33:44,609 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh' 2026-04-03T10:33:44,614 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu' 2026-04-03T10:33:44,616 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h' 2026-04-03T10:33:44,619 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu' 2026-04-03T10:33:44,620 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h' 2026-04-03T10:33:44,624 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp' 2026-04-03T10:33:44,626 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h' 2026-04-03T10:33:44,627 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h' 2026-04-03T10:33:44,630 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu' 2026-04-03T10:33:44,632 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h' 2026-04-03T10:33:44,639 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh' 2026-04-03T10:33:44,642 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh' 2026-04-03T10:33:44,643 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh' 2026-04-03T10:33:44,647 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh' 2026-04-03T10:33:44,649 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh' 2026-04-03T10:33:44,651 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu' 2026-04-03T10:33:44,652 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu' 2026-04-03T10:33:44,653 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu' 2026-04-03T10:33:44,655 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu' 2026-04-03T10:33:44,656 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu' 2026-04-03T10:33:44,657 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu' 2026-04-03T10:33:44,659 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu' 2026-04-03T10:33:44,660 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu' 2026-04-03T10:33:44,661 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu' 2026-04-03T10:33:44,662 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu' 2026-04-03T10:33:44,664 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu' 2026-04-03T10:33:44,665 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu' 2026-04-03T10:33:44,666 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu' 2026-04-03T10:33:44,667 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu' 2026-04-03T10:33:44,669 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu' 2026-04-03T10:33:44,670 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu' 2026-04-03T10:33:44,671 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu' 2026-04-03T10:33:44,673 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h' 2026-04-03T10:33:44,676 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h' 2026-04-03T10:33:44,678 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h' 2026-04-03T10:33:44,680 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h' 2026-04-03T10:33:44,682 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl' 2026-04-03T10:33:44,684 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h' 2026-04-03T10:33:44,685 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h' 2026-04-03T10:33:44,687 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h' 2026-04-03T10:33:44,692 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h' 2026-04-03T10:33:44,694 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h' 2026-04-03T10:33:44,696 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu' 2026-04-03T10:33:44,698 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu' 2026-04-03T10:33:44,699 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu' 2026-04-03T10:33:44,700 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu' 2026-04-03T10:33:44,702 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu' 2026-04-03T10:33:44,703 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu' 2026-04-03T10:33:44,704 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu' 2026-04-03T10:33:44,705 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu' 2026-04-03T10:33:44,707 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu' 2026-04-03T10:33:44,708 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu' 2026-04-03T10:33:44,709 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu' 2026-04-03T10:33:44,710 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu' 2026-04-03T10:33:44,712 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu' 2026-04-03T10:33:44,713 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu' 2026-04-03T10:33:44,717 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h' 2026-04-03T10:33:44,721 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h' 2026-04-03T10:33:44,723 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h' 2026-04-03T10:33:44,724 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu' 2026-04-03T10:33:44,726 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh' 2026-04-03T10:33:44,727 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h' 2026-04-03T10:33:44,729 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h' 2026-04-03T10:33:44,731 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl' 2026-04-03T10:33:44,732 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h' 2026-04-03T10:33:44,739 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl' 2026-04-03T10:33:44,742 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h' 2026-04-03T10:33:44,744 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl' 2026-04-03T10:33:44,746 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp' 2026-04-03T10:33:44,748 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h' 2026-04-03T10:33:44,751 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp' 2026-04-03T10:33:44,752 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp' 2026-04-03T10:33:44,754 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h' 2026-04-03T10:33:44,756 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp' 2026-04-03T10:33:44,757 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h' 2026-04-03T10:33:44,758 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h' 2026-04-03T10:33:44,760 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h' 2026-04-03T10:33:44,762 adding 'flashinfer/data/csrc/xqa/barriers.cuh' 2026-04-03T10:33:44,763 adding 'flashinfer/data/csrc/xqa/cuda_hint.cuh' 2026-04-03T10:33:44,765 adding 'flashinfer/data/csrc/xqa/defines.h' 2026-04-03T10:33:44,766 adding 'flashinfer/data/csrc/xqa/gmma.cuh' 2026-04-03T10:33:44,776 adding 'flashinfer/data/csrc/xqa/gmma_impl.cuh' 2026-04-03T10:33:44,779 adding 'flashinfer/data/csrc/xqa/hostUtils.h' 2026-04-03T10:33:44,780 adding 'flashinfer/data/csrc/xqa/ldgsts.cuh' 2026-04-03T10:33:44,794 adding 'flashinfer/data/csrc/xqa/mha.cu' 2026-04-03T10:33:44,797 adding 'flashinfer/data/csrc/xqa/mha.h' 2026-04-03T10:33:44,799 adding 'flashinfer/data/csrc/xqa/mhaUtils.cuh' 2026-04-03T10:33:44,801 adding 'flashinfer/data/csrc/xqa/mha_components.cuh' 2026-04-03T10:33:44,814 adding 'flashinfer/data/csrc/xqa/mha_sm90.cu' 2026-04-03T10:33:44,818 adding 'flashinfer/data/csrc/xqa/mha_stdheaders.cuh' 2026-04-03T10:33:44,826 adding 'flashinfer/data/csrc/xqa/mla_sm120.cu' 2026-04-03T10:33:44,827 adding 'flashinfer/data/csrc/xqa/mla_sm120.cuh' 2026-04-03T10:33:44,829 adding 'flashinfer/data/csrc/xqa/mma.cuh' 2026-04-03T10:33:44,830 adding 'flashinfer/data/csrc/xqa/platform.h' 2026-04-03T10:33:44,831 adding 'flashinfer/data/csrc/xqa/specDec.h' 2026-04-03T10:33:44,832 adding 'flashinfer/data/csrc/xqa/tensorMap.cpp' 2026-04-03T10:33:44,833 adding 'flashinfer/data/csrc/xqa/tensorMap.h' 2026-04-03T10:33:44,835 adding 'flashinfer/data/csrc/xqa/tma.h' 2026-04-03T10:33:44,839 adding 'flashinfer/data/csrc/xqa/utils.cuh' 2026-04-03T10:33:44,841 adding 'flashinfer/data/csrc/xqa/utils.h' 2026-04-03T10:33:44,842 adding 'flashinfer/data/csrc/xqa/xqa_wrapper.cu' 2026-04-03T10:33:44,846 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py' 2026-04-03T10:33:44,847 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py' 2026-04-03T10:33:44,849 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py' 2026-04-03T10:33:44,852 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py' 2026-04-03T10:33:44,854 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py' 2026-04-03T10:33:44,856 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py' 2026-04-03T10:33:44,859 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py' 2026-04-03T10:33:44,860 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py' 2026-04-03T10:33:44,863 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py' 2026-04-03T10:33:44,864 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py' 2026-04-03T10:33:44,866 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py' 2026-04-03T10:33:44,868 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py' 2026-04-03T10:33:44,870 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py' 2026-04-03T10:33:44,873 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py' 2026-04-03T10:33:44,875 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py' 2026-04-03T10:33:44,879 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py' 2026-04-03T10:33:44,882 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py' 2026-04-03T10:33:44,883 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py' 2026-04-03T10:33:44,885 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py' 2026-04-03T10:33:44,886 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py' 2026-04-03T10:33:44,889 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py' 2026-04-03T10:33:44,891 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py' 2026-04-03T10:33:44,894 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py' 2026-04-03T10:33:44,896 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py' 2026-04-03T10:33:44,898 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py' 2026-04-03T10:33:44,900 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py' 2026-04-03T10:33:44,902 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py' 2026-04-03T10:33:44,908 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py' 2026-04-03T10:33:44,912 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py' 2026-04-03T10:33:44,914 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py' 2026-04-03T10:33:44,918 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py' 2026-04-03T10:33:44,920 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py' 2026-04-03T10:33:44,924 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py' 2026-04-03T10:33:44,936 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py' 2026-04-03T10:33:44,947 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py' 2026-04-03T10:33:44,957 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-03T10:33:44,965 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py' 2026-04-03T10:33:44,973 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py' 2026-04-03T10:33:44,981 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py' 2026-04-03T10:33:44,989 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py' 2026-04-03T10:33:44,997 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py' 2026-04-03T10:33:45,005 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py' 2026-04-03T10:33:45,016 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py' 2026-04-03T10:33:45,028 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py' 2026-04-03T10:33:45,040 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py' 2026-04-03T10:33:45,050 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py' 2026-04-03T10:33:45,067 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py' 2026-04-03T10:33:45,071 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py' 2026-04-03T10:33:45,074 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py' 2026-04-03T10:33:45,077 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py' 2026-04-03T10:33:45,088 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py' 2026-04-03T10:33:45,099 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py' 2026-04-03T10:33:45,110 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py' 2026-04-03T10:33:45,120 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py' 2026-04-03T10:33:45,124 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py' 2026-04-03T10:33:45,133 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py' 2026-04-03T10:33:45,139 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py' 2026-04-03T10:33:45,142 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py' 2026-04-03T10:33:45,145 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py' 2026-04-03T10:33:45,157 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py' 2026-04-03T10:33:45,160 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py' 2026-04-03T10:33:45,162 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py' 2026-04-03T10:33:45,170 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py' 2026-04-03T10:33:45,178 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py' 2026-04-03T10:33:45,186 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py' 2026-04-03T10:33:45,188 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py' 2026-04-03T10:33:45,198 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py' 2026-04-03T10:33:45,208 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py' 2026-04-03T10:33:45,218 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py' 2026-04-03T10:33:45,221 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py' 2026-04-03T10:33:45,237 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py' 2026-04-03T10:33:45,253 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py' 2026-04-03T10:33:45,256 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py' 2026-04-03T10:33:45,259 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py' 2026-04-03T10:33:45,261 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py' 2026-04-03T10:33:45,265 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py' 2026-04-03T10:33:45,268 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py' 2026-04-03T10:33:45,271 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py' 2026-04-03T10:33:45,276 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py' 2026-04-03T10:33:45,279 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py' 2026-04-03T10:33:45,280 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py' 2026-04-03T10:33:45,282 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py' 2026-04-03T10:33:45,284 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py' 2026-04-03T10:33:45,286 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py' 2026-04-03T10:33:45,289 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py' 2026-04-03T10:33:45,290 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py' 2026-04-03T10:33:45,292 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py' 2026-04-03T10:33:45,293 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py' 2026-04-03T10:33:45,294 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py' 2026-04-03T10:33:45,295 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py' 2026-04-03T10:33:45,297 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py' 2026-04-03T10:33:45,298 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py' 2026-04-03T10:33:45,301 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py' 2026-04-03T10:33:45,303 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py' 2026-04-03T10:33:45,306 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py' 2026-04-03T10:33:45,308 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py' 2026-04-03T10:33:45,318 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py' 2026-04-03T10:33:45,328 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py' 2026-04-03T10:33:45,338 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py' 2026-04-03T10:33:45,341 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py' 2026-04-03T10:33:45,346 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py' 2026-04-03T10:33:45,353 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py' 2026-04-03T10:33:45,356 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py' 2026-04-03T10:33:45,364 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py' 2026-04-03T10:33:45,367 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py' 2026-04-03T10:33:45,369 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py' 2026-04-03T10:33:45,372 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py' 2026-04-03T10:33:45,375 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py' 2026-04-03T10:33:45,381 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py' 2026-04-03T10:33:45,387 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py' 2026-04-03T10:33:45,396 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py' 2026-04-03T10:33:45,399 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py' 2026-04-03T10:33:45,401 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py' 2026-04-03T10:33:45,402 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py' 2026-04-03T10:33:45,404 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py' 2026-04-03T10:33:45,406 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py' 2026-04-03T10:33:45,409 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py' 2026-04-03T10:33:45,412 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py' 2026-04-03T10:33:45,413 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py' 2026-04-03T10:33:45,416 adding 'flashinfer/data/cutlass/include/cute/config.hpp' 2026-04-03T10:33:45,419 adding 'flashinfer/data/cutlass/include/cute/int_tuple.hpp' 2026-04-03T10:33:45,426 adding 'flashinfer/data/cutlass/include/cute/layout.hpp' 2026-04-03T10:33:45,429 adding 'flashinfer/data/cutlass/include/cute/layout_composed.hpp' 2026-04-03T10:33:45,430 adding 'flashinfer/data/cutlass/include/cute/pointer.hpp' 2026-04-03T10:33:45,432 adding 'flashinfer/data/cutlass/include/cute/pointer_base.hpp' 2026-04-03T10:33:45,434 adding 'flashinfer/data/cutlass/include/cute/pointer_flagged.hpp' 2026-04-03T10:33:45,435 adding 'flashinfer/data/cutlass/include/cute/pointer_sparse.hpp' 2026-04-03T10:33:45,437 adding 'flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp' 2026-04-03T10:33:45,439 adding 'flashinfer/data/cutlass/include/cute/stride.hpp' 2026-04-03T10:33:45,442 adding 'flashinfer/data/cutlass/include/cute/swizzle.hpp' 2026-04-03T10:33:45,444 adding 'flashinfer/data/cutlass/include/cute/swizzle_layout.hpp' 2026-04-03T10:33:45,446 adding 'flashinfer/data/cutlass/include/cute/tensor.hpp' 2026-04-03T10:33:45,450 adding 'flashinfer/data/cutlass/include/cute/tensor_impl.hpp' 2026-04-03T10:33:45,452 adding 'flashinfer/data/cutlass/include/cute/tensor_zip.hpp' 2026-04-03T10:33:45,453 adding 'flashinfer/data/cutlass/include/cute/underscore.hpp' 2026-04-03T10:33:45,456 adding 'flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp' 2026-04-03T10:33:45,457 adding 'flashinfer/data/cutlass/include/cute/algorithm/clear.hpp' 2026-04-03T10:33:45,459 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp' 2026-04-03T10:33:45,462 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp' 2026-04-03T10:33:45,464 adding 'flashinfer/data/cutlass/include/cute/algorithm/copy.hpp' 2026-04-03T10:33:45,466 adding 'flashinfer/data/cutlass/include/cute/algorithm/fill.hpp' 2026-04-03T10:33:45,468 adding 'flashinfer/data/cutlass/include/cute/algorithm/functional.hpp' 2026-04-03T10:33:45,470 adding 'flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp' 2026-04-03T10:33:45,471 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp' 2026-04-03T10:33:45,473 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp' 2026-04-03T10:33:45,474 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp' 2026-04-03T10:33:45,476 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp' 2026-04-03T10:33:45,479 adding 'flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp' 2026-04-03T10:33:45,482 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp' 2026-04-03T10:33:45,483 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp' 2026-04-03T10:33:45,485 adding 'flashinfer/data/cutlass/include/cute/arch/config.hpp' 2026-04-03T10:33:45,486 adding 'flashinfer/data/cutlass/include/cute/arch/copy.hpp' 2026-04-03T10:33:45,498 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp' 2026-04-03T10:33:45,503 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp' 2026-04-03T10:33:45,505 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp' 2026-04-03T10:33:45,507 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp' 2026-04-03T10:33:45,508 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp' 2026-04-03T10:33:45,510 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp' 2026-04-03T10:33:45,512 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp' 2026-04-03T10:33:45,516 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp' 2026-04-03T10:33:45,518 adding 'flashinfer/data/cutlass/include/cute/arch/mma.hpp' 2026-04-03T10:33:45,519 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp' 2026-04-03T10:33:45,522 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp' 2026-04-03T10:33:45,526 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp' 2026-04-03T10:33:45,531 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp' 2026-04-03T10:33:45,536 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp' 2026-04-03T10:33:45,538 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp' 2026-04-03T10:33:45,540 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp' 2026-04-03T10:33:45,541 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp' 2026-04-03T10:33:45,544 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp' 2026-04-03T10:33:45,546 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp' 2026-04-03T10:33:45,559 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp' 2026-04-03T10:33:45,563 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp' 2026-04-03T10:33:45,598 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp' 2026-04-03T10:33:45,687 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp' 2026-04-03T10:33:45,741 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp' 2026-04-03T10:33:45,837 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp' 2026-04-03T10:33:45,856 adding 'flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp' 2026-04-03T10:33:45,857 adding 'flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp' 2026-04-03T10:33:45,859 adding 'flashinfer/data/cutlass/include/cute/arch/util.hpp' 2026-04-03T10:33:45,863 adding 'flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp' 2026-04-03T10:33:45,865 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp' 2026-04-03T10:33:45,872 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp' 2026-04-03T10:33:45,875 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp' 2026-04-03T10:33:45,878 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp' 2026-04-03T10:33:45,879 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp' 2026-04-03T10:33:45,881 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp' 2026-04-03T10:33:45,882 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp' 2026-04-03T10:33:45,884 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp' 2026-04-03T10:33:45,887 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp' 2026-04-03T10:33:45,895 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp' 2026-04-03T10:33:45,896 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp' 2026-04-03T10:33:45,899 adding 'flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp' 2026-04-03T10:33:45,901 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp' 2026-04-03T10:33:45,911 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp' 2026-04-03T10:33:45,915 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp' 2026-04-03T10:33:45,917 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp' 2026-04-03T10:33:45,919 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp' 2026-04-03T10:33:45,920 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp' 2026-04-03T10:33:45,922 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp' 2026-04-03T10:33:45,924 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp' 2026-04-03T10:33:45,926 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp' 2026-04-03T10:33:45,927 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp' 2026-04-03T10:33:45,938 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp' 2026-04-03T10:33:45,960 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp' 2026-04-03T10:33:45,973 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp' 2026-04-03T10:33:45,992 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp' 2026-04-03T10:33:45,997 adding 'flashinfer/data/cutlass/include/cute/atom/partitioner.hpp' 2026-04-03T10:33:45,999 adding 'flashinfer/data/cutlass/include/cute/container/alignment.hpp' 2026-04-03T10:33:46,001 adding 'flashinfer/data/cutlass/include/cute/container/array.hpp' 2026-04-03T10:33:46,002 adding 'flashinfer/data/cutlass/include/cute/container/array_aligned.hpp' 2026-04-03T10:33:46,004 adding 'flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp' 2026-04-03T10:33:46,006 adding 'flashinfer/data/cutlass/include/cute/container/bit_field.hpp' 2026-04-03T10:33:46,007 adding 'flashinfer/data/cutlass/include/cute/container/cuda_types.hpp' 2026-04-03T10:33:46,009 adding 'flashinfer/data/cutlass/include/cute/container/tuple.hpp' 2026-04-03T10:33:46,011 adding 'flashinfer/data/cutlass/include/cute/container/type_list.hpp' 2026-04-03T10:33:46,013 adding 'flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp' 2026-04-03T10:33:46,015 adding 'flashinfer/data/cutlass/include/cute/numeric/complex.hpp' 2026-04-03T10:33:46,016 adding 'flashinfer/data/cutlass/include/cute/numeric/int.hpp' 2026-04-03T10:33:46,017 adding 'flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp' 2026-04-03T10:33:46,019 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp' 2026-04-03T10:33:46,021 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp' 2026-04-03T10:33:46,023 adding 'flashinfer/data/cutlass/include/cute/numeric/math.hpp' 2026-04-03T10:33:46,024 adding 'flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp' 2026-04-03T10:33:46,025 adding 'flashinfer/data/cutlass/include/cute/numeric/real.hpp' 2026-04-03T10:33:46,027 adding 'flashinfer/data/cutlass/include/cute/util/debug.hpp' 2026-04-03T10:33:46,029 adding 'flashinfer/data/cutlass/include/cute/util/print.hpp' 2026-04-03T10:33:46,031 adding 'flashinfer/data/cutlass/include/cute/util/print_latex.hpp' 2026-04-03T10:33:46,033 adding 'flashinfer/data/cutlass/include/cute/util/print_svg.hpp' 2026-04-03T10:33:46,034 adding 'flashinfer/data/cutlass/include/cute/util/print_tensor.hpp' 2026-04-03T10:33:46,036 adding 'flashinfer/data/cutlass/include/cute/util/type_traits.hpp' 2026-04-03T10:33:46,039 adding 'flashinfer/data/cutlass/include/cutlass/aligned_buffer.h' 2026-04-03T10:33:46,043 adding 'flashinfer/data/cutlass/include/cutlass/array.h' 2026-04-03T10:33:46,045 adding 'flashinfer/data/cutlass/include/cutlass/array_planar_complex.h' 2026-04-03T10:33:46,047 adding 'flashinfer/data/cutlass/include/cutlass/array_subbyte.h' 2026-04-03T10:33:46,048 adding 'flashinfer/data/cutlass/include/cutlass/barrier.h' 2026-04-03T10:33:46,051 adding 'flashinfer/data/cutlass/include/cutlass/bfloat16.h' 2026-04-03T10:33:46,052 adding 'flashinfer/data/cutlass/include/cutlass/blas3.h' 2026-04-03T10:33:46,053 adding 'flashinfer/data/cutlass/include/cutlass/blas3_types.h' 2026-04-03T10:33:46,055 adding 'flashinfer/data/cutlass/include/cutlass/block_striped.h' 2026-04-03T10:33:46,057 adding 'flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp' 2026-04-03T10:33:46,061 adding 'flashinfer/data/cutlass/include/cutlass/complex.h' 2026-04-03T10:33:46,064 adding 'flashinfer/data/cutlass/include/cutlass/constants.h' 2026-04-03T10:33:46,066 adding 'flashinfer/data/cutlass/include/cutlass/coord.h' 2026-04-03T10:33:46,067 adding 'flashinfer/data/cutlass/include/cutlass/core_io.h' 2026-04-03T10:33:46,070 adding 'flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp' 2026-04-03T10:33:46,071 adding 'flashinfer/data/cutlass/include/cutlass/cutlass.h' 2026-04-03T10:33:46,073 adding 'flashinfer/data/cutlass/include/cutlass/device_kernel.h' 2026-04-03T10:33:46,077 adding 'flashinfer/data/cutlass/include/cutlass/exmy_base.h' 2026-04-03T10:33:46,080 adding 'flashinfer/data/cutlass/include/cutlass/fast_math.h' 2026-04-03T10:33:46,084 adding 'flashinfer/data/cutlass/include/cutlass/float8.h' 2026-04-03T10:33:46,087 adding 'flashinfer/data/cutlass/include/cutlass/float_subbyte.h' 2026-04-03T10:33:46,089 adding 'flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h' 2026-04-03T10:33:46,092 adding 'flashinfer/data/cutlass/include/cutlass/functional.h' 2026-04-03T10:33:46,094 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.h' 2026-04-03T10:33:46,095 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp' 2026-04-03T10:33:46,098 adding 'flashinfer/data/cutlass/include/cutlass/half.h' 2026-04-03T10:33:46,100 adding 'flashinfer/data/cutlass/include/cutlass/integer_subbyte.h' 2026-04-03T10:33:46,101 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h' 2026-04-03T10:33:46,102 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp' 2026-04-03T10:33:46,104 adding 'flashinfer/data/cutlass/include/cutlass/kernel_launch.h' 2026-04-03T10:33:46,125 adding 'flashinfer/data/cutlass/include/cutlass/matrix.h' 2026-04-03T10:33:46,129 adding 'flashinfer/data/cutlass/include/cutlass/matrix_coord.h' 2026-04-03T10:33:46,130 adding 'flashinfer/data/cutlass/include/cutlass/matrix_shape.h' 2026-04-03T10:33:46,146 adding 'flashinfer/data/cutlass/include/cutlass/numeric_conversion.h' 2026-04-03T10:33:46,149 adding 'flashinfer/data/cutlass/include/cutlass/numeric_size.h' 2026-04-03T10:33:46,151 adding 'flashinfer/data/cutlass/include/cutlass/numeric_types.h' 2026-04-03T10:33:46,152 adding 'flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h' 2026-04-03T10:33:46,154 adding 'flashinfer/data/cutlass/include/cutlass/predicate_vector.h' 2026-04-03T10:33:46,157 adding 'flashinfer/data/cutlass/include/cutlass/quaternion.h' 2026-04-03T10:33:46,159 adding 'flashinfer/data/cutlass/include/cutlass/real.h' 2026-04-03T10:33:46,160 adding 'flashinfer/data/cutlass/include/cutlass/relatively_equal.h' 2026-04-03T10:33:46,162 adding 'flashinfer/data/cutlass/include/cutlass/semaphore.h' 2026-04-03T10:33:46,165 adding 'flashinfer/data/cutlass/include/cutlass/subbyte_reference.h' 2026-04-03T10:33:46,167 adding 'flashinfer/data/cutlass/include/cutlass/tensor_coord.h' 2026-04-03T10:33:46,169 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref.h' 2026-04-03T10:33:46,171 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h' 2026-04-03T10:33:46,172 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view.h' 2026-04-03T10:33:46,174 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h' 2026-04-03T10:33:46,176 adding 'flashinfer/data/cutlass/include/cutlass/tfloat32.h' 2026-04-03T10:33:46,178 adding 'flashinfer/data/cutlass/include/cutlass/trace.h' 2026-04-03T10:33:46,179 adding 'flashinfer/data/cutlass/include/cutlass/uint128.h' 2026-04-03T10:33:46,181 adding 'flashinfer/data/cutlass/include/cutlass/uint256.h' 2026-04-03T10:33:46,182 adding 'flashinfer/data/cutlass/include/cutlass/version.h' 2026-04-03T10:33:46,184 adding 'flashinfer/data/cutlass/include/cutlass/wmma_array.h' 2026-04-03T10:33:46,186 adding 'flashinfer/data/cutlass/include/cutlass/workspace.h' 2026-04-03T10:33:46,188 adding 'flashinfer/data/cutlass/include/cutlass/arch/arch.h' 2026-04-03T10:33:46,191 adding 'flashinfer/data/cutlass/include/cutlass/arch/barrier.h' 2026-04-03T10:33:46,192 adding 'flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h' 2026-04-03T10:33:46,194 adding 'flashinfer/data/cutlass/include/cutlass/arch/config.h' 2026-04-03T10:33:46,196 adding 'flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h' 2026-04-03T10:33:46,198 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory.h' 2026-04-03T10:33:46,199 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h' 2026-04-03T10:33:46,201 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h' 2026-04-03T10:33:46,203 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma.h' 2026-04-03T10:33:46,205 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h' 2026-04-03T10:33:46,207 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h' 2026-04-03T10:33:46,208 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h' 2026-04-03T10:33:46,210 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h' 2026-04-03T10:33:46,211 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h' 2026-04-03T10:33:46,213 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h' 2026-04-03T10:33:46,216 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h' 2026-04-03T10:33:46,218 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h' 2026-04-03T10:33:46,220 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h' 2026-04-03T10:33:46,222 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h' 2026-04-03T10:33:46,224 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h' 2026-04-03T10:33:46,225 adding 'flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h' 2026-04-03T10:33:46,227 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd.h' 2026-04-03T10:33:46,228 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h' 2026-04-03T10:33:46,229 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h' 2026-04-03T10:33:46,233 adding 'flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp' 2026-04-03T10:33:46,234 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma.h' 2026-04-03T10:33:46,236 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h' 2026-04-03T10:33:46,237 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h' 2026-04-03T10:33:46,239 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h' 2026-04-03T10:33:46,242 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h' 2026-04-03T10:33:46,244 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h' 2026-04-03T10:33:46,247 adding 'flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp' 2026-04-03T10:33:46,249 adding 'flashinfer/data/cutlass/include/cutlass/conv/convolution.h' 2026-04-03T10:33:46,250 adding 'flashinfer/data/cutlass/include/cutlass/conv/detail.hpp' 2026-04-03T10:33:46,252 adding 'flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp' 2026-04-03T10:33:46,254 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp' 2026-04-03T10:33:46,255 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp' 2026-04-03T10:33:46,257 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp' 2026-04-03T10:33:46,261 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp' 2026-04-03T10:33:46,265 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp' 2026-04-03T10:33:46,268 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl' 2026-04-03T10:33:46,270 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl' 2026-04-03T10:33:46,271 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl' 2026-04-03T10:33:46,273 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl' 2026-04-03T10:33:46,276 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp' 2026-04-03T10:33:46,278 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h' 2026-04-03T10:33:46,280 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h' 2026-04-03T10:33:46,282 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h' 2026-04-03T10:33:46,284 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp' 2026-04-03T10:33:46,286 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h' 2026-04-03T10:33:46,288 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h' 2026-04-03T10:33:46,291 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h' 2026-04-03T10:33:46,293 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h' 2026-04-03T10:33:46,295 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h' 2026-04-03T10:33:46,296 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h' 2026-04-03T10:33:46,298 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h' 2026-04-03T10:33:46,299 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h' 2026-04-03T10:33:46,301 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h' 2026-04-03T10:33:46,303 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h' 2026-04-03T10:33:46,305 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h' 2026-04-03T10:33:46,307 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h' 2026-04-03T10:33:46,309 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h' 2026-04-03T10:33:46,310 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h' 2026-04-03T10:33:46,312 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h' 2026-04-03T10:33:46,314 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h' 2026-04-03T10:33:46,316 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h' 2026-04-03T10:33:46,318 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h' 2026-04-03T10:33:46,319 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h' 2026-04-03T10:33:46,321 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h' 2026-04-03T10:33:46,324 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h' 2026-04-03T10:33:46,326 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h' 2026-04-03T10:33:46,328 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h' 2026-04-03T10:33:46,331 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h' 2026-04-03T10:33:46,333 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h' 2026-04-03T10:33:46,335 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h' 2026-04-03T10:33:46,340 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp' 2026-04-03T10:33:46,341 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp' 2026-04-03T10:33:46,343 adding 'flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h' 2026-04-03T10:33:46,347 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,349 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,352 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,354 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,356 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,358 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h' 2026-04-03T10:33:46,360 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h' 2026-04-03T10:33:46,363 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,364 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,366 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h' 2026-04-03T10:33:46,368 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h' 2026-04-03T10:33:46,370 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,372 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h' 2026-04-03T10:33:46,374 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h' 2026-04-03T10:33:46,376 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,378 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,380 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,382 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,384 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,386 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,387 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,390 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,391 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,394 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,395 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,397 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,399 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h' 2026-04-03T10:33:46,401 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,403 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,405 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-03T10:33:46,407 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-03T10:33:46,408 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h' 2026-04-03T10:33:46,410 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h' 2026-04-03T10:33:46,412 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h' 2026-04-03T10:33:46,415 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h' 2026-04-03T10:33:46,417 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h' 2026-04-03T10:33:46,419 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h' 2026-04-03T10:33:46,420 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h' 2026-04-03T10:33:46,423 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h' 2026-04-03T10:33:46,426 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h' 2026-04-03T10:33:46,429 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h' 2026-04-03T10:33:46,431 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h' 2026-04-03T10:33:46,434 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h' 2026-04-03T10:33:46,436 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-03T10:33:46,438 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-03T10:33:46,440 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h' 2026-04-03T10:33:46,442 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h' 2026-04-03T10:33:46,445 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h' 2026-04-03T10:33:46,446 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h' 2026-04-03T10:33:46,449 adding 'flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp' 2026-04-03T10:33:46,450 adding 'flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp' 2026-04-03T10:33:46,452 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective.hpp' 2026-04-03T10:33:46,453 adding 'flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp' 2026-04-03T10:33:46,455 adding 'flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp' 2026-04-03T10:33:46,457 adding 'flashinfer/data/cutlass/include/cutlass/detail/layout.hpp' 2026-04-03T10:33:46,458 adding 'flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp' 2026-04-03T10:33:46,460 adding 'flashinfer/data/cutlass/include/cutlass/detail/mma.hpp' 2026-04-03T10:33:46,462 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp' 2026-04-03T10:33:46,463 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp' 2026-04-03T10:33:46,465 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp' 2026-04-03T10:33:46,466 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp' 2026-04-03T10:33:46,472 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp' 2026-04-03T10:33:46,473 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp' 2026-04-03T10:33:46,475 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp' 2026-04-03T10:33:46,477 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp' 2026-04-03T10:33:46,479 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp' 2026-04-03T10:33:46,481 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp' 2026-04-03T10:33:46,482 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp' 2026-04-03T10:33:46,484 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp' 2026-04-03T10:33:46,487 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp' 2026-04-03T10:33:46,489 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp' 2026-04-03T10:33:46,493 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp' 2026-04-03T10:33:46,495 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp' 2026-04-03T10:33:46,501 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp' 2026-04-03T10:33:46,509 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp' 2026-04-03T10:33:46,513 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp' 2026-04-03T10:33:46,517 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp' 2026-04-03T10:33:46,523 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp' 2026-04-03T10:33:46,526 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp' 2026-04-03T10:33:46,528 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp' 2026-04-03T10:33:46,534 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp' 2026-04-03T10:33:46,539 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp' 2026-04-03T10:33:46,541 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp' 2026-04-03T10:33:46,548 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl' 2026-04-03T10:33:46,550 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl' 2026-04-03T10:33:46,553 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl' 2026-04-03T10:33:46,554 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl' 2026-04-03T10:33:46,557 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl' 2026-04-03T10:33:46,559 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl' 2026-04-03T10:33:46,561 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp' 2026-04-03T10:33:46,563 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp' 2026-04-03T10:33:46,566 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp' 2026-04-03T10:33:46,569 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp' 2026-04-03T10:33:46,572 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp' 2026-04-03T10:33:46,576 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp' 2026-04-03T10:33:46,580 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp' 2026-04-03T10:33:46,586 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp' 2026-04-03T10:33:46,589 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp' 2026-04-03T10:33:46,594 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp' 2026-04-03T10:33:46,600 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp' 2026-04-03T10:33:46,604 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp' 2026-04-03T10:33:46,608 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp' 2026-04-03T10:33:46,612 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h' 2026-04-03T10:33:46,613 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h' 2026-04-03T10:33:46,615 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp' 2026-04-03T10:33:46,617 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h' 2026-04-03T10:33:46,619 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h' 2026-04-03T10:33:46,621 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h' 2026-04-03T10:33:46,623 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h' 2026-04-03T10:33:46,625 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h' 2026-04-03T10:33:46,627 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h' 2026-04-03T10:33:46,629 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h' 2026-04-03T10:33:46,631 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h' 2026-04-03T10:33:46,632 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h' 2026-04-03T10:33:46,634 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h' 2026-04-03T10:33:46,636 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h' 2026-04-03T10:33:46,637 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h' 2026-04-03T10:33:46,639 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h' 2026-04-03T10:33:46,641 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h' 2026-04-03T10:33:46,643 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h' 2026-04-03T10:33:46,645 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h' 2026-04-03T10:33:46,646 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h' 2026-04-03T10:33:46,648 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h' 2026-04-03T10:33:46,649 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp' 2026-04-03T10:33:46,651 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h' 2026-04-03T10:33:46,653 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h' 2026-04-03T10:33:46,654 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h' 2026-04-03T10:33:46,657 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h' 2026-04-03T10:33:46,659 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h' 2026-04-03T10:33:46,661 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h' 2026-04-03T10:33:46,662 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h' 2026-04-03T10:33:46,665 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h' 2026-04-03T10:33:46,667 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h' 2026-04-03T10:33:46,669 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h' 2026-04-03T10:33:46,671 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h' 2026-04-03T10:33:46,672 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h' 2026-04-03T10:33:46,674 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h' 2026-04-03T10:33:46,675 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h' 2026-04-03T10:33:46,677 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h' 2026-04-03T10:33:46,678 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h' 2026-04-03T10:33:46,680 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h' 2026-04-03T10:33:46,682 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h' 2026-04-03T10:33:46,683 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h' 2026-04-03T10:33:46,685 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h' 2026-04-03T10:33:46,687 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h' 2026-04-03T10:33:46,689 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h' 2026-04-03T10:33:46,691 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h' 2026-04-03T10:33:46,693 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h' 2026-04-03T10:33:46,695 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h' 2026-04-03T10:33:46,697 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h' 2026-04-03T10:33:46,699 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h' 2026-04-03T10:33:46,701 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h' 2026-04-03T10:33:46,703 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h' 2026-04-03T10:33:46,705 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h' 2026-04-03T10:33:46,709 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h' 2026-04-03T10:33:46,714 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h' 2026-04-03T10:33:46,717 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h' 2026-04-03T10:33:46,719 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h' 2026-04-03T10:33:46,721 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h' 2026-04-03T10:33:46,724 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h' 2026-04-03T10:33:46,726 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h' 2026-04-03T10:33:46,728 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h' 2026-04-03T10:33:46,730 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h' 2026-04-03T10:33:46,732 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h' 2026-04-03T10:33:46,736 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h' 2026-04-03T10:33:46,739 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h' 2026-04-03T10:33:46,741 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h' 2026-04-03T10:33:46,743 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h' 2026-04-03T10:33:46,746 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h' 2026-04-03T10:33:46,748 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h' 2026-04-03T10:33:46,750 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h' 2026-04-03T10:33:46,752 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h' 2026-04-03T10:33:46,754 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h' 2026-04-03T10:33:46,756 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h' 2026-04-03T10:33:46,758 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h' 2026-04-03T10:33:46,760 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h' 2026-04-03T10:33:46,763 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp' 2026-04-03T10:33:46,764 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp' 2026-04-03T10:33:46,766 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp' 2026-04-03T10:33:46,769 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp' 2026-04-03T10:33:46,771 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp' 2026-04-03T10:33:46,773 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h' 2026-04-03T10:33:46,775 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h' 2026-04-03T10:33:46,777 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h' 2026-04-03T10:33:46,779 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h' 2026-04-03T10:33:46,780 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h' 2026-04-03T10:33:46,782 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h' 2026-04-03T10:33:46,784 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h' 2026-04-03T10:33:46,785 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h' 2026-04-03T10:33:46,788 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h' 2026-04-03T10:33:46,790 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h' 2026-04-03T10:33:46,793 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h' 2026-04-03T10:33:46,795 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h' 2026-04-03T10:33:46,796 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h' 2026-04-03T10:33:46,798 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h' 2026-04-03T10:33:46,800 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h' 2026-04-03T10:33:46,803 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp' 2026-04-03T10:33:46,806 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp' 2026-04-03T10:33:46,807 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp' 2026-04-03T10:33:46,810 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp' 2026-04-03T10:33:46,811 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp' 2026-04-03T10:33:46,813 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp' 2026-04-03T10:33:46,816 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp' 2026-04-03T10:33:46,818 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp' 2026-04-03T10:33:46,823 adding 'flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp' 2026-04-03T10:33:46,825 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm.h' 2026-04-03T10:33:46,826 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h' 2026-04-03T10:33:46,828 adding 'flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp' 2026-04-03T10:33:46,831 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp' 2026-04-03T10:33:46,832 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp' 2026-04-03T10:33:46,833 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp' 2026-04-03T10:33:46,835 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp' 2026-04-03T10:33:46,837 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp' 2026-04-03T10:33:46,843 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp' 2026-04-03T10:33:46,850 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp' 2026-04-03T10:33:46,855 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-03T10:33:46,861 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp' 2026-04-03T10:33:46,868 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp' 2026-04-03T10:33:46,873 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp' 2026-04-03T10:33:46,879 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp' 2026-04-03T10:33:46,885 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp' 2026-04-03T10:33:46,892 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-03T10:33:46,898 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-03T10:33:46,903 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp' 2026-04-03T10:33:46,907 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp' 2026-04-03T10:33:46,911 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp' 2026-04-03T10:33:46,914 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-03T10:33:46,918 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp' 2026-04-03T10:33:46,924 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp' 2026-04-03T10:33:46,930 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp' 2026-04-03T10:33:46,936 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-03T10:33:46,940 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-03T10:33:46,947 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp' 2026-04-03T10:33:46,951 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp' 2026-04-03T10:33:46,956 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp' 2026-04-03T10:33:46,964 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp' 2026-04-03T10:33:46,971 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp' 2026-04-03T10:33:46,977 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp' 2026-04-03T10:33:46,981 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp' 2026-04-03T10:33:46,988 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp' 2026-04-03T10:33:46,993 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp' 2026-04-03T10:33:46,996 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp' 2026-04-03T10:33:47,000 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp' 2026-04-03T10:33:47,005 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp' 2026-04-03T10:33:47,008 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp' 2026-04-03T10:33:47,010 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp' 2026-04-03T10:33:47,013 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp' 2026-04-03T10:33:47,020 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-03T10:33:47,024 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp' 2026-04-03T10:33:47,028 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-03T10:33:47,034 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-03T10:33:47,037 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp' 2026-04-03T10:33:47,040 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp' 2026-04-03T10:33:47,044 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp' 2026-04-03T10:33:47,049 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-03T10:33:47,052 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp' 2026-04-03T10:33:47,055 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-03T10:33:47,058 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-03T10:33:47,063 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-03T10:33:47,067 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-03T10:33:47,071 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-03T10:33:47,074 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl' 2026-04-03T10:33:47,076 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl' 2026-04-03T10:33:47,079 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl' 2026-04-03T10:33:47,081 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl' 2026-04-03T10:33:47,083 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl' 2026-04-03T10:33:47,085 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl' 2026-04-03T10:33:47,089 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl' 2026-04-03T10:33:47,091 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl' 2026-04-03T10:33:47,092 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl' 2026-04-03T10:33:47,095 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl' 2026-04-03T10:33:47,097 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl' 2026-04-03T10:33:47,098 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl' 2026-04-03T10:33:47,100 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl' 2026-04-03T10:33:47,102 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl' 2026-04-03T10:33:47,104 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl' 2026-04-03T10:33:47,106 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl' 2026-04-03T10:33:47,109 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl' 2026-04-03T10:33:47,111 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl' 2026-04-03T10:33:47,114 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl' 2026-04-03T10:33:47,116 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl' 2026-04-03T10:33:47,117 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl' 2026-04-03T10:33:47,119 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl' 2026-04-03T10:33:47,121 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl' 2026-04-03T10:33:47,125 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl' 2026-04-03T10:33:47,127 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl' 2026-04-03T10:33:47,129 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl' 2026-04-03T10:33:47,133 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl' 2026-04-03T10:33:47,135 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl' 2026-04-03T10:33:47,137 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl' 2026-04-03T10:33:47,141 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h' 2026-04-03T10:33:47,143 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h' 2026-04-03T10:33:47,146 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h' 2026-04-03T10:33:47,148 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h' 2026-04-03T10:33:47,151 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h' 2026-04-03T10:33:47,154 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h' 2026-04-03T10:33:47,157 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h' 2026-04-03T10:33:47,160 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h' 2026-04-03T10:33:47,161 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h' 2026-04-03T10:33:47,163 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h' 2026-04-03T10:33:47,165 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h' 2026-04-03T10:33:47,167 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h' 2026-04-03T10:33:47,168 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h' 2026-04-03T10:33:47,170 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h' 2026-04-03T10:33:47,172 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h' 2026-04-03T10:33:47,174 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h' 2026-04-03T10:33:47,176 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h' 2026-04-03T10:33:47,179 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h' 2026-04-03T10:33:47,182 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h' 2026-04-03T10:33:47,184 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h' 2026-04-03T10:33:47,185 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h' 2026-04-03T10:33:47,187 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h' 2026-04-03T10:33:47,189 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h' 2026-04-03T10:33:47,191 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h' 2026-04-03T10:33:47,193 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h' 2026-04-03T10:33:47,195 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h' 2026-04-03T10:33:47,196 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h' 2026-04-03T10:33:47,198 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h' 2026-04-03T10:33:47,201 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h' 2026-04-03T10:33:47,204 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h' 2026-04-03T10:33:47,209 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h' 2026-04-03T10:33:47,212 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h' 2026-04-03T10:33:47,214 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h' 2026-04-03T10:33:47,215 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h' 2026-04-03T10:33:47,217 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h' 2026-04-03T10:33:47,219 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h' 2026-04-03T10:33:47,220 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h' 2026-04-03T10:33:47,222 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h' 2026-04-03T10:33:47,224 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h' 2026-04-03T10:33:47,225 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h' 2026-04-03T10:33:47,227 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h' 2026-04-03T10:33:47,228 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h' 2026-04-03T10:33:47,230 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h' 2026-04-03T10:33:47,232 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h' 2026-04-03T10:33:47,233 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h' 2026-04-03T10:33:47,235 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h' 2026-04-03T10:33:47,236 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h' 2026-04-03T10:33:47,238 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h' 2026-04-03T10:33:47,240 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h' 2026-04-03T10:33:47,241 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h' 2026-04-03T10:33:47,243 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h' 2026-04-03T10:33:47,244 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h' 2026-04-03T10:33:47,246 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h' 2026-04-03T10:33:47,248 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h' 2026-04-03T10:33:47,250 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h' 2026-04-03T10:33:47,251 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h' 2026-04-03T10:33:47,253 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h' 2026-04-03T10:33:47,255 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h' 2026-04-03T10:33:47,256 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h' 2026-04-03T10:33:47,258 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h' 2026-04-03T10:33:47,260 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h' 2026-04-03T10:33:47,262 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h' 2026-04-03T10:33:47,264 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h' 2026-04-03T10:33:47,265 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h' 2026-04-03T10:33:47,267 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h' 2026-04-03T10:33:47,270 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h' 2026-04-03T10:33:47,272 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h' 2026-04-03T10:33:47,273 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h' 2026-04-03T10:33:47,275 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h' 2026-04-03T10:33:47,277 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h' 2026-04-03T10:33:47,279 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h' 2026-04-03T10:33:47,281 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h' 2026-04-03T10:33:47,282 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h' 2026-04-03T10:33:47,285 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h' 2026-04-03T10:33:47,288 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h' 2026-04-03T10:33:47,289 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h' 2026-04-03T10:33:47,291 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h' 2026-04-03T10:33:47,294 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h' 2026-04-03T10:33:47,296 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h' 2026-04-03T10:33:47,299 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h' 2026-04-03T10:33:47,302 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h' 2026-04-03T10:33:47,304 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h' 2026-04-03T10:33:47,313 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h' 2026-04-03T10:33:47,314 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h' 2026-04-03T10:33:47,317 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h' 2026-04-03T10:33:47,319 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp' 2026-04-03T10:33:47,321 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h' 2026-04-03T10:33:47,322 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h' 2026-04-03T10:33:47,326 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h' 2026-04-03T10:33:47,329 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h' 2026-04-03T10:33:47,332 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h' 2026-04-03T10:33:47,335 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h' 2026-04-03T10:33:47,338 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h' 2026-04-03T10:33:47,341 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h' 2026-04-03T10:33:47,343 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h' 2026-04-03T10:33:47,345 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h' 2026-04-03T10:33:47,349 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h' 2026-04-03T10:33:47,351 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h' 2026-04-03T10:33:47,353 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h' 2026-04-03T10:33:47,354 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h' 2026-04-03T10:33:47,357 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h' 2026-04-03T10:33:47,360 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h' 2026-04-03T10:33:47,361 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h' 2026-04-03T10:33:47,364 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h' 2026-04-03T10:33:47,367 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h' 2026-04-03T10:33:47,374 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp' 2026-04-03T10:33:47,380 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp' 2026-04-03T10:33:47,386 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp' 2026-04-03T10:33:47,390 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp' 2026-04-03T10:33:47,395 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-03T10:33:47,400 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp' 2026-04-03T10:33:47,405 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp' 2026-04-03T10:33:47,410 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp' 2026-04-03T10:33:47,415 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp' 2026-04-03T10:33:47,420 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp' 2026-04-03T10:33:47,422 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp' 2026-04-03T10:33:47,426 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp' 2026-04-03T10:33:47,428 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp' 2026-04-03T10:33:47,432 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp' 2026-04-03T10:33:47,438 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp' 2026-04-03T10:33:47,444 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp' 2026-04-03T10:33:47,448 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp' 2026-04-03T10:33:47,450 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp' 2026-04-03T10:33:47,452 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp' 2026-04-03T10:33:47,457 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp' 2026-04-03T10:33:47,462 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp' 2026-04-03T10:33:47,465 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp' 2026-04-03T10:33:47,467 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp' 2026-04-03T10:33:47,472 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp' 2026-04-03T10:33:47,476 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp' 2026-04-03T10:33:47,479 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp' 2026-04-03T10:33:47,482 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp' 2026-04-03T10:33:47,485 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp' 2026-04-03T10:33:47,487 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp' 2026-04-03T10:33:47,490 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp' 2026-04-03T10:33:47,495 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp' 2026-04-03T10:33:47,497 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h' 2026-04-03T10:33:47,500 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h' 2026-04-03T10:33:47,501 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h' 2026-04-03T10:33:47,504 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp' 2026-04-03T10:33:47,507 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h' 2026-04-03T10:33:47,509 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp' 2026-04-03T10:33:47,510 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp' 2026-04-03T10:33:47,519 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h' 2026-04-03T10:33:47,522 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h' 2026-04-03T10:33:47,524 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h' 2026-04-03T10:33:47,526 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h' 2026-04-03T10:33:47,528 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h' 2026-04-03T10:33:47,530 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h' 2026-04-03T10:33:47,533 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h' 2026-04-03T10:33:47,535 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h' 2026-04-03T10:33:47,538 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h' 2026-04-03T10:33:47,539 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h' 2026-04-03T10:33:47,542 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h' 2026-04-03T10:33:47,544 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h' 2026-04-03T10:33:47,547 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h' 2026-04-03T10:33:47,552 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h' 2026-04-03T10:33:47,555 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h' 2026-04-03T10:33:47,557 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h' 2026-04-03T10:33:47,559 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h' 2026-04-03T10:33:47,561 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h' 2026-04-03T10:33:47,563 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h' 2026-04-03T10:33:47,565 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h' 2026-04-03T10:33:47,566 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h' 2026-04-03T10:33:47,568 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h' 2026-04-03T10:33:47,569 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h' 2026-04-03T10:33:47,571 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h' 2026-04-03T10:33:47,573 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h' 2026-04-03T10:33:47,574 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h' 2026-04-03T10:33:47,577 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h' 2026-04-03T10:33:47,580 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h' 2026-04-03T10:33:47,582 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h' 2026-04-03T10:33:47,584 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h' 2026-04-03T10:33:47,587 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h' 2026-04-03T10:33:47,589 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h' 2026-04-03T10:33:47,591 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h' 2026-04-03T10:33:47,592 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h' 2026-04-03T10:33:47,594 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h' 2026-04-03T10:33:47,597 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h' 2026-04-03T10:33:47,600 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h' 2026-04-03T10:33:47,604 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h' 2026-04-03T10:33:47,606 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h' 2026-04-03T10:33:47,609 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h' 2026-04-03T10:33:47,611 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h' 2026-04-03T10:33:47,613 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h' 2026-04-03T10:33:47,616 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h' 2026-04-03T10:33:47,618 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h' 2026-04-03T10:33:47,621 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h' 2026-04-03T10:33:47,623 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h' 2026-04-03T10:33:47,625 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h' 2026-04-03T10:33:47,628 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h' 2026-04-03T10:33:47,630 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h' 2026-04-03T10:33:47,633 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h' 2026-04-03T10:33:47,637 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h' 2026-04-03T10:33:47,638 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h' 2026-04-03T10:33:47,640 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h' 2026-04-03T10:33:47,642 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h' 2026-04-03T10:33:47,643 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h' 2026-04-03T10:33:47,645 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h' 2026-04-03T10:33:47,646 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h' 2026-04-03T10:33:47,648 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h' 2026-04-03T10:33:47,651 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h' 2026-04-03T10:33:47,654 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h' 2026-04-03T10:33:47,659 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h' 2026-04-03T10:33:47,662 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h' 2026-04-03T10:33:47,664 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h' 2026-04-03T10:33:47,667 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h' 2026-04-03T10:33:47,668 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h' 2026-04-03T10:33:47,670 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h' 2026-04-03T10:33:47,671 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h' 2026-04-03T10:33:47,675 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h' 2026-04-03T10:33:47,677 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h' 2026-04-03T10:33:47,679 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h' 2026-04-03T10:33:47,681 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h' 2026-04-03T10:33:47,684 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h' 2026-04-03T10:33:47,685 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h' 2026-04-03T10:33:47,687 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h' 2026-04-03T10:33:47,689 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h' 2026-04-03T10:33:47,697 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h' 2026-04-03T10:33:47,704 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h' 2026-04-03T10:33:47,709 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h' 2026-04-03T10:33:47,712 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h' 2026-04-03T10:33:47,714 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h' 2026-04-03T10:33:47,716 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h' 2026-04-03T10:33:47,718 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h' 2026-04-03T10:33:47,720 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h' 2026-04-03T10:33:47,721 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h' 2026-04-03T10:33:47,723 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h' 2026-04-03T10:33:47,725 adding 'flashinfer/data/cutlass/include/cutlass/layout/layout.h' 2026-04-03T10:33:47,727 adding 'flashinfer/data/cutlass/include/cutlass/layout/matrix.h' 2026-04-03T10:33:47,730 adding 'flashinfer/data/cutlass/include/cutlass/layout/permute.h' 2026-04-03T10:33:47,732 adding 'flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h' 2026-04-03T10:33:47,733 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor.h' 2026-04-03T10:33:47,736 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h' 2026-04-03T10:33:47,738 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h' 2026-04-03T10:33:47,741 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h' 2026-04-03T10:33:47,742 adding 'flashinfer/data/cutlass/include/cutlass/layout/vector.h' 2026-04-03T10:33:47,744 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp' 2026-04-03T10:33:47,748 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp' 2026-04-03T10:33:47,753 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp' 2026-04-03T10:33:47,756 adding 'flashinfer/data/cutlass/include/cutlass/platform/platform.h' 2026-04-03T10:33:47,758 adding 'flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h' 2026-04-03T10:33:47,761 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h' 2026-04-03T10:33:47,762 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h' 2026-04-03T10:33:47,764 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h' 2026-04-03T10:33:47,766 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h' 2026-04-03T10:33:47,769 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h' 2026-04-03T10:33:47,770 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h' 2026-04-03T10:33:47,773 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h' 2026-04-03T10:33:47,776 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h' 2026-04-03T10:33:47,778 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h' 2026-04-03T10:33:47,779 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h' 2026-04-03T10:33:47,781 adding 'flashinfer/data/cutlass/include/cutlass/thread/matrix.h' 2026-04-03T10:33:47,785 adding 'flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h' 2026-04-03T10:33:47,789 adding 'flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp' 2026-04-03T10:33:47,791 adding 'flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp' 2026-04-03T10:33:47,793 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp' 2026-04-03T10:33:47,797 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp' 2026-04-03T10:33:47,799 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp' 2026-04-03T10:33:47,801 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h' 2026-04-03T10:33:47,802 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h' 2026-04-03T10:33:47,805 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h' 2026-04-03T10:33:47,808 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h' 2026-04-03T10:33:47,811 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h' 2026-04-03T10:33:47,813 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-03T10:33:47,815 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-03T10:33:47,819 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h' 2026-04-03T10:33:47,822 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h' 2026-04-03T10:33:47,824 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h' 2026-04-03T10:33:47,827 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h' 2026-04-03T10:33:47,830 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h' 2026-04-03T10:33:47,833 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h' 2026-04-03T10:33:47,836 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h' 2026-04-03T10:33:47,838 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h' 2026-04-03T10:33:47,840 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h' 2026-04-03T10:33:47,841 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h' 2026-04-03T10:33:47,843 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h' 2026-04-03T10:33:47,845 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h' 2026-04-03T10:33:47,847 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h' 2026-04-03T10:33:47,850 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h' 2026-04-03T10:33:47,852 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h' 2026-04-03T10:33:47,854 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h' 2026-04-03T10:33:47,855 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h' 2026-04-03T10:33:47,858 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h' 2026-04-03T10:33:47,861 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h' 2026-04-03T10:33:47,863 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h' 2026-04-03T10:33:47,865 adding 'flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h' 2026-04-03T10:33:47,867 adding 'flashinfer/data/cutlass/python/setup_cutlass.py' 2026-04-03T10:33:47,868 adding 'flashinfer/data/cutlass/python/setup_library.py' 2026-04-03T10:33:47,870 adding 'flashinfer/data/cutlass/python/setup_pycute.py' 2026-04-03T10:33:47,872 adding 'flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py' 2026-04-03T10:33:47,874 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py' 2026-04-03T10:33:47,875 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py' 2026-04-03T10:33:47,877 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py' 2026-04-03T10:33:47,879 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py' 2026-04-03T10:33:47,880 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py' 2026-04-03T10:33:47,883 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py' 2026-04-03T10:33:47,894 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py' 2026-04-03T10:33:47,897 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py' 2026-04-03T10:33:47,899 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py' 2026-04-03T10:33:47,902 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py' 2026-04-03T10:33:47,910 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py' 2026-04-03T10:33:47,913 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py' 2026-04-03T10:33:47,918 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py' 2026-04-03T10:33:47,926 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py' 2026-04-03T10:33:47,928 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py' 2026-04-03T10:33:47,929 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py' 2026-04-03T10:33:47,932 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py' 2026-04-03T10:33:47,934 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py' 2026-04-03T10:33:47,935 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py' 2026-04-03T10:33:47,936 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py' 2026-04-03T10:33:47,938 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py' 2026-04-03T10:33:47,940 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py' 2026-04-03T10:33:47,942 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py' 2026-04-03T10:33:47,944 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py' 2026-04-03T10:33:47,946 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py' 2026-04-03T10:33:47,949 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py' 2026-04-03T10:33:47,950 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py' 2026-04-03T10:33:47,952 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py' 2026-04-03T10:33:47,953 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py' 2026-04-03T10:33:47,955 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py' 2026-04-03T10:33:47,956 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py' 2026-04-03T10:33:47,958 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py' 2026-04-03T10:33:47,960 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py' 2026-04-03T10:33:47,963 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py' 2026-04-03T10:33:47,966 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py' 2026-04-03T10:33:47,973 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py' 2026-04-03T10:33:47,976 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py' 2026-04-03T10:33:47,977 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py' 2026-04-03T10:33:47,979 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py' 2026-04-03T10:33:47,980 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py' 2026-04-03T10:33:47,982 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py' 2026-04-03T10:33:47,984 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py' 2026-04-03T10:33:47,987 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py' 2026-04-03T10:33:47,989 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py' 2026-04-03T10:33:47,992 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py' 2026-04-03T10:33:47,997 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py' 2026-04-03T10:33:48,016 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py' 2026-04-03T10:33:48,021 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py' 2026-04-03T10:33:48,042 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py' 2026-04-03T10:33:48,047 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py' 2026-04-03T10:33:48,056 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py' 2026-04-03T10:33:48,062 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py' 2026-04-03T10:33:48,064 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py' 2026-04-03T10:33:48,067 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py' 2026-04-03T10:33:48,069 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py' 2026-04-03T10:33:48,071 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py' 2026-04-03T10:33:48,072 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py' 2026-04-03T10:33:48,074 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py' 2026-04-03T10:33:48,076 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py' 2026-04-03T10:33:48,083 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py' 2026-04-03T10:33:48,085 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py' 2026-04-03T10:33:48,086 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py' 2026-04-03T10:33:48,090 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py' 2026-04-03T10:33:48,092 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py' 2026-04-03T10:33:48,093 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py' 2026-04-03T10:33:48,095 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py' 2026-04-03T10:33:48,096 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py' 2026-04-03T10:33:48,099 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py' 2026-04-03T10:33:48,101 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py' 2026-04-03T10:33:48,103 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py' 2026-04-03T10:33:48,105 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py' 2026-04-03T10:33:48,107 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py' 2026-04-03T10:33:48,108 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py' 2026-04-03T10:33:48,109 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py' 2026-04-03T10:33:48,111 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py' 2026-04-03T10:33:48,113 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py' 2026-04-03T10:33:48,115 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py' 2026-04-03T10:33:48,116 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py' 2026-04-03T10:33:48,119 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py' 2026-04-03T10:33:48,121 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py' 2026-04-03T10:33:48,123 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py' 2026-04-03T10:33:48,125 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py' 2026-04-03T10:33:48,127 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py' 2026-04-03T10:33:48,130 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py' 2026-04-03T10:33:48,132 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py' 2026-04-03T10:33:48,134 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py' 2026-04-03T10:33:48,135 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py' 2026-04-03T10:33:48,137 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py' 2026-04-03T10:33:48,138 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py' 2026-04-03T10:33:48,140 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py' 2026-04-03T10:33:48,142 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py' 2026-04-03T10:33:48,144 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py' 2026-04-03T10:33:48,145 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py' 2026-04-03T10:33:48,154 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py' 2026-04-03T10:33:48,158 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py' 2026-04-03T10:33:48,161 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py' 2026-04-03T10:33:48,163 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py' 2026-04-03T10:33:48,165 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py' 2026-04-03T10:33:48,166 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py' 2026-04-03T10:33:48,168 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py' 2026-04-03T10:33:48,170 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py' 2026-04-03T10:33:48,172 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py' 2026-04-03T10:33:48,174 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py' 2026-04-03T10:33:48,177 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py' 2026-04-03T10:33:48,180 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py' 2026-04-03T10:33:48,185 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py' 2026-04-03T10:33:48,187 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py' 2026-04-03T10:33:48,191 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py' 2026-04-03T10:33:48,193 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py' 2026-04-03T10:33:48,194 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py' 2026-04-03T10:33:48,196 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py' 2026-04-03T10:33:48,200 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py' 2026-04-03T10:33:48,202 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py' 2026-04-03T10:33:48,204 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py' 2026-04-03T10:33:48,206 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py' 2026-04-03T10:33:48,208 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py' 2026-04-03T10:33:48,212 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py' 2026-04-03T10:33:48,213 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py' 2026-04-03T10:33:48,215 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py' 2026-04-03T10:33:48,218 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py' 2026-04-03T10:33:48,220 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py' 2026-04-03T10:33:48,221 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py' 2026-04-03T10:33:48,223 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py' 2026-04-03T10:33:48,225 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py' 2026-04-03T10:33:48,228 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py' 2026-04-03T10:33:48,231 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py' 2026-04-03T10:33:48,234 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py' 2026-04-03T10:33:48,235 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/shape.py' 2026-04-03T10:33:48,237 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py' 2026-04-03T10:33:48,239 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py' 2026-04-03T10:33:48,240 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py' 2026-04-03T10:33:48,243 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py' 2026-04-03T10:33:48,245 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py' 2026-04-03T10:33:48,248 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py' 2026-04-03T10:33:48,250 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py' 2026-04-03T10:33:48,252 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py' 2026-04-03T10:33:48,259 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py' 2026-04-03T10:33:48,262 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py' 2026-04-03T10:33:48,264 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py' 2026-04-03T10:33:48,265 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py' 2026-04-03T10:33:48,267 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py' 2026-04-03T10:33:48,269 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py' 2026-04-03T10:33:48,271 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py' 2026-04-03T10:33:48,272 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py' 2026-04-03T10:33:48,275 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py' 2026-04-03T10:33:48,277 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py' 2026-04-03T10:33:48,278 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py' 2026-04-03T10:33:48,280 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py' 2026-04-03T10:33:48,282 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py' 2026-04-03T10:33:48,284 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py' 2026-04-03T10:33:48,285 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py' 2026-04-03T10:33:48,287 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py' 2026-04-03T10:33:48,289 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py' 2026-04-03T10:33:48,291 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py' 2026-04-03T10:33:48,293 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py' 2026-04-03T10:33:48,295 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py' 2026-04-03T10:33:48,296 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py' 2026-04-03T10:33:48,298 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py' 2026-04-03T10:33:48,300 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py' 2026-04-03T10:33:48,302 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py' 2026-04-03T10:33:48,303 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py' 2026-04-03T10:33:48,305 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py' 2026-04-03T10:33:48,307 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py' 2026-04-03T10:33:48,309 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py' 2026-04-03T10:33:48,311 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py' 2026-04-03T10:33:48,312 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py' 2026-04-03T10:33:48,313 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py' 2026-04-03T10:33:48,315 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py' 2026-04-03T10:33:48,317 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py' 2026-04-03T10:33:48,318 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py' 2026-04-03T10:33:48,320 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py' 2026-04-03T10:33:48,321 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py' 2026-04-03T10:33:48,323 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py' 2026-04-03T10:33:48,324 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py' 2026-04-03T10:33:48,326 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py' 2026-04-03T10:33:48,327 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py' 2026-04-03T10:33:48,329 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py' 2026-04-03T10:33:48,331 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py' 2026-04-03T10:33:48,332 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py' 2026-04-03T10:33:48,334 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py' 2026-04-03T10:33:48,336 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py' 2026-04-03T10:33:48,339 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py' 2026-04-03T10:33:48,341 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py' 2026-04-03T10:33:48,343 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py' 2026-04-03T10:33:48,344 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py' 2026-04-03T10:33:48,346 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py' 2026-04-03T10:33:48,351 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py' 2026-04-03T10:33:48,355 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py' 2026-04-03T10:33:48,357 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py' 2026-04-03T10:33:48,359 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py' 2026-04-03T10:33:48,361 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py' 2026-04-03T10:33:48,363 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py' 2026-04-03T10:33:48,365 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py' 2026-04-03T10:33:48,366 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py' 2026-04-03T10:33:48,368 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py' 2026-04-03T10:33:48,370 adding 'flashinfer/data/cutlass/python/cutlass_library/__init__.py' 2026-04-03T10:33:48,372 adding 'flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py' 2026-04-03T10:33:48,375 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py' 2026-04-03T10:33:48,377 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py' 2026-04-03T10:33:48,381 adding 'flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py' 2026-04-03T10:33:48,387 adding 'flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py' 2026-04-03T10:33:48,413 adding 'flashinfer/data/cutlass/python/cutlass_library/generator.py' 2026-04-03T10:33:48,419 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics.py' 2026-04-03T10:33:48,421 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py' 2026-04-03T10:33:48,427 adding 'flashinfer/data/cutlass/python/cutlass_library/library.py' 2026-04-03T10:33:48,431 adding 'flashinfer/data/cutlass/python/cutlass_library/manifest.py' 2026-04-03T10:33:48,433 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py' 2026-04-03T10:33:48,435 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py' 2026-04-03T10:33:48,437 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py' 2026-04-03T10:33:48,439 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py' 2026-04-03T10:33:48,441 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py' 2026-04-03T10:33:48,444 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py' 2026-04-03T10:33:48,446 adding 'flashinfer/data/cutlass/python/cutlass_library/symm_operation.py' 2026-04-03T10:33:48,448 adding 'flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py' 2026-04-03T10:33:48,451 adding 'flashinfer/data/cutlass/python/docs_src/source/conf.py' 2026-04-03T10:33:48,453 adding 'flashinfer/data/cutlass/python/pycute/__init__.py' 2026-04-03T10:33:48,454 adding 'flashinfer/data/cutlass/python/pycute/int_tuple.py' 2026-04-03T10:33:48,456 adding 'flashinfer/data/cutlass/python/pycute/layout.py' 2026-04-03T10:33:48,458 adding 'flashinfer/data/cutlass/python/pycute/swizzle.py' 2026-04-03T10:33:48,459 adding 'flashinfer/data/cutlass/python/pycute/typing.py' 2026-04-03T10:33:48,462 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py' 2026-04-03T10:33:48,464 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py' 2026-04-03T10:33:48,466 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-03T10:33:48,468 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py' 2026-04-03T10:33:48,469 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py' 2026-04-03T10:33:48,470 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py' 2026-04-03T10:33:48,473 adding 'flashinfer/data/cutlass/test/python/cutlass/installation.py' 2026-04-03T10:33:48,475 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py' 2026-04-03T10:33:48,477 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py' 2026-04-03T10:33:48,479 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py' 2026-04-03T10:33:48,481 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py' 2026-04-03T10:33:48,483 adding 'flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py' 2026-04-03T10:33:48,485 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py' 2026-04-03T10:33:48,486 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py' 2026-04-03T10:33:48,488 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py' 2026-04-03T10:33:48,489 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py' 2026-04-03T10:33:48,491 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py' 2026-04-03T10:33:48,492 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py' 2026-04-03T10:33:48,494 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py' 2026-04-03T10:33:48,497 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py' 2026-04-03T10:33:48,498 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py' 2026-04-03T10:33:48,500 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py' 2026-04-03T10:33:48,501 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py' 2026-04-03T10:33:48,503 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py' 2026-04-03T10:33:48,504 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py' 2026-04-03T10:33:48,505 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py' 2026-04-03T10:33:48,507 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py' 2026-04-03T10:33:48,508 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py' 2026-04-03T10:33:48,510 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py' 2026-04-03T10:33:48,512 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py' 2026-04-03T10:33:48,513 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py' 2026-04-03T10:33:48,515 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py' 2026-04-03T10:33:48,518 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py' 2026-04-03T10:33:48,521 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py' 2026-04-03T10:33:48,523 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py' 2026-04-03T10:33:48,525 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/utils.py' 2026-04-03T10:33:48,527 adding 'flashinfer/data/cutlass/test/python/pycute/run_all_tests.py' 2026-04-03T10:33:48,528 adding 'flashinfer/data/cutlass/test/python/pycute/test_coalesce.py' 2026-04-03T10:33:48,529 adding 'flashinfer/data/cutlass/test/python/pycute/test_complement.py' 2026-04-03T10:33:48,531 adding 'flashinfer/data/cutlass/test/python/pycute/test_composition.py' 2026-04-03T10:33:48,532 adding 'flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py' 2026-04-03T10:33:48,534 adding 'flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py' 2026-04-03T10:33:48,535 adding 'flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py' 2026-04-03T10:33:48,536 adding 'flashinfer/data/cutlass/test/python/pycute/test_typing.py' 2026-04-03T10:33:48,540 adding 'flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py' 2026-04-03T10:33:48,543 adding 'flashinfer/data/cutlass/test/utils/test_sharding.py' 2026-04-03T10:33:48,547 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp' 2026-04-03T10:33:48,549 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h' 2026-04-03T10:33:48,552 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp' 2026-04-03T10:33:48,553 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h' 2026-04-03T10:33:48,555 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h' 2026-04-03T10:33:48,557 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h' 2026-04-03T10:33:48,559 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h' 2026-04-03T10:33:48,561 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h' 2026-04-03T10:33:48,563 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h' 2026-04-03T10:33:48,564 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h' 2026-04-03T10:33:48,566 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h' 2026-04-03T10:33:48,568 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h' 2026-04-03T10:33:48,570 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h' 2026-04-03T10:33:48,571 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h' 2026-04-03T10:33:48,573 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h' 2026-04-03T10:33:48,574 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h' 2026-04-03T10:33:48,576 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp' 2026-04-03T10:33:48,578 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp' 2026-04-03T10:33:48,579 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h' 2026-04-03T10:33:48,581 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h' 2026-04-03T10:33:48,583 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h' 2026-04-03T10:33:48,585 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h' 2026-04-03T10:33:48,586 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h' 2026-04-03T10:33:48,589 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp' 2026-04-03T10:33:48,591 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp' 2026-04-03T10:33:48,593 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp' 2026-04-03T10:33:48,595 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h' 2026-04-03T10:33:48,597 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h' 2026-04-03T10:33:48,599 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h' 2026-04-03T10:33:48,601 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h' 2026-04-03T10:33:48,604 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h' 2026-04-03T10:33:48,606 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h' 2026-04-03T10:33:48,608 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h' 2026-04-03T10:33:48,610 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h' 2026-04-03T10:33:48,612 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp' 2026-04-03T10:33:48,614 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h' 2026-04-03T10:33:48,615 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h' 2026-04-03T10:33:48,619 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h' 2026-04-03T10:33:48,621 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h' 2026-04-03T10:33:48,623 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h' 2026-04-03T10:33:48,624 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h' 2026-04-03T10:33:48,627 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h' 2026-04-03T10:33:48,628 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h' 2026-04-03T10:33:48,630 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h' 2026-04-03T10:33:48,631 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h' 2026-04-03T10:33:48,635 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp' 2026-04-03T10:33:48,637 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h' 2026-04-03T10:33:48,639 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h' 2026-04-03T10:33:48,641 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h' 2026-04-03T10:33:48,642 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h' 2026-04-03T10:33:48,644 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h' 2026-04-03T10:33:48,648 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp' 2026-04-03T10:33:48,650 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h' 2026-04-03T10:33:48,652 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h' 2026-04-03T10:33:48,653 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h' 2026-04-03T10:33:48,655 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h' 2026-04-03T10:33:48,657 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h' 2026-04-03T10:33:48,659 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h' 2026-04-03T10:33:48,660 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp' 2026-04-03T10:33:48,662 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h' 2026-04-03T10:33:48,663 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h' 2026-04-03T10:33:48,667 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h' 2026-04-03T10:33:48,669 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp' 2026-04-03T10:33:48,671 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h' 2026-04-03T10:33:48,672 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h' 2026-04-03T10:33:48,674 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h' 2026-04-03T10:33:48,675 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp' 2026-04-03T10:33:48,677 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h' 2026-04-03T10:33:48,678 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h' 2026-04-03T10:33:48,681 adding 'flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py' 2026-04-03T10:33:48,684 adding 'flashinfer/data/include/flashinfer/activation.cuh' 2026-04-03T10:33:48,687 adding 'flashinfer/data/include/flashinfer/air_top_p.cuh' 2026-04-03T10:33:48,688 adding 'flashinfer/data/include/flashinfer/allocator.h' 2026-04-03T10:33:48,690 adding 'flashinfer/data/include/flashinfer/arch_condition.h' 2026-04-03T10:33:48,691 adding 'flashinfer/data/include/flashinfer/attention_impl.cuh' 2026-04-03T10:33:48,693 adding 'flashinfer/data/include/flashinfer/concat_mla.cuh' 2026-04-03T10:33:48,694 adding 'flashinfer/data/include/flashinfer/cp_async.cuh' 2026-04-03T10:33:48,695 adding 'flashinfer/data/include/flashinfer/cubin_loader.h' 2026-04-03T10:33:48,697 adding 'flashinfer/data/include/flashinfer/cutlass_utils.cuh' 2026-04-03T10:33:48,698 adding 'flashinfer/data/include/flashinfer/exception.h' 2026-04-03T10:33:48,699 adding 'flashinfer/data/include/flashinfer/fastdiv.cuh' 2026-04-03T10:33:48,701 adding 'flashinfer/data/include/flashinfer/fp16.h' 2026-04-03T10:33:48,702 adding 'flashinfer/data/include/flashinfer/fp4_layout.cuh' 2026-04-03T10:33:48,703 adding 'flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh' 2026-04-03T10:33:48,705 adding 'flashinfer/data/include/flashinfer/layout.cuh' 2026-04-03T10:33:48,706 adding 'flashinfer/data/include/flashinfer/logging.h' 2026-04-03T10:33:48,707 adding 'flashinfer/data/include/flashinfer/math.cuh' 2026-04-03T10:33:48,709 adding 'flashinfer/data/include/flashinfer/mma.cuh' 2026-04-03T10:33:48,713 adding 'flashinfer/data/include/flashinfer/norm.cuh' 2026-04-03T10:33:48,715 adding 'flashinfer/data/include/flashinfer/page.cuh' 2026-04-03T10:33:48,717 adding 'flashinfer/data/include/flashinfer/permuted_smem.cuh' 2026-04-03T10:33:48,722 adding 'flashinfer/data/include/flashinfer/pos_enc.cuh' 2026-04-03T10:33:48,724 adding 'flashinfer/data/include/flashinfer/profiler.cuh' 2026-04-03T10:33:48,726 adding 'flashinfer/data/include/flashinfer/quantization.cuh' 2026-04-03T10:33:48,732 adding 'flashinfer/data/include/flashinfer/sampling.cuh' 2026-04-03T10:33:48,742 adding 'flashinfer/data/include/flashinfer/topk.cuh' 2026-04-03T10:33:48,745 adding 'flashinfer/data/include/flashinfer/utils.cuh' 2026-04-03T10:33:48,750 adding 'flashinfer/data/include/flashinfer/vec_dtypes.cuh' 2026-04-03T10:33:48,753 adding 'flashinfer/data/include/flashinfer/attention/batch_pod.cuh' 2026-04-03T10:33:48,756 adding 'flashinfer/data/include/flashinfer/attention/cascade.cuh' 2026-04-03T10:33:48,758 adding 'flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh' 2026-04-03T10:33:48,762 adding 'flashinfer/data/include/flashinfer/attention/decode.cuh' 2026-04-03T10:33:48,765 adding 'flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh' 2026-04-03T10:33:48,767 adding 'flashinfer/data/include/flashinfer/attention/default_decode_params.cuh' 2026-04-03T10:33:48,769 adding 'flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh' 2026-04-03T10:33:48,770 adding 'flashinfer/data/include/flashinfer/attention/heap.h' 2026-04-03T10:33:48,772 adding 'flashinfer/data/include/flashinfer/attention/hopper.cuh' 2026-04-03T10:33:48,773 adding 'flashinfer/data/include/flashinfer/attention/mask.cuh' 2026-04-03T10:33:48,777 adding 'flashinfer/data/include/flashinfer/attention/mla.cuh' 2026-04-03T10:33:48,782 adding 'flashinfer/data/include/flashinfer/attention/mla_hopper.cuh' 2026-04-03T10:33:48,783 adding 'flashinfer/data/include/flashinfer/attention/mla_params.cuh' 2026-04-03T10:33:48,786 adding 'flashinfer/data/include/flashinfer/attention/persistent.cuh' 2026-04-03T10:33:48,788 adding 'flashinfer/data/include/flashinfer/attention/persistent_template.cuh' 2026-04-03T10:33:48,791 adding 'flashinfer/data/include/flashinfer/attention/pod.cuh' 2026-04-03T10:33:48,800 adding 'flashinfer/data/include/flashinfer/attention/prefill.cuh' 2026-04-03T10:33:48,807 adding 'flashinfer/data/include/flashinfer/attention/scheduler.cuh' 2026-04-03T10:33:48,809 adding 'flashinfer/data/include/flashinfer/attention/state.cuh' 2026-04-03T10:33:48,811 adding 'flashinfer/data/include/flashinfer/attention/variant_helper.cuh' 2026-04-03T10:33:48,812 adding 'flashinfer/data/include/flashinfer/attention/variants.cuh' 2026-04-03T10:33:48,814 adding 'flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh' 2026-04-03T10:33:48,816 adding 'flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh' 2026-04-03T10:33:48,818 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp' 2026-04-03T10:33:48,820 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp' 2026-04-03T10:33:48,822 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp' 2026-04-03T10:33:48,826 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp' 2026-04-03T10:33:48,828 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp' 2026-04-03T10:33:48,832 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp' 2026-04-03T10:33:48,834 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp' 2026-04-03T10:33:48,836 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp' 2026-04-03T10:33:48,838 adding 'flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp' 2026-04-03T10:33:48,840 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp' 2026-04-03T10:33:48,842 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp' 2026-04-03T10:33:48,844 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp' 2026-04-03T10:33:48,845 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp' 2026-04-03T10:33:48,847 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp' 2026-04-03T10:33:48,850 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp' 2026-04-03T10:33:48,852 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp' 2026-04-03T10:33:48,854 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp' 2026-04-03T10:33:48,861 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp' 2026-04-03T10:33:48,863 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp' 2026-04-03T10:33:48,865 adding 'flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh' 2026-04-03T10:33:48,866 adding 'flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh' 2026-04-03T10:33:48,869 adding 'flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh' 2026-04-03T10:33:48,870 adding 'flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh' 2026-04-03T10:33:48,872 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh' 2026-04-03T10:33:48,874 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh' 2026-04-03T10:33:48,876 adding 'flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh' 2026-04-03T10:33:48,878 adding 'flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh' 2026-04-03T10:33:48,881 adding 'flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh' 2026-04-03T10:33:48,883 adding 'flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh' 2026-04-03T10:33:48,884 adding 'flashinfer/data/include/flashinfer/attention/hopper/utils.cuh' 2026-04-03T10:33:48,886 adding 'flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh' 2026-04-03T10:33:48,887 adding 'flashinfer/data/include/flashinfer/attention/hopper/variants.cuh' 2026-04-03T10:33:48,890 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh' 2026-04-03T10:33:48,891 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh' 2026-04-03T10:33:48,893 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh' 2026-04-03T10:33:48,895 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh' 2026-04-03T10:33:48,898 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh' 2026-04-03T10:33:48,901 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh' 2026-04-03T10:33:48,907 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh' 2026-04-03T10:33:48,913 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh' 2026-04-03T10:33:48,917 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh' 2026-04-03T10:33:48,919 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh' 2026-04-03T10:33:48,924 adding 'flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh' 2026-04-03T10:33:48,929 adding 'flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh' 2026-04-03T10:33:48,932 adding 'flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh' 2026-04-03T10:33:48,934 adding 'flashinfer/data/include/flashinfer/flat/common.hpp' 2026-04-03T10:33:48,936 adding 'flashinfer/data/include/flashinfer/flat/cute_ext.hpp' 2026-04-03T10:33:48,937 adding 'flashinfer/data/include/flashinfer/flat/debug.hpp' 2026-04-03T10:33:48,938 adding 'flashinfer/data/include/flashinfer/flat/math.hpp' 2026-04-03T10:33:48,940 adding 'flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp' 2026-04-03T10:33:48,941 adding 'flashinfer/data/include/flashinfer/flat/type_traits.hpp' 2026-04-03T10:33:48,942 adding 'flashinfer/data/include/flashinfer/flat/unused.hpp' 2026-04-03T10:33:48,945 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp' 2026-04-03T10:33:48,947 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp' 2026-04-03T10:33:48,949 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp' 2026-04-03T10:33:48,951 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp' 2026-04-03T10:33:48,957 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp' 2026-04-03T10:33:48,958 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp' 2026-04-03T10:33:48,960 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp' 2026-04-03T10:33:48,962 adding 'flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp' 2026-04-03T10:33:48,964 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp' 2026-04-03T10:33:48,967 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp' 2026-04-03T10:33:48,968 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp' 2026-04-03T10:33:48,970 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp' 2026-04-03T10:33:48,971 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp' 2026-04-03T10:33:48,973 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh' 2026-04-03T10:33:48,976 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h' 2026-04-03T10:33:48,977 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h' 2026-04-03T10:33:48,979 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h' 2026-04-03T10:33:48,981 adding 'flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh' 2026-04-03T10:33:48,983 adding 'flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h' 2026-04-03T10:33:48,984 adding 'flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh' 2026-04-03T10:33:48,986 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h' 2026-04-03T10:33:48,988 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h' 2026-04-03T10:33:48,990 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h' 2026-04-03T10:33:48,992 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h' 2026-04-03T10:33:48,994 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h' 2026-04-03T10:33:48,997 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h' 2026-04-03T10:33:48,999 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h' 2026-04-03T10:33:49,000 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h' 2026-04-03T10:33:49,002 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h' 2026-04-03T10:33:49,004 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h' 2026-04-03T10:33:49,006 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh' 2026-04-03T10:33:49,007 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh' 2026-04-03T10:33:49,009 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm.cuh' 2026-04-03T10:33:49,011 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh' 2026-04-03T10:33:49,013 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh' 2026-04-03T10:33:49,014 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh' 2026-04-03T10:33:49,016 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh' 2026-04-03T10:33:49,018 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh' 2026-04-03T10:33:49,020 adding 'flashinfer/data/include/flashinfer/gemm/group_gemv.cuh' 2026-04-03T10:33:49,021 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h' 2026-04-03T10:33:49,023 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h' 2026-04-03T10:33:49,026 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h' 2026-04-03T10:33:49,035 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh' 2026-04-03T10:33:49,037 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h' 2026-04-03T10:33:49,038 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h' 2026-04-03T10:33:49,040 adding 'flashinfer/data/include/flashinfer/mamba/common.cuh' 2026-04-03T10:33:49,042 adding 'flashinfer/data/include/flashinfer/mamba/conversion.cuh' 2026-04-03T10:33:49,044 adding 'flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh' 2026-04-03T10:33:49,047 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp.cuh' 2026-04-03T10:33:49,052 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh' 2026-04-03T10:33:49,054 adding 'flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh' 2026-04-03T10:33:49,056 adding 'flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh' 2026-04-03T10:33:49,058 adding 'flashinfer/data/include/flashinfer/trtllm/common.h' 2026-04-03T10:33:49,061 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h' 2026-04-03T10:33:49,063 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh' 2026-04-03T10:33:49,064 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h' 2026-04-03T10:33:49,066 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h' 2026-04-03T10:33:49,068 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh' 2026-04-03T10:33:49,070 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h' 2026-04-03T10:33:49,072 adding 'flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh' 2026-04-03T10:33:49,075 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h' 2026-04-03T10:33:49,077 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h' 2026-04-03T10:33:49,082 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh' 2026-04-03T10:33:49,084 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h' 2026-04-03T10:33:49,085 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh' 2026-04-03T10:33:49,087 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h' 2026-04-03T10:33:49,092 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h' 2026-04-03T10:33:49,093 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h' 2026-04-03T10:33:49,094 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh' 2026-04-03T10:33:49,097 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h' 2026-04-03T10:33:49,099 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h' 2026-04-03T10:33:49,101 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh' 2026-04-03T10:33:49,103 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h' 2026-04-03T10:33:49,105 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh' 2026-04-03T10:33:49,106 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h' 2026-04-03T10:33:49,108 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h' 2026-04-03T10:33:49,112 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/Enums.h' 2026-04-03T10:33:49,115 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmInterface.h' 2026-04-03T10:33:49,123 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/GemmOptions.h' 2026-04-03T10:33:49,126 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParams.h' 2026-04-03T10:33:49,128 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelParamsDecl.h' 2026-04-03T10:33:49,131 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/KernelTraits.h' 2026-04-03T10:33:49,133 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/TmaDescriptor.h' 2026-04-03T10:33:49,136 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CommonUtils.h' 2026-04-03T10:33:49,137 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaArchDecl.h' 2026-04-03T10:33:49,139 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/CudaKernelLauncher.h' 2026-04-03T10:33:49,140 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/DtypeDecl.h' 2026-04-03T10:33:49,142 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/MmaDecl.h' 2026-04-03T10:33:49,143 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SfLayoutDecl.h' 2026-04-03T10:33:49,145 adding 'flashinfer/data/include/flashinfer/trtllm/gemm/trtllmGen_gemm_export/trtllm/gen/SparsityDecl.h' 2026-04-03T10:33:49,147 adding 'flashinfer/data/spdlog/include/spdlog/async.h' 2026-04-03T10:33:49,149 adding 'flashinfer/data/spdlog/include/spdlog/async_logger-inl.h' 2026-04-03T10:33:49,150 adding 'flashinfer/data/spdlog/include/spdlog/async_logger.h' 2026-04-03T10:33:49,151 adding 'flashinfer/data/spdlog/include/spdlog/common-inl.h' 2026-04-03T10:33:49,153 adding 'flashinfer/data/spdlog/include/spdlog/common.h' 2026-04-03T10:33:49,155 adding 'flashinfer/data/spdlog/include/spdlog/formatter.h' 2026-04-03T10:33:49,156 adding 'flashinfer/data/spdlog/include/spdlog/fwd.h' 2026-04-03T10:33:49,158 adding 'flashinfer/data/spdlog/include/spdlog/logger-inl.h' 2026-04-03T10:33:49,160 adding 'flashinfer/data/spdlog/include/spdlog/logger.h' 2026-04-03T10:33:49,162 adding 'flashinfer/data/spdlog/include/spdlog/mdc.h' 2026-04-03T10:33:49,166 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h' 2026-04-03T10:33:49,168 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter.h' 2026-04-03T10:33:49,169 adding 'flashinfer/data/spdlog/include/spdlog/spdlog-inl.h' 2026-04-03T10:33:49,171 adding 'flashinfer/data/spdlog/include/spdlog/spdlog.h' 2026-04-03T10:33:49,172 adding 'flashinfer/data/spdlog/include/spdlog/stopwatch.h' 2026-04-03T10:33:49,174 adding 'flashinfer/data/spdlog/include/spdlog/tweakme.h' 2026-04-03T10:33:49,175 adding 'flashinfer/data/spdlog/include/spdlog/version.h' 2026-04-03T10:33:49,176 adding 'flashinfer/data/spdlog/include/spdlog/cfg/argv.h' 2026-04-03T10:33:49,178 adding 'flashinfer/data/spdlog/include/spdlog/cfg/env.h' 2026-04-03T10:33:49,179 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h' 2026-04-03T10:33:49,180 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers.h' 2026-04-03T10:33:49,182 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h' 2026-04-03T10:33:49,183 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer.h' 2026-04-03T10:33:49,185 adding 'flashinfer/data/spdlog/include/spdlog/details/circular_q.h' 2026-04-03T10:33:49,186 adding 'flashinfer/data/spdlog/include/spdlog/details/console_globals.h' 2026-04-03T10:33:49,187 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h' 2026-04-03T10:33:49,189 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper.h' 2026-04-03T10:33:49,190 adding 'flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h' 2026-04-03T10:33:49,191 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h' 2026-04-03T10:33:49,193 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg.h' 2026-04-03T10:33:49,194 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h' 2026-04-03T10:33:49,195 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h' 2026-04-03T10:33:49,196 adding 'flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h' 2026-04-03T10:33:49,198 adding 'flashinfer/data/spdlog/include/spdlog/details/null_mutex.h' 2026-04-03T10:33:49,200 adding 'flashinfer/data/spdlog/include/spdlog/details/os-inl.h' 2026-04-03T10:33:49,202 adding 'flashinfer/data/spdlog/include/spdlog/details/os.h' 2026-04-03T10:33:49,203 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h' 2026-04-03T10:33:49,204 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h' 2026-04-03T10:33:49,206 adding 'flashinfer/data/spdlog/include/spdlog/details/registry-inl.h' 2026-04-03T10:33:49,207 adding 'flashinfer/data/spdlog/include/spdlog/details/registry.h' 2026-04-03T10:33:49,209 adding 'flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h' 2026-04-03T10:33:49,210 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h' 2026-04-03T10:33:49,212 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client.h' 2026-04-03T10:33:49,213 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h' 2026-04-03T10:33:49,215 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool.h' 2026-04-03T10:33:49,216 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h' 2026-04-03T10:33:49,217 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client.h' 2026-04-03T10:33:49,218 adding 'flashinfer/data/spdlog/include/spdlog/details/windows_include.h' 2026-04-03T10:33:49,221 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h' 2026-04-03T10:33:49,222 adding 'flashinfer/data/spdlog/include/spdlog/fmt/chrono.h' 2026-04-03T10:33:49,223 adding 'flashinfer/data/spdlog/include/spdlog/fmt/compile.h' 2026-04-03T10:33:49,224 adding 'flashinfer/data/spdlog/include/spdlog/fmt/fmt.h' 2026-04-03T10:33:49,226 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ostr.h' 2026-04-03T10:33:49,227 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ranges.h' 2026-04-03T10:33:49,228 adding 'flashinfer/data/spdlog/include/spdlog/fmt/std.h' 2026-04-03T10:33:49,230 adding 'flashinfer/data/spdlog/include/spdlog/fmt/xchar.h' 2026-04-03T10:33:49,232 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h' 2026-04-03T10:33:49,240 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h' 2026-04-03T10:33:49,243 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h' 2026-04-03T10:33:49,246 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h' 2026-04-03T10:33:49,258 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h' 2026-04-03T10:33:49,260 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst' 2026-04-03T10:33:49,269 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h' 2026-04-03T10:33:49,290 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h' 2026-04-03T10:33:49,292 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h' 2026-04-03T10:33:49,294 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h' 2026-04-03T10:33:49,296 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h' 2026-04-03T10:33:49,299 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h' 2026-04-03T10:33:49,302 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h' 2026-04-03T10:33:49,304 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h' 2026-04-03T10:33:49,306 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h' 2026-04-03T10:33:49,308 adding 'flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h' 2026-04-03T10:33:49,310 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h' 2026-04-03T10:33:49,311 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h' 2026-04-03T10:33:49,313 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h' 2026-04-03T10:33:49,314 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h' 2026-04-03T10:33:49,315 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h' 2026-04-03T10:33:49,316 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h' 2026-04-03T10:33:49,318 adding 'flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h' 2026-04-03T10:33:49,320 adding 'flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h' 2026-04-03T10:33:49,321 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h' 2026-04-03T10:33:49,323 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h' 2026-04-03T10:33:49,324 adding 'flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h' 2026-04-03T10:33:49,326 adding 'flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h' 2026-04-03T10:33:49,327 adding 'flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h' 2026-04-03T10:33:49,329 adding 'flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h' 2026-04-03T10:33:49,330 adding 'flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h' 2026-04-03T10:33:49,331 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h' 2026-04-03T10:33:49,333 adding 'flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h' 2026-04-03T10:33:49,335 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h' 2026-04-03T10:33:49,336 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h' 2026-04-03T10:33:49,338 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h' 2026-04-03T10:33:49,339 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h' 2026-04-03T10:33:49,340 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink.h' 2026-04-03T10:33:49,341 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h' 2026-04-03T10:33:49,343 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h' 2026-04-03T10:33:49,344 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h' 2026-04-03T10:33:49,346 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h' 2026-04-03T10:33:49,347 adding 'flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h' 2026-04-03T10:33:49,349 adding 'flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h' 2026-04-03T10:33:49,350 adding 'flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h' 2026-04-03T10:33:49,351 adding 'flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h' 2026-04-03T10:33:49,353 adding 'flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h' 2026-04-03T10:33:49,355 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h' 2026-04-03T10:33:49,356 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h' 2026-04-03T10:33:49,358 adding 'flashinfer/data/spdlog/scripts/extract_version.py' 2026-04-03T10:33:49,360 adding 'flashinfer/dsv3_ops/__init__.py' 2026-04-03T10:33:49,361 adding 'flashinfer/fused_moe/__init__.py' 2026-04-03T10:33:49,369 adding 'flashinfer/fused_moe/core.py' 2026-04-03T10:33:49,371 adding 'flashinfer/fused_moe/fused_routing_dsv3.py' 2026-04-03T10:33:49,373 adding 'flashinfer/fused_moe/utils.py' 2026-04-03T10:33:49,375 adding 'flashinfer/fused_moe/cute_dsl/__init__.py' 2026-04-03T10:33:49,378 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-03T10:33:49,381 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-03T10:33:49,384 adding 'flashinfer/fused_moe/cute_dsl/fused_moe.py' 2026-04-03T10:33:49,387 adding 'flashinfer/fused_moe/cute_dsl/moe_utils.py' 2026-04-03T10:33:49,390 adding 'flashinfer/fused_moe/cute_dsl/tuner.py' 2026-04-03T10:33:49,391 adding 'flashinfer/fused_moe/cute_dsl/blackwell/__init__.py' 2026-04-03T10:33:49,404 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-03T10:33:49,415 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-03T10:33:49,418 adding 'flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py' 2026-04-03T10:33:49,420 adding 'flashinfer/fused_moe/cute_dsl/blackwell/utils.py' 2026-04-03T10:33:49,422 adding 'flashinfer/gdn_kernels/__init__.py' 2026-04-03T10:33:49,427 adding 'flashinfer/gdn_kernels/gdn_decode_bf16_state.py' 2026-04-03T10:33:49,435 adding 'flashinfer/gdn_kernels/gdn_decode_mtp.py' 2026-04-03T10:33:49,438 adding 'flashinfer/gdn_kernels/gdn_decode_nontranspose.py' 2026-04-03T10:33:49,441 adding 'flashinfer/gdn_kernels/gdn_decode_pretranspose.py' 2026-04-03T10:33:49,443 adding 'flashinfer/gdn_kernels/blackwell_prefill/__init__.py' 2026-04-03T10:33:49,456 adding 'flashinfer/gdn_kernels/blackwell_prefill/gdn.py' 2026-04-03T10:33:49,459 adding 'flashinfer/gdn_kernels/blackwell_prefill/gdn_helpers.py' 2026-04-03T10:33:49,460 adding 'flashinfer/gdn_kernels/blackwell_prefill/gdn_tile_scheduler.py' 2026-04-03T10:33:49,462 adding 'flashinfer/gemm/__init__.py' 2026-04-03T10:33:49,483 adding 'flashinfer/gemm/gemm_base.py' 2026-04-03T10:33:49,486 adding 'flashinfer/gemm/routergemm.py' 2026-04-03T10:33:49,488 adding 'flashinfer/gemm/kernels/__init__.py' 2026-04-03T10:33:49,496 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py' 2026-04-03T10:33:49,506 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py' 2026-04-03T10:33:49,517 adding 'flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py' 2026-04-03T10:33:49,520 adding 'flashinfer/jit/__init__.py' 2026-04-03T10:33:49,521 adding 'flashinfer/jit/activation.py' 2026-04-03T10:33:49,523 adding 'flashinfer/jit/cascade.py' 2026-04-03T10:33:49,524 adding 'flashinfer/jit/comm.py' 2026-04-03T10:33:49,526 adding 'flashinfer/jit/core.py' 2026-04-03T10:33:49,528 adding 'flashinfer/jit/cpp_ext.py' 2026-04-03T10:33:49,530 adding 'flashinfer/jit/cubin_loader.py' 2026-04-03T10:33:49,532 adding 'flashinfer/jit/dsv3_optimizations.py' 2026-04-03T10:33:49,533 adding 'flashinfer/jit/env.py' 2026-04-03T10:33:49,535 adding 'flashinfer/jit/fp4_kv_dequantization.py' 2026-04-03T10:33:49,536 adding 'flashinfer/jit/fp4_kv_quantization.py' 2026-04-03T10:33:49,537 adding 'flashinfer/jit/fp4_quantization.py' 2026-04-03T10:33:49,539 adding 'flashinfer/jit/fp8_quantization.py' 2026-04-03T10:33:49,540 adding 'flashinfer/jit/fused_moe.py' 2026-04-03T10:33:49,542 adding 'flashinfer/jit/gdn.py' 2026-04-03T10:33:49,543 adding 'flashinfer/jit/mla.py' 2026-04-03T10:33:49,544 adding 'flashinfer/jit/moe_utils.py' 2026-04-03T10:33:49,545 adding 'flashinfer/jit/norm.py' 2026-04-03T10:33:49,547 adding 'flashinfer/jit/page.py' 2026-04-03T10:33:49,548 adding 'flashinfer/jit/quantization.py' 2026-04-03T10:33:49,549 adding 'flashinfer/jit/rope.py' 2026-04-03T10:33:49,550 adding 'flashinfer/jit/sampling.py' 2026-04-03T10:33:49,551 adding 'flashinfer/jit/spdlog.py' 2026-04-03T10:33:49,552 adding 'flashinfer/jit/tinygemm2.py' 2026-04-03T10:33:49,553 adding 'flashinfer/jit/tllm_utils.py' 2026-04-03T10:33:49,554 adding 'flashinfer/jit/topk.py' 2026-04-03T10:33:49,556 adding 'flashinfer/jit/utils.py' 2026-04-03T10:33:49,557 adding 'flashinfer/jit/xqa.py' 2026-04-03T10:33:49,559 adding 'flashinfer/jit/attention/__init__.py' 2026-04-03T10:33:49,563 adding 'flashinfer/jit/attention/modules.py' 2026-04-03T10:33:49,565 adding 'flashinfer/jit/attention/utils.py' 2026-04-03T10:33:49,566 adding 'flashinfer/jit/attention/variants.py' 2026-04-03T10:33:49,572 adding 'flashinfer/jit/attention/fmha_v2/fmha_library.py' 2026-04-03T10:33:49,573 adding 'flashinfer/jit/attention/fmha_v2/generate_kernels.py' 2026-04-03T10:33:49,589 adding 'flashinfer/jit/attention/fmha_v2/generator_utils.py' 2026-04-03T10:33:49,594 adding 'flashinfer/jit/attention/fmha_v2/utils.py' 2026-04-03T10:33:49,596 adding 'flashinfer/jit/gemm/__init__.py' 2026-04-03T10:33:49,598 adding 'flashinfer/jit/gemm/core.py' 2026-04-03T10:33:49,599 adding 'flashinfer/jit/gemm/deepgemm.py' 2026-04-03T10:33:49,601 adding 'flashinfer/jit/gemm/fp8_blockscale.py' 2026-04-03T10:33:49,602 adding 'flashinfer/jit/gemm/cutlass/__init__.py' 2026-04-03T10:33:49,607 adding 'flashinfer/jit/gemm/cutlass/cutlass_library.py' 2026-04-03T10:33:49,611 adding 'flashinfer/jit/gemm/cutlass/generate_kernels.py' 2026-04-03T10:33:49,613 adding 'flashinfer/jit/mamba/__init__.py' 2026-04-03T10:33:49,614 adding 'flashinfer/jit/mamba/selective_state_update.py' 2026-04-03T10:33:49,615 adding 'flashinfer/jit/mamba/seq_chunk_cumsum.py' 2026-04-03T10:33:49,617 adding 'flashinfer/logits_processor/__init__.py' 2026-04-03T10:33:49,619 adding 'flashinfer/logits_processor/compiler.py' 2026-04-03T10:33:49,620 adding 'flashinfer/logits_processor/fusion_rules.py' 2026-04-03T10:33:49,622 adding 'flashinfer/logits_processor/legalization.py' 2026-04-03T10:33:49,623 adding 'flashinfer/logits_processor/op.py' 2026-04-03T10:33:49,625 adding 'flashinfer/logits_processor/operators.py' 2026-04-03T10:33:49,626 adding 'flashinfer/logits_processor/pipeline.py' 2026-04-03T10:33:49,628 adding 'flashinfer/logits_processor/processors.py' 2026-04-03T10:33:49,630 adding 'flashinfer/logits_processor/types.py' 2026-04-03T10:33:49,631 adding 'flashinfer/logits_processor/validators.py' 2026-04-03T10:33:49,633 adding 'flashinfer/mamba/__init__.py' 2026-04-03T10:33:49,635 adding 'flashinfer/mamba/selective_state_update.py' 2026-04-03T10:33:49,638 adding 'flashinfer/mamba/ssd_combined.py' 2026-04-03T10:33:49,652 adding 'flashinfer/mamba/ssd_kernel.py' 2026-04-03T10:33:49,655 adding 'flashinfer/mamba/ssd_tile_scheduler.py' 2026-04-03T10:33:49,657 adding 'flashinfer/norm/__init__.py' 2026-04-03T10:33:49,660 adding 'flashinfer/norm/utils.py' 2026-04-03T10:33:49,662 adding 'flashinfer/norm/kernels/__init__.py' 2026-04-03T10:33:49,665 adding 'flashinfer/norm/kernels/fused_add_rmsnorm.py' 2026-04-03T10:33:49,667 adding 'flashinfer/norm/kernels/layernorm.py' 2026-04-03T10:33:49,671 adding 'flashinfer/norm/kernels/rmsnorm.py' 2026-04-03T10:33:49,673 adding 'flashinfer/profiler/__init__.py' 2026-04-03T10:33:49,675 adding 'flashinfer/quantization/__init__.py' 2026-04-03T10:33:49,680 adding 'flashinfer/quantization/fp4_quantization.py' 2026-04-03T10:33:49,682 adding 'flashinfer/quantization/fp8_quantization.py' 2026-04-03T10:33:49,683 adding 'flashinfer/quantization/packbits.py' 2026-04-03T10:33:49,686 adding 'flashinfer/quantization/quantization_cute_dsl_utils.py' 2026-04-03T10:33:49,688 adding 'flashinfer/quantization/kernels/__init__.py' 2026-04-03T10:33:49,691 adding 'flashinfer/quantization/kernels/mxfp4_quantize.py' 2026-04-03T10:33:49,694 adding 'flashinfer/quantization/kernels/mxfp8_quantize.py' 2026-04-03T10:33:49,695 adding 'flashinfer/testing/__init__.py' 2026-04-03T10:33:49,702 adding 'flashinfer/testing/utils.py' 2026-04-03T10:33:49,704 adding 'flashinfer/triton/__init__.py' 2026-04-03T10:33:49,705 adding 'flashinfer/triton/activation.py' 2026-04-03T10:33:49,706 adding 'flashinfer/triton/cascade.py' 2026-04-03T10:33:49,708 adding 'flashinfer/triton/gemm.py' 2026-04-03T10:33:49,709 adding 'flashinfer/triton/norm.py' 2026-04-03T10:33:49,710 adding 'flashinfer/triton/page.py' 2026-04-03T10:33:49,712 adding 'flashinfer/triton/sm_constraint_gemm.py' 2026-04-03T10:33:49,713 adding 'flashinfer/triton/utils.py' 2026-04-03T10:33:49,714 adding 'flashinfer/triton/kernels/__init__.py' 2026-04-03T10:33:49,716 adding 'flashinfer/triton/kernels/activation.py' 2026-04-03T10:33:49,717 adding 'flashinfer/triton/kernels/cascade.py' 2026-04-03T10:33:49,718 adding 'flashinfer/triton/kernels/norm.py' 2026-04-03T10:33:49,720 adding 'flashinfer/triton/kernels/quant.py' 2026-04-03T10:33:49,721 adding 'flashinfer/triton/kernels/sm_constraint_gemm.py' 2026-04-03T10:33:49,723 adding 'flashinfer/triton/kernels/ssd_chunk_state.py' 2026-04-03T10:33:49,725 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py' 2026-04-03T10:33:49,726 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py' 2026-04-03T10:33:49,730 adding 'flashinfer_python-0.6.7.post1.dist-info/licenses/LICENSE' 2026-04-03T10:33:49,732 adding 'flashinfer_python-0.6.7.post1.dist-info/METADATA' 2026-04-03T10:33:49,733 adding 'flashinfer_python-0.6.7.post1.dist-info/WHEEL' 2026-04-03T10:33:49,734 adding 'flashinfer_python-0.6.7.post1.dist-info/entry_points.txt' 2026-04-03T10:33:49,735 adding 'flashinfer_python-0.6.7.post1.dist-info/top_level.txt' 2026-04-03T10:33:49,777 adding 'flashinfer_python-0.6.7.post1.dist-info/RECORD' 2026-04-03T10:33:49,925 removing build/bdist.linux-armv7l/wheel 2026-04-03T10:33:50,990 Building wheel for flashinfer-python (pyproject.toml): finished with status 'done' 2026-04-03T10:33:51,207 Created wheel for flashinfer-python: filename=flashinfer_python-0.6.7.post1-py3-none-any.whl size=9185339 sha256=c9bf5183228f6636ddb26d7354f250af4b2385876527538a0ff7f94fd48207d2 2026-04-03T10:33:51,208 Stored in directory: /tmp/pip-ephem-wheel-cache-v9jb6c71/wheels/53/91/5d/928fedbdc0f74f9108b12952b2459637053d0c938a75f9b2af 2026-04-03T10:33:51,294 Successfully built flashinfer-python 2026-04-03T10:33:51,503 Removed build tracker: '/tmp/pip-build-tracker-ni7c15lb'